Staff AI Scientist specializing in machine learning fundamentals at HackerRank. Leading rigorous AI evaluation and dataset construction efforts across teams in a hybrid setting.
Responsibilities
Design, prepare, and curate high-quality evaluation datasets with defensible methodology.
Define criteria for dataset construction, ensuring statistical rigor, reproducibility, and fairness.
Develop new metrics and evaluation frameworks to measure model performance in nuanced ways.
Evaluate LLMs and other pre-trained models using carefully chosen datasets and metrics.
Build scalable pipelines for training, fine-tuning, and benchmarking models.
Contribute to projects involving fine-tuning, retrieval-augmented generation (RAG), and other adaptation methods.
Partner with product and engineering to align scientific rigor with business outcomes.
Define evaluation standards and ML lifecycle practices that raise the bar across the company.
Mentor scientists and engineers, guiding best practices in experimentation, statistics, and ML development.
Requirements
Master’s degree (PhD preferred) in Computer Science, Statistics, Machine Learning, or a related quantitative field.
Strong background in mathematical and statistical foundations of machine learning (probability, linear algebra, optimization, experimental design).
Demonstrated experience in end-to-end ML lifecycle: dataset preparation, model training, evaluation, deployment, and monitoring.
Proven expertise in evaluation dataset design and metric creation, not just using existing benchmarks but knowing when and how to improve them.
Experience with LLM evaluation, fine-tuning, and RAG, with the engineering skills to build production-ready pipelines.
Track record of strategic impact at a staff or principal level setting evaluation and research standards across teams.
Lead AI research initiatives at Galileo, focusing on generative AI and machine learning models. Collaborate with cross - functional teams to enhance AI - driven products and tools in an innovative environment.
Lead Data & AI Scientist delivering innovative AI solutions and insights for Business & Commercial Banking. Responsible for shaping the capability roadmap and mentoring the Data Science & AI team.
Lead Data & AI Scientist at Lloyds Banking Group delivering AI - driven solutions for Banking. Shaping technical roadmaps and mentoring teams in advanced AI practices and deployments.
AI Research Scientist at Lendbuzz developing conversational AI and agentic AI systems. Leading research direction and collaborating with cross - functional teams in a hybrid work environment.
Machine Learning Researcher at Longshot Systems designing and implementing predictive models for sports betting analytics. Involvement in all aspects of R&D from design to implementation.
Lead AI Researcher at Lloyds Banking Group advancing AI transformation through innovative technologies and ethical practices. Collaborates across teams to solve complex financial challenges.
Machine Learning Researcher developing novel AI solutions for impactful products at RBC Borealis. Conducting publishable research and collaborating with development teams to transfer research to production.
Lead AI Scientist at Lloyds Banking Group leading technical delivery of AI solutions. Collaborate with diverse teams to solve financial challenges and create innovative banking solutions.
AI Research Scientist exploring quantum advantage in AI applications for real - world problems. Collaborating across disciplines to develop innovative solutions at Thales in Montreal.
Data & Evaluation Applied AI Scientist at SES AI Corp., focusing on integrating AI into battery technology for energy transition. Collaborating with experts in a dynamic, innovative environment for impactful scientific projects.