Staff AI Scientist specializing in machine learning fundamentals at HackerRank. Leading rigorous AI evaluation and dataset construction efforts across teams in a hybrid setting.
Responsibilities
Design, prepare, and curate high-quality evaluation datasets with defensible methodology.
Define criteria for dataset construction, ensuring statistical rigor, reproducibility, and fairness.
Develop new metrics and evaluation frameworks to measure model performance in nuanced ways.
Evaluate LLMs and other pre-trained models using carefully chosen datasets and metrics.
Build scalable pipelines for training, fine-tuning, and benchmarking models.
Contribute to projects involving fine-tuning, retrieval-augmented generation (RAG), and other adaptation methods.
Partner with product and engineering to align scientific rigor with business outcomes.
Define evaluation standards and ML lifecycle practices that raise the bar across the company.
Mentor scientists and engineers, guiding best practices in experimentation, statistics, and ML development.
Requirements
Master’s degree (PhD preferred) in Computer Science, Statistics, Machine Learning, or a related quantitative field.
Strong background in mathematical and statistical foundations of machine learning (probability, linear algebra, optimization, experimental design).
Demonstrated experience in end-to-end ML lifecycle: dataset preparation, model training, evaluation, deployment, and monitoring.
Proven expertise in evaluation dataset design and metric creation, not just using existing benchmarks but knowing when and how to improve them.
Experience with LLM evaluation, fine-tuning, and RAG, with the engineering skills to build production-ready pipelines.
Track record of strategic impact at a staff or principal level setting evaluation and research standards across teams.
Finance Intern supporting AI research by building and executing financial models. Collaborating with senior professionals to enhance AI's understanding of financial markets.
Machine Learning Researcher enhancing AI in K - 12 education at Kiddom. Driving significant improvements in teaching experiences and student outcomes with innovative AI applications.
Build neural networks for autonomous vehicle technology at Mobileye, focusing on deep learning model design and deployment. Collaborate with teams to ensure high - impact research solutions.
As a Research Student at LILT, you'll evaluate AI models for multilingual tasks and work with leading global labs. Opportunity for publishing and innovative research in AI.
Join Saviynt's AI Research Team to develop identity security solutions for AI agents. Collaborate to innovate and implement AI technologies for identity management.
ML/AI Scientist developing and optimizing computational drug discovery models at Orion Pharma. Collaborating on innovative ML/AI methodologies impacting drug discovery decisions.
AI Research Engineer developing AI - first products for Homeprotect's insurance services. Building and automating AI solutions for internal productivity and claims handling.
Director leading applied AI research initiatives for Snorkel AI. Overseeing team management and collaboration across product, engineering, and research departments.
AI Research Engineer at PostHog working on achieving product autonomy through data - driven model training. Collaborating closely with teams and utilizing extensive data for product development.
AI Scientist at Xaira Therapeutics leveraging AI to advance drug discovery and develop innovative therapeutics. Collaborating with a diverse team to solve complex scientific challenges.