Staff AI Scientist specializing in machine learning fundamentals at HackerRank. Leading rigorous AI evaluation and dataset construction efforts across teams in a hybrid setting.
Responsibilities
Design, prepare, and curate high-quality evaluation datasets with defensible methodology.
Define criteria for dataset construction, ensuring statistical rigor, reproducibility, and fairness.
Develop new metrics and evaluation frameworks to measure model performance in nuanced ways.
Evaluate LLMs and other pre-trained models using carefully chosen datasets and metrics.
Build scalable pipelines for training, fine-tuning, and benchmarking models.
Contribute to projects involving fine-tuning, retrieval-augmented generation (RAG), and other adaptation methods.
Partner with product and engineering to align scientific rigor with business outcomes.
Define evaluation standards and ML lifecycle practices that raise the bar across the company.
Mentor scientists and engineers, guiding best practices in experimentation, statistics, and ML development.
Requirements
Master’s degree (PhD preferred) in Computer Science, Statistics, Machine Learning, or a related quantitative field.
Strong background in mathematical and statistical foundations of machine learning (probability, linear algebra, optimization, experimental design).
Demonstrated experience in end-to-end ML lifecycle: dataset preparation, model training, evaluation, deployment, and monitoring.
Proven expertise in evaluation dataset design and metric creation, not just using existing benchmarks but knowing when and how to improve them.
Experience with LLM evaluation, fine-tuning, and RAG, with the engineering skills to build production-ready pipelines.
Track record of strategic impact at a staff or principal level setting evaluation and research standards across teams.
Machine Learning Engineer at Salesforce AI Research working closely with teams to design agentic AI systems. Innovating at the frontier of AI with transformative solutions for Salesforce customers.
Principal Data Scientist leading design, development, and deployment of AI solutions for PEMCO's insurance portfolio. Working closely with teams to drive measurable impact through data science.
Cloud & AI Researcher joining PointFive to optimize cloud costs through research and innovative solutions. Collaborating with product teams to enhance platform features and publish technical content.
AI Data Scientist driving research and development for Pearson’s learner models and next - gen AI - driven learning products. Collaborating with teams to implement advanced AI strategies.
AI Research Lead defining research agenda for 1mind’s multimodal AI models. Leading exploratory research and building a world - class applied research team in a high - impact role.
AI Research Resident exploring foundational AI systems from first principles at Maincode, an AI research lab building systems that move humans forward.
AI Research Engineer developing sophisticated workflows leveraging LLM models at Cisco. Collaborating with teams to ensure security and scalability of AI solutions.
AI Scientist at Preply applying deep learning and NLP for personalized learning solutions. Collaborating with cross - functional teams to translate AI research into impactful educational tools.
Senior Applied AI Scientist focusing on AI Learning at Preply, leveraging deep learning for personalized education solutions. Collaborating with teams to enhance learning experiences globally.