Research Scientist focusing on reinforcement learning for training large language models at Snorkel AI. Collaborating with research and engineering teams to advance RL data capabilities.
Responsibilities
Research and implement reinforcement learning techniques including GRPO, RLHF, RLAIF, DPO, and reward modeling
Design and build data pipelines that generate high-quality training signal for RL workflows
Prototype and iterate on end-to-end RL training recipes
Work closely with research scientists, ML engineers, and delivery teams
Stay current with the latest developments in large-scale muli-node LLM training
Requirements
Deep expertise in reinforcement learning from human or AI feedback
Experience training or fine-tuning 30B+ large language models at scale
Strong proficiency in Python and ML frameworks, especially PyTorch and HuggingFace
Solid software engineering fundamentals
Familiarity with ML infrastructure and cloud platforms
Comfort operating in a high-iteration environment
Ph.D. in machine learning, reinforcement learning, or a related field strongly preferred
Research Assistant supporting social science research initiatives at Vanderbilt University. Involves assisting with study procedures, data collection, and collaboration with research teams.
Research Assistant part of Vanderbilt School of Nursing conducting studies following research guidelines. Requires strong communication and organizational skills to manage research activities.
Research Associate at HHL Leipzig Graduate School focusing on economic psychology and leadership. Engaging in research projects while supporting teaching and supervising students' theses.
Senior Research Scientist focusing on quantum algorithms, driving near - term quantum advantage. Working on cutting - edge applications in a multidisciplinary environment.
Staff/Senior Research Scientist driving development of AI frontier benchmarks and datasets at Snorkel AI. Collaborating across teams to scale production and impact research community.
Research Scientist conducting original research in AI and machine learning for Spotify's Personalization team. Developing methodologies and collaborating with partners to advance personalization systems.
Research Scientist at Valence Labs developing ML models for predicting cellular responses in drug discovery. Building generative models based on massive multiomics datasets with collaborative research.
Research Assistant responsible for statistical data analysis under a Principal Investigator for health science projects. Involves data management, statistical analyses, and documentation of workflows.
Temporary Research Assistant supporting data collection for Medical Ethics at University of Pennsylvania. Engaging in programming activities and organizing research - related documentation.
Machine Learning Research Scientist conducting applied AI/ML research at SEI. Developing prototype capabilities for government workflows with a focus on mission context.