Research Engineer developing agentic systems at Anthropic focused on LLMs and AI applications. Collaborating with researchers to enhance agent performance and tackle complex tasks.
Responsibilities
Ideate, develop, and compare the performance of different agent harnesses (eg memory, context compression, communication architectures for agents)
Design and implement rigorous quantitative benchmarks for large scale agentic tasks
Assist with automated evaluation of Claude models and prompts across the training and product lifecycle
Work with our product org to find solutions to our most vexing challenges applying agents to our products
Help create and optimize data mixes for model training that maximize Claude’s performance or ease of use on agentic tasks
Requirements
Have experience developing complex agentic systems using LLMs
Have significant software engineering and ML experience
Have spent time prompting and/or building products with language models
Have good communication skills and an interest in working with other researchers on difficult tasks
Have a passion for making powerful technology safe and societally beneficial
Stay up-to-date and informed by taking an active interest in emerging research and industry trends.
Enjoy pair programming (we love to pair!)
Strong candidates may also have experience with large-scale RL on language models and multi-agent systems
System Modelling Innovation Engineer at Electrolux developing advanced product development system models. Enhancing modeling techniques and optimizing product development for better consumer experiences.
R&D Engineer developing estimation and control strategies for Electrolux appliances. Collaborating with global teams to innovate product features and drive sustainability in consumer electronics.
Principal Research Engineer leading engineering activities in behavior autonomy for Scientific Systems. Overseeing critical technology deliverables, team management, and proposal efforts.
Staff Research Engineer involved in creating a neurosymbolic AI agent at Onton. Focused on optimal decision - making processes and addressing challenges in current AI systems.
Research Engineer focusing on decentralized AI training stack for Prime Intellect. Engaging in novel research, optimizing workloads, and contributing to open - source frameworks.
Post - Training Research Engineer at Baseten developing tooling for efficient AI model training. Collaborating on diverse architectures and systems - level concepts to enhance performance in AI applications.
AI Data Innovation Engineer developing and validating AI capabilities tied to governed enterprise data products at U.S. Bank. Collaborating on AI readiness efforts and supporting data product initiatives.
Research Engineer at Yooz, specializing in AI - driven document automation. Collaborating with R&D to develop innovative technologies and enhance document management solutions.
System Test & Research Engineer developing testing protocols and supporting improvements in Precision Agriculture solutions at Topcon. Collaborating with teams to ensure product quality and performance.
Senior Research Engineer developing mechanical designs for engine demonstrators at GKN Aerospace. Leading technology integration and collaborating across engineering disciplines in aeronautics.