Build, design, and maintain foundational LLM infrastructure, tooling, and reusable libraries to enable product teams
Operationalize observability tools like LangSmith and create shared patterns for orchestration frameworks
Deploy and build pipelines for vector databases to enable Retrieval-Augmented Generation (RAG) across product teams
Contribute to proofs-of-concept and architectural groundwork for future-state LLM capabilities
Add logging, monitoring, and alerting to platform services to ensure stability, performance, and cost-effectiveness
Partner with feature teams as a consultant and enabler to understand needs and unblock AI roadmaps
Deliver roadmap items on schedule and produce high-quality technical designs and maintainable code
Mentor and share knowledge to improve internal processes and developer enablement
Requirements
Demonstrated experience building and delivering production-grade software, with hands-on experience in LLM or Generative AI engineering
Experience building internal tools, libraries, or platforms and strong API design and documentation skills
Strong proficiency in Python
Familiarity with modern AI/ML stack, including cloud services (AWS, GCP, or Azure) and CI/CD pipelines
Hands-on experience with orchestration frameworks (e.g., LangChain, LangGraph) and observability tools (e.g., LangSmith)
Experience with Retrieval-Augmented Generation (RAG) infrastructure and vector databases
Ability to design, build, and maintain LLM infrastructure, tooling, logging, monitoring, and alerting
Pragmatic problem-solving, risk identification, and sprint planning skills
High degree of grit, ownership, and ability to work in fast-paced high-growth environments
Bachelor's degree in a technical field or equivalent practical experience
Deep understanding of handling sensitive data, security and privacy awareness
Benefits
Health, Dental, Vision benefits start on your first day
One Medical access
HSA and FSA plans available with Spring contributing up to $1K for HSAs
Employer sponsored 401(k) match of up to 2%
Yearly allotment of no cost visits to the Spring Health network of therapists, coaches, and medication management providers for you and your dependents
Competitive paid time off including vacation, sick leave and company holidays
Parental leave at 6 months: 18 weeks for birthing parents and 16 weeks for non-birthing parents
Access to Noom weight management program
Fertility care support through Carrot and $4,000 reimbursement for related fertility expenses
Access to Wellhub subscription for fitness, mindfulness, nutrition, and sleep
Access to BrightHorizons for sponsored child care, back-up care, and elder care
Up to $1,000 Professional Development Reimbursement per year
Machine Learning Engineer responsible for optimizing AI pipelines at Easy2Parts. Join a growing team to revolutionize component sourcing with AI technology.
AI/ML Engineer developing and deploying machine learning solutions for Nokia's network optimization projects. Collaborating with cross - functional teams to enhance network planning capabilities.
Machine Learning Platform Engineer for Coinbase, building foundational components for ML at scale. Collaborating on fraud combat, personalizing user experiences, and blockchain analysis.
Machine Learning Engineer focused on building sophisticated models to protect Coinbase users from fraud. Engaging in hands - on technical role with modern AI/ML methodologies.
Senior ML Platform Engineer developing and maintaining scalable ML infrastructure at GEICO. Focused on Large Language Models and collaborating with data science and engineering teams.
Staff ML Engineer developing GenAI infrastructure at Zendesk. Leading design and optimization of ML platforms while fostering technical excellence and collaboration.
Senior Deep Learning Engineer developing deep learning models for wireless communications. Working on next - gen signal processing and radio access technologies at NVIDIA's Vietnam R&D center.
Leading a team of ML Engineers to design and deploy AI - driven solutions at Welldoc. Overseeing critical ML projects while collaborating with international teams.
Senior ML Platform Engineer building and scaling machine learning infrastructure for AI applications. Responsible for LLM deployment, Kubernetes management, and mentoring engineering teams.
Internship in AI and machine learning for internal process optimization at Dräger. Collaborate with cross - functional teams and develop predictive models in Lübeck.