MLOps Engineer responsible for managing PyTorch-based training and inference workloads at Menlo HQ. Building and maintaining robust infrastructure for AI models and optimization processes.
Responsibilities
Own and evolve the infrastructure behind PyTorch-based training and inference workloads
Build and maintain training and inference pipelines using PyTorch
Own and evolve inference serving infrastructure
Write and maintain robust tooling in Python and C++
Optimize compute workloads for bare-metal environments
Troubleshoot low-level networking issues
Set up and manage ML environments
Establish CI/CD patterns for AI workloads
Integrate monitoring, alerting, and incident response
Requirements
Deep expertise in PyTorch internals
Strong programming skills in Python and C++
Solid computer science fundamentals
Hands-on experience with vLLM and SGLang
Experience with RLHF and PPO training pipelines
Strong understanding of distributed training setups
Experience debugging and tuning bare-metal Linux servers
Familiarity with job schedulers such as Airflow
Strong grasp of containerized and cloud-native environments
Senior Developer building and evolving ML/AI applications on AWS for Valorem Reply. Collaborating closely with product, architecture, and engineering teams for quality solutions.
Senior Developer at Valorem Reply delivering ML/AI applications on AWS. Collaborating with product and engineering teams to provide high - quality tech solutions.
Senior Software Engineer designing and operating ML infrastructure for Plaid's AI initiatives. Collaborating with product teams to accelerate AI - powered financial experiences and ensure scalable ML systems.
Senior ML Engineer serving as an individual contributor in generative AI at GEICO. Collaborating with teams to design, develop, and deploy AI systems that drive business value.
Senior Staff Machine Learning Engineer at GEICO, enhancing service productivity through AI technologies. Collaborating with dynamic teams to develop and deploy scalable AI workflows across Geico.
Staff AI Engineer at GEICO designing and deploying AI platforms for virtual agent workflows. Collaborating with teams to improve service for millions of customers.
Machine Learning Engineer at Tilt, developing personalisation solutions across various app surfaces. Collaborate with teams to enhance recommendation systems on a video - first shopping platform.
Senior Machine Learning Engineer architecting next - generation AI platforms for healthcare and fintech with Nitra's diverse team. Focused on data pipelines, ML infrastructure, and production - ready AI systems.
Senior Machine Learning Engineer architecting and building Nitra's data and AI platform. Driving intelligent products across healthcare and fintech industries with applied AI and platform engineering.
Machine Learning Engineer developing and implementing ML models for lending at Blue Whale Lending LLC. Collaborating with teams to enhance data insights and validate model performance.