Senior ML Platform Engineer building and scaling machine learning infrastructure for AI applications. Responsible for LLM deployment, Kubernetes management, and mentoring engineering teams.
Responsibilities
Build and scale machine learning infrastructure focused on Large Language Models (LLMs) and AI applications
Design and implement scalable infrastructure for training, fine-tuning, and serving open source LLMs
Architect and manage Kubernetes clusters for ML workloads
Ensure 99.9%+ uptime for ML platforms through robust monitoring
Mentor junior engineers and data scientists on platform best practices
Collaborate with data scientists and product engineering teams
Present technical solutions and platform roadmaps to leadership
Requirements
Bachelor’s degree in computer science, Engineering, or related technical field (or equivalent experience)
5+ years of software engineering experience with focus on infrastructure, platform engineering, or MLOps
2+ years of hands-on experience with machine learning infrastructure and deployment at scale
1+ years of experience working with Large Language Models and transformer architectures
Proficient in Python; strong skills in Go, Rust, or Java preferred
Proven experience working with open source LLMs (Llama 2/3, Qwen, Mistral, Gemma, Code Llama, etc.)
Proficient in Kubernetes including custom operators, helm charts, and GPU scheduling
Deep expertise in Azure services (AKS, Azure ML, Container Registry, Storage, Networking)
Benefits
Comprehensive Total Rewards program that offers personalized coverage
Health insurance
401(k) savings plan vested from day one that offers a 6% match
Performance and recognition-based incentives
Tuition assistance
Workplace flexibility as well as GEICO Flex program allowing work from anywhere in the US for up to four weeks per year
Machine Learning Engineer responsible for implementing and maintaining data science models in bpx’s machine learning studio. Bridging data science and computational needs to achieve business outcomes.
Machine Learning Engineer at DentalMonitoring developing AI solutions for orthodontics. Responsibilities include model development, evaluation, deployment, and performance monitoring.
Machine Learning Engineer at Hiscox working on fraud detection and generative AI projects. Collaborating closely with data scientists and engineers to solve complex business challenges.
Internship focusing on programming robotic arms and using machine learning in simulations at Fraunhofer IIS. Opportunity to gain practical experience and contribute to innovative research.
Senior Machine Learning Engineer at Itaú, driving innovation with data and AI solutions. Collaborating across teams to implement robust machine learning architectures and ensure scalable deployments.
Machine Learning Engineer responsible for developing and deploying advanced ML and AI solutions at Zendesk. Collaborating with stakeholders to deliver impactful business outcomes using latest machine learning technologies.
Lead advanced machine learning model development and optimization at PayPal. Collaborate with teams to deploy scalable ML solutions in production environments.
Senior Machine Learning Engineer at Pivotal Health developing ML systems for healthcare reimbursement. Collaborating across teams to build and maintain reliable, production - grade machine learning systems.
Machine Learning Engineer working with Algorithm team on customer onboarding processes. Focus on execution and automation of models using computer vision and AI in sports industry.
Senior Machine Learning Engineer at Troveo designing and optimizing machine learning pipelines for AI video models. Collaborating with cross - functional teams to build scalable video data solutions.