Designs, builds, and maintains scalable ML infrastructure and pipelines for model training, deployment, and monitoring.
Optimizes orchestration processes to ensure efficient deployment and management of predictive models.
Optimizes resource usage to minimize infrastructure expense while maximizing performance.
Monitors and maintains the performance, security, and scalability of the ML infrastructure.
Collaborates with data scientists and software engineers to streamline the ML lifecycle from development to production.
Develops and maintains tools for data analysis, experimentation, model versioning, and artifact management.
Supports data and model governance requirements as needed.
Creates robust monitoring systems to measure and trend model performance, detect model drift, and ensure optimal performance of models in production.
Develops automation scripts and tools to improve the efficiency and reliability of MLOps processes.
Optimizes ML workflows for efficiency, scalability, and reliability.
Provides technical assistance and mentorship to all team members; troubleshoots complex issues and escalates issues as necessary.
Supports the company commitment to risk management and protecting the integrity and confidentiality of systems and data.
Requirements
Minimum 5 years’ experience in Data Science, ML Engineering or ML Ops capacity.
Strong programming skills in Python and experience with Data Science and ML packages and frameworks.
Experience with AWS services.
Proficiency with containerization technologies (Docker, Kubernetes) and CI/CD practices.
Experience deploying models with MLOps tools such as MLflow, Kubeflow, or similar platforms.
Expert understanding of data management, distributed computing, and software architecture principles.
Proven experience delivering real-time models in production environments.
Background and drug screen.
Benefits
Healthcare Coverage – Competitive medical (PPO/HDHP), dental, and vision plans as well as company contributions to your Health Savings Account (HSA) or pre-tax savings through flexible spending accounts (FSA) for commuting, health & dependent care expenses.
401(k) Retirement Plan – Featuring a 100% Company Safe Harbor Match on your first 6% deferral immediately upon eligibility.
Paid Time Off – Unlimited Time Off for Exempt (salaried) employees, as well as generous PTO for Non-Exempt (hourly) employees, plus 11 paid company holidays and a paid volunteer day.
12 weeks of Paid Parental Leave
Maven Family Planning – provides support through your Parenting journey including egg freezing, fertility, adoption, surrogacy, pregnancy, postpartum, early pediatrics, and returning to work.
Machine Learning Engineer responsible for optimizing AI pipelines at Easy2Parts. Join a growing team to revolutionize component sourcing with AI technology.
AI/ML Engineer developing and deploying machine learning solutions for Nokia's network optimization projects. Collaborating with cross - functional teams to enhance network planning capabilities.
Machine Learning Platform Engineer for Coinbase, building foundational components for ML at scale. Collaborating on fraud combat, personalizing user experiences, and blockchain analysis.
Machine Learning Engineer focused on building sophisticated models to protect Coinbase users from fraud. Engaging in hands - on technical role with modern AI/ML methodologies.
Senior ML Platform Engineer developing and maintaining scalable ML infrastructure at GEICO. Focused on Large Language Models and collaborating with data science and engineering teams.
Staff ML Engineer developing GenAI infrastructure at Zendesk. Leading design and optimization of ML platforms while fostering technical excellence and collaboration.
Senior Deep Learning Engineer developing deep learning models for wireless communications. Working on next - gen signal processing and radio access technologies at NVIDIA's Vietnam R&D center.
Leading a team of ML Engineers to design and deploy AI - driven solutions at Welldoc. Overseeing critical ML projects while collaborating with international teams.
Senior ML Platform Engineer building and scaling machine learning infrastructure for AI applications. Responsible for LLM deployment, Kubernetes management, and mentoring engineering teams.
Internship in AI and machine learning for internal process optimization at Dräger. Collaborate with cross - functional teams and develop predictive models in Lübeck.