MLOps Engineer designing and maintaining cloud infrastructure for large-scale computer vision model training. Collaborating with Data Scientists and AI Engineers to streamline model development lifecycle.
Responsibilities
Build & Maintain ML Infrastructure: Design, implement, and maintain our cloud-based infrastructure for large-scale computer vision model training and data management.
Automate ML Pipelines: Engineer and deploy automated, production-grade ML pipelines for seamless data processing, model training, validation, and deployment.
Enable AI/ML Teams: Collaborate directly with Data Scientists and AI Engineers to streamline and accelerate the entire model development lifecycle.
Ensure Scalability & Reliability: Architect and operate robust, secure, and efficient infrastructure for our large-scale AI solutions.
Requirements
Production MLOps Experience: Strong, relevant work experience operating and scaling machine learning systems and AI workflows in a production environment.
Kubernetes Mastery: Deep, hands-on proficiency with Kubernetes for scheduling and scaling ML training jobs and complex workloads.
ML Pipeline Expertise: Proven ability to build, manage, and troubleshoot ML pipelines and serving infrastructure. Direct experience with Argo Workflows and ArgoCD is an advantage.
MLOps Tooling: Proficiency with modern MLOps tools, especially MLFlow for experiment tracking and model management.
Infrastructure as Code (IaC): Solid practical experience managing cloud infrastructure using Terraform.
Pragmatic Problem-Solver: Demonstrated ability to quickly and independently solve complex technical challenges with reliable, scalable solutions.
AI/ML Engineer III designing and architecting AI solutions at Hewlett Packard Enterprise. Collaborating with teams to drive innovation and tackle complex problems.
AI/ML Engineer deploying state - of - the - art AI models to solve real - world problems at Brain Co. Working in healthcare, government, and energy sectors for impactful results.
Trainer at WeAndTheMany facilitating learning by sharing experiences and creating interactive sessions. Engaging with students to enhance their skills and knowledge through dynamic teaching methods.
Machine Learning Manager leading experienced team to drive data - driven AI/ML solutions at Ford. Overseeing strategies for product development focused on analytics in various domains.
Software Engineer I developing machine learning models and applications at Smart Data Solutions. Collaborating to improve infrastructure and automate processes using AI technology.
Intermediate Machine Learning Engineer at Aviva Canada implementing ML pipelines with required collaboration in AI/ML Operations. Join a team dedicated to operationalizing ML models for optimizing solutions.
Machine Learning Engineer developing AI - first dating solutions at Hinge, enhancing user matchmaking and conversation experience. Collaborating with cross - functional teams to move ML models to production.
Senior ML Engineer designing and developing machine learning models for national security. Collaborating with cross - functional teams to deliver scalable solutions in defense applications.
Machine Learning Engineer developing and deploying ML planning algorithms for autonomous trucks. Join Plus, a leader in AI - based virtual driver software for autonomous trucking.
Intern for Servo Engineering at Seagate, integrating AI/ML into precision servo design. Collaborating on research and optimization of control algorithms for hard disk systems.