DevOps Engineer responsible for building and maintaining scalable AI systems on Azure cloud. Collaborating with teams to ensure operational excellence for enterprise-grade AI solutions.
Responsibilities
Design, implement, and maintain MLOps pipelines for AI/ML assets deployed as managed online endpoints in Azure Machine Learning.
Implement CI/CD workflows for AI solutions using Azure DevOps and Azure CLI.
Ensure compliance, security, and scalability of AI systems across environments.
Build monitoring dashboards to track performance, data drift, and system health, and implement alerting and remediation strategies.
Manage promotion of AI/ML assets (models, apps, containers) across development, staging, and production environments.
Automate deployment, monitoring, and lifecycle management of AI systems, including Azure Speech Services and OpenAI models.
Assist in deploying and maintaining Flask-based applications that consume Azure ML endpoints.
Requirements
5+ years of experience in MLOps or related roles with a focus on Azure cloud platform
Strong proficiency in Azure AI services (Azure Machine Learning, Cognitive Services, Azure Speech Services, Azure OpenAI)
Hands-on experience with CI/CD pipelines using Azure DevOps and Azure CLI
Proficiency in Python and experience with Azure SDKs for AI services
Solid understanding of containerization (Docker) and orchestration (Kubernetes, AKS)
Experience with monitoring and logging solutions for AI systems (Azure Monitor, Application Insights) and building dashboards
Knowledge of identity and access management in Azure (Managed Identity, RBAC)
Strong understanding of AI/ML asset lifecycle management, including promotion across environments.
Benefits
25 days holiday, increasing through length of service, with option to buy or sell
Bupa health insurance as a benefit in kind
An enhanced pension plan and life insurance
Onsite gyms or local discounts where no onsite gym available
Junior MLOps Engineer helping to design and maintain AI/ML systems at Bupa. Collaborating with teams to operationalize machine learning models and automate workflows.
DevOps Engineer developing and managing scalable AWS infrastructures for a PropTech startup. Collaborating within a growing tech team to achieve ambitious goals in the legal conveyancing space.
Senior DevOps Engineer leading the design and optimization of cloud infrastructure at Growth Acceleration Partners. Ensuring secure and cost - effective deployments within fast - paced product development environment.
Advanced Dev Ops Engineer optimizing infrastructure solutions for engineering teams at a consulting and technology services company. Ensuring secure and cost - effective deployments in a fast - paced environment.
Entry - level DevOps Engineer at Nokia focusing on building and maintaining CI environment for LTE and 5G solutions. Engage with high - end telecommunication technologies and support development workflows.
AI Security Control Developer/Site Reliability Engineer for RBC's enterprise AI ecosystem. Design, implement, and validate security controls to protect AI systems with 24/7 reliability.
Senior Site Reliability Engineer ensuring scalability and reliability for NGINX systems and SaaS platforms. Collaborating across teams to drive automation and system performance.
Site Reliability Engineer ensuring reliability and performance of data platform services for Veepee. Collaborating on cloud migration, Kubernetes operations, and observability best practices.
Senior Lead Site Reliability Engineer overseeing critical systems stability and incident management. Leading Java applications reliability and supporting a dynamic technology environment.
Infrastructure Architect connecting clients and Kyndryl. Leading projects from start to finish, ensuring technical solutions meet client needs at Kyndryl.