Platform Engineer leading the development of cloud application platforms for Allstate. Responsible for cloud infrastructure for ML experimentation and production deployments.
Responsibilities
Lead the design, build, and operation of cloud infrastructure supporting ML experimentation, training, and production deployments
Define technical direction and best practices for ML platforms, MLOps, reliability, and cloud infrastructure
Architect ML platforms for high availability, fault tolerance, and resiliency across supported environments
Build and oversee CI/CD pipelines and automation for infrastructure and ML workflows
Champion MLOps best practices including model versioning, validation, promotion, monitoring, and rollback strategies
Mentor engineers through design reviews, code reviews, and hands-on technical leadership
Requirements
Proven experience leading cloud platform or infrastructure initiatives
Strong hands-on experience with cloud platforms (Azure, AWS, and/or GCP)
Deep knowledge of infrastructure as code, automation, CI/CD, and reliability engineering
Experience designing highly available and resilient distributed systems
Experience with ML platforms or MLOps tooling (e.g., MLflow, Kubeflow, Azure ML, SageMaker, Vertex AI)
Familiarity with observability tools (e.g., Datadog, ELK, New Relic, Prometheus)
Strong communication skills and a leadership mindset
6 or more years of experience (Preferred)
Benefits
Joining our team isn’t just a job — it’s an opportunity
One that takes your skills and pushes them to the next level
One that encourages you to challenge the status quo
One where you can shape the future of protection while supporting causes that mean the most to you
DevOps Engineer designing and operating AWS infrastructure within industrial IoT environments. Working on systems that ensure security, resilience, and end - to - end observability.
Sr. Site Reliability Engineer (SRE) III providing technical solutions for the federal government. Collaborating in a high - performing team focused on reliability and application scalability.
Senior Linux System Engineer developing and maintaining Linux server infrastructure for Th. Geyer GmbH. Collaborating on ERP systems and CI/CD processes while ensuring system performance and security.
Cloud Platform Engineer (ML DevOps) developing and managing CI/CD pipelines for ML workflows in a leading insurance company. Collaborating with data scientists and ensuring infrastructure security and compliance.
DevOps Engineer developing and managing container platforms for client solutions at Booz Allen Hamilton. Utilizing cloud technologies to enhance capabilities and secure deployments.
Senior DevOps/Platform Engineer automating cloud infrastructure and optimizing delivery pipelines at S&P Global Mobility. Collaborating with teams to enhance product reliability and security.
DevOps Engineer responsible for maintaining and enhancing AWS/EKS platform for energy transition products. Ensuring platform stability, security compliance, and streamlined deployment processes.
Suspension Design and Release Engineer for Ford, impacting vehicle ride, handling, and NVH. Collaborating with cross - functional teams to deliver quality systems and components.
DevOps Engineer at TeamViewer driving DevOps excellence by building CI/CD pipelines and managing Kubernetes. Collaborate within a diverse team to optimize digital processes with cloud infrastructure.
Senior DevOps Engineer managing DevOps processes and tooling for customer - facing platforms at Luminor. Building CI/CD pipelines and providing production support with a focus on mentoring and collaboration.