Senior Engineering Manager for Hybrid Services & Reliability within AV Core Infrastructure at GM. Leading a team for the measurable availability of hybrid cloud systems for autonomous vehicle development.
Responsibilities
Define, measure, and enforce strict SLOs/SLIs for critical hybrid cloud services, including network connectivity and compute readiness.
Own and manage core on-prem utilities, such as DHCP, PXE, and CDN, to ensure seamless server auto-provisioning across the global fleet.
Manage the entire data flow path, from initial ingestion at the test bench through the secure cloud network into production staging.
Guarantee the 99%+ availability and stability of remote CI-based Hardware-in-the-Loop (HIL) benches required for AV safety validation.
Actively lead the recruitment and technical mentorship of Senior and Staff ICs as part of the team's expansion.
Requirements
Extensive background in Site Reliability Engineering (SRE) and defining SLO/SLI frameworks for hybrid cloud environments.
Technical proficiency in managing on-prem Linux utilities (DHCP/PXE/NTP) and core development services.
Opinionated view on automated observability, incident response, and MTTR reduction.
Proven leadership experience.
Experience with configuration management tools (e.g., Chef, Ansible) for large-scale, remote hardware fleets.
Benefits
GM offers a variety of health and wellbeing benefit programs.
Benefit options include medical, dental, vision, Health Savings Account, Flexible Spending Accounts, retirement savings plan, sickness and accident benefits, life insurance, paid vacation & holidays, tuition assistance programs, employee assistance program, GM vehicle discounts and more.
DevOps Specialist creating and overseeing Azure hybrid cloud infrastructures for EVLO's battery energy storage solutions. Collaborating with teams to implement cutting - edge technologies in a dynamic environment.
Software Quality and Release Engineer developing and maintaining C++/Python software solutions for aerospace and defense industry. Collaborating on CI/CD automation and feedback documentation.
Site Reliability / DevOps Engineer developing Big Data platforms for clients in Telco and Retail industries. Focus on stability, scalability, and performance of large - scale data processing systems.
Senior DevOps Engineer building and managing big data platforms for clients in telecommunications and finance industries. Ensuring stability, scalability, and performance across cloud and on - premise environments.
Site Reliability Engineer ensuring reliability, automation, and observability across cloud infrastructures for Diligent. Leading initiatives to improve performance in fast - paced environments.
Senior DevOps Engineer leading DevOps design and implementation for gaming projects at Stillfront. Collaborating with international teams to enhance gaming infrastructure and reduce costs.
Mainframe DevOps Engineer at Kyndryl enhancing mainframe delivery practices and migrating SCM to Azure DevOps. Requires extensive Mainframe development experience and DevOps skills.
DevOps/MLOps Engineer designing, automating, and maintaining scalable infrastructure for federal client. Collaborating with software engineers and data scientists for resilient solutions.