Senior Site Reliability Engineer responsible for reliability and performance of Ford Service Reservation Platform. Leading SRE practices and technical initiatives in a hybrid role based in Dearborn, MI.
Responsibilities
Ensure the reliability, performance, and scalability of the Ford Service Reservation Platform and its associated applications.
Lead the implementation and continuous evolution of Site Reliability Engineering (SRE) practices.
Define, implement, and maintain Service Level Objectives (SLOs) and Service Level Indicators (SLIs).
Collaborate with engineering teams to prioritize reliability work and incident follow-ups.
Own, evolve, and optimize observability solutions using Dynatrace.
Develop and deploy infrastructure as code using Terraform scripts.
Establish and refine Incident Management and Problem Management processes.
Requirements
Bachelor’s degree in Computer Science, Computer Engineering, Systems Engineering or equivalent combination of relevant education and experience.
7+ years of experience in Software Engineering, DevOps, or Systems Administration.
5+ years of dedicated experience in a Site Reliability Engineering (SRE) or Platform Engineering role.
2+ years of experience leading technical initiatives or mentoring junior engineers in an SRE context.
Master’s degree in Computer Science, Computer Engineering, Systems Engineering or related field (even better).
Certifications:
Google Professional Cloud Architect or Google Professional Cloud DevOps Engineer.
Dynatrace Professional Certification.
Terraform Associate Certification.
Platform experience working on high-traffic reservation systems, e-commerce platforms, or automotive service applications.
Benefits
Visa sponsorship is available for this position.
Equal Opportunity Employer.
Reasonable accommodation for the online application process due to disability.
DevOps Engineer automating software delivery processes for energy systems in Sweden. Collaborating with development teams and enhancing operational environments for a growing organization.
Senior Site Reliability Engineer focused on developing and maintaining OpenShift - based platform solutions at Red Hat. Responsible for software automation, onboarding new services, and maintaining service reliability.
Site Reliability Engineer at Red Hat designing Python and Golang solutions for managed services. Involves onboarding services, maintaining reliability, and fostering team excellence.
Development Operations Engineer supporting enterprise application development in Java and/or C. Ensuring high availability and operational excellence in modern payment solutions.
Site Reliability Engineer designing and supporting Kubernetes environments for F5's UDF platform. Collaborating with cross - functional teams to ensure reliability and operational excellence.
Senior Site Reliability Engineer ensuring operational excellence for multi - datacenter infrastructure at F5. Developing automation tools and APIs in Python and Go.
DevOps Engineer needed to develop a new OpenXDR solution on AWS, processing security data from multiple sources. Join a leading cybersecurity company in Slovakia.
DevOps Engineer at Castalia Systems automating and optimizing toolchain and CI/CD pipelines. Designing Azure infrastructure and ensuring collaboration between development and operations teams.