Site Reliability/Operations Engineering Lead focused on cloud infrastructure reliability and performance. Joining Pfizer's engineering team dedicated to innovative solutions and operational excellence.
Responsibilities
Ensure high availability and performance of cloud infrastructure and services (AWS, Azure)
Build and maintain monitoring, alerting, and observability systems (e.g., Prometheus, Grafana, ELK)
Automate operational tasks using Terraform, Ansible, and scripting languages
Manage incident response, root cause analysis, and postmortems
Collaborate on CI/CD pipelines and deployment strategies using GitHub Actions
Maintain and improve container orchestration platforms (Kubernetes, Docker)
Administer systems, databases, and networks with a focus on reliability and security
Implement and enforce security and compliance best practices
Continuously evaluate and integrate tools to improve operational efficiency
Lead and grow a high-performing team of reliability and operations engineers
Requirements
Bachelor’s degree in a relevant field (e.g., Computer Science, Data Science, Bioinformatics, Engineering, or related discipline)
6+ years of experience in site reliability, operations, or infrastructure engineering
Strong experience with AWS or Azure
Proficiency in Terraform, Ansible, and GitHub
Solid understanding of Kubernetes, Docker, and container orchestration
Senior Site Reliability Engineer at T - Mobile enhancing system reliability and resilience while facilitating software development and deployment. Ensures performance of Network Supply Chain ecosystem including multiple systems.
Technical Lead Manager for DevSecOps at Atom Computing, innovating in quantum computing with a hybrid infrastructure. Leading a team and enhancing developer workflows across on - premises and cloud systems.
Steuer - /Rechtsanwaltsfachangestellte für InsO - Schlussrechnungen in einer Steuerberatung mit über 50 Jahren Erfolg. Betreuung von Mandanten und Erstellung/Prüfung von Schlussrechnungen.
Senior DevSecOps Engineer at Rockwell Automation designing and managing cloud - based infrastructures. Collaborating cross - functionally to streamline DevOps processes and enhance system reliability.
Reliability Engineer at Ferrara responsible for continuous improvement and maintenance programs. Supporting factory operations and leading initiatives to enhance reliability and reduce downtime.
Site Reliability/Operations Engineer ensuring reliability and operational excellence within Pfizer’s cloud - native platforms. Collaborate with team to monitor systems, deploy infrastructure, and troubleshoot issues.
DevSecOps Platform Engineer delivering an Enterprise Management solution for Satellite Communications at KBR in a collaborative environment. Ensuring security and optimizing defense systems for national security.
Site Reliability/Operations Engineering Lead at Pfizer ensuring performance and reliability of cloud - native platforms. Overseeing engineering excellence with cloud services and team leadership.
Senior Electrical Reliability Engineer at Cargill providing technical support in electrical engineering standards and governance for a bioindustrial plant.