SRE Lead developing scalable cloud-native solutions for mission-critical systems supporting USAF. Managing teams, collaborating with cross-functional units, and ensuring high service reliability standards.
Responsibilities
Responsible for developing scalable cloud-native solutions, ensuring best practices across architecture, development, deployment, and security.
Manage and mentor the SRE team, providing guidance and fostering professional development.
Work collaboratively with SRE Resource Managers to maintain engineering resources.
Meet regularly with team members, participate in performance reviews and development planning.
Oversee the reliability, availability, and performance of critical systems by leading SRE teams.
Ensure team adheres to best practices for system reliability, automation, and operational efficiency.
Drive continuous improvement initiatives by analyzing performance metrics.
Collaborate with operations, quality, cybersecurity, and other SRE engineering teams.
Act as a liaison between the SRE team and other departments to prioritize reliability and operational needs.
Collaborate with senior leadership to define SRE strategy and set long-term reliability goals.
Lead efforts to reduce operational toil through automation.
Oversee development and adoption of Infrastructure as Code (IaC) tools, CI/CD pipelines, and other automation processes.
Ensure SRE practices align with organizational security policies and compliance requirements.
Collaborate with security teams to integrate reliability-focused security practices.
Ensure systems meet or exceed agreed-upon service levels.
Work within a SRE team to continuously deliver products and increase value for the organization and customers.
Embrace and champion Agile development processes and provide technical guidance.
Requirements
Bachelors and twelve (12) years or more of experience;
Masters and ten (10) years or more of experience.
Secret clearance required
US citizenship required
Certifications: CompTIA Security+ or equivalent (IAT-2)
Familiarity with DevSecOps principles and practices.
Familiarity with Agile methodologies such as Scrum and/or Kanban.
DevOps Lead at Leidos managing platform engineering, SRE, and application security functions. Driving operational excellence and ensuring scalability for federal government applications.
Junior DevOps / Platform Engineer at DieEnergiekoppler GmbH managing AWS/EKS platform operations. Collaborating with team members to improve platform functionalities and security compliance.
DevOps Engineer responsible for AWS infrastructures and backend development at Allguth GmbH. Engaging in greenfield projects with modern solutions in a collaborative team.
Cloud DevOps Specialist responsible for building scalable infrastructure solutions in AWS at SONDA. Focusing on automation, containerization, and data management in a collaborative environment.
DevOps Engineer maintaining and evolving deployment pipelines for Docebo’s AI - powered learning platform. Collaborating with cross - functional teams to ensure efficient software releases and infrastructure management.
DevOps Engineer optimizing CI/CD pipelines for Docebo, an AI - powered learning platform. Involves managing multi - tenant infrastructure using AWS, Docker, and Kubernetes.
DevOps Engineer maintaining and automating infrastructure and CI/CD processes for cybersecurity solutions by NordLayer. Collaborating with teams to ensure performance and scalability of cloud services.
DevOps Engineer maintaining and improving infrastructure and CI/CD processes for cybersecurity solutions provider. Collaborating with cross - functional teams for reliable and scalable cloud solutions.
DevOps Engineer maintaining and automating infrastructure and CI/CD processes at NordLayer. Collaborating with Senior Engineers to implement best practices in a dynamic cybersecurity environment.
Secure DevOps Engineer responsible for integrating security into CI/CD pipelines and strengthening AWS infrastructure. Key expertise in AWS security and container management.