Reliability Engineer responsible for availability and performance of U.S. Air Force Cloud services. Collaborates with teams to deliver reliable mission-critical systems in a hybrid environment.
Responsibilities
This role supports the U.S. Air Force Cloud One Architecture and Common Shared Services contract and currently has an opening for a **Reliability Engineer**.
The Reliability Engineer is responsible for ensuring the availability, performance, scalability, and resiliency of mission‑critical systems.
This role applies software engineering principles to infrastructure and operations, with a strong emphasis on automation, monitoring, incident response, and continuous reliability improvement.
The reliability engineer serves as the bridge between development, operations, and platform teams to ensure production systems consistently meet defined service level objectives (SLOs) while supporting rapid, safe delivery of new capabilities.
Requirements
Bachelors and eight (8) years or more of experience; Masters and six (6) years or more of experience. Additional experience may be accepted in lieu of degree.
Active Secret clearance at a minimum required to start
US citizenship required
Experience with cloud platforms (AWS, Azure, OCI, or GCP), including managed services
Experience with containerized environments (Docker, Kubernetes)
Familiarity with CI/CD pipelines and deployment automation
SLOs and error budgets
Capacity modeling and performance testing
Strong understanding of:
Distributed systems and high‑availability architectures
Entry - level DevOps Engineer assisting in cloud infrastructure automation for AI - powered security operations platform. Seeking passionate candidates with foundational knowledge in Terraform, Kubernetes, and CI/CD pipelines.
DevSecOps Engineer responsible for security in CI/CD pipelines for a global client network. Collaborating on security hardening of applications and automation processes.
DevSecOps Engineer maintaining CI/CD security pipelines at SQA Consulting. Collaborating with teams to automate processes and ensure security best practices are followed.
DevSecOps Engineer for SQA Consulting focusing on CI/CD automation and security hardening. Collaborating with teams on cloud solutions in a hybrid work environment.
DevSecOps Engineer managing CI/CD pipelines and ensuring application security for SQA Consulting. Collaborating across teams while focusing on continuous improvement and automation in cloud environments.
Site Reliability Engineer focused on designing and maintaining observability platform for dLocal. Collaborating with global teams and optimizing system performance for major clients.
Staff Site Reliability Engineer focused on product engineering for Civica. Leading technical practices and architectural alignment while improving service delivery and quality.
Senior Cloud Operations Engineer at CELUM focusing on cloud infrastructure and system security. Collaborating on IT projects and optimizing hosting environments.
DevOps Engineer at FormativGroup focusing on Kubernetes management and automation solutions. Designing, implementing, and securing infrastructure for efficient application deployment in a remote setting.
Senior AWS Cloud Engineer designing and building cloud infrastructure at Emergn. Collaborating with global teams to enhance scalable and reliable delivery of products.