Sr. Site Reliability Engineer III delivering technical solutions within the highest levels of federal government. Collaborating in a high-performing team with a focus on mission-critical application workloads.
Responsibilities
Design, deploy, and maintain mission-critical application workloads on virtualized or containerized environments (e.g., VMWare or Kubernetes), ensuring scalability, availability, and compliance with government requirements.
Develop and sustain automated CI/CD pipelines, monitoring, and configuration management workflows to support reliable software delivery and operational observability across development, integration, staging, and production environments.
Provision, configure, and maintain developer environments and toolchains to support rapid, secure, and efficient development workflows, enabling mission-aligned software delivery.
Identify developer friction across the software development lifecycle and implement solutions to reduce that friction and provide developer-first environments.
Establish and maintain a high level of customer trust and confidence through deep technical expertise, and use creativity to provide innovative solutions that fit the customer’s mission needs.
Requirements
Active Top Secret with SCI eligibility security clearance.
Certification meeting DoD 8140 (e.g., Security+, or higher).
Bachelor’s degree in Computer Science or related engineering field is preferred; relevant experience may substitute.
7+ years of experience in software development, systems engineering, or operations roles with responsibility for availability, performance, and reliability of production systems.
Demonstrated experience blending software engineering and systems administration practices to support highly available, scalable applications.
Experience designing and managing monitoring, alerting, and observability solutions to meet defined Service Level Objectives.
Experience leading or participating in incident response, root cause analysis, and continuous improvement activities.
Experience with Ansible and Desired State Configuration.
Experience with GitLab CI/CD automation and Bash scripting.
Experience supporting container-native storage and object storage solutions (e.g., MinIO, S3-compatible services, and PortWorx).
Experience with enterprise load-balancing solutions (e.g., F5 or similar platforms).
Ability to contribute immediately with minimal ramp-up in a mission-critical operational environment.
Senior DevOps Engineer designing and maintaining CI/CD pipelines for Solace Cloud. Collaborating with teams on AWS and Kubernetes to enhance developer experiences.
Analyzing vulnerabilities and implementing security strategies within the software development cycle at Redbelt Security. Ensuring compliance with security requirements and providing guidance to the development team.
Data Center Network Deployment Engineer for NVIDIA's HPC/AI Infrastructure team. Deploying and managing large scale AI Data Centers with a focus on networking and automation.
Deployment Engineer at Megaport expanding global network using technology with collaborative team culture and problem solvers. Engage with stakeholders to deliver effective networking solutions.
Senior DevOps/Infrastructure Engineer at Thndr focusing on cloud infrastructure and DevOps best practices. Leading initiatives to improve scalable and secure financial applications.
DevOps Engineer assisting developers in leveraging DevOps tooling and best practices for Cat Digital applications. Collaborating closely with development teams to optimize delivery and troubleshooting.
Reliability Engineer providing strategic support at Y12 National Security Complex. Enhancing equipment reliability and maintainability through proactive maintenance strategies.
Upper Steering System Design and Release Engineer responsible for managing steering components and suppliers. Engaging in design and development of upper steering systems for Ford vehicles in a hybrid capacity.
Senior DevOps Engineer implementing CI/CD solutions for software projects. Requires expertise in Docker, Azure, and IAC tools in a hybrid work environment.