Senior Platform DevOps Engineer at Code Metal designing and implementing cloud and hybrid infrastructure to support customer deployments and internal platforms. Collaborating with software and security teams for reliable delivery.
Responsibilities
Design and implement cloud and hybrid infrastructure to support internal platforms and customer deployments.
Build and manage deployment automation across cloud, on-prem, and hybrid environments.
Maintain infrastructure-as-code repositories and deployment tooling using Terraform, Helm, Ansible, or similar tools.
Implement monitoring, alerting, rollback, and recovery systems to improve platform reliability.
Partner with infrastructure engineers to support hybrid connectivity and environment consistency.
Work with security teams to implement guardrails, secrets management, and deployment hardening controls.
Support customer-hosted, air-gapped, on-prem, and government deployment requirements as needed.
Ensure infrastructure and deployment processes are traceable, auditable, and aligned with compliance needs.
Requirements
Bachelor’s degree in Computer Science, Software Engineering, or a related field, or equivalent professional experience.
5+ years of experience in DevOps, platform engineering, or cloud infrastructure roles.
Experience with public cloud platforms such as AWS, Azure, or GCP.
Experience automating infrastructure using tools such as Terraform, Ansible, Helm, or similar technologies.
Proficiency with containerization and orchestration technologies such as Docker and Kubernetes.
Experience building and maintaining deployment workflows across cloud and on-prem environments.
Strong scripting and automation skills using Bash, Python, or similar languages.
Strong understanding of networking, secrets management, environment design, and operational reliability.
Eligible to obtain and maintain an active U.S. Top Secret security clearance
Experience supporting customer-hosted, regulated, or government deployment environments.
Familiarity with GitOps and deployment tools such as ArgoCD or similar platforms.
Experience with observability tooling, incident response practices, and platform SRE concepts.
Knowledge of compliance-oriented automation and auditability requirements.
Experience working in hybrid environments that span SaaS and on-prem delivery models.
Experience operating in multi-cloud environments, for example Azure, AWS, or GCP.
Active U.S. security clearance (Secret or Top-Secret)
Benefits
Pay depends on experience, but we strive to be at the upper end of the salary range
Health care plan with 100% premium coverage, including medical, dental, and vision
401k with 5% matching
Paid Time Off (uncapped vacation, plus sick and public holidays)
DevOps Platform Intern managing cloud infrastructure and deployment pipelines for AI - native software delivery. Partnering with a Product Development Intern, set up and manage containerized applications on Azure Kubernetes Service.
UNIX DevOps Engineer managing AIX and Solaris server operations for a Swiss telecom company. Focusing on automation, optimization and 7x24h monitoring responsibilities across multiple locations.
Staff Site Reliability Engineer designing and building backend services for NordVPN. High - ownership role focusing on system architecture and operational excellence.
Senior Site Reliability Engineer managing VPN and DNS services to ensure performance and reliability. Collaborating with application teams to maintain security and quality across global infrastructure operations.
Senior Site Reliability Engineer managing globally distributed VPN and DNS services. Optimizing service performance and handling security posture in a hybrid work environment.
Senior Site Reliability Engineer focused on observability for NordVPN. Designing monitoring systems and collaborating with data teams on anomaly detection.
Senior Site Reliability Engineer ensuring content accessibility across global edge infrastructure for NordVPN. Designing and troubleshooting systems critical to internet traffic management.
Staff Site Reliability Engineer designing tools for Threat Protection Pro and NordLynx protocol. Working on globally distributed backend services for NordVPN with a focus on security and privacy.
Senior Site Reliability Engineer focused on observability for cybersecurity tools at NordVPN. Designing monitoring systems and collaborating on anomaly detection within distributed systems.
Senior Site Reliability Engineer focused on traffic engineering at NordVPN. Working to enhance the world's most advanced VPN and online security solutions.