Senior DevOps/Infrastructure Engineer designing and implementing cloud and hybrid infrastructure for early-stage deep-tech company. Leading automation, security, and reliability initiatives at founding-team level.
Responsibilities
Own the Infrastructure Stack: Lead the design and implementation of our infrastructure using Terraform (IaC) across cloud and hybrid setups
Build and administer VMs and Containers using Docker and Kubernetes, ensuring rigorous security and permission models are enforced
Design effective CI/CD pipelines and implement GitOps practices to accelerate development velocity
Establish best practices for IAM, secrets management, and overall infrastructure hygiene to mitigate reliability and security risks
Implement a full observability setup (OTEL) for telemetry across distributed and remote systems
Design and maintain secure, efficient network topologies suitable for data-intensive applications
Requirements
7+ years of experience building and maintaining production-grade infrastructure
Extensive experience managing cloud (AWS/GCP), on-prem, and hybrid environments
Advanced understanding of Docker (including security/permissions) and hands-on experience administering Kubernetes
Deep experience designing CI/CD pipelines and implementing GitOps workflows
Strong understanding of modern network topologies and security protocols
Strong experience with Terraform/Pulumi
Demonstrated experience with IAM, secrets management, and security policies
Cloud Site Reliability Engineer managing Solace Cloud services across leading cloud providers. Ensuring reliability, handling incidents, and collaborating with customers for operational excellence.
Senior Cloud Site Reliability Engineer ensuring reliability and health of Solace Cloud Services with hands - on cloud operations expertise. Lead incident management and customer support for high - impact environments.
DevOps Engineer designing and operating AWS infrastructure within industrial IoT environments. Working on systems that ensure security, resilience, and end - to - end observability.
Sr. Site Reliability Engineer (SRE) III providing technical solutions for the federal government. Collaborating in a high - performing team focused on reliability and application scalability.
Senior Linux System Engineer developing and maintaining Linux server infrastructure for Th. Geyer GmbH. Collaborating on ERP systems and CI/CD processes while ensuring system performance and security.
Platform Engineer leading the development of cloud application platforms for Allstate. Responsible for cloud infrastructure for ML experimentation and production deployments.
Cloud Platform Engineer (ML DevOps) developing and managing CI/CD pipelines for ML workflows in a leading insurance company. Collaborating with data scientists and ensuring infrastructure security and compliance.
DevOps Engineer developing and managing container platforms for client solutions at Booz Allen Hamilton. Utilizing cloud technologies to enhance capabilities and secure deployments.
Senior DevOps/Platform Engineer automating cloud infrastructure and optimizing delivery pipelines at S&P Global Mobility. Collaborating with teams to enhance product reliability and security.