DevOps Engineer for designing, automating, and optimizing cloud-native infrastructures across AWS, Azure, and GCP. Collaborating with teams to improve delivery workflows, reliability, and performance.
Responsibilities
Design, build, and maintain cloud-native infrastructures across AWS, Azure, and (optionally) GCP.
Implement scalable, secure, and highly available systems using Kubernetes, Terraform, and CI/CD pipelines.
Automate cloud provisioning and deployments, improve platform reliability, and ensure cost and performance optimization.
Integrate observability tools (Datadog, Grafana, Prometheus, Splunk) into applications and support teams in monitoring and troubleshooting.
Collaborate with developers, QA, and cross-functional teams to enable DevOps practices, streamline workflows, and improve delivery processes.
Support AI/ML workloads by designing infrastructure for training, inference, and MLOps pipelines (SageMaker, Azure ML, Vertex AI).
Maintain documentation, build self-service DevOps tools, and contribute to platform best practices.
Requirements
4+ years of experience in DevOps, SRE, or cloud platform engineering.
Strong expertise in AWS or Azure cloud architectures, networking, and security.
Skilled in Kubernetes (EKS/AKS), Docker, Helm, and modern infrastructure-as-code (Terraform).
Solid understanding of Linux systems, distributed systems, and scalable architecture design.
Hands-on experience with CI/CD tools (Jenkins, GitHub Actions, Azure DevOps) and GitOps (ArgoCD).
Comfortable with observability tooling (Datadog, Splunk, Prometheus, Grafana).
Experience with AI/ML platforms or ML-driven workloads is a strong plus.
Ability to work well with cross-functional teams, communicate clearly, and enjoy building reliable, automated, developer-friendly platforms.
Site Reliability Engineer responsible for infrastructure supporting AI platform. Safeguarding US customer data and ensuring compliance in the Aerospace and Defense sector.
Senior Infrastructure Engineer managing Azure platform for a SaaS product at Rillion. Focused on automation, security, reliability, and scalability in a hybrid work environment.
Statistician/Reliability Engineer applying statistical analysis for satellite systems at Aerospace Corporation. Leading projects on system reliability and working closely with interdisciplinary teams in a full - time on - site role.
DevOps Engineer designing and implementing solutions to optimize operations in media technology at Mediagenix. Collaborating with cross - functional teams to enhance user experiences.
Senior DevOps Engineer at SimCorp managing cloud environments and automating builds using Azure. Collaborating with cross - functional teams to ensure high service availability and compliance.
DevOps Senior Software Engineer at SimCorp developing high - quality software solutions for financial technology. Responsible for mentoring junior engineers and solving complex technical challenges.
DevOps Engineer designing, building, and operating software development infrastructure for CodeMettle. Leading automation and best practices to enhance value delivery across teams.
DevOps Engineer maintaining scalable infrastructure for VOX's telecom services. Implementing automation and CI/CD pipelines in a fast - paced environment with significant growth potential.
DevOps Engineer focused on designing and managing CI/CD pipelines using Azure DevOps. Collaborating with teams for application deployment and ensuring DevSecOps practices.
DevOps Engineer working closely with engineering and security teams to optimize CI/CD pipelines and manage infrastructure. Ensuring security and compliance for mission - critical financial applications.