DevOps Engineer for designing, automating, and optimizing cloud-native infrastructures across AWS, Azure, and GCP. Collaborating with teams to improve delivery workflows, reliability, and performance.
Responsibilities
Design, build, and maintain cloud-native infrastructures across AWS, Azure, and (optionally) GCP.
Implement scalable, secure, and highly available systems using Kubernetes, Terraform, and CI/CD pipelines.
Automate cloud provisioning and deployments, improve platform reliability, and ensure cost and performance optimization.
Integrate observability tools (Datadog, Grafana, Prometheus, Splunk) into applications and support teams in monitoring and troubleshooting.
Collaborate with developers, QA, and cross-functional teams to enable DevOps practices, streamline workflows, and improve delivery processes.
Support AI/ML workloads by designing infrastructure for training, inference, and MLOps pipelines (SageMaker, Azure ML, Vertex AI).
Maintain documentation, build self-service DevOps tools, and contribute to platform best practices.
Requirements
4+ years of experience in DevOps, SRE, or cloud platform engineering.
Strong expertise in AWS or Azure cloud architectures, networking, and security.
Skilled in Kubernetes (EKS/AKS), Docker, Helm, and modern infrastructure-as-code (Terraform).
Solid understanding of Linux systems, distributed systems, and scalable architecture design.
Hands-on experience with CI/CD tools (Jenkins, GitHub Actions, Azure DevOps) and GitOps (ArgoCD).
Comfortable with observability tooling (Datadog, Splunk, Prometheus, Grafana).
Experience with AI/ML platforms or ML-driven workloads is a strong plus.
Ability to work well with cross-functional teams, communicate clearly, and enjoy building reliable, automated, developer-friendly platforms.
Cloud Engineer at Agility Technologies leading the design of scalable eLearning infrastructure. Collaborating on technical design and implementation involving cloud - based platforms and secure integrations.
Senior Hardware Reliability Engineer overseeing reliability testing and analysis of outdoor electronic assemblies at Gridware. Collaborating with mechanical engineers and contributing to product lifetimes modeling.
Senior Manager leading SRE, Virtualization, Networking, and AI Infrastructure teams at F5. Overseeing mission - critical infrastructure and driving operational excellence across hybrid compute environments.
Senior Software Release Engineer managing software release trains at GM. Owning integration activities and defining software release scopes with a focus on collaboration with suppliers.
Software Release Engineer managing VCU and CCU software release trains for automotive solutions. Overseeing release readiness, integration, and building processes for embedded software.
Senior DevOps Engineer at Broadridge developing fully automated pipelines for Python applications. Collaborating on LTX Trading applications with a focus on cloud infrastructure and deployment automation.
DevOps Azure Developer specializing in end - to - end application development with Python, Azure, and CI/CD practices at Abbott. Involves collaborative environments and building secure cloud applications.
Release Engineer enhancing end - to - end build and deployment pipelines for Ironclad's AI contracting platform. Collaborating with Engineering, QE, and Product teams to manage releases and deployment processes.
DevOps Engineer focused on CI/CD and cloud operations for a leading financial services client. Ensure high - quality, automated deployments and promote DevOps practices within the team.
DevOps Engineer maintaining cloud infrastructure and automation for clinical trials at Teckro. Collaborating with development and operations teams to optimize performance and ensure system reliability.