Senior DevOps Engineer enhancing AWS infrastructure for climate risk technology company. Leading DevOps practices and managing CI/CD pipelines for scalable cloud-based products.
Responsibilities
Enhance and evolve our AWS platform: Evolve our AWS infrastructure and delivery pipelines to support faster, more reliable deployments and developer workflows.
Shape the architectural blueprint for scaling: Design scalable, secure CI/CD processes and infrastructure to accelerate software delivery.
Drive best practices in observability and monitoring: building robust logging, metrics, and alerting to ensure systems are transparent and issues are caught early.
Act as DevOps leadership: Provide DevOps leadership in modern deployment strategies (Kubernetes, GitOps, CI/CD), fostering knowledge sharing across the engineering org.
Design, build, and maintain CI/CD pipelines (e.g., GitHub Actions, ArgoCD) to support secure, fast, and automated software delivery.
Manage containerized and serverless workloads across AWS, including Kubernetes (EKS), ECS/Fargate, and Lambda, ensuring reliability, scalability, and cost-efficiency.
Enhance and evolve our AWS platform: improve performance across Kubernetes workloads, serverless systems, and data-driven services.
Lead migration initiatives from ECS to EKS, enabling more standardized and reliable deployments across environments.
Requirements
Strong experience designing and managing AWS infrastructure, including EKS, ECS/Fargate, Lambda, RDS, CloudFront, and VPC.
Solid expertise with Kubernetes (EKS preferred) - including scaling, monitoring, and migrating workloads.
Proven experience with CI/CD tooling and DevOps automation (e.g., GitHub Actions, ArgoCD).
Hands-on experience with Infrastructure as Code using Terraform.
Deep understanding of cloud security best practices, IAM policies, and secrets management.
DevSecOps Engineer focusing on automation and Active Directory management at Saab. Collaborating within the IAM team to enhance secure access and infrastructure management.
Technology Lead - SRE at Broadridge managing service delivery agreements and client satisfaction through project management. Focusing on improving processes for efficient service delivery in financial solutions.
Director of DevSecOps and SRE at Allegion overseeing infrastructure reliability and CI/CD pipelines. Leading and mentoring SRE and cloud infrastructure teams in a global organization.
Platform Engineer (SRE) responsible for implementing cloud - native infrastructure and automation. Join UOL EdTech to transform education using technology in Brazil.
Site Reliability Engineer ensuring smooth operations for banking systems at GFT. Working on production system access, deployment, and observability in AWS and Kubernetes environments.
DevOps Engineer ensuring stability, scalability, and reliability of justtrack's SaaS platform. Collaborate with development teams, manage cloud infrastructure, and enhance CI/CD processes.
Cloud DevOps Engineer designing and optimizing secure cloud infrastructure on Azure. Collaborating closely with developers for reliable CI/CD processes on cloud - based products.
Staff Site Reliability Engineer responsible for cloud infrastructure implementation and reliability improvements at Auror. Collaborating with engineering teams to enhance production code understanding.
Own availability and strive for operational excellence of Sumo Logic’s observability. Collaborate with global SRE team to optimize operations and improve developer velocity.