Site Reliability Engineer focusing on AWS cloud services and Site Reliability Engineering practices. Collaborating on performance, availability, and observability within a hybrid work environment.
Responsibilities
Work on SRE initiatives and activities in an AWS cloud environment;
Define and monitor Service Levels (SLAs), Service Level Indicators (SLIs) and performance metrics;
Expand and consolidate Site Reliability Engineering (SRE) practices;
Assess service maturity and define optimization strategies and process adjustments;
Monitor technical and business metrics, ensuring availability, resilience and performance of IT services;
Participate in modernization and cloud migration projects;
Work on projects and design architectures focused on Observability.
Requirements
Experience with Observability and APM tools such as Grafana, AppDynamics, Dynatrace, Prometheus, DataDog, ELK and Zabbix;
Experience in log analysis and troubleshooting connectivity and integrations between applications and partners;
Experience optimizing cost and performance of cloud services on AWS;
Focused on reliability, availability and security of services.
Benefits
Multi-benefits card – you choose how and where to use it.
Scholarships for Undergraduate, Postgraduate, MBA and language courses.
Certification incentive programs.
Flexible working hours.
Competitive salaries.
Annual performance review with a structured career plan.
Sr. Site Reliability Engineer designing and automating robust technical infrastructure at Broadridge. Collaborating across teams for successful deployment and operational support of services.
Senior Fleet Reliability Engineer maintaining high fleet uptime for autonomous vehicle technology. Collaborating with technical teams to ensure peak operational performance in data collection efforts.
DevOps Lead at Leidos managing platform engineering, SRE, and application security functions. Driving operational excellence and ensuring scalability for federal government applications.
SRE Lead developing scalable cloud - native solutions for mission - critical systems supporting USAF. Managing teams, collaborating with cross - functional units, and ensuring high service reliability standards.
Junior DevOps / Platform Engineer at DieEnergiekoppler GmbH managing AWS/EKS platform operations. Collaborating with team members to improve platform functionalities and security compliance.
DevOps Engineer responsible for AWS infrastructures and backend development at Allguth GmbH. Engaging in greenfield projects with modern solutions in a collaborative team.
Cloud DevOps Specialist responsible for building scalable infrastructure solutions in AWS at SONDA. Focusing on automation, containerization, and data management in a collaborative environment.
DevOps Engineer maintaining and evolving deployment pipelines for Docebo’s AI - powered learning platform. Collaborating with cross - functional teams to ensure efficient software releases and infrastructure management.
DevOps Engineer optimizing CI/CD pipelines for Docebo, an AI - powered learning platform. Involves managing multi - tenant infrastructure using AWS, Docker, and Kubernetes.
DevOps Engineer maintaining and automating infrastructure and CI/CD processes for cybersecurity solutions by NordLayer. Collaborating with teams to ensure performance and scalability of cloud services.