Site Reliability Engineer focusing on AWS cloud environments, SRE practices, and system reliability within GFT's team. Collaborating on cloud migrations and observability initiatives.
Responsibilities
Work on Site Reliability Engineering (SRE) initiatives and activities in AWS cloud environments;
Define and track Service Levels (SLAs), Service Level Indicators (SLIs) and performance metrics;
Contribute to the expansion and evolution of SRE practices across the organization;
Assess environment maturity and propose optimization strategies and process improvements;
Monitor technical and business metrics and indicators to ensure availability, resilience and performance of IT services;
Participate in modernization projects and migrations of environments to the cloud;
Design solutions and architectures focused on Observability.
Requirements
Experience with SRE practices and operating environments in AWS cloud;
Knowledge of Observability and APM tools such as Grafana, AppDynamics, Dynatrace, Prometheus, DataDog, ELK or Zabbix;
Experience in log analysis and investigation of connectivity and integration scenarios between applications and partners;
Knowledge of service monitoring and defining operational metrics;
Focus on cost optimization and service performance;
Orientation toward ensuring application reliability and security;
Experience in modernization and cloud transformation projects;
Experience designing Observability architectures;
Experience in high-availability environments and critical systems.
Benefits
Multi-benefits card – choose how and where to use it.
Scholarships for undergraduate, graduate, MBA and language courses.
Certification incentive programs.
Flexible working hours.
Competitive salaries.
Annual performance review with a structured career plan.
Senior DevOps Analyst enhancing infrastructure automation in a transformative technology firm. Collaborating on innovative projects in sectors like healthcare, finance, and utilities in Brazil.
Consultant at Minsait supporting technical decisions in infrastructure automation and developing solutions. Collaborating with teams for maintaining and evolving automation platforms.
Practical Trainee focusing on hardware reliability engineering at Sonova. Support reliability improvement initiatives and work closely with experienced engineers on real - life product challenges.
Configuration Management Engineering Technician supporting naval shipbuilding projects with engineering documentation and configuration integrity. Establishing and maintaining relationships with stakeholders in the shipbuilding community.
Principal Configuration Management Engineering Technician contributing to major shipbuilding programs for national security. Leading Configuration Management teams and ensuring data integrity for advanced naval vessels.
Senior Configuration Management Engineering Technician at Babcock supporting naval engineering programmes across multiple ship configurations. Influencing critical decisions and contributing to engineering outcomes for national defence.
DevOps Engineer designing and managing scalable Azure cloud infrastructure for a financial technology company. Collaborating with teams to enhance system reliability and automate application delivery pipelines.
DevOps Engineer responsible for designing and managing Azure cloud infrastructure for a financial services provider. Collaborating with development teams to optimize system reliability and security.
Senior DevOps Engineer responsible for scaling and securing infrastructure behind healthcare AI platform. Collaborating with teams to deliver integrations and drive automation best practices.
Config/DevOps Engineer industrializing delivery for Microsoft Dynamics 365 and Azure within Sanlam Group. Focus on establishing repeatable CI/CD pipelines and operational controls across platforms.