SRE / DevOps professional specializing in cloud automation and observability to ensure operational excellence and collaboration with Development and Infrastructure teams at GFT.
Responsibilities
We are looking for an SRE/DevOps professional with solid experience in automation, observability and reliability practices, working in cloud environments with a strong focus on AWS.
This professional will play a strategic role in ensuring availability, performance, security and operational efficiency, working closely with Development and Infrastructure teams.
Requirements
Work in partnership with the Development team, supporting the building, maintenance and evolution of applications;
Expand, optimize and evolve CI/CD pipelines;
Troubleshoot and analyze incidents using APM and observability tools;
Work daily with cloud technologies, primarily AWS;
Lead initiatives to optimize costs and performance of services;
Ensure reliability, availability, security and scalability of applications and infrastructures;
Analyze existing architectures and propose structural improvements;
Identify processes that can be automated and implement them;
Promote best practices and support DevOps and SRE culture across the organization;
Strong experience with AWS, including: Cognito, Aurora PostgreSQL, EKS, Lambda, S3, API Gateway, DynamoDB, EC2, DocumentDB, SNS, and OpenSearch;
Experience with messaging and streaming: RabbitMQ, SQS, Kafka, Kinesis;
Experience with Infrastructure as Code (IaC): CloudFormation and Terraform;
Experience in rightsizing resources, provisioning new services, optimizing workloads and cluster architecture;
Knowledge of Windows Server (Active Directory, IIS, Windows Services);
Experience with CI/CD tools such as Jenkins and Azure DevOps;
Knowledge of Shell scripting and Python;
Advanced experience in SRE (Site Reliability Engineering) practices;
Experience with complex automations or large-scale pipelines;
AWS, Kubernetes, DevOps or SRE certifications;
Previous experience in high-criticality financial or corporate environments;
Experience with other clouds: GCP and Azure;
Knowledge of CI/CD with the AWS stack and GitLab CI;
Experience with SQL and NoSQL databases, including PostgreSQL;
Development experience with Kotlin, Java, Go and Spring Boot;
Experience with observability tools: Datadog, Grafana, Prometheus, Zabbix, New Relic, Dynatrace;
Knowledge of Big Data, especially the AWS stack.
Benefits
Multi-benefit card – you choose how and where to use it.
Tuition assistance for undergraduate, graduate, MBA and language courses.
Certification incentive programs.
Flexible working hours.
Competitive salaries.
Annual performance review with a structured career plan.
Possibility of international career opportunities.
Senior DevOps Engineer responsible for cloud infrastructure and deployments. Optimizing AWS services and ensuring system security and reliability for Verizon.
Senior DevOps Engineer responsible for automating infrastructure and building CI/CD pipelines for collaborative robotics company. Collaborating with global engineering teams from the Bangalore office.
Site Reliability Engineer Intern at Tencent working on gaming services and cloud native solutions. Collaborating with global teams to eliminate toil and enhance reliability.
Cloud/DevOps Specialist at N5X managing and optimizing critical cloud infrastructures for Brazilian energy trading. Collaborating with a multidisciplinary team to ensure high availability and performance.
Cloud/Devops Specialist responsible for designing a hybrid architecture combining cloud and on - premises infrastructure for energy trading systems. Collaborating with a multidisciplinary team in a dynamic environment.
Reliability Engineering Specialist utilizing reliability tools and models to improve asset performance at Enbridge. Collaborating across teams to guide investment decisions for safe operations.
DevOps Engineer responsible for structuring and supporting cloud DevOps architecture in Brazil. Working strategically on automation and CI/CD practices with development teams in Pernambuco.
DevSecOps Software Engineer developing secure CI/CD pipelines for Boeing's military software systems. Collaborate with cross - functional teams and implement automation and security best practices.
DevOps Manager responsible for managing a team for multi - cloud solutions supporting the USAF Cloud One project. Focus on scalable cloud - native solutions and CI/CD practices.
Lead Site Reliability Engineer overseeing SRE practices across Azure and GCP platforms. Driving reliability improvements and leading a team at Lloyds Banking Group.