About the role

  • SRE Specialist ensuring reliability and stability of critical products and services at GFT. Seeking a professional with systemic vision and strong analytical skills for hybrid work in São Paulo.

Responsibilities

  • Define, maintain, and evolve SLIs and SLOs for critical APIs and services;
  • Manage and communicate error budget consumption, guiding release decisions;
  • Serve as a reference for balancing agility and operational stability;
  • Implement and improve monitoring, metrics, logging, and tracing practices;
  • Ensure actionable alerts and clear dashboards for service tracking;
  • Lead or support incident responses and war rooms;
  • Structure incident response processes with a blameless approach;
  • Conduct postmortems and ensure execution of corrective actions;
  • Work to reduce MTTA, MTTR, and incident recurrence;
  • Automate operational workflows and eliminate repetitive tasks (toil);
  • Create runbooks, automations, and improvements in CI/CD pipelines;
  • Standardize rollout, rollback processes, and resilience testing;
  • Work in environments with Kubernetes/EKS, Azure DevOps, Kafka, and databases;
  • Support technical decisions together with Engineering and Architecture teams;
  • Optimize performance, capacity, and costs in infrastructure environments;
  • Promote best practices and raise SRE maturity across squads;
  • Collaborate with Architecture, DevOps/SRE Enablement, and Security teams;
  • Influence technical decisions based on data and metrics;

Requirements

  • Experience with SLIs, SLOs, error budgets, and incident management;
  • Strong troubleshooting and root cause analysis (RCA) skills;
  • Kubernetes / EKS;
  • Observability: Prometheus, Grafana, ELK, CloudWatch, X-Ray;
  • Messaging and data: Kafka, Oracle, MySQL;
  • Operational security and IAM;
  • Bash;
  • PowerShell;
  • Python;
  • Ansible;
  • Terraform;
  • Helm;
  • Ability to teach, influence, and mentor;
  • Clear, concise, data-oriented communication;
  • Strong cross-functional collaboration;
  • Product mindset and blameless culture;
  • Knowledge of .NET Framework / .NET Core;
  • Experience with Chaos Engineering;
  • Familiarity with Progressive Delivery;
  • Experience optimizing cloud costs.

Benefits

  • Multi-benefit card – you choose how and where to use it.
  • Scholarships for undergraduate, graduate, MBA, and language courses.
  • Certification incentive programs.
  • Flexible working hours.
  • Competitive salaries.
  • Annual performance review with a structured career plan.
  • Possibility of international career opportunities.
  • Wellhub and TotalPass.
  • Private pension plan.
  • Childcare assistance.
  • Health insurance.
  • Dental insurance.
  • Life insurance.

Job title

DevOps SRE Specialist

Job type

Experience level

Mid levelSenior

Salary

Not specified

Degree requirement

Bachelor's Degree

Location requirements

Report this job

See something inaccurate? Let us know and we'll update the listing.

Report job