Analista de SRE Sênior focado em automação e otimização de ambientes complexo. Atuação em equipe técnica com ênfase em resiliência e melhoria contínua.
Responsibilities
Design, implement, and maintain CI/CD pipelines, promoting automation and efficiency in deployment processes.
Ensure reliability and availability of systems through observability, monitoring, and alerting practices.
Participate in incident resolution and continuous improvement of infrastructure and applications.
Implement SRE best practices such as SLOs, SLIs, and SLAs to ensure operational excellence.
Collaborate with development and operations teams to foster a DevOps culture.
Optimize cost, performance, and security in cloud environments (AWS, Azure, or GCP).
Support architecture improvement initiatives and infrastructure automation (IaC).
Requirements
Experience with OpenTelemetry, Grafana, Prometheus, Loki, Tempo, Jaeger, and Dynatrace.
Creating performance dashboards and indicators (SLIs/SLOs).
Knowledge of Kubernetes, CI/CD pipelines, and automation.
Building probes, alerts, and anomaly detection mechanisms.
Ability to standardize instrumentation across multiple services.
DevSecOps Engineer responsible for enhancing Thales' secure hosting platforms in public and private clouds. Collaborating with teams to apply modern practices and build resilient infrastructures.
Develops high - automation services in Golang or Java within AWS, Kubernetes, and Azure. Supports teams in building secure applications while working in a hybrid environment.
DevOps Engineer specializing in AWS Cloud Infrastructure in a hybrid position. Collaborating within a supportive team to build modern infrastructure for VM - based applications.
Leading DevOps platform strategy for KIPMI Software's next - generation digital trust products. Collaborating with teams to implement scalable infrastructure and DevSecOps practices.
Join our DevOps team to build and manage GitHub pipelines and cloud - native Azure solutions. Collaborate with teams to drive DevOps best practices and optimize deployments.
Site Reliability Engineer enhancing system reliability and deployment practices at OpenLoop. Collaborating with cross - functional teams for incident management and performance tuning.
Senior DevOps Engineer enhancing Azure application reliability for a healthcare fintech platform. Collaborating closely with engineering teams to ensure deploy safety and observability.
DevOps Engineer contributing to tooling changes and leading a community of practice at Totara. Focused on collaboration, development, and support for internal teams.
Site Reliability Engineer responsible for infrastructure supporting AI platform. Safeguarding US customer data and ensuring compliance in the Aerospace and Defense sector.