Senior Site Reliability Engineer responsible for reliability, automation, and optimizing complex architecture for a logistics platform. Collaborating with engineering, infrastructure, and database teams in Brazil.
Responsibilities
Define, implement and monitor Service Level Objectives (SLOs), Service Level Indicators (SLIs) and error budgets for core services.
Build and maintain observability stacks using New Relic (or other tools) to ensure full visibility into system health.
Automate operational tasks through Infrastructure as Code (IaC) and CI/CD pipelines.
Collaborate with Infrastructure and DBA teams to optimize performance and improve fault tolerance.
Develop incident response processes, runbooks and postmortems to enhance system reliability.
Manage and tune Kafka-based systems, ensuring high throughput and low latency.
Participate in capacity planning, load testing and scalability strategies for high-demand scenarios.
Continuously improve deployment pipelines, monitoring and recovery procedures.
Provide technical mentorship and guide engineering teams on reliability best practices.
Requirements
Bachelor’s degree in Computer Science, Engineering or equivalent experience.
5–8 years of proven experience in Site Reliability or DevOps roles.
DevOps Engineer maintaining scalable infrastructure for VOX's telecom services. Implementing automation and CI/CD pipelines in a fast - paced environment with significant growth potential.
DevOps Engineer focused on designing and managing CI/CD pipelines using Azure DevOps. Collaborating with teams for application deployment and ensuring DevSecOps practices.
DevOps Engineer working closely with engineering and security teams to optimize CI/CD pipelines and manage infrastructure. Ensuring security and compliance for mission - critical financial applications.
Build and scale cloud infrastructure that powers Heidi's healthcare AI platform. Work with AWS and Azure while enhancing automation and reliability in an innovative healthtech startup.
Infrastructure - as - Code DevOps Engineer designing and managing cloud - native platforms at Vodafone. Collaborating with agile teams for digital transformation and business success.
Director of Data Engineering leading a strategic DevOps team within Enterprise AI. Balancing leadership with hands - on expertise to enable AI technology adoption.
Join a Data Engineering Team as a Senior DevOps to support multiple Data & AI initiatives. Utilize cloud technologies and enhance data pipelines in a collaborative environment.
Principal Site Reliability Engineer at Early Warning designing performance and resiliency patterns for applications and infrastructure. Collaborating with development teams to improve systems and data integrity.
DevOps Engineer contributing to CI/CD setup and Azure services management. Collaborates with teams to ensure efficient project delivery in a hybrid environment.
IT DevOps Specialist at BMW responsible for analyzing requirements and implementing software solutions in AWS cloud environments. Collaborating internationally within agile teams for digital transformation projects.