Implement tools and processes for deployment and industrialization (CI/CD, blue/green, canary, rollback, etc.)
Automate provisioning of a resilient infrastructure that meets product needs
Work with development teams to facilitate regular releases
Maintain services in operational condition; analyze and resolve performance and scalability issues (including load testing) for current and historical deployments
Oversee the application portfolio in collaboration with the Network Operations Center (NOC); manage access and security
Contribute to the evolution of the IT infrastructure (e.g., VMware to KVM migration and service offering) and reduce technical debt
Act as a DevOps advocate and help build a transversal SRE community across the company
Share company information and communicate team activities
Define and maintain a clear, relevant team organization
Develop the team while avoiding micromanagement
Requirements
Minimum 3 years’ experience in a similar role
Proven managerial experience
Knowledge of industrialization processes, agile methodologies, GitFlow and DevOps best practices, with a solid understanding of system administration
Experience maintaining high availability systems
Experience with on-call organization and incident response
Strong Linux skills; Windows knowledge is a plus
Proficiency with Infrastructure-as-Code: Terraform, Ansible
Experience with logging and monitoring: ELK (Elasticsearch, Logstash, Kibana), Prometheus
Hands-on experience with Docker, Kubernetes, Consul, Vault
Experience with messaging systems such as RabbitMQ
Experience with databases such as PostgreSQL, MongoDB, Elasticsearch
Good knowledge of backup and recovery systems
Strong verbal and written English skills
Empathetic and open-minded
Benefits
Dynamic and creative environment within international teams
Wide range of self-learning courses available on our e-learning platform
Opportunities to participate in local and international meetups and conferences
DevOps Product Manager working on complex platform and infrastructure projects. Consulting on DevOps best practices and ensuring scalable, efficient digital ecosystems for clients.
Site Reliability Engineer optimizing large - scale Linux environments at Bumble Inc. Troubleshooting incidents and driving performance improvements on platforms such as Kafka and Kubernetes.
Senior DevOps Engineer at mylo, managing multi - cloud infrastructure and CI/CD pipelines. Promoting DevOps culture while ensuring compliance and automating system maintenance.
Lead Site Reliability Engineer at S&P Global's Cloud Engineering team. Responsible for designing and maintaining cloud infrastructure and ensuring the performance of cloud - based systems.
Site Reliability Engineer responsible for monitoring and improving the reliability of satellite operations infrastructure. Collaborating with teams to automate processes in a dynamic environment.
DevOps Analyst providing high quality and reliable solutions within multifuncional teams at technology - focused financial organization. Automating build and deployment solutions in a hybrid work environment.
Network & Datacenter Deployment Engineer at Cloudflare focused on building and expanding their global network infrastructure with collaboration across multiple engineering teams and vendors.
Senior DevOps Engineer leading cloud - native solutions at Sparksoft Corporation. Driving automation and system reliability within a fast - paced Agile team.