Being an SRE in VeepeeTech means to be a part of the transversal SRE community and at the same time integrate one of the agile product teams. You will ensure the system’s reliability and scalability by applying DevOps practices and share knowledge within the SRE community.
Implementing tools and processes for deployment and industrialization (CI/CD, blue/green, canary, rollback, etc.);
Automating provisioning of a resilient infrastructure that meets the needs of products;
Working with development teams to facilitate regular releases;
Maintaining services in operational conditions, analyze and resolve performance and scalability anomalies (load tests) of current and historical deployments;
Supervising the application portfolio in collaboration with the Network Operations Center (NOC), manage access and security;
Participating in the evolution of the IS (VMware migration to KVM and service offer) and the reduction of the technical debt;
Being the evangelist of DevOps’ good practices and participate in the construction of a true transversal SRE community within Veepee.
Requirements
At least 3 years of experience in a similar function;
Knowledge of industrialization processes, agile methods, gitflow flow and DevOps practices in general and understanding of a system side;
Familiar with Linux (good knowledge), knowledge of Windows would be a plus;
Proficiency with IaC: Packer, Terraform, Ansible, Puppet;
SUP: Icinga, ELK, Prometheus;
Hands on with Docker, Kubernetes, Nomad;
Proficiency with different types of DB such as, PostgreSQL, MongoDB, ElasticSearch;
You have strong verbal and written English language skills.
Benefits
Dynamic and creative environment within international teams
The variety of self-education courses on our e-learning platform
The participation in meetups and conferences locally and internationally
Network & Datacenter Deployment Engineer at Cloudflare focused on building and expanding their global network infrastructure with collaboration across multiple engineering teams and vendors.
Senior DevOps Engineer leading cloud - native solutions at Sparksoft Corporation. Driving automation and system reliability within a fast - paced Agile team.
Platform Engineer focusing on supporting CI/CD pipelines and Kubernetes at PCCW. Responsible for ensuring platform services' reliability and performance, with night - time support as needed.
Site Reliability Engineer at Bumble optimizing large - scale Linux environments and ensuring system stability. Focusing on troubleshooting, incident recovery, and performance tuning in complex infrastructures.
Senior DevOps Manager overseeing CI/CD processes for NVIDIA Networking products. Leading a team and collaborating with global teams to enhance R&D efficiency and infrastructure.
DevOps Manager overseeing engineering team developing scalable CI/CD processes for NVIDIA Networking products. Enhancing global R&D efficiency in a technology - focused company.
Join Operations Team as Senior Site Reliability Engineer driving operational excellence for cybersecurity solutions. Collaborate across teams to manage production platforms and optimize infrastructure.
Software Developer - DevOps System Administrator working within the SCMT team to enhance software application efficiency. Collaborating on tools and scripts for application lifecycle management.