Be a part of team which maintains thousands of servers around the globe
Work on infrastructure arrangement, capacity planning, performance optimization
Automate everything or as many things as possible to get rid of manual job
Upgrade systems with new releases and models
Work together with PMs and Development team to find the best technical solutions
Implement new solutions to make our products better
New ideas & projects are always welcomed
Requirements
Strong Linux skills, including troubleshooting and programming
Good understanding of what proxy is - know what is reverse proxy and forward proxy; have used tools like HAProxy, Envoy, Squid or similar
You have a clue or two about how network technologies work (DNS, IPv4, IPv6, BGP, routing, bridging, bonding, etc.)
Hands-on experience with automation and servers provisioning tools (preferably Ansible and Terraform)
Excellent proactive, responsibility and ownership skills
Nice to have:
Experience with critical web systems (you know what HA actually means)
Hands-on experience with Kubernetes/Helm/ArgoCD
Experience with big databases (SQL and NoSQL)
That you are familiar with some of these technologies: Microservices, Kafka, Nginx, MySQL, Redis, Prometheus
Experience with continuous integration and continuous delivery/continuous deployment (Gitlab CI/CD)
Experience with cloud providers (AWS, GCP, etc.)
Benefits
To support your professional growth and make you feel taken care of, we’ve put together an expansive benefit package. It covers learning, well-being, celebration, and much more — learn all about it here.
DevOps Product Manager working on complex platform and infrastructure projects. Consulting on DevOps best practices and ensuring scalable, efficient digital ecosystems for clients.
Site Reliability Engineer optimizing large - scale Linux environments at Bumble Inc. Troubleshooting incidents and driving performance improvements on platforms such as Kafka and Kubernetes.
Senior DevOps Engineer at mylo, managing multi - cloud infrastructure and CI/CD pipelines. Promoting DevOps culture while ensuring compliance and automating system maintenance.
Lead Site Reliability Engineer at S&P Global's Cloud Engineering team. Responsible for designing and maintaining cloud infrastructure and ensuring the performance of cloud - based systems.
Site Reliability Engineer responsible for monitoring and improving the reliability of satellite operations infrastructure. Collaborating with teams to automate processes in a dynamic environment.
DevOps Analyst providing high quality and reliable solutions within multifuncional teams at technology - focused financial organization. Automating build and deployment solutions in a hybrid work environment.
Network & Datacenter Deployment Engineer at Cloudflare focused on building and expanding their global network infrastructure with collaboration across multiple engineering teams and vendors.
Senior DevOps Engineer leading cloud - native solutions at Sparksoft Corporation. Driving automation and system reliability within a fast - paced Agile team.