Site Reliability Engineer responsible for maintaining global server infrastructure in a web data solutions company. Collaborating with teams to optimize performance and drive innovation.
Responsibilities
Be a part of team which maintains thousands of servers around the globe
Work on infrastructure arrangement, capacity planning, performance optimization
Automate everything or as many things as possible to get rid of manual job
Upgrade systems with new releases and models
Work together with PMs and Development team to find the best technical solutions
Implement new solutions to make our products better
New ideas & projects are always welcomed
Requirements
Strong Linux skills, including troubleshooting and programming
Good understanding of what proxy is - know what is reverse proxy and forward proxy; have used tools like HAProxy, Envoy, Squid or similar
You have a clue or two about how network technologies work (DNS, IPv4, IPv6, BGP, routing, bridging, bonding, etc.)
Hands-on experience with automation and servers provisioning tools (preferably Ansible and Terraform)
Excellent proactive, responsibility and ownership skills
Nice to have:
Experience with critical web systems (you know what HA actually means)
Hands-on experience with Kubernetes/Helm/ArgoCD
Experience with big databases (SQL and NoSQL)
That you are familiar with some of these technologies: Microservices, Kafka, Nginx, MySQL, Redis, Prometheus
Experience with continuous integration and continuous delivery/continuous deployment (Gitlab CI/CD)
Experience with cloud providers (AWS, GCP, etc.)
Benefits
To support your professional growth and make you feel taken care of, we’ve put together an expansive benefit package. It covers learning, well-being, celebration, and much more — learn all about it here.
Join Boeing AvionX as a Software DevOps Engineer driving automation and CI/CD pipelines for cloud - native systems. Lead initiatives improving deployment pipelines and mentor engineering team.
Senior SRE responsible for ensuring system reliability and performance at Aggrandize. Collaborating with cross - functional teams and implementing SRE best practices.
Lead Oracle ERP Enterprise Architect focusing on DevSecOps and cloud - native modernization for a defense - related company. Transitioning monolithic applications to microservices and maintaining CI/CD pipelines.
Lead Oracle ERP Enterprise Architect supporting DevSecOps implementation and modernization initiatives at Credence. Overseeing CI/CD pipelines in cloud environments for defense and health organizations.
Reliability Engineer responsible for RCM program and maintenance initiatives in mining industry. Enhancing equipment reliability and collaborating with various teams.
Lead SRE for Data & Analytics platforms at Deloitte. Championing reliability, improving stability, and driving automation in a hybrid environment based in London.
RDS Engineer supporting enterprise - grade RDS environments for Wells Fargo. Building and tuning Windows Server RDS environments and collaborating with security and networking teams.
Senior DevSecOps Engineer managing Azure to AWS migration for AccuSourceHR. Leading cloud architecture, CI/CD implementation, and ensuring security and reliability in production systems.
Site Reliability Engineer ensuring infrastructure reliability and performance for Hornetsecurity. Collaborating across product, business, and infrastructure teams in a critical environment.
Senior DevOps Engineer developing core infrastructure supporting Shelf products. Focused on building reliable, secure, and scalable systems in hybrid work environment.