Site Reliability Engineer responsible for building and maintaining cloud infrastructure at Tricentis. Collaborating with product engineers and enhancing operational processes for seamless scaling with innovative solutions.
Responsibilities
Design, build, and maintain the product cloud infrastructure that enables seamless scaling
Develop advanced monitoring systems that proactively alert on symptoms
Leverage tools like Terraform, GitHub actions, and Kubernetes to efficiently manage AWS or AZURE infrastructure
Continuously enhance operational processes, making deployments, upgrades, and other tasks as boring and automated as possible
Collaborate with product engineers on daily basis and influence product architectures designs
Be part of an on-call rotation to respond swiftly to incidents affecting availability
Act as a reliability champion for stable counterpart assignments
Propose innovative ideas and solutions within the SRE organization and engineering
Proactively identify opportunities to enhance system availability and performance
Share learnings with the wider community
Be the first responder during emergencies and on-call duties
Requirements
Proficiency in Terraform syntax and GitHub Actions configuration
Working knowledge of SaaS architecture concepts and designs
Understanding of Kubernetes, including CLI usage and service re-provisioning
Ability to provision and set up metrics along with managing alerts and silences
Identify Service Level Indicators (SLIs) that align the team with availability and latency objectives
Experience with Linux operating system configuration, package management, and troubleshooting
Working experience with cloud environments like AZURE or AWS and provisioning infrastructure there
Good cultural fit: clear communication, empathy, curiosity & continuous learning, no blame attitude, but instead supportive
(Senior) DevOps Engineer with a focus on CI/CD and cloud infrastructure management for e - commerce solutions. Collaborating across teams to ensure automated, scalable deployments.
Senior DevOps Engineer managing monitoring systems for B2B e - commerce platforms in Azure Cloud. Collaborating with teams to improve platform products and processes.
DevSecOps Expert managing deployments, monitoring systems, and providing technical support in Brussels. Role involves close collaboration with Development and IT teams at a major client's site.
DevOps Engineer at Gemba designing secure, cloud - native platforms for public - sector organizations. Leading technical decisions and collaborating to solve complex challenges for critical systems.
DevOps Engineer automating cloud - native infrastructure for public - sector organizations. Join an agile team to enhance deployment processes and support critical systems.
DevOps Engineer designing and constructing secure cloud - native platforms for public - sector organizations across the UK. Leading technical decisions while collaborating closely with clients.
Junior and DevOps Engineers designing and running secure cloud - native platforms for UK public - sector organisations. Collaborating with teams to streamline deployment and automate infrastructure workflows.
Site Reliability Engineer optimizing global trading infrastructure for a crypto capital markets partner. Responsibilities include cloud environment management and system design for high availability.
DevOps Engineer responsible for implementing and operating CI/CD pipelines for SaaS services. Collaborating with teams to ensure reliable and secure operations in the Risk & Fraud business unit.