SRE responsible for designing and maintaining cloud infrastructure to support scalable applications. Collaborating with product teams to enhance monitoring and response systems in the Czech Republic.
Responsibilities
Design, build, and maintain the product cloud infrastructure
Develop advanced monitoring systems that proactively alert on symptoms
Leverage tools like Terraform, GitHub actions, and Kubernetes to manage AWS or AZURE infrastructure
Collaborate with product engineers on a daily basis and influence product architectures designs
Be part of an on-call (PagerDuty) rotation to respond swiftly to incidents affecting availability
Proactively identify opportunities to enhance system availability and performance by applying insights from monitoring and observation
Requirements
Proficiency in Terraform syntax and GitHub Actions configuration
Working knowledge of SaaS architecture concepts and designs
Understanding of Kubernetes, including CLI usage and service re-provisioning
Ability to provision and set up metrics along with managing alerts and silences
Identify Service Level Indicators (SLIs) that align the team with availability and latency objectives
Experience with Linux operating system configuration, package management, and troubleshooting
Working experience with cloud environments like AZURE or AWS and provisioning infrastructure there
Benefits
Opportunity to propose innovative ideas and solutions within the SRE organization
Staff Software Engineer joining Site Reliability team ensuring performance and reliability of legal AI platform. Designing monitoring and alerting systems while managing operations across global regions.
Senior SRE Technical Lead responsible for reliability and scalability at Adobe's RealTime Customer Data Platform. Overseeing incident response and core datastore strategy in a high impact role.
Director of Site Reliability Engineering at Mastercard, overseeing resilience and operational excellence initiatives. Leading a high - performing team of technical leaders within CX Technology.
Vehicle Reliability Engineer identifying and resolving issues for Waabi, a leader in Physical AI for autonomous transportation. Collaborating across teams to enhance vehicle reliability and performance.
DevOps Engineer responsible for maintaining cloud infrastructure at the leading crypto brand in the Philippines. Collaborating with legal and compliance teams to ensure requirements are met while monitoring and troubleshooting systems.
Tech Lead SRE managing technology talent and connecting them to impactful projects in a healthy work environment. Seeking professionals with a solid technical foundation and product mindset.
Senior DevOps Engineer modernising environment landscapes through IaC and SRE principles while collaborating across teams for a global engineering firm.
DevOps Specialist at WayCarbon architecting and managing infrastructure for web applications. Focused on supporting a sustainable Net - Zero economy with a diverse tech team.
Intern assisting with cloud infrastructure automation for educational technology company UOL EdTech. Collaborating with teams on database operations and cloud deployment tasks.