Hybrid Lead Site Reliability Engineer

Posted 3 hours ago

Apply now

About the role

  • SRE responsible for designing and maintaining cloud infrastructure to support scalable applications. Collaborating with product teams to enhance monitoring and response systems in the Czech Republic.

Responsibilities

  • Design, build, and maintain the product cloud infrastructure
  • Develop advanced monitoring systems that proactively alert on symptoms
  • Leverage tools like Terraform, GitHub actions, and Kubernetes to manage AWS or AZURE infrastructure
  • Collaborate with product engineers on a daily basis and influence product architectures designs
  • Be part of an on-call (PagerDuty) rotation to respond swiftly to incidents affecting availability
  • Proactively identify opportunities to enhance system availability and performance by applying insights from monitoring and observation

Requirements

  • Proficiency in Terraform syntax and GitHub Actions configuration
  • Working knowledge of SaaS architecture concepts and designs
  • Understanding of Kubernetes, including CLI usage and service re-provisioning
  • Ability to provision and set up metrics along with managing alerts and silences
  • Identify Service Level Indicators (SLIs) that align the team with availability and latency objectives
  • Experience with Linux operating system configuration, package management, and troubleshooting
  • Working experience with cloud environments like AZURE or AWS and provisioning infrastructure there

Benefits

  • Opportunity to propose innovative ideas and solutions within the SRE organization
  • Health insurance
  • Paid time off
  • Flexible working arrangements
  • Professional development opportunities

Job title

Lead Site Reliability Engineer

Job type

Experience level

Senior

Salary

Not specified

Degree requirement

No Education Requirement

Location requirements

Report this job

See something inaccurate? Let us know and we'll update the listing.

Report job