Lead Site Reliability Engineer at <Undefined> | Hybrid Hired

About the role

Lead Site Reliability Engineer for Personio, shaping the future of HR technology through reliable infrastructure and collaborative engineering.

Responsibilities

Engage in and improve the full service lifecycle from initial design through deployment, operation, and continuous improvement.
Prepare services for production by taking part in system design reviews, developing shared frameworks and platforms, planning capacity and conducting launch assessments.
Operate, monitor, and maintain live services, designing observability stacks and dashboards to track key metrics and improve operational insight.
Ensure sustainable scalability through automation, actively contributing to continuous improvement for reliability and delivery speed.
Collaborate with product and engineering teams to define SLOs, error budgets and ensure services are reliable, scalable and observable.
Support incident management processes, including on-call rotations, assisting with outage response, and contributing to post-mortems and root cause analysis.
Identify and reduce toil through process automation, creating playbooks and automated runbooks to reduce MTTR.
Support resilience strategies and help implement chaos testing to proactively uncover weaknesses and validate recovery strategies.
Mentor and train peers on reliability best practices and tooling, contributing to community growth.

Requirements

Bachelor’s degree in Computer Science, a related field, or equivalent practical experience.
6+ years of experience with SaaS software development in distributed systems using languages such as Kotlin/Java, Typescript, Python, and technologies like IaC, Docker, and Kubernetes.
2+ years’ experience as an SRE or similar role designing, operating, analyzing and troubleshooting distributed systems in agile environments.
Act as a Datadog subject matter expert, assisting with observability stack design, dashboard creation, and training peers in best practices.
Systematic problem solving and debugging skills with a strong sense of ownership and bias towards establishing mechanisms which can scale across the entire company.
Excellent written, verbal, and documentation skills.
Collaborative team player, able to communicate effectively across disciplines.

Benefits

Receive a competitive reward package – reevaluated each year – that includes salary, benefits, and pre-IPO equity.
Enjoy 28 days of paid vacation, plus an additional day after 2 and 4 years.
Make an impact on the environment and society with 1 (fully paid) Impact Day.
Receive generous family leave, child support, mental health support, and sabbatical opportunities.
We enjoy gathering for meals, cultural initiatives, and events like local Summer Sessions and year-end celebrations. There's also healthy snacks, drinks, and a weekly catered lunch.

Similar roles

Browse all Devops Engineer jobs

1 hour ago

BG

Cloud Engineering Specialist – SRE

BT Group

SRE role at BT Group focusing on cloud reliability and operational excellence across engineering teams. Collaborate with product owners to implement SRE principles for improved service performance.

Onsite Role

Manchester United Kingdom Devops Engineer

1 hour ago

UN

Staff Software Engineer – SRE, GO Programming

Uniphore

Senior Site Reliability Engineer at Uniphore developing cloud infrastructure and Go services. Collaborating with teams to ensure operational excellence and reliability.

Onsite Role

Bangalore India Devops Engineer

4 hours ago

BU

Senior DevOps Engineer – Managed Service

Burendo

Join Burendo as a Senior DevOps Engineer, maintaining critical services and improving operational efficiency in a cloud - first environment.

Hybrid Role

London United Kingdom Devops Engineer

5 hours ago

ST

Learning Content Engineer – Cloud, DevOps

StackFuel

As Learning Content Engineer, developing and enhancing training content for Cloud and DevOps. Engaging in creating practical learning materials from basics to advanced topics.

Hybrid Role

Berlin Germany Devops Engineer

7 hours ago

SO

AWS DevOps Engineer, Microservices

Solventum

AWS DevOps Microservices Engineer at Solventum designing secure and scalable AWS infrastructures. Collaborating with diverse teams for innovative healthcare solutions using cloud technology.

Hybrid Role

Heredia Costa Rica Devops Engineer

7 hours ago

GA

Manager, Dev Ops

GALE

Manager leading a team of DevOps engineers and shaping cloud infrastructure strategy at a technology company in India.

Hybrid Role

Bengaluru India Devops Engineer

9 hours ago

CA

DevOps Engineer

Catena

DevOps Engineer building and maintaining Catena’s scalable platform infrastructure. Collaborating with engineers to enhance CI/CD pipelines and support cloud - native workloads on AWS.

Hybrid Role

Gzira Malta Devops Engineer

10 hours ago

GV

SRE Platform Engineer

GE Vernova

Platform System Reliability Engineer focused on operations of EKS Kubernetes environment for GE Vernova's SaaS grid products. Responsible for the full lifecycle of production clusters from performance tuning to securing infrastructure.

Hybrid Role

United States Devops Engineer

10 hours ago

GV

SRE Observability SLO Engineer

GE Vernova

SRE Observability SLO Engineer for GE Vernova’s GridOS Platform Engineering team. Building telemetry stack in SaaS reliability for critical energy infrastructure.

Hybrid Role

Remote United States Devops Engineer

11 hours ago

RA

DevOps Engineer, Ansible Automation Platform

Rabobank

DevOps Engineer responsible for building and operating automation services using Ansible for Rabobank. Collaborating with teams to ensure stable, secure, and auditable infrastructure across multiple servers.

Hybrid Role

Utrecht Netherlands Devops Engineer

€4,024 - €5,747 per month