Senior Site Reliability Engineer at Heidi Health | Hybrid Hired

About the role

Senior SRE managing incident response and system reliability for healthcare AI platform. Collaborating with engineering teams to improve production readiness and operational practices.

Responsibilities

Participate in on-call and incident response: Respond to production incidents, contribute to service restoration, and support clear communication during incidents. Over time, take increasing responsibility for leading incidents end-to-end.
Improve operational reliability: Identify recurring issues and reliability risks, and drive fixes through better alerting, automation, system changes, or process improvements.
Own parts of the production environment: Operate and improve Kubernetes clusters, cloud infrastructure, and core platform services, with growing ownership as familiarity increases.
Strengthen observability: Improve dashboards, alerts, logs, and traces so issues are detected earlier and diagnosed faster, with a strong focus on actionable signals.
Reduce operational toil: Automate repetitive tasks, simplify runbooks, and improve tooling to make on-call and day-to-day operations easier and safer.
Support safe change: Improve deployments, rollback mechanisms, and operational readiness to reduce the risk of incidents caused by change.
Contribute to operational practices: Write and maintain runbooks, participate in blameless post-mortems, and help improve incident response processes over time.
Collaborate closely with engineers: Work with product and feature teams to improve production readiness, service ownership, and reliability expectations.

Requirements

3–6+ years in SRE, DevOps, Platform, or operations-heavy engineering roles.
Experience supporting production systems and participating in on-call rotations.
Comfortable debugging live systems under pressure.
Experience operating cloud infrastructure (AWS preferred).
Working knowledge of Kubernetes and containerised workloads.
Infrastructure as Code experience (Terraform or similar).
Familiarity with monitoring and alerting tools (Datadog, Prometheus, etc).
Scripting or automation experience (Python, Bash, or similar).

Benefits

Real product momentum.
Equity from day one.
Unmatched impact.
Work alongside world-class talent.
Global reach.
Growth and balance.
Flexibility that works.

Similar roles

Browse all Devops Engineer jobs

2 hours ago

BR

Senior Site Reliability Engineer

Broadridge

Sr. Site Reliability Engineer designing and automating robust technical infrastructure at Broadridge. Collaborating across teams for successful deployment and operational support of services.

Hybrid Role

London United Kingdom Devops Engineer

2 hours ago

GM

Senior Fleet Reliability Engineer

General Motors

Senior Fleet Reliability Engineer maintaining high fleet uptime for autonomous vehicle technology. Collaborating with technical teams to ensure peak operational performance in data collection efforts.

Hybrid Role

Sunnyvale United States Devops Engineer

$106,600 - $192,700 per year

4 hours ago

LE

DevOps Lead

Leidos

DevOps Lead at Leidos managing platform engineering, SRE, and application security functions. Driving operational excellence and ensuring scalability for federal government applications.

Hybrid Role

United States Devops Engineer

$107,900 - $195,050 per year

4 hours ago

LE

SRE Lead

Leidos

SRE Lead developing scalable cloud - native solutions for mission - critical systems supporting USAF. Managing teams, collaborating with cross - functional units, and ensuring high service reliability standards.

Hybrid Role

United States Devops Engineer

$131,300 - $237,350 per year

5 hours ago

DG

Junior DevOps, Platform Engineer

DieEnergiekoppler GmbH

Junior DevOps / Platform Engineer at DieEnergiekoppler GmbH managing AWS/EKS platform operations. Collaborating with team members to improve platform functionalities and security compliance.

Hybrid Role

Dresden Germany Devops Engineer

€50,000 - €60,000 per year

6 hours ago

AG

DevOps Engineer – m/f/d

Allguth GmbH

DevOps Engineer responsible for AWS infrastructures and backend development at Allguth GmbH. Engaging in greenfield projects with modern solutions in a collaborative team.

Hybrid Role

Gräfelfing Germany Devops Engineer

8 hours ago

SO

Cloud DevOps Specialist – Multicloud

SONDA

Cloud DevOps Specialist responsible for building scalable infrastructure solutions in AWS at SONDA. Focusing on automation, containerization, and data management in a collaborative environment.

Hybrid Role

Brazil Devops Engineer

9 hours ago

DO

DevOps Engineer

Docebo

DevOps Engineer maintaining and evolving deployment pipelines for Docebo’s AI - powered learning platform. Collaborating with cross - functional teams to ensure efficient software releases and infrastructure management.

Hybrid Role

Biassono Italy Devops Engineer

9 hours ago

DO

DevOps Engineer

Docebo

DevOps Engineer optimizing CI/CD pipelines for Docebo, an AI - powered learning platform. Involves managing multi - tenant infrastructure using AWS, Docker, and Kubernetes.

Hybrid Role

Toronto Canada Devops Engineer

CA$113,700 - CA$151,600 per year

11 hours ago

NS

Senior DevOps Engineer

Nord Security

DevOps Engineer maintaining and automating infrastructure and CI/CD processes for cybersecurity solutions by NordLayer. Collaborating with teams to ensure performance and scalability of cloud services.

Hybrid Role

Krakow Poland Devops Engineer

PLN 23,300 - PLN 34,000 per month