Hybrid Senior Site Reliability Engineer

Posted 4 hours ago

Apply now

About the role

  • SRE leading reliability and operational excellence at a mortgage tech platform. Designing systems, tooling, and processes for managing Pylon's production systems in Palo Alto.

Responsibilities

  • You'll own reliability and operational excellence for Pylon's production systems.
  • Designing and implementing monitoring, alerting, and incident response processes that scale as we grow.
  • Building tooling that makes the entire engineering team more effective.
  • Establish on-call rotations and runbooks.
  • Ensure our platform can handle the demands of a regulated, high-stakes financial product.
  • Spend 50%+ of your time writing code: building infrastructure tooling, automating operational burden, making reliability improvements, and productivity tools.

Requirements

  • 4+ years experience in SRE, infrastructure, or platform engineering roles
  • Experience working on a team of SREs at a company with mature SRE practices (not solo SRE roles)
  • Real on-call experience at scale in a large production environment (you've carried the pager and lived through incidents)
  • Deep AWS expertise (ECS, RDS, networking, security)
  • Strong experience with declarative infrastructure (Terraform, CDK, or similar)
  • Nix experience (we use it and want to expand its adoption)
  • Track record of building reliability tooling and automation
  • Can design and implement monitoring, alerting, and observability systems from first principles
  • Comfortable working in a regulated environment where "breaking things" is not an option.

Benefits

  • Equity
  • Benefits

Job title

Senior Site Reliability Engineer

Job type

Experience level

Senior

Salary

$140,000 - $220,000 per year

Degree requirement

Bachelor's Degree

Tech skills

Location requirements

Report this job

See something inaccurate? Let us know and we'll update the listing.

Report job