About the role

  • Senior Site Reliability Engineer improving software performance and technical operations for a workflow builder startup. Collaborating with teams on infrastructure, scalability, and developer experience.

Responsibilities

  • Monitoring our core business-logic software, both via on-call and in non-urgent situations: describing its existing behavior and defining SLOs or SLAs that get us (you and we) to respond.
  • Extending and monitoring our infrastructure stack.
  • Maintaining both a mental and a reified model of our systems: from this model, estimating risk, planning projects, and debugging efficiently.
  • Collaborating with our engineering team and company leadership from your unique lens on site reliability.
  • Working on our core orchestration logic that determines how to efficiently run tens of thousands of workflows at the same time.
  • Advising many parallel major backend engineering projects, both early in planning and through release.
  • Optimizing our services for scalability, stability, and observability as our customer base grows and our product becomes more sophisticated.
  • Improving our developer experience in tactical ways, and improving our overall engineering processes and practices more broadly.

Requirements

  • 5+ years of SRE, DevOps, or Platform engineering experience
  • A proven record of building efficient, performant, and easy to extend systems.
  • Has maintained quantitative metrics of site reliability, while also demonstrating judgment about appropriate strictness for SLOs & SLAs.
  • Familiarity with containerization and orchestration tools (e.g., Docker, Kubernetes) and how they interact with backend services, and with Linux
  • Experience implementing and managing AWS infrastructure
  • You’re not afraid to ask for help, and you’re happy to give it, too.
  • You’re an enthusiastic communicator and you like working with a team that provides both mutual support and thoughtful critique.
  • You're excited to join a hybrid team and work out of our NYC or SF office ~3 days a week.

Benefits

  • equity
  • premium health and wellness benefits

Job title

Senior Software Engineer, Site Reliability

Job type

Experience level

Senior

Salary

$180,000 - $200,000 per year

Degree requirement

No Education Requirement

Location requirements

Report this job

See something inaccurate? Let us know and we'll update the listing.

Report job