About the role

  • Senior Site Reliability Engineer focused on building reliable, scalable infrastructure at a tech company. Driving best practices in observability, incident response, and engineering collaboration.

Responsibilities

  • Design, build, and maintain highly available, scalable, and fault-tolerant systems
  • Lead reliability improvements across production and non-production environments
  • Own and improve monitoring, alerting, and observability platforms
  • Drive incident response, root cause analysis, and post-incident reviews
  • Implement automation to reduce manual operational work
  • Partner with Engineering, Security, and Product to support platform needs
  • Establish and track SLIs, SLOs, and error budgets
  • Lead capacity planning and performance tuning efforts
  • Improve deployment, CI/CD, and infrastructure-as-code practices
  • Identify and mitigate reliability and scalability risks before they impact customers
  • Mentor and guide junior engineers and contribute to team technical standards
  • Participate in on-call rotation and help mature on-call processes

Requirements

  • 6+ years of experience in Site Reliability Engineering, DevOps, Platform Engineering, or related roles
  • Strong experience with cloud platforms (AWS, Azure, or GCP)
  • Proficiency with infrastructure as code (Terraform, CloudFormation, Pulumi, etc.)
  • Experience with containerization and orchestration (Docker, Kubernetes)
  • Strong Linux systems administration and networking fundamentals
  • Experience building and maintaining CI/CD pipelines
  • Hands-on experience with monitoring and observability tools (Datadog, Prometheus, Grafana, New Relic, etc.)
  • Strong troubleshooting and incident management skills
  • Experience with scripting and automation (Python, Bash, Go, or similar)

Benefits

  • Medical, dental, and vision benefits
  • Company-paid life insurance
  • Flexible schedules
  • Unlimited PTO
  • Volunteer Time Off
  • Sick leave
  • Parental leave
  • 9 company-paid holidays

Job title

Senior Site Reliability Engineer

Job type

Experience level

Senior

Salary

$110,000 per year

Degree requirement

Bachelor's Degree

Location requirements

Report this job

See something inaccurate? Let us know and we'll update the listing.

Report job