Hybrid Site Reliability Engineer

Posted 1 hour ago

Apply now

About the role

  • Site Reliability Engineer ensuring the availability and performance of services for autonomous vehicle operations. Collaborating on system design and automation in a robotics-focused environment.

Responsibilities

  • Design and implement highly scalable and reliable systems to support Zoox's autonomous vehicle platform.
  • Optimize system performance, reliability, and scalability.
  • Develop and maintain monitoring, alerting, and reporting systems to ensure proactive identification and resolution of issues.
  • Collaborate with software engineering teams to improve software architecture, deployment processes, and automation.
  • Conduct root cause analysis of production issues and implement corrective actions.
  • Implement disaster recovery and business continuity plans.

Requirements

  • 5+ years of experience in site reliability engineering or a similar role, with a strong background in working with large-scale distributed systems.
  • Proven experience with cloud platforms such as AWS, GCP, or Azure.
  • Expertise in container orchestration technologies like Kubernetes.
  • Deep understanding of networking, storage, and database technologies.
  • Strong programming skills in languages such as Python, Go, C/C++, or Java.
  • Experience with infrastructure as code tools such as Terraform, Ansible, Salt, or CloudFormation.

Benefits

  • paid time off (e.g. sick leave, vacation, bereavement)
  • unpaid time off
  • Zoox Stock Appreciation Rights
  • Amazon RSUs
  • health insurance
  • long-term care insurance
  • long-term and short-term disability insurance
  • life insurance

Job title

Site Reliability Engineer

Job type

Experience level

Mid levelSenior

Salary

$170,000 - $205,000 per year

Degree requirement

Bachelor's Degree

Location requirements

Report this job

See something inaccurate? Let us know and we'll update the listing.

Report job