Hybrid Senior Site Reliability Engineer, SRE

Posted 3 weeks ago

Apply now

About the role

  • Define, implement, and monitor SLIs/SLOs for availability, latency, and reliability.
  • Design and optimize CI/CD pipelines for microservices in high-availability environments.
  • Manage and evolve infrastructure on AWS (EC2, ECS/EKS, S3, RDS, CloudFront, VPC, IAM, CloudWatch, etc.).
  • Manage distributed databases and critical systems: Astra DB / Cassandra (DataStax), Redis, and RabbitMQ.
  • Automate provisioning, configuration, and scalability with Terraform, Ansible, or similar tools.
  • Develop and maintain observability practices (metrics, logs, tracing) using DataDog and related tools.
  • Lead investigations into critical incidents, proposing definitive solutions (blameless postmortems).
  • Work on cloud cost optimization, balancing reliability and budget.
  • Ensure infrastructure security and compliance, with access policies, backups, and continuous auditing.
  • Collaborate with engineering and product teams, bringing a reliability mindset to the development cycle.

Requirements

  • 6+ years of SRE/DevOps experience in high-scale, mission-critical environments.
  • Strong expertise in AWS and cloud-native architecture.
  • Advanced knowledge of Cassandra (Astra DB / DataStax), Redis, and RabbitMQ.
  • Experience with microservices and containerization (Docker, Kubernetes, ECS/EKS).
  • Strong automation experience (Terraform, Ansible, etc.).
  • Experience with observability and DataDog (metrics, logs, and tracing).
  • Solid understanding of networking, security, and protocols.
  • Experience with incident response and resolving complex problems.
  • Experience working in agile environments with a DevOps/SRE culture.
  • Nice to have: Experience in high-volume B2B SaaS environments.
  • Relevant certifications (AWS, Kubernetes, DevOps, SRE).
  • Specialized knowledge of GitHub Actions.
  • Experience with serverless architectures (AWS Lambda) and event-driven systems.
  • Track record in migration and optimization of distributed databases and cloud infrastructure.

Benefits

  • Remote work
  • Flexible working hours

Job title

Senior Site Reliability Engineer, SRE

Job type

Experience level

Senior

Salary

Not specified

Degree requirement

High School Diploma

Location requirements

Report this job

See something inaccurate? Let us know and we'll update the listing.

Report job