Hybrid Senior Cloud Ops

Posted yesterday

Apply now

About the role

  • Site Reliability Engineer handling the design, deployment, and operation of customer-facing SaaS platforms. Collaborating with various teams to ensure high availability and performance in the cloud environment.

Responsibilities

  • Design, deploy, and operate SaaS platforms on AWS.
  • Work with Kubernetes, Terraform, Crossplane, and GitOps practices to automate infrastructure.
  • Develop and maintain ArgoCD pipelines and reusable automation assets.
  • Manage monitoring and observability using tools like Prometheus, Grafana, Loki, OpenTelemetry, and Datadog.
  • Investigate and resolve system, application, and network issues.
  • Ensure platforms adhere to security and compliance standards.

Requirements

  • 3–7+ years in SRE, DevOps, CloudOps, or cloud engineering roles.
  • Strong background working with AWS services and SaaS architectures.
  • Experience managing reliability metrics and applying SRE principles in production environments.
  • Proficiency with AWS (networking, compute, storage, IAM, multi-account environments).
  • Strong understanding of containers and Kubernetes (EKS preferred).
  • Experience with Terraform, Git, CI/CD, ArgoCD, and Infrastructure-as-Code practices.
  • Scripting skills (Python, Bash/PowerShell, YAML) and experience with tools like Crossplane or Ansible.
  • Solid experience with observability stacks (Grafana, Prometheus, Loki, Datadog, OpenTelemetry).
  • Good knowledge of system design, troubleshooting, and performance analysis.
  • Clear communicator with strong organizational skills.

Benefits

  • Health insurance
  • Retirement plans
  • Paid time off
  • Flexible work arrangements
  • Professional development

Job title

Senior Cloud Ops

Job type

Experience level

Senior

Salary

Not specified

Degree requirement

Bachelor's Degree

Location requirements

Report this job

See something inaccurate? Let us know and we'll update the listing.

Report job