About the role

  • DevOps Manager overseeing scaling for Seekr's AI platform using Kubernetes, Terraform, and Ansible. Leading a hands-on team and collaborating with engineering for efficiency.

Responsibilities

  • Lead development of solutions to complex reliability, performance, and scaling challenges.
  • Design, architect, and implement systems, networks, and services powering Seekr’s platform.
  • Provide hands-on leadership and mentorship to the team.
  • Partner with software engineering teams to build scalable, efficient, and reliable services.
  • Identify and resolve operational inefficiencies through automation.
  • Troubleshoot and lead response to deployment and production incidents.
  • Implement and enforce security best practices, ensuring infrastructure, deployments, and data are protected at every stage.

Requirements

  • Technical Leadership: 12+ years experience, Proven ability to deliver results in a high-pressure/dynamic environment, Communication Skills, Roadmap & long-term strategy, mentoring senior engineers.
  • Kubernetes & Distributed Systems: Enterprise-scale K8s with custom operators/controllers, multi-platform clusters, hybrid fleet orchestration across cloud & edge, K8s control plane, k8s upgrades, Docker, containerd, CRI-O, Ingress Controllers (Istio, NGINIX, Traefik), K8s Databases, Helm charts.
  • Database Management: Postgres, ElasticSearch/OpenSearch, Kubernetes databases, Stateful sets.
  • Networking: L2/L3 protocols (BGP, OSPF, VLANs, IPSec), VPNs, firewalls, redundancy paths, bare-metal Linux networking, CoreDNS, Calico, K8s service mesh (Istio).
  • Infrastructure Automation: Ansible, Terraform, CI/CD Pipelines, GitLab, ArgoCD, MAAS, scripting (Python, Golang, Bash), AWS, Azure.
  • Observability: Grafana, Prometheus, Loki, Tempo, ELK, OTEL.
  • Security: Zero-trust architecture, PKI, mTLS, SPIFFE/SPIRE, certificate automation, CVE remediation, secrets management, IAM.
  • Incident Management & RCA: End-to-end incident lifecycle, root cause analysis, corrective action ownership.

Benefits

  • Meaningful Mission & Impact - Work with a deeply talented, collaborative team solving some of the toughest AI challenges that matter.
  • Equity Ownership – RSUs that let you share directly in Seekr’s long‑term success and growth.
  • Time Off That Respects Real Life – Unlimited PTO plus 14 paid company holidays to truly recharge.
  • Work Your Way – A flexible hybrid work environment with offices in Reston, VA and Austin, TX, plus remote options and flexible working hours.
  • Competitive Total Rewards – A role‑appropriate compensation structure that supports long‑term growth, including base salary, bonuses, or commission plans depending on role.
  • 401(k) with Company Match – Build your future with a retirement plan that includes employer matching.
  • Comprehensive Health & Wellness – Medical, dental, vision, and life insurance coverage starting day one—for you and your family.
  • Parental Leave – Paid parental leave to support employees as they welcome a new child through birth, adoption, or foster placement.

Job title

Manager, DevOps

Job type

Experience level

SeniorLead

Salary

Not specified

Degree requirement

No Education Requirement

Location requirements

Report this job

See something inaccurate? Let us know and we'll update the listing.

Report job