Hybrid Senior Manager, SRE – Networking

Posted 4 days ago

Apply now

About the role

  • Senior Manager leading SRE, Virtualization, Networking, and AI Infrastructure teams at F5. Overseeing mission-critical infrastructure and driving operational excellence across hybrid compute environments.

Responsibilities

  • Lead multi-team ownership: SRE, Networking, Virtualization, AI/GPU Infrastructure
  • Oversee hybrid data centers spanning routing, switching, firewalls, SDN/overlay, Kubernetes CNI, and service‑mesh/L4‑L7 traffic to drive network reliability, performance, security, and automation
  • Reliability strategy: SLO/SLI programs, incident management, automation, scaling Kubernetes platform operations across multiple distros
  • Provide executive oversight for OpenStack compute storage, and networking services
  • Ensure scalable VM lifecycle management, resource optimization, and operational maturity
  • Own end‑to‑end reliability and performance of AI compute platforms, including model training/inference pipelines, GPU scheduling and autoscaling, and high‑performance compute environments
  • Partner with ML, Data, and Product to build next-gen AI compute platforms
  • Drive adoption of automation-first operations, GitOps, and infrastructure-as-code
  • Own the multi‑year platform roadmap across hybrid compute, Kubernetes, virtualization, AI, and networking while driving cross‑org alignment and leading large‑scale modernization across CI/CD, observability, and infrastructure
  • Drive organizational strategy, prioritization, staffing plans, hiring, and budgeting
  • Build a high-performance, inclusive culture focused on ownership, excellence, and continuous improvement

Requirements

  • 10+ years infrastructure/SRE/platform engineering experience
  • 5+ years managing engineering teams (including managers or tech leads)
  • Deep experience with Kubernetes, virtualization, and cloud/networking
  • Strong leadership, communication, and cross-functional alignment
  • Proven record of accomplishment improving platform uptime, performance, and reliability
  • Proven leadership in: Kubernetes platforms (OpenShift, Titan-k8s, Robin, Vanilla K8s)
  • Virtualization (Proxmox, VMware, XCP-ng, KVM)
  • OpenStack (Nova, Neutron, Cinder, Keystone)
  • Data center/cloud networking and distributed systems
  • Strong executive communication skills and cross-org influencing ability
  • Demonstrated experience improving operational maturity and reliability for large-scale systems
  • Strong background in automation, CI/CD, observability, and infrastructure architecture
  • Experience running large-scale multi-cluster Kubernetes environments (preferred)
  • Experience with service mesh, ingress controllers, and network policy frameworks (preferred)
  • Familiarity with GPU scheduling, Ray, Kubeflow, MLflow, or Triton Inference Server (preferred)
  • Experience with storage backends (Ceph, vSAN, ZFS, CSI-based solutions) (preferred)
  • Experience driving multi-year infrastructure transformation programs (preferred)
  • Expertise in GitOps and IaC (Terraform, Ansible, Pulumi) (preferred)

Benefits

  • Incentive compensation
  • Bonuses
  • Restricted stock units
  • Comprehensive benefits package

Job title

Senior Manager, SRE – Networking

Job type

Experience level

Senior

Salary

$196,800 - $295,200 per year

Degree requirement

Bachelor's Degree

Location requirements

Report this job

See something inaccurate? Let us know and we'll update the listing.

Report job