About the role

Senior Manager leading SRE, Virtualization, Networking, and AI Infrastructure teams at F5. Overseeing mission-critical infrastructure and driving operational excellence across hybrid compute environments.

Responsibilities

Lead multi-team ownership: SRE, Networking, Virtualization, AI/GPU Infrastructure
Oversee hybrid data centers spanning routing, switching, firewalls, SDN/overlay, Kubernetes CNI, and service‑mesh/L4‑L7 traffic to drive network reliability, performance, security, and automation
Reliability strategy: SLO/SLI programs, incident management, automation, scaling Kubernetes platform operations across multiple distros
Provide executive oversight for OpenStack compute storage, and networking services
Ensure scalable VM lifecycle management, resource optimization, and operational maturity
Own end‑to‑end reliability and performance of AI compute platforms, including model training/inference pipelines, GPU scheduling and autoscaling, and high‑performance compute environments
Partner with ML, Data, and Product to build next-gen AI compute platforms
Drive adoption of automation-first operations, GitOps, and infrastructure-as-code
Own the multi‑year platform roadmap across hybrid compute, Kubernetes, virtualization, AI, and networking while driving cross‑org alignment and leading large‑scale modernization across CI/CD, observability, and infrastructure
Drive organizational strategy, prioritization, staffing plans, hiring, and budgeting
Build a high-performance, inclusive culture focused on ownership, excellence, and continuous improvement

10+ years infrastructure/SRE/platform engineering experience
5+ years managing engineering teams (including managers or tech leads)
Deep experience with Kubernetes, virtualization, and cloud/networking
Strong leadership, communication, and cross-functional alignment
Proven record of accomplishment improving platform uptime, performance, and reliability
Proven leadership in: Kubernetes platforms (OpenShift, Titan-k8s, Robin, Vanilla K8s)
Virtualization (Proxmox, VMware, XCP-ng, KVM)
OpenStack (Nova, Neutron, Cinder, Keystone)
Data center/cloud networking and distributed systems
Strong executive communication skills and cross-org influencing ability
Demonstrated experience improving operational maturity and reliability for large-scale systems
Strong background in automation, CI/CD, observability, and infrastructure architecture
Experience running large-scale multi-cluster Kubernetes environments (preferred)
Experience with service mesh, ingress controllers, and network policy frameworks (preferred)
Familiarity with GPU scheduling, Ray, Kubeflow, MLflow, or Triton Inference Server (preferred)
Experience with storage backends (Ceph, vSAN, ZFS, CSI-based solutions) (preferred)
Experience driving multi-year infrastructure transformation programs (preferred)
Expertise in GitOps and IaC (Terraform, Ansible, Pulumi) (preferred)