Senior Manager leading SRE, Virtualization, Networking, and AI Infrastructure teams at F5. Overseeing mission-critical infrastructure and driving operational excellence across hybrid compute environments.
Responsibilities
Lead multi-team ownership: SRE, Networking, Virtualization, AI/GPU Infrastructure
Oversee hybrid data centers spanning routing, switching, firewalls, SDN/overlay, Kubernetes CNI, and service‑mesh/L4‑L7 traffic to drive network reliability, performance, security, and automation
Provide executive oversight for OpenStack compute storage, and networking services
Ensure scalable VM lifecycle management, resource optimization, and operational maturity
Own end‑to‑end reliability and performance of AI compute platforms, including model training/inference pipelines, GPU scheduling and autoscaling, and high‑performance compute environments
Partner with ML, Data, and Product to build next-gen AI compute platforms
Drive adoption of automation-first operations, GitOps, and infrastructure-as-code
Own the multi‑year platform roadmap across hybrid compute, Kubernetes, virtualization, AI, and networking while driving cross‑org alignment and leading large‑scale modernization across CI/CD, observability, and infrastructure
Drive organizational strategy, prioritization, staffing plans, hiring, and budgeting
Build a high-performance, inclusive culture focused on ownership, excellence, and continuous improvement
Requirements
10+ years infrastructure/SRE/platform engineering experience
5+ years managing engineering teams (including managers or tech leads)
Deep experience with Kubernetes, virtualization, and cloud/networking
Strong leadership, communication, and cross-functional alignment
Proven record of accomplishment improving platform uptime, performance, and reliability
Senior DevOps Engineer leading design and management of CI/CD pipelines at Neuron7.ai. Collaborating on cloud infrastructure for scalable applications in an innovative tech environment.
Backend Software Engineer responsible for building robust backend systems for AI and analytics products. Collaborating with various teams to enhance platform reliability and performance.
Senior DevOps Engineer responsible for cloud ecosystem architecture at health - tech startup. Building HIPAA/GDPR - compliant foundations and mentoring developers.
Senior Backend Engineer building product features and maintaining infrastructure for insurance platform. Employing tools like Terraform, Kafka, Datadog and Qovery with a strong DevOps focus.
DevOps Systems Engineer supporting customer operations in Annapolis Junction, MD. Responsible for creating, sustaining, and troubleshooting complex operational data flows.
OpenShift Fresher assisting Cloud team in managing containerized applications using Red Hat OpenShift. Supporting CI/CD, deployment automation, and cloud - native application environments.
Site Reliability Engineer for Leidos ensuring reliability, performance, and scalability of complex distributed systems for the Navy - Marine Corps Intranet. Collaborating with teams to maintain and optimize network operations and services.
DevOps Engineer evolving banking infrastructure for a fintech company. Focusing on observability, incident response, and platform automation in a hybrid work setup.
Lead Site Reliability Engineer managing critical IT systems for S&P Dow Jones Indices. Focused on service availability, incident management, and developer collaboration to enhance operational reliability.
Lead DevOps Engineer developing AI - powered supply chain intelligence solutions at S&P Global Mobility. Collaborate with data scientists and engineers to optimize operational infrastructure and continuous delivery processes.