Lead Director Engineering responsible for ensuring reliability and scalability of CVS Health retail pharmacy tech. Leading a global team in driving operational excellence and modernization efforts.
Responsibilities
Lead a global team of technical professionals, providing guidance, mentorship, and support to ensure their success and professional growth
Align SRE strategies with enterprise goals, delivering resilient technology that enables world-class customer and patient experiences
Execute on a multi-year roadmap for observability, automation, and reliability improvements across distributed store environments
Define and implement standardization and process improvements across the SRE organization
Define and maintain Service Level Indicators (SLIs), Service Level Objectives (SLOs), and business KPIs to measure and enhance system reliability for critical store applications
Build and optimize dashboards, visualizations, and alerting systems to enable real-time insights and rapid incident response for edge nodes and remote facilities
Provide weekly and monthly reporting on KPIs and other operational-based metrics to cross-functional teams and senior leadership
Develop and implement strategic communication plans to support organizational goals, ensuring alignment with business objectives and stakeholder needs
Reduce operational costs and increase efficiency through automation and platform engineering to reduce toil and enable self-healing capabilities across thousands of locations
Lead major incident management, ensuring rapid detection, root-cause analysis, and resolution in collaboration with business and technology partners
Champion modern cloud, edge, and AI-driven monitoring solutions for store technology
Partner with architects, product engineering, and infrastructure teams to embed reliability practices throughout the software lifecycle
Represent CVS Health as a thought leader in SRE and operational resilience, both internally and externally
Mentor the SRE and technical teams on building, scaling, and operating highly available systems
Foster a culture of ownership, honesty, accountability, and continuous improvement within the organization
Contribute to long-term planning, technology adoption strategies, and innovation initiatives to drive digital modernization efforts
Requirements
10+ years of experience with cloud platform technologies such as: AWS, Microsoft Azure, Google Cloud
8+ years of experience in a technical leadership or people management role, with a proven ability to lead and grow technical teams, particularly within SRE or large-scale reliability organizations
8+ years of experience leading complex technical initiatives using Agile/continuous improvement methodologies
8+ years of managing distributed technology environments (retail, healthcare, or other multi-site operations strongly preferred)
5+ years of experience in container orchestration (Kubernetes) and using monitoring tools (Dynatrace, AppDynamics, Prometheus, Splunk, Grafana, etc.)
Strong understanding of cloud infrastructure components (compute, storage, networking, security)
Strong knowledge of Point of Sale (POS), pharmacy systems, handheld devices, store servers, and network infrastructure
Exceptional communication, decision-making, and problem-solving skills, with demonstrated ability to influence senior executives and cross-functional teams (technical and non-technical)
Experience leading performance reviews, career development planning, and team capacity management
Mastery of incident management, observability, automation, and operational excellence practices
Adept at resource planning, program delivery, and change leadership at enterprise scale
Adept at collaboration, teamwork, and fostering an inclusive engineering culture
Benefits
Affordable medical plan options
401(k) plan (including matching company contributions)
Employee stock purchase plan
No-cost programs for all colleagues including wellness screenings, tobacco cessation and weight management programs, confidential counseling and financial coaching
Benefit solutions that address the different needs and preferences of our colleagues including paid time off, flexible work schedules, family leave, dependent care resources, colleague assistance programs, tuition assistance, retiree medical access and many other benefits depending on eligibility
Job title
Lead Director, Software Development Engineering – SRE, Retail Pharmacy
DevOps Engineer developing and managing container platforms at Booz Allen. Utilizing cloud technologies to solve client challenges and improve environments while ensuring secure adoption of containers.
Senior Director of DevOps at HUMAN Security leading global teams and modernizing infrastructure for high - scale environments. Responsible for developing strategy and ensuring operational excellence across products.
Manage the DevOps team to deliver reliable internet - scale infrastructure at HUMAN Security. Solve problems related to fraud defense and enhance product capabilities for security researchers.
Senior DevOps Engineer designing deployment systems and overseeing IT projects for PROCITEC. Collaborating in a team - focused environment to deliver innovative technology solutions.
Reliability Engineer I responsible for conducting product inventories at customer locations for Regal Rexnord. Managing workflows and mentoring new engineers while adhering to safety protocols in hybrid work setting.
Senior Manager of Site Reliability Engineering at Insulet overseeing SRE practices and team leadership to enhance system reliability. Driving automation, incident response, and partnership across engineering and product teams.
DevOps Engineer responsible for designing and supporting CI/CD pipelines for Xumo. Collaborating with teams to enhance cloud infrastructure for video streaming services.
Software Developer responsible for developing and optimizing functionalities for a PHP/Symfony platform. Collaborating on projects in a data - driven environment focused on product data solutions.
DevOps Engineer at Perelyn supporting cloud infrastructures and providing technical consulting to clients. Engaging in various DevOps projects within a dynamic remote work environment in Germany.
Site Reliability Engineer ensuring the reliability and performance of cloud - native infrastructure at Sanlam Fintech. Collaborating with teams to deliver innovative solutions across the African continent.