Critical role in capacity management for VMware-based infrastructure supporting BT's Private Cloud platform. Transforming capacity management into a proactive capability aligned with business demand.
Responsibilities
Own the Private Cloud “EC.3” Capacity Management Platform – act as the single accountable owner for capacity planning, forecasting, modelling, and optimisation across the VMware-based Enterprise Cloud v3 environment.
Define and Deliver the Capacity Roadmap – translate business demand and programme milestones into a prioritised backlog of features and automation, using Agile delivery practices.
Implement SRE Guardrails – establish SLIs, SLOs, and error budgets for infrastructure-related reliability; ensure proactive risk management.
Develop Forecasting Models – build accurate short-, medium-, and long-term capacity forecasts using telemetry and scenario analysis to prevent saturation and ensure headroom.
Automate Capacity Workflows – reduce manual toil by creating scripts, policies, and integrations for rightsizing, placement, and quota enforcement using PowerCLI, APIs, and IaC.
Maintain Real-Time Telemetry & Dashboards – provide a single source of truth for utilisation, trends, and optimisation opportunities through VMware Aria Operations (vROps) and reporting tools.
Optimise Cost and Efficiency – align with FinOps principles to deliver show back/chargeback reporting, identify waste, and implement cost-saving measures without compromising reliability.
Integrate with ITSM & Governance – ensure ServiceNow CMDB accuracy, automate request fulfilment, and maintain compliance with capacity policies and audit requirements.
Collaborate Across Teams – work closely with Architecture, Programme Delivery, Finance, and Operations to align capacity decisions with strategic objectives and risk appetite.
Continuously Improve – evolve the capacity management capability through iterative enhancements, stakeholder feedback, and adoption of emerging best practices.
Requirements
Deep VMware Expertise – hands-on experience with vSphere, vCenter, vSAN, NSX-T, and VMware Aria Operations (vROps) for capacity analytics and optimisation.
Capacity Planning & Forecasting – ability to model demand, headroom, and growth scenarios using telemetry and data-driven methods.
Automation & Scripting – proficiency in PowerCLI, Python, and API integrations to automate rightsizing, placement, and quota enforcement.
Agile Delivery Skills – experience managing backlogs, writing user stories, and delivering incremental improvements through sprints and ceremonies.
SRE Practices – strong understanding of SLIs, SLOs, error budgets, and reliability engineering principles applied to infrastructure capacity.
Observability & Analytics – ability to design dashboards and alerts for utilisation, saturation, and optimisation opportunities.
FinOps Awareness – knowledge of cost optimisation, show back/chargeback models, and unit economics for infrastructure services.
Governance & Compliance – familiarity with ITSM tools (e.g., ServiceNow), CMDB data integrity, and audit-ready processes.
Stakeholder Engagement – excellent communication and influencing skills to align capacity decisions with business priorities.
Continuous Improvement Mindset – proactive approach to evolving processes, reducing toil, and adopting emerging best practices.
From January 2025, equal family leave: receive 18 weeks at full pay, 8 weeks at half pay and 26 weeks at the statutory rate. It’s for all parents, no matter how your family is made up.
Enhanced women’s health support: including help with menopause symptoms, cancer screenings, period care and more.
25 days annual leave (not including bank holidays), increasing with service
24/7 private virtual GP appointments for UK colleagues
2 weeks carer’s leave
World-class training and development opportunities
DevOps Engineer working closely with engineering and security teams to optimize CI/CD pipelines and manage infrastructure. Ensuring security and compliance for mission - critical financial applications.
Build and scale cloud infrastructure that powers Heidi's healthcare AI platform. Work with AWS and Azure while enhancing automation and reliability in an innovative healthtech startup.
Infrastructure - as - Code DevOps Engineer designing and managing cloud - native platforms at Vodafone. Collaborating with agile teams for digital transformation and business success.
Director of Data Engineering leading a strategic DevOps team within Enterprise AI. Balancing leadership with hands - on expertise to enable AI technology adoption.
Join a Data Engineering Team as a Senior DevOps to support multiple Data & AI initiatives. Utilize cloud technologies and enhance data pipelines in a collaborative environment.
Principal Site Reliability Engineer at Early Warning designing performance and resiliency patterns for applications and infrastructure. Collaborating with development teams to improve systems and data integrity.
DevOps Engineer contributing to CI/CD setup and Azure services management. Collaborates with teams to ensure efficient project delivery in a hybrid environment.
IT DevOps Specialist at BMW responsible for analyzing requirements and implementing software solutions in AWS cloud environments. Collaborating internationally within agile teams for digital transformation projects.
DevOps Engineer at Vistra designing, implementing, and maintaining robust CI/CD pipelines and cloud infrastructure. Enabling software delivery across multiple technology stacks with a focus on AWS.
Manage complex customer rollouts and initial system deployments at Talex.ai. Bridging technical development with real - world application in robotics and AI systems.