Cloud Infrastructure Manager in charge of cloud operations and Site Reliability Engineers, driving innovation with hands-on technical leadership. Ensures scalability and cost-effective infrastructure for business growth.
Responsibilities
Lead day-to-day cloud operations (deployments, monitoring, incident resolution, change management, and system administration) across multi-cloud environments, with a strong focus on AWS, in a 24/7 multi-region SaaS setup.
Provide hands-on technical leadership for complex infrastructure projects, system upgrades, and major incident responses, serving as the escalation point for the team.
Build, lead, and develop a high-performing SRE team through coaching, mentoring, and feedback, while setting clear performance goals and career paths.
Collaborate with the central FinOps team to drive cloud cost optimisation, implementing cost monitoring frameworks to achieve budget targets.
Lead technical integration of newly acquired businesses into Ideagen’s infrastructure, ensuring smooth consolidation and standardisation with minimal disruption.
Implement robust observability strategies—monitoring, logging, alerting, and dashboards—for proactive issue detection and stakeholder visibility.
Partner with Security teams to ensure all infrastructure meets cybersecurity and compliance standards, including participation in audits.
Oversee vendor management, capacity planning, disaster recovery, and business continuity to maintain high availability, strong performance, and continuous improvement through automation and process optimisation.
Requirements
Proven experience managing 24/7 production environments in multi-region, multi-vendor cloud SaaS platforms.
Cloud expertise: Hands-on experience with AWS and/or Azure infrastructure, including IaaS, PaaS, and managed services.
Database management: Experience with both on-premises and cloud databases (e.g. MSSQL, MySQL, PostgreSQL, Aurora, SQL Azure).
Modern DevOps practices: Proficiency with containerization, orchestration, and IaC tools (Docker, Kubernetes, Helm, Terraform).
Leadership experience: Track record of successfully leading and developing technical teams.
Communication skills: Excellent written and verbal communication with ability to engage technical and business stakeholders.
Problem-solving: Strong analytical and strategic thinking capabilities with a bias toward action.
Agile experience: Familiarity working in agile development environments with cross-functional teams.
Desirable: Compliance frameworks: Experience with ISO27001, SOC2, FedRAMP, or similar compliance standards.
Service Management: Understanding of ITIL service management framework.
Cost optimisation: Experience with cloud FinOps practices and cost management tools.
Observability platforms: Experience with tools like New Relic, Datadog, Prometheus, or similar.
Software Development Lifecycle: Deep understanding of SDLC and CI/CD practices.
Scripting/automation: Proficiency in Python, Bash, PowerShell, or similar languages.
Acquisition integration: Experience integrating infrastructure from M&A activities.
Senior Cybersecurity Cloud Engineer focusing on strengthening cloud security posture and ensuring compliance. Tasks include maintaining secure cloud environments and assisting with incident response.
Enterprise Cloud Architect responsible for aligning cloud architecture with business objectives and managing technology trends. Working across multiple areas including design, risk management, and team collaborations.
Senior Product Manager designing and managing products for Private Cloud environments. Collaborating with cross - functional teams to deliver innovative solutions at Hewlett Packard Enterprise.
Principal Kubernetes / Hybrid Cloud Engineer at LSEG developing cloud - native container platform for financial markets. Involves engineering, design, and consultancy on Kubernetes solutions.
Cloud Application Architect at Booz Allen revolutionizing cloud application development. Streamlining software development life cycle and incorporating modern technology solutions for national security.
Azure Cloud Operation and Support Engineer developing scalable cloud - native solutions. Collaborating with cross - functional teams to ensure best practices in cloud security and performance.
Cloud Engineer role at S&P Global focused on developing and maintaining cloud architecture. Collaborate with teams to innovate and enhance cloud services and processes.
Lead Infrastructure & CloudOps at Enclaive to build secure confidential computing platforms. Architect multi - cloud infrastructure while leading a high - impact engineering team in Hamburg.
Azure Engineer providing enterprise scale technology solutions for public sector clients while collaborating closely with technical leads and project managers.