Design, implement, and manage cloud-based infrastructure solutions on Microsoft Azure.
Oversee deployment, maintenance, and scaling of Azure services, ensuring high availability and reliability for applications.
Work closely with software development teams to understand application requirements and provide necessary support for continuous integration and continuous deployment (CI/CD) practices.
Foster a collaborative environment by sharing best practices, tools, and methodologies with development teams.
Develop and maintain automated CI/CD pipelines to streamline the deployment process for applications.
Integrate testing automation into deployment pipelines, ensuring high-quality deliverables.
Implement monitoring and alerting tools to track application performance, system health, and infrastructure usage.
Analyse performance metrics and logs to troubleshoot issues and optimize application and infrastructure performance.
Collaborate with security teams to implement best practices for infrastructure security, including access controls, network security, and data protection.
Ensure compliance with industry regulations and internal policies regarding data protection and system management.
Produce clear and comprehensive documentation on system architecture, deployment processes, and operational procedures.
Provide regular reports on system performance, deployment status, and incident management to technical leadership.
Requirements
Proven experience in a DevOps or Site Reliability Engineering role, preferably within cloud environments.
Strong hands-on experience with Microsoft Azure, including services such as Azure App Service, Azure Functions, Azure Kubernetes Service, and Azure DevOps.
Proficiency in scripting languages (e.g., PowerShell, Python, Bash) for automation tasks.
Familiarity with containerization technologies (e.g., Docker, Kubernetes) and orchestration tools.
Experience with CI/CD tools (e.g., Azure DevOps, Jenkins, GitLab CI/CD) and version control systems (e.g., Git).
Strong understanding of network protocols, security, and infrastructure as code concepts (e.g., Terraform, ARM templates).
Excellent problem-solving skills and the ability to troubleshoot complex issues in a collaborative environment.
Strong communication skills, with the ability to convey technical concepts to both technical and non-technical stakeholders.
Benefits
A flexible holiday plan of up to 40 days per year
£400 a year Wellbeing Allowance
Private Medical Insurance
Allowance for professional development books, E-books, podcasts
Contributory pension Scheme
Employee, friends and family discounts across 1200+ retail, hospitality and lifestyle brands
DevOps Product Manager working on complex platform and infrastructure projects. Consulting on DevOps best practices and ensuring scalable, efficient digital ecosystems for clients.
Site Reliability Engineer optimizing large - scale Linux environments at Bumble Inc. Troubleshooting incidents and driving performance improvements on platforms such as Kafka and Kubernetes.
Senior DevOps Engineer at mylo, managing multi - cloud infrastructure and CI/CD pipelines. Promoting DevOps culture while ensuring compliance and automating system maintenance.
Lead Site Reliability Engineer at S&P Global's Cloud Engineering team. Responsible for designing and maintaining cloud infrastructure and ensuring the performance of cloud - based systems.
Site Reliability Engineer responsible for monitoring and improving the reliability of satellite operations infrastructure. Collaborating with teams to automate processes in a dynamic environment.
DevOps Analyst providing high quality and reliable solutions within multifuncional teams at technology - focused financial organization. Automating build and deployment solutions in a hybrid work environment.
Network & Datacenter Deployment Engineer at Cloudflare focused on building and expanding their global network infrastructure with collaboration across multiple engineering teams and vendors.
Senior DevOps Engineer leading cloud - native solutions at Sparksoft Corporation. Driving automation and system reliability within a fast - paced Agile team.