DevOps Engineer focused on CI/CD pipeline management, infrastructure as code, and Azure services for Vistra's global operations. Collaborating with IT teams to ensure efficient deployment and compliance across cloud environments.
Responsibilities
Design, implement, and optimize CI/CD pipelines using tools like Azure DevOps
Develop and maintain IaC scripts using Terraform or similar tools to provision and configure Azure resources
Manage containerized environments using Docker and Kubernetes
Oversee database configuration, backups, and recovery strategies for Azure databases
Design and implement disaster recovery (DR) strategies, leveraging tools like Azure Site Recovery
Document and optimize Recovery Time Objectives (RTO) and Recovery Point Objectives (RPO)
Conduct and support Business Continuity Planning (BCP) testing to ensure operational resilience
Implement and manage monitoring solutions to track system health and performance
Create reusable templates/scripts to provision Azure VMs, containers, or other resources
Work closely with the IT infrastructure team to manage network configurations
Analyze resource usage and recommend cost-saving strategies
Ensure infrastructure and processes adhere to SOC 2 compliance
Automate repetitive tasks using Python, PowerShell, Bash, or Azure CLI
Resolve deployment, infrastructure, and production issues promptly
Requirements
Proven experience with CI/CD tools (Azure DevOps, GitHub Actions) and YAML-based pipeline creation
Strong proficiency in Infrastructure as Code (IaC) tools like Terraform
Hands-on experience with containerization (Docker) and orchestration (Kubernetes)
Deep knowledge of Azure services (e.g., Azure App Services, Azure VMs, VMSS, Azure SQL Database) and hybrid cloud/on-prem environments
Proficiency in scripting languages (Python, PowerShell, Bash, Azure CLI)
Experience with Microsoft technologies (.NET Framework, .NET Core, IIS, SQL Server)
Familiarity with monitoring and logging tools (Azure Monitor, Site24x7, Application Insights, Prometheus, Grafana, Loki)
Knowledge of version control systems (Git, Azure Repos, GitHub)
Benefits
flexible hybrid working arrangement
birthday leave
comprehensive medical insurance
dental coverage
wellness allowance
competitive annual leave entitlement
internal mentorship program
reimbursement for professional membership fees for certifications
Cloud Site Reliability Engineer managing Solace Cloud services across leading cloud providers. Ensuring reliability, handling incidents, and collaborating with customers for operational excellence.
Senior Cloud Site Reliability Engineer ensuring reliability and health of Solace Cloud Services with hands - on cloud operations expertise. Lead incident management and customer support for high - impact environments.
DevOps Engineer designing and operating AWS infrastructure within industrial IoT environments. Working on systems that ensure security, resilience, and end - to - end observability.
Sr. Site Reliability Engineer (SRE) III providing technical solutions for the federal government. Collaborating in a high - performing team focused on reliability and application scalability.
Senior Linux System Engineer developing and maintaining Linux server infrastructure for Th. Geyer GmbH. Collaborating on ERP systems and CI/CD processes while ensuring system performance and security.
Platform Engineer leading the development of cloud application platforms for Allstate. Responsible for cloud infrastructure for ML experimentation and production deployments.
Cloud Platform Engineer (ML DevOps) developing and managing CI/CD pipelines for ML workflows in a leading insurance company. Collaborating with data scientists and ensuring infrastructure security and compliance.
DevOps Engineer developing and managing container platforms for client solutions at Booz Allen Hamilton. Utilizing cloud technologies to enhance capabilities and secure deployments.
Senior DevOps/Platform Engineer automating cloud infrastructure and optimizing delivery pipelines at S&P Global Mobility. Collaborating with teams to enhance product reliability and security.