Site Reliability Engineer at AIG applying software engineering principles to IT operations and building resilient IT infrastructure while ensuring system stability and speed.
Responsibilities
Apply software engineering principles to IT operations
Build resilient, efficient, and scalable IT infrastructure
Prioritize automation, monitoring, and incident management
Define and meet Service Level Objectives (SLOs)
Manage error budgets
Conduct blameless postmortems for continuous improvement
Act as a bridge between development and operations teams
Ensure the speed of software development and system stability
Requirements
Bachelor's degree in related field
3+ years of relevant technology experience
Solid grasp of core technical areas such as programming (Python, Go, Java)
System administration (Linux/Unix), networking, databases, and cloud computing platforms (like AWS, Azure, GCP)
Practical experience running production systems
Proficiency in scripting languages (e.g., Python, Bash)
Experience with Infrastructure as Code (IaC) tools (e.g., Terraform, Ansible)
Implementing comprehensive monitoring solutions (e.g., Prometheus, Grafana, or ELK Stack)
Ability to quickly diagnose and resolve system incidents
Excellent communication skills
Proactive in learning new technologies
Benefits
Volunteer Time Off
Matching Grants Programs
Comprehensive benefits package focused on health, wellbeing and financial security
Professional development opportunities
Job title
Service Reliability Engineer, GI Application Management
DevOps Engineer maintaining scalable infrastructure for VOX's telecom services. Implementing automation and CI/CD pipelines in a fast - paced environment with significant growth potential.
DevOps Engineer focused on designing and managing CI/CD pipelines using Azure DevOps. Collaborating with teams for application deployment and ensuring DevSecOps practices.
DevOps Engineer working closely with engineering and security teams to optimize CI/CD pipelines and manage infrastructure. Ensuring security and compliance for mission - critical financial applications.
Build and scale cloud infrastructure that powers Heidi's healthcare AI platform. Work with AWS and Azure while enhancing automation and reliability in an innovative healthtech startup.
Infrastructure - as - Code DevOps Engineer designing and managing cloud - native platforms at Vodafone. Collaborating with agile teams for digital transformation and business success.
Director of Data Engineering leading a strategic DevOps team within Enterprise AI. Balancing leadership with hands - on expertise to enable AI technology adoption.
Join a Data Engineering Team as a Senior DevOps to support multiple Data & AI initiatives. Utilize cloud technologies and enhance data pipelines in a collaborative environment.
Principal Site Reliability Engineer at Early Warning designing performance and resiliency patterns for applications and infrastructure. Collaborating with development teams to improve systems and data integrity.
DevOps Engineer contributing to CI/CD setup and Azure services management. Collaborates with teams to ensure efficient project delivery in a hybrid environment.
IT DevOps Specialist at BMW responsible for analyzing requirements and implementing software solutions in AWS cloud environments. Collaborating internationally within agile teams for digital transformation projects.