Hybrid Service Reliability Engineer, GI Application Management

Posted 2 weeks ago

Apply now

About the role

  • Site Reliability Engineer at AIG applying software engineering principles to IT operations and building resilient IT infrastructure while ensuring system stability and speed.

Responsibilities

  • Apply software engineering principles to IT operations
  • Build resilient, efficient, and scalable IT infrastructure
  • Prioritize automation, monitoring, and incident management
  • Define and meet Service Level Objectives (SLOs)
  • Manage error budgets
  • Conduct blameless postmortems for continuous improvement
  • Act as a bridge between development and operations teams
  • Ensure the speed of software development and system stability

Requirements

  • Bachelor's degree in related field
  • 3+ years of relevant technology experience
  • Solid grasp of core technical areas such as programming (Python, Go, Java)
  • System administration (Linux/Unix), networking, databases, and cloud computing platforms (like AWS, Azure, GCP)
  • Practical experience running production systems
  • Proficiency in scripting languages (e.g., Python, Bash)
  • Experience with Infrastructure as Code (IaC) tools (e.g., Terraform, Ansible)
  • Implementing comprehensive monitoring solutions (e.g., Prometheus, Grafana, or ELK Stack)
  • Ability to quickly diagnose and resolve system incidents
  • Excellent communication skills
  • Proactive in learning new technologies

Benefits

  • Volunteer Time Off
  • Matching Grants Programs
  • Comprehensive benefits package focused on health, wellbeing and financial security
  • Professional development opportunities

Job title

Service Reliability Engineer, GI Application Management

Job type

Experience level

Mid levelSenior

Salary

Not specified

Degree requirement

Bachelor's Degree

Location requirements

Report this job

See something inaccurate? Let us know and we'll update the listing.

Report job