Principal AI Site Reliability Engineer driving operational excellence for critical contact center applications at Fidelity. Leading automation and observability initiatives to improve reliability and efficiency.
Responsibilities
Drive operational excellence, observability, and intelligent automation for mission-critical contact center applications
Lead initiatives to advance observability, automation, and operational efficiency
Collaborate with engineering and business leaders to prioritize and resolve issues impacting associate experience
Implement automation and self-service capabilities to reduce manual intervention and improve reliability
Establish and track SLIs/SLOs to measure and optimize system performance
Communicate progress, outcomes, and technical concepts clearly to senior leadership and stakeholders
Requirements
10+ years in technology operations, systems engineering, or production support leadership
Deep expertise in IT Service Management (ITSM), incident/problem management, and operational process optimization
Advanced knowledge of observability and monitoring tools (OTEL, Splunk, DataDog, Prometheus, Grafana)
Experience leveraging AI and automation to drive efficiency and reliability
Proficiency in scripting and automation (Python, Bash, PowerShell, or similar)
Strong understanding of On-Prem and Public Cloud (AWS/Azure/GCP) environments
Familiarity with networking, load balancing, and security fundamentals
Agile and DevOps mindset with experience in CI/CD and operational automation
Data Transport Infrastructure DevOps Engineer at Leidos modernizing global - scale multi - cloud environments for USAF missions. Involves developing cloud - native solutions and ensuring security best practices.
DevOps Engineer responsible for building and optimizing AWS - based infrastructure and backend systems at Allguth GmbH. Part of a team focused on innovative mobility solutions in Munich region.
(Senior) DevOps Engineer specializing in ML solutions implementation and management in Germany. Focused on CI/CD pipelines, automation, and cloud services.
Specialist DevSecOps joining Periferia IT Group, a leader in digital transformation. Work in a dynamic environment with continuous learning and professional development opportunities.
Join Zinkworks as a Senior Platform Engineer designing scalable IaC - driven cloud platforms for a large - scale enterprise contact centre. Focused on automation, reliability, and platform ownership in a hybrid work environment.
Asset Reliability Engineer providing maintenance advice and service innovations. Join Sensorfact, the leading smart monitoring platform, to modernize the industrial sector.
Cloud Operations Engineer responsible for securing AWS infrastructure at Avalon Healthcare Solutions. Collaborating on SRE best practices and ensuring system reliability and performance.
Design Release Engineer designing, developing, and releasing seat systems for Ford vehicles. Ensuring engineering deliverables meet quality, cost, and timing targets while collaborating with cross - functional teams.
DevOps Engineer responsible for maintaining FME infrastructure and development pipelines at Safe Software. Collaborate in an agile team focused on constant improvement and automation.