Cloud Ops Engineer at Motive ensuring operational health and reliability of the platform. Managing incident response and improving system reliability across diverse tech stacks.
Responsibilities
Own and refine the incident management lifecycle and be the incident commander, running communication and triage, and post-incident analysis and follow-ups to drive continuous service improvement.
Manage the central on-call solution and integrations used by over 100 teams from different monitoring and other platforms, leveraging automation and self-serve tools such as terraform.
Analyze operational statistics (MTTR, incident frequency, service-level data) to identify trends and prioritize reliability initiatives and teams’ focus.
Improve change management processes and automation to reduce both risk and friction.
Collaborate with engineering teams across the organization to standardize operational practices and develop automated workflows.
Leverage AI for incident analysis, alert/issue solutioning, and automation.
Requirements
Experience managing and participating in a 24/7 on-call rotation and incident response process.
Experience with on-call systems such as Rootly, PagerDuty, Opsgenie, etc.
Experience with monitoring and observability tools (e.g., Datadog, NewRelic, Grafana, etc.).
Ability to communicate clearly and manage incidents, communications, and action items with stakeholders from engineers to directors, and public-facing messaging.
Experience with IT Service Management tools (Jira/JSM) for ticket and change management.
3+ years experience in an incident response role.
Benefits
Creating a diverse and inclusive workplace is one of Motive's core values.
We are an equal opportunity employer and welcome people of different backgrounds, experiences, abilities and perspectives.
Director of Claims Operations overseeing Accident Benefits to drive growth and customer service at Intact. Collaborating with executives and managing team resources for effective delivery.
Facility Operations Specialist at AMERICAN SYSTEMS troubleshooting building systems in data centers. Responsible for maintenance, compliance, and vendor management to ensure efficient operations.
GMP Facility Operations Manager overseeing facility operations and ensuring Good Manufacturing Practices at Emory University. Responsible for equipment maintenance, cleaning services, and compliance with safety protocols and regulatory standards.
Technicien administratif au sein de Desjardins Group offrant un environnement de travail hybride. Engagement envers l'équité, la diversité et l'inclusion pour tous les employés.
Manager of Strategy & Operations at Allara driving operational excellence and data - driven decision - making in women's health initiatives. Collaborating across teams to enhance service lines and improve patient care.
Director of Data Transformation overseeing enterprise - wide transformation initiatives at Vanguard. Leading strategic operations, external partnerships, and organizational enablement while managing executive communications and performance metrics.
Quality Specialist supporting production operations and issue resolution at Celestica. Collaborating on NPI product launches and ensuring quality compliance across manufacturing standards.
Ops Communications Specialist at Destinus translating operational insights into compelling narratives for leadership. Handle end - to - end production of operational reviews and strategic presentations.
Director of Operations for the Hospital Services Group at DaVita Kidney Care managing dialysis programs. Responsible for financial growth and clinical performance across multiple hospital - based teams.