Customer Reliability Engineer responsible for proactive monitoring and managing escalations for integration issues. Working with partners to enhance service quality on ClickBus's marketplace.
Responsibilities
Proactive Monitoring and "Golden Signals": You will not just wait for issues to occur. Your role will be to define, implement, and actively monitor the "Golden Signals" (key metrics such as API error rates or spikes in support tickets) to detect integration issues before the partner reports them.
Transparent Escalation Management: Act as the focal point for resolving complex issues, ensuring end-to-end follow-up. This includes keeping the partner informed of the status (e.g., "Under Analysis", "Awaiting Dev") and orchestrating the solution with internal teams.
Development of Resolution Playbooks: To scale our support, you will document and create playbooks (step-by-step guides) for the 5 to 10 most recurring problems. The goal is to standardize and speed up diagnosis and communication.
Partner Journey Mapping: Lead the mapping of critical paths that our partners take (from integration to post-sales), identifying friction points, involved systems, and opportunities for continuous improvement.
Critical Analysis and Insight Generation: Facilitate "Case of the Week" discussions (successes or failures), investigate root causes, gauge partner sentiment, and turn each incident into an actionable learning for the team.
Building the Knowledge Base (CRE Wiki): Be the steward of our internal documentation, centralizing playbooks, case learnings, and resolution guides in an accessible tool to ensure knowledge is shared and not lost.
Requirements
Experience with ITSM systems such as JIRA, ServiceNow, Zendesk, or similar
Basic knowledge of how APIs work
Basic knowledge of Git
Critical thinking and problem-solving
Communication skills (clear and concise)
Empathy and partner-focused mindset
Knowledge of monitoring and observability
Benefits
🥘 Meals/Refeição: R$ 1,000.00/month credited to a Flash card (Flexible Benefits)
💻 Home office allowance: R$ 141.16/month credited to a Flash card (Flexible Benefits)
💰 Flexible benefits: R$ 200.00/month credited to a Flash card (Flexible Benefits)
🚐Busonauta Traveler: Our exclusive benefit for Busonautas — R$ 2,000.00/year to use for purchasing bus tickets via the app or website
🚋 Commuter transit allowance (Vale Transporte)
🅿️ Parking
🏥 SulAmérica Health Insurance: no co-payment and no monthly fee
🦷 SulAmérica Dental Insurance
👶 Childcare assistance for parents
🤰 6-month maternity leave and 30-day paternity leave
🔒 Life insurance
🏋️♀️ Wellhub and totalPass
💸 Annual profit-sharing (PLR)
🏖️ Birthday day off
🐶 Petlove partnership
🩹 Pharmacy assistance
🧑🦽➡️ Support for children with disabilities (PCD)
📚 Partnerships with educational and leisure institutions
Senior Reliability Engineer at Sonova ensuring dependable performance of hearing solutions for millions of users globally. Involves engineering skills to improve product reliability across development stages.
Equipment and Reliability Engineer at Chobani responsible for improving asset efficiency, redesigning equipment. Collaborating with Operations to solve complex problems and lead projects in a team environment.
Reliability Engineer II focused on enhancing safety, efficiencies, and cost controls at Freeport - McMoRan mining operations. Collaborating with multiple teams and managing engineering projects.
Reliability Engineer I responsible for equipment failure analysis and improvement recommendations at Freeport - McMoRan's copper smelting operations. Ensuring uninterrupted production and managing equipment health through data analysis.
Designing, building, and maintaining the Kubernetes - based developer platform for Schwarz IT Barcelona. Collaborating with engineering teams to enhance services in Azure and Google Cloud.
Database Reliability Engineer managing MySQL database infrastructure at PointClickCare. Collaborating with Engineering and SRE teams for product development and reliable integration across the platform.
Teamleitung in der Gebäudereinigung in Grimma, verantwortliche Planung, Organisation und Führung des Reinigungsteams. Aktive Mitarbeit und Einhaltung von Hygiene - und Qualitätsstandards sind erforderlich.
Service Reliability Engineer providing technical support and managing incidents for BT International. Ensuring system availability and collaboration with global stakeholders to achieve objectives.
Studying Bachelor of Arts in Accounting, Taxation, and Economic Law while gaining practical experience in a dynamic team. Benefit from a diverse working day and continuous development opportunities.
Technical Trainer conducting workshops and training sessions on MERKUR Group's product content for diverse audiences. Engaging with employees and clients to ensure smooth product operation and understanding.