Senior DevOps overseeing installation and maintenance of Apigee runtime environments in multiple data centers. Ensuring high availability and performance while participating in on-call support rotation.
Responsibilities
Manage the installation and maintenance of Apigee runtime environments across multiple data centers, ensuring seamless operations and deployments.
Ensure high availability, scalability, and optimal performance of the Apigee platform to meet business and user demands.
Implement and maintain comprehensive monitoring, logging, and alerting solutions to proactively identify and address issues.
Automate infrastructure provisioning and management to enhance efficiency and reduce manual intervention using leading automation tools.
Troubleshoot and resolve runtime issues promptly to minimize downtime and maintain service reliability.
Participate in a 24/7 on-call support rotation to ensure continuous system operability and address critical issues as they arise.
Requirements
Strong experience in managing runtime environments, with specific expertise in Apigee hybrid.
Expertise in ensuring system availability, scalability, and performance, with a focus on delivering uninterrupted services.
Proficiency in using monitoring, logging, and alerting tools to maintain high visibility into system operations and preemptively identify potential issues.
Skills in infrastructure automation, including scripting and use of CI/CD pipelines to streamline operations.
Proven ability to troubleshoot and resolve technical issues efficiently, reducing mean time to recovery (MTTR).
Knowledge of Kubernetes (k8s) for container orchestration and GitHub action for automatic deployment.
For candidates located in Quebec, bilingualism is required considering the necessity to interact on a regular basis with English speaking colleagues across the country.
No Canadian work experience required however must be eligible to work in Canada.
Benefits
A financial rewards program that recognizes your success
An industry leading Employee Share Purchase Plan; we match 50% of net shares purchased
An extensive flex pension and benefits package, with access to virtual healthcare
Flexible work arrangements
Possibility to purchase up to 5 extra days off per year
An annual wellness account that promotes an active and healthy lifestyle
Access to tools and resources to support physical and mental health, embracing change and connecting with colleagues
A dynamic workplace learning ecosystem complete with learning journeys, interactive online content, and inspiring programs
Inclusive employee-led networks to educate, inspire, amplify voices, build relationships and provide development opportunities
Inspiring leaders and colleagues who will lift you up and help you grow
A Community Impact program, because what you care about is a part of what makes you different.
Senior Executive supporting technology initiatives in Pune, India. Collaborating globally to connect people and solve complex challenges in a sustainable manner.
DevOps Engineer leading the design, implementation, and optimisation of Kubernetes platforms for Vodafone. Collaborating with product teams to streamline operational processes and enhance developer experience.
Senior Site Reliability Engineer developing scalable systems and automation for high - scale projects at Euna Solutions. Collaborating closely with software developers and mentoring junior engineers.
Senior Site Reliability Engineer responsible for designing scalable systems at Euna Solutions. Collaborating with developers and mentoring juniors while driving automation and reliability.
Senior Site Reliability DevOps Specialist at Boeing overseeing GCP cloud environment and infrastructure. Ensuring reliability, scalability, and automation while collaborating with distributed teams.
Lead DevOps Engineer driving modernization and operational excellence for Enterprise Payments at American Family Insurance. Collaborate across teams and enhance payment processing capabilities.
Senior DevOps Engineer at Fidelity leading operational excellence of production reporting applications. Responsible for stability, reliability, and cloud migration initiatives in a hybrid work environment.
Senior Site Reliability DevOps Specialist for Boeing, focusing on cloud technology and automation in GCP environments. Collaborate globally to enhance system reliability and performance with a diverse tech stack.
SRE Team Lead in charge of reliability strategy and operational maturity for a cybersecurity SaaS platform. Leading a specialized team to enhance system performance and incident management.
Junior DevOps Engineer implementing continuous integration and deployment architecture for the Defense Logistics Agency. Debugging cluster - based computing while using various configuration management tools.