Senior DevOps role supporting AI and Data services on Cloud and on-prem infrastructure. Collaborating with AI Developers to enable advanced AI capabilities for Data Scientists.
Responsibilities
Administrate multiple systems: Apply routinely updates especially security patches.
Respond to incidents, and minimize service interruptions.
Investigate and fixes complex defects.
Establish good monitoring and observability on systems health and cost.
Support developers and users: Provide support in solving system issues and escalations which require complex technical expertise to troubleshoot.
Perform root cause analysis, interprets the results, and develops action plan in backlog (identifying short- and long-term solutions).
Maintain open and efficient communication with developers and users while troubleshooting issues.
Participate in proof-of-concept for medium to large initiatives.
Facilitate development and design of technical solutions: Enhance existing products, and propose new/future releases for better security, scalability, flexibility and efficiency.
Assist development of requirements, understanding impact of architecture on overall solution.
Assist development teams by providing guidance of operation best practices.
Requirements
Bachelor’s degree in computer science, computer engineering or any combination of equivalent education and experience
A self-starter, who can drive and follow through the initiatives from the beginning to the end.
Excellent communication skills, spoken and written.
Bilingualism required for candidates located in Quebec considering the necessity to interact on a regular basis with English-speaking colleagues across the country.
Benefits
Opportunities and performance-led financial rewards
Site Reliability Engineer responsible for infrastructure supporting AI platform. Safeguarding US customer data and ensuring compliance in the Aerospace and Defense sector.
Senior Infrastructure Engineer managing Azure platform for a SaaS product at Rillion. Focused on automation, security, reliability, and scalability in a hybrid work environment.
Statistician/Reliability Engineer applying statistical analysis for satellite systems at Aerospace Corporation. Leading projects on system reliability and working closely with interdisciplinary teams in a full - time on - site role.
DevOps Engineer designing and implementing solutions to optimize operations in media technology at Mediagenix. Collaborating with cross - functional teams to enhance user experiences.
Senior DevOps Engineer at SimCorp managing cloud environments and automating builds using Azure. Collaborating with cross - functional teams to ensure high service availability and compliance.
DevOps Senior Software Engineer at SimCorp developing high - quality software solutions for financial technology. Responsible for mentoring junior engineers and solving complex technical challenges.
DevOps Engineer designing, building, and operating software development infrastructure for CodeMettle. Leading automation and best practices to enhance value delivery across teams.
DevOps Engineer maintaining scalable infrastructure for VOX's telecom services. Implementing automation and CI/CD pipelines in a fast - paced environment with significant growth potential.
DevOps Engineer focused on designing and managing CI/CD pipelines using Azure DevOps. Collaborating with teams for application deployment and ensuring DevSecOps practices.
DevOps Engineer working closely with engineering and security teams to optimize CI/CD pipelines and manage infrastructure. Ensuring security and compliance for mission - critical financial applications.