Site Reliability Engineer at Fidelity responsible for the reliability and observability strategy. Ensuring system availability through technical standardization and process refinement.
Responsibilities
Help define and execute a comprehensive reliability and observability strategy, ensuring that Fidelity’s systems are always available when our customers need them.
Bring together technical, procedural, and financial data to reduce toil and increase efficiency.
You will execute plans for technical standardization and process refinement within the engineering organization, especially for Site Reliability Engineers.
Coach peer SREs and development teams on how to build highly available systems.
Requirements
Bachelor’s degree or higher in a technology related field (e.g. Engineering, Computer Science, etc.) required, master’s degree a plus.
5+ years of hands-on experience deploying and/or supporting highly distributed multi-tiered systems at scale.
Strong experience in Cloud development (preferably AWS) and migration skills;
2-4 years of experience in software development with Python, NodeJS, or Java with a focus on SDLC and automation
Hands-on experience with container orchestration, preferably with Kubernetes
Solid understanding of Cloud Computing and DevOps concepts including CI/CD pipelines
Experienced in Instrumentation with systems skills on building and operating, monitoring, logging, alerting services of distributed systems at scale.
Proven experience in maintaining scalability and resiliency of complex environment.
Ability to triage, execute root cause analysis, and be decisive under pressure.
Benefits
Most roles at Fidelity are Hybrid, requiring associates to work onsite every other week (all business days, M-F) in a Fidelity office.
DevOps intern joining Canada Life's tech team to work on cloud infrastructure and automation tasks. Contributing to innovative solutions that enhance customer experience and well - being of Canadians.
DevOps Engineer developing and supporting operational and non - operational platforms in a software development team. Focus on CI/CD practices and infrastructure automation using cloud technologies.
DevOps Engineer deploying and monitoring solutions for thousands of clients. Working in an agile environment using modern tools like AWS, Terraform, Kubernetes, and Ansible.
DevOps Engineer responsible for deploying, monitoring, and operating software across key production systems. Impacting user experience and company growth in an agile environment with modern tools.
Site Reliability Engineer managing Java - based microservice architecture for Deutsche WertpapierService Bank AG. Ensuring stable and performant processing of securities transactions with a modern stack including Kubernetes.
CAASM Deployment Engineer responsible for installing, configuring, and integrating Cyber Asset Management platforms. Collaborating with clients and monitoring deployed CAASM instances for security and data integrity.
DevOps Engineer for designing, automating, and optimizing cloud - native infrastructures across AWS, Azure, and GCP. Collaborating with teams to improve delivery workflows, reliability, and performance.
Cloud Site Reliability Engineer ensuring health and reliability of Solace Cloud Services across AWS, Azure, Google Cloud, and Kubernetes. Responsible for daily operations and improving infrastructure tooling and automation.
DevOps Specialist designing and testing Content Management Solutions for Morgan Stanley’s Enterprise Application Team in Montreal. Collaborating to ensure optimal infrastructure performance in a hybrid work environment.
Site Reliability Engineer ensuring software reliability and managing cloud environments at Akur8. Collaborating with R&D teams and maintaining infrastructure across three time zones.