Site Reliability Engineer responsible for leading technology teams at SS&C. Delivering scalable and resilient infrastructure platforms in the financial services and healthcare technology sector.
Responsibilities
Collaborate with Technology Infrastructure teams to build and operate reusable, cloud-native platforms
Work with business units and technical teams to improve application availability, observability, and reliability
Enhance platform reliability through automatic problem detection and self-healing systems
Use SLOs, SLIs, and KPIs to guide prioritization and measure impact
Eliminate toil using intelligent automation and agentic workflows
Conduct blameless retrospectives and share learnings across the organization
Foster a culture of ownership and continuous learning
Integrate DevSecOps, zero-trust principles, and policy-as-code into every pipeline
Produce and promote Architecture Decision Records (ADRs) and Cloud Well-Architected Frameworks
Requirements
5+ years of professional experience in a SRE role
3+ years in financial services or other regulated industries preferred
Minimum Bachelor’s degree in Computer Science, Engineering, or a related field
Proven expertise in architecting, designing and operating private cloud environments (e.g., VMware, OpenStack, OpenShift Virtualization) and Kubernetes clusters
Hands-on experience with building, deploying and operating infrastructure as code platforms
Experience with CI/CD pipelines and observability platforms (e.g., Prometheus, Splunk)
Strong understanding of modern systems reliability standards and practices
Familiarity with financial services regulatory frameworks and their impact on infrastructure design and operations
Familiarity with structured naming conventions and asset management for global infrastructure
Experience with financial-grade network segmentation, micro-segmentation, and zero-trust architecture
Certifications such as TOGAF, AWS Certified Solutions Architect, VMware VCP, or Red Hat Certified Architect are a plus
Familiarity with ISO 27001, NIST 800-53, and other security frameworks is a plus.
Senior DevOps / Platform Engineer responsible for Voice AI product deployment and operational quality. Working with engineering teams to enhance deployment, reliability, and observability processes.
Cloud Engineer specializing in hybrid - cloud platform design and operation at Dun & Bradstreet. Collaborating closely with team members to enhance developer self - service and automation capabilities.
DevOps Engineer II evolving cloud infrastructure and CI/CD pipelines at HackerRank. Collaborating with teams to design, build, and optimize systems for developer productivity.
DevOps Engineer managing CI/CD pipelines and cloud infrastructure for mobile apps at Air Apps. Collaborating with teams to ensure app performance and reliability.
DevOps Engineer at Vodafone Romania delivering resilient infrastructure for software development lifecycle. Collaborating with Digital Squads and optimizing CI/CD pipelines for efficient deployments.
Mechanical/Reliability Engineer responsible for mechanical installations in Bergen op Zoom. Analyzing maintenance strategies and leading projects to enhance reliability.
Senior DevOps Engineer responsible for cloud infrastructure and deployments. Optimizing AWS services and ensuring system security and reliability for Verizon.
Senior DevOps Engineer responsible for automating infrastructure and building CI/CD pipelines for collaborative robotics company. Collaborating with global engineering teams from the Bangalore office.
Site Reliability Engineer Intern at Tencent working on gaming services and cloud native solutions. Collaborating with global teams to eliminate toil and enhance reliability.
Cloud/DevOps Specialist at N5X managing and optimizing critical cloud infrastructures for Brazilian energy trading. Collaborating with a multidisciplinary team to ensure high availability and performance.