Site Reliability Engineer ensuring the reliability and performance of cloud-native infrastructure at Sanlam Fintech. Collaborating with teams to deliver innovative solutions across the African continent.
Responsibilities
Ensure the reliability, scalability, and performance of cloud-native infrastructure and services
Build and maintain resilient systems on AWS
Implement comprehensive observability solutions
Drive automation across the infrastructure lifecycle
Lead incident response and root cause analysis
Build self-healing systems with automated fixes for common failures
Set up metrics, logs and traces for full system visibility
Write and maintain Infrastructure as Code using Terraform and CloudFormation
Design and optimise serverless solutions (Lambda, API Gateway, Step Functions)
Build clean, well-structured automation tools and scripts
Requirements
5+ years of experience in systems engineering, DevOps, or site reliability engineering roles
3+ years of hands-on experience with AWS cloud services in production environments
2+ years of experience with Infrastructure as Code (Terraform and/or CloudFormation)
Demonstrated experience in incident management and on-call responsibilities
Track record of implementing automation that reduced operational toil
Bachelor's degree in Computer Science, Information Technology, Engineering or related field; or equivalent practical experience
Relevant professional certifications are advantageous but not required.
DevOps Engineer designing and operating AWS infrastructure within industrial IoT environments. Working on systems that ensure security, resilience, and end - to - end observability.
Sr. Site Reliability Engineer (SRE) III providing technical solutions for the federal government. Collaborating in a high - performing team focused on reliability and application scalability.
Senior Linux System Engineer developing and maintaining Linux server infrastructure for Th. Geyer GmbH. Collaborating on ERP systems and CI/CD processes while ensuring system performance and security.
Platform Engineer leading the development of cloud application platforms for Allstate. Responsible for cloud infrastructure for ML experimentation and production deployments.
Cloud Platform Engineer (ML DevOps) developing and managing CI/CD pipelines for ML workflows in a leading insurance company. Collaborating with data scientists and ensuring infrastructure security and compliance.
DevOps Engineer developing and managing container platforms for client solutions at Booz Allen Hamilton. Utilizing cloud technologies to enhance capabilities and secure deployments.
Senior DevOps/Platform Engineer automating cloud infrastructure and optimizing delivery pipelines at S&P Global Mobility. Collaborating with teams to enhance product reliability and security.
DevOps Engineer responsible for maintaining and enhancing AWS/EKS platform for energy transition products. Ensuring platform stability, security compliance, and streamlined deployment processes.
Suspension Design and Release Engineer for Ford, impacting vehicle ride, handling, and NVH. Collaborating with cross - functional teams to deliver quality systems and components.
DevOps Engineer at TeamViewer driving DevOps excellence by building CI/CD pipelines and managing Kubernetes. Collaborate within a diverse team to optimize digital processes with cloud infrastructure.