Senior Site Reliability Engineer at Entrust ensuring reliability and performance of SaaS platform with extensive DevOps tooling experience. Collaborating on micro-services and cloud solutions for identity-centric security.
Responsibilities
Own SLOs/SLIs for availability (99.9%), latency, error rate, and quality of service across microservices
Build health probes and SLA monitors for critical transactions and cross-service dependencies
Monitor system issues using various metrics, such as uptime, latency, error rate, throughput, and availability
Lead incident response (triage, comms, coordination, real-time mitigation) and conduct blameless postmortems with actionable follow-ups
Validate scaling strategies (horizontal vs. vertical) and implement auto-scaling where supported
Requirements
Bachelor’s degree in computer science, Software Engineering, or equivalent combination of education and experience
5+ years of related experience as a Software Engineer, DevOps Engineer, Site Reliability Engineer or a role in similar capacity
Extensive experience working with enterprise level micro-services applications, including deployment and maintenance of the applications in distributed environments
Demonstrated hands-on experience and expertise with DevOps tooling (Ansible, Terraform, Jenkins, Octopus deploy, etc.)
In-Depth hands-on experience with on-prem and cloud compute, storage and networking solutions (vmWare, NetApp, Azure, AWS, etc)
Benefits
comprehensive health and well-being programs including medical, vision, and dental
Associate DevSecOps Engineer in Bologna supporting deployment and maintenance of DevSecOps tools in R&D infrastructure. Collaborating on automation and logging initiatives in a dynamic environment.
DevSecOps Engineer at Helpshift, designing and implementing secure infrastructure while collaborating with InfoSec teams. Responsibilities include automation and mentoring within hybrid work structure.
Forward Deployment Engineer at Adobe addressing engineering challenges while working onsite with customers and delivering innovative solutions. Contributing to Adobe's Digital Experience and joining a newly formed team.
Reliability Engineer providing support to maintenance and operations teams for critical gold processing assets. Ensuring equipment reliability and leading improvement initiatives at Gruyere Gold Mine.
Site Reliability Engineer responsible for monitoring and improving production systems at ING. Leading teams to ensure high reliability and performance of business - critical applications.
Staff Reliability Engineer at insurance company enhancing stability and performance of systems. Collaborating across teams to implement best practices and mentor others in reliability engineering.
Reliability Engineer at Mosaic Company providing in - depth analysis on mechanical systems to reduce risk. Supporting operations in reliability improvement initiatives across refinery and minefield.
DevOps Engineer at MYOB enhancing core business management systems for small to medium enterprises in Australia and New Zealand. Focused on operational excellence and stability.