Site Reliability Engineer ensuring reliability and availability of critical gaming platforms at Flutter Entertainment. Collaborating with teams to implement monitoring and incident response procedures.
Responsibilities
Ensure the reliability, availability, and performance of critical gaming and betting platforms across global operations
Maintain 24/7/365 service availability for millions of customers worldwide
Implement automation, monitoring, and incident response procedures
Design and implement monitoring, alerting, and observability solutions using tools such as Grafana, Splunk & CloudWatch
Conduct capacity planning and performance optimization
Establish and maintain Service Level Objectives (SLOs) and Service Level Indicators (SLIs)
Support ProdOps and Service Management teams during P1/P2 incident response
Collaborate on post-incident reviews and contribute technical insights
Assist in developing and maintaining comprehensive runbooks and incident response procedures
Design, deploy, and maintain Grafana dashboards for real-time system visibility
Create custom Grafana panels and dashboards for business metrics
Requirements
Advanced experience with AWS, Azure, or Google Cloud Platform services and architecture
Proficiency with Docker and Kubernetes for container orchestration and management
Strong scripting abilities in Python, Go, Bash, or PowerShell; familiarity with Java or .NET advantageous
Hands-on experience with Prometheus, Grafana, ELK stack, or similar monitoring solutions
Proficiency with Jenkins, GitLab CI, Azure DevOps, or similar continuous integration tools
Working knowledge of SQL databases (PostgreSQL, MySQL) and NoSQL solutions
Understanding of load balancers, CDNs, DNS, and network security principles
DevOps Engineer maintaining and evolving deployment pipelines for Docebo’s AI - powered learning platform. Collaborating with cross - functional teams to ensure efficient software releases and infrastructure management.
DevOps Engineer optimizing CI/CD pipelines for Docebo, an AI - powered learning platform. Involves managing multi - tenant infrastructure using AWS, Docker, and Kubernetes.
DevOps Engineer maintaining and automating infrastructure and CI/CD processes for cybersecurity solutions by NordLayer. Collaborating with teams to ensure performance and scalability of cloud services.
DevOps Engineer maintaining and improving infrastructure and CI/CD processes for cybersecurity solutions provider. Collaborating with cross - functional teams for reliable and scalable cloud solutions.
DevOps Engineer maintaining and automating infrastructure and CI/CD processes at NordLayer. Collaborating with Senior Engineers to implement best practices in a dynamic cybersecurity environment.
Secure DevOps Engineer responsible for integrating security into CI/CD pipelines and strengthening AWS infrastructure. Key expertise in AWS security and container management.
DevOps Engineer responsible for CI/CD pipeline development and automation for urban software solutions. Collaborating with teams to enhance efficiency in software deployment and infrastructure.
DevOps Engineer managing cloud and on - premise platforms for a public sector infrastructure project. Collaboration primarily remote, with occasional on - site meetings.
DevSecOps Engineer architecting CI/CD framework services for Truist, enhancing the flow of business value through DevSecOps practices. Building and maintaining automation for software delivery and operations.
Application Security Manager at Evertec, handling security strategy and implementation in financial tech. Leading efforts in Application Security, DevSecOps, and compliance with financial regulations.