Site Reliability Engineer at ING enhancing BTP platform services with a focus on reliability and scalability. Collaborating with cross-functional teams to drive continuous improvement and implement effective monitoring solutions.
Responsibilities
Ensure Service Level Objective (SLO) levels are set and met
Optimize our Observability tooling like Grafana dashboards
Report on GSRE targets and KPIs
Drive Always Available mindset and behaviour
Define and enhance standards for logging monitoring and alerting
Improve incident response practices
Participate in Root Cause Analysis
Drive Continuous improvement on all services in the R&BT Platforms
Roll out new resilience features through the organization
Requirements
4+ years of experience working using Agile DevOps principles
Solid understanding how technology setup and ITSM processes relate to service level objectives like Availability (time based, successful call rate, response times), MTTR, and MTBF
Good understanding of microservices architecture and related high availability / resilience patterns
Proven experience: working as a Site Reliability Engineer or DevOps engineer
Scripting in at least one of the following: Ruby, Python, Bash, PowerShell
Set up Build and Deployment pipelines in Azure DevOps (ADO)
Eliminate toil through automation and process optimization
Able to coordinate/lead incident response and Post mortem / root cause analysis activities
Benefits
A collaborative, communicative environment
Flexible way of working
Opportunities for coaching and training
Job title
Site Reliability Engineer – Retail & Banking Technology
DevOps Engineer ensuring the stability and scalability of the justtrack platform. Collaborate with development teams managing the cloud infrastructure for a SaaS solution.
Senior DevOps Engineer implementing CI/CD solutions for software projects. Requires expertise in Docker, Azure, and IAC tools in a hybrid work environment.
Site Reliability Intern ensuring smooth operation of Compute services and collaborating on tooling development. Participate in teams for system performance and reliability improvements in a global tech company.
DevOps role at Vodafone responsible for designing and maintaining decisioning workflows for automated credit vetting using DataView360 platform. Collaborate with analysts to translate requirements into technical solutions.
SRE Lead responsible for driving reliability and performance across Platform Engineering ecosystem at Birlasoft. Leading capacity planning, incident management, and mentoring SRE engineers.
Senior Director of Engineering leading the DevSecOps Platform team. Championing developer experiences and integrated practices to enhance security and effectiveness at FIS.
DevOps Engineer responsible for designing AWS infrastructure at AI - driven legal conveyancing startup. Collaborating with teams on CI/CD, monitoring, and security compliance.
Co - op role in Configuration Management at Pratt & Whitney, engaging in engineering projects within aerospace. Collaborating with various teams and gaining practical experience in the industry.
Lead System Engineer maintaining Oracle EBS/ERP applications at AT&T. Providing production support and troubleshooting for supply chain processes and integrations.
DevOps Engineer at WiseStamp designing and automating cloud infrastructure. Bridging development and operations to enhance system reliability and performance.