SRE Lead responsible for driving reliability and performance across Platform Engineering ecosystem at Birlasoft. Leading capacity planning, incident management, and mentoring SRE engineers.
Responsibilities
Responsible for driving reliability, resiliency, and performance across Platform Engineering ecosystem
Define & maintain SLOs, SLIs, and error budgets for platform services
Lead capacity planning, performance tuning, autoscaling strategies, and resilience testing
Own monitoring stack across Azure Monitor, App Insights, Log Analytics, OpenTelemetry, and AKS
Lead incident response, on-call processes, and blameless postmortems
Implement automation-first operations for remediation, self-healing, and repetitive tasks
Partner with Platform Engineering pods to embed reliability by design
Mentor SRE engineers and lead the maturity of SRE practices
Requirements
Bachelor’s degree in Computer Science, Engineering, or related field
8–14 years of experience in SRE, DevOps, Cloud, or Platform Engineering roles
DevOps Engineer ensuring the stability and scalability of the justtrack platform. Collaborate with development teams managing the cloud infrastructure for a SaaS solution.
Senior DevOps Engineer implementing CI/CD solutions for software projects. Requires expertise in Docker, Azure, and IAC tools in a hybrid work environment.
Site Reliability Intern ensuring smooth operation of Compute services and collaborating on tooling development. Participate in teams for system performance and reliability improvements in a global tech company.
Site Reliability Engineer at ING enhancing BTP platform services with a focus on reliability and scalability. Collaborating with cross - functional teams to drive continuous improvement and implement effective monitoring solutions.
DevOps role at Vodafone responsible for designing and maintaining decisioning workflows for automated credit vetting using DataView360 platform. Collaborate with analysts to translate requirements into technical solutions.
Senior Director of Engineering leading the DevSecOps Platform team. Championing developer experiences and integrated practices to enhance security and effectiveness at FIS.
DevOps Engineer responsible for designing AWS infrastructure at AI - driven legal conveyancing startup. Collaborating with teams on CI/CD, monitoring, and security compliance.
Co - op role in Configuration Management at Pratt & Whitney, engaging in engineering projects within aerospace. Collaborating with various teams and gaining practical experience in the industry.
Lead System Engineer maintaining Oracle EBS/ERP applications at AT&T. Providing production support and troubleshooting for supply chain processes and integrations.
DevOps Engineer at WiseStamp designing and automating cloud infrastructure. Bridging development and operations to enhance system reliability and performance.