Lead Site Reliability Engineer at S&P Global's Cloud Engineering team. Responsible for designing and maintaining cloud infrastructure and ensuring the performance of cloud-based systems.
Responsibilities
Develop monitoring solutions to provide complete system coverage
Strategically plan and organize team deliverables
Practice and document disaster recovery scenarios
Streamline our software build and release pipeline
Engage in post-incident reviews to learn from failures and solve the root cause
Able to be available for incident escalations
Requirements
Bachelor’s degree in Computer Science, Engineering, or a related field.
+10 years of relevant experience in related roles.
AWS Certified Solutions Architect – Professional certification
Expertise in scripting languages such as Python, Bash, and PowerShell
Advanced experience with infrastructure-as-code tools, including Terraform and CloudFormation
Strong time-management skills with the ability to prioritize tasks and meet deadlines
Excellent troubleshooting abilities to diagnose and resolve complex issues efficiently
Clear and effective communication skills, especially when explaining complex technical concepts
Proven leadership experience with the ability to guide and mentor a team of engineers.
Benefits
Health & Wellness: Health care coverage designed for the mind and body.
Flexible Downtime: Generous time off helps keep you energized for your time on.
Continuous Learning: Access a wealth of resources to grow your career and learn valuable new skills.
Invest in Your Future: Secure your financial future through competitive pay, retirement planning, a continuing education program with a company-matched student loan contribution, and financial wellness programs.
Family Friendly Perks: It’s not just about you. S&P Global has perks for your partners and little ones, too, with some best-in class benefits for families.
Beyond the Basics: From retail discounts to referral incentive awards—small perks can make a big difference.
Software Deployment Engineer responsible for deployment, configuration, and testing of applications across multiple environments. Collaborating with engineers and teams to ensure seamless deployments.
Senior Software Deployment Engineer responsible for delivery, configuration, and testing in complex environments. Overseeing deployments and mentoring engineers with a focus on automation and reliability.
DevOps Engineer responsible for deploying AWS cloud infrastructure and building CI/CD pipelines. Collaborating with engineering teams to enhance platform reliability and observability in a hybrid work environment.
DevOps Team Leader focusing on improving technology and leading a small team of remote engineers for iGaming solutions in Lisbon. Striving for innovation in sports betting and casino experiences.
Site Reliability Engineer responsible for system reliability and performance at a leading financial services technology company. Collaborating with infrastructure, engineering, and security teams to build robust systems.
Principal Release Engineer leading and orchestrating end - to - end release management at F5. Driving cross - platform coordination and ensuring seamless releases across enterprise transformation programs.
Sr DevOps Manager leading the way in Cloud infrastructure, DevOps, and SRE practices at F5. Empowering engineers and fostering a culture of collaboration and improvement.
Site Reliability Engineer focused on developing and improving Kubernetes configurations for F5's infrastructure. Collaborating with product teams and ensuring operational delivery processes are efficient and reliable.
Senior Site Reliability Engineer developing IT infrastructure and automation solutions for Coinbase. Collaborating with Infrastructure, security, and compliance teams to enhance operational efficiency.
DevOps Engineer joining AI and Innovation team to ensure scalable, secure, and resilient systems at global media agency. Collaborating with UX and AI engineers for next - generation media experiences.