Lead Site Reliability Engineer at S&P Global's Cloud Engineering team. Responsible for designing and maintaining cloud infrastructure and ensuring the performance of cloud-based systems.
Responsibilities
Develop monitoring solutions to provide complete system coverage
Strategically plan and organize team deliverables
Practice and document disaster recovery scenarios
Streamline our software build and release pipeline
Engage in post-incident reviews to learn from failures and solve the root cause
Able to be available for incident escalations
Requirements
Bachelor’s degree in Computer Science, Engineering, or a related field.
+10 years of relevant experience in related roles.
AWS Certified Solutions Architect – Professional certification
Expertise in scripting languages such as Python, Bash, and PowerShell
Advanced experience with infrastructure-as-code tools, including Terraform and CloudFormation
Strong time-management skills with the ability to prioritize tasks and meet deadlines
Excellent troubleshooting abilities to diagnose and resolve complex issues efficiently
Clear and effective communication skills, especially when explaining complex technical concepts
Proven leadership experience with the ability to guide and mentor a team of engineers.
Benefits
Health & Wellness: Health care coverage designed for the mind and body.
Flexible Downtime: Generous time off helps keep you energized for your time on.
Continuous Learning: Access a wealth of resources to grow your career and learn valuable new skills.
Invest in Your Future: Secure your financial future through competitive pay, retirement planning, a continuing education program with a company-matched student loan contribution, and financial wellness programs.
Family Friendly Perks: It’s not just about you. S&P Global has perks for your partners and little ones, too, with some best-in class benefits for families.
Beyond the Basics: From retail discounts to referral incentive awards—small perks can make a big difference.
Site Reliability Engineer ensuring the availability and performance of services for autonomous vehicle operations. Collaborating on system design and automation in a robotics - focused environment.
DevOps Engineer automating continuous deployment and monitoring on AWS for Crown Equipment Corporation. Bridging developers, IT, and external providers for operational efficiency.
Senior DevOps Engineer responsible for leading CI/CD pipeline design and optimization. Collaborating with teams to drive DevOps maturity across the enterprise while managing infrastructure automation.
Cloud Operations Engineer ensuring reliable performance of cloud systems at 2Innovate. Focused on automation, incident management, cloud security, and infrastructure monitoring in cloud environments.
AWS DevOps Engineer responsible for delivering scalable digital experiences for EXL's MarTech ecosystem. Engaging in development, maintenance, and collaboration across stakeholders and services.
Senior Site Reliability Engineer managing critical infrastructure at Hornetsecurity. Collaborating with product teams to ensure performance and reliability across services.
Site Reliability Engineer enhancing platform reliability for AI workflows at WRITER. Overseeing automated solutions and cloud infrastructure supporting high - trafficked AI systems.
Site reliability engineer ensuring 24/7 availability of AI - powered workflows at WRITER. Developing and automating robust platforms for high - traffic AI demands.