Site Reliability Engineer Lead Analyst at Citi overseeing application systems analysis and reliability. Leading monitoring, automation, and collaborative initiatives to enhance system performance.
Responsibilities
Monitor, Measure and analyze the system's performance and availability
Resolve variety of high impact problems/projects through in-depth evaluation of complex business processes, system processes, and industry standards
Provide expertise in area and advanced knowledge of applications programming and ensure application design adheres to the overall architecture blueprint
Utilize advanced knowledge of system flow and develop standards for coding, testing, debugging, and implementation
Develop comprehensive knowledge of how areas of business, such as architecture and infrastructure, integrate to accomplish business goals
Provide in-depth analysis with interpretive thinking to define issues and develop innovative solutions
Serve as advisor or coach to junior SRE engineers, allocating work as necessary
Develop and maintain automated tools and systems to manage and monitor the infrastructure
Reduce manual intervention, human errors and the time it takes to perform routine tasks
Periodically assess the capacity of needs of services and work on scaling them to handle the increased usage
Plan for resource allocation, manage load balancing and ensure the system can handle demand fluctuations
Work to detect, diagnose and resolve issues quickly to minimize the impact on users and business
Conduct post-incident reviews to learn and improve system's reliability
Work with different development teams, product owners and other stakeholders to ensure seamless deliveries and aligning to a common goal
Requirements
6+ years of relevant experience in Apps Development or systems analysis role
Extensive experience system analysis and in programming of software applications
Extensive experience in automated pipelines, automated testing and automated security controls
Extensive experience in the use of logging tools/systems (splunk, appDynamics, etc...)
Experience in managing and implementing successful projects
Subject Matter Expert (SME) in at least one area of Applications Development
Ability to adjust priorities quickly as circumstances dictate
Demonstrated leadership and project management skills
Consistently demonstrates clear and concise written and verbal communication
Benefits
medical, dental & vision coverage
401(k)
life, accident, and disability insurance
wellness programs
paid time off packages, including planned time off (vacation), unplanned time off (sick leave), and paid holidays
Job title
Site Reliability Engineer, Lead Analyst, Vice President
DevOps Engineer designing and implementing solutions to optimize operations in media technology at Mediagenix. Collaborating with cross - functional teams to enhance user experiences.
Senior DevOps Engineer at SimCorp managing cloud environments and automating builds using Azure. Collaborating with cross - functional teams to ensure high service availability and compliance.
DevOps Senior Software Engineer at SimCorp developing high - quality software solutions for financial technology. Responsible for mentoring junior engineers and solving complex technical challenges.
DevOps Engineer designing, building, and operating software development infrastructure for CodeMettle. Leading automation and best practices to enhance value delivery across teams.
DevOps Engineer maintaining scalable infrastructure for VOX's telecom services. Implementing automation and CI/CD pipelines in a fast - paced environment with significant growth potential.
DevOps Engineer focused on designing and managing CI/CD pipelines using Azure DevOps. Collaborating with teams for application deployment and ensuring DevSecOps practices.
DevOps Engineer working closely with engineering and security teams to optimize CI/CD pipelines and manage infrastructure. Ensuring security and compliance for mission - critical financial applications.
Build and scale cloud infrastructure that powers Heidi's healthcare AI platform. Work with AWS and Azure while enhancing automation and reliability in an innovative healthtech startup.
Infrastructure - as - Code DevOps Engineer designing and managing cloud - native platforms at Vodafone. Collaborating with agile teams for digital transformation and business success.
Director of Data Engineering leading a strategic DevOps team within Enterprise AI. Balancing leadership with hands - on expertise to enable AI technology adoption.