Site Reliability Engineer Lead Analyst at Citi overseeing application systems analysis and reliability. Leading monitoring, automation, and collaborative initiatives to enhance system performance.
Responsibilities
Monitor, Measure and analyze the system's performance and availability
Resolve variety of high impact problems/projects through in-depth evaluation of complex business processes, system processes, and industry standards
Provide expertise in area and advanced knowledge of applications programming and ensure application design adheres to the overall architecture blueprint
Utilize advanced knowledge of system flow and develop standards for coding, testing, debugging, and implementation
Develop comprehensive knowledge of how areas of business, such as architecture and infrastructure, integrate to accomplish business goals
Provide in-depth analysis with interpretive thinking to define issues and develop innovative solutions
Serve as advisor or coach to junior SRE engineers, allocating work as necessary
Develop and maintain automated tools and systems to manage and monitor the infrastructure
Reduce manual intervention, human errors and the time it takes to perform routine tasks
Periodically assess the capacity of needs of services and work on scaling them to handle the increased usage
Plan for resource allocation, manage load balancing and ensure the system can handle demand fluctuations
Work to detect, diagnose and resolve issues quickly to minimize the impact on users and business
Conduct post-incident reviews to learn and improve system's reliability
Work with different development teams, product owners and other stakeholders to ensure seamless deliveries and aligning to a common goal
Requirements
6+ years of relevant experience in Apps Development or systems analysis role
Extensive experience system analysis and in programming of software applications
Extensive experience in automated pipelines, automated testing and automated security controls
Extensive experience in the use of logging tools/systems (splunk, appDynamics, etc...)
Experience in managing and implementing successful projects
Subject Matter Expert (SME) in at least one area of Applications Development
Ability to adjust priorities quickly as circumstances dictate
Demonstrated leadership and project management skills
Consistently demonstrates clear and concise written and verbal communication
Benefits
medical, dental & vision coverage
401(k)
life, accident, and disability insurance
wellness programs
paid time off packages, including planned time off (vacation), unplanned time off (sick leave), and paid holidays
Job title
Site Reliability Engineer, Lead Analyst, Vice President
Site Reliability Engineer enhancing system reliability and deployment practices at OpenLoop. Collaborating with cross - functional teams for incident management and performance tuning.
Senior DevOps Engineer enhancing Azure application reliability for a healthcare fintech platform. Collaborating closely with engineering teams to ensure deploy safety and observability.
DevOps Engineer contributing to tooling changes and leading a community of practice at Totara. Focused on collaboration, development, and support for internal teams.
Site Reliability Engineer responsible for infrastructure supporting AI platform. Safeguarding US customer data and ensuring compliance in the Aerospace and Defense sector.
Senior Infrastructure Engineer managing Azure platform for a SaaS product at Rillion. Focused on automation, security, reliability, and scalability in a hybrid work environment.
Statistician/Reliability Engineer applying statistical analysis for satellite systems at Aerospace Corporation. Leading projects on system reliability and working closely with interdisciplinary teams in a full - time on - site role.
DevOps Engineer designing and implementing solutions to optimize operations in media technology at Mediagenix. Collaborating with cross - functional teams to enhance user experiences.
DevOps Senior Software Engineer at SimCorp developing high - quality software solutions for financial technology. Responsible for mentoring junior engineers and solving complex technical challenges.
Senior DevOps Engineer at SimCorp managing cloud environments and automating builds using Azure. Collaborating with cross - functional teams to ensure high service availability and compliance.
DevOps Engineer designing, building, and operating software development infrastructure for CodeMettle. Leading automation and best practices to enhance value delivery across teams.