Site Reliability Engineer focused on application infrastructure, reliability, and scalability. Working at Early Warning, a leader in financial technology solutions for secure transactions.
Responsibilities
Implement software and tools to improve the performance - availability, scalability, and latency, while delivering end products to customer with the highest efficiency and meeting all security standards.
Build automation and tooling around application management, such as deployments, configuration changes and disaster recovery scenarios.
Implement and evangelize Observability and monitoring systems to proactively detect problems and identify cause.
Evaluate capacity of the application on a continuous basis to provide stats to the Product/Business teams and recommend an efficient path to scale for future needs.
Identify performance bottlenecks and work with cross-functional teams to troubleshoot and resolve issues.
Implement standards across multiple disciplines, systems and practices to improve the overall application delivery.
Work directly with application development teams to provide feedback and technical requirements to the software development lifecycle, implementing best-practice microservice design patterns and other modern software development approaches.
Serve as a technical liaison for the application and provide documents and runbooks to Level 1 and Level 2 teams.
Participate in 24 X 7 on-call rotation.
Requirements
Education and experience typically obtained through completion of a Bachelor’s Degree in Business and/or Computer Science or related field.
3+ years of related experience managing large complex projects in a technical or software development environment inclusive of post-graduate degree
Demonstrated experience in effective Incident and Problem Management
Proven related work experience in a medium to large scale enterprise.
Strong understanding of scripting languages
Hands on experience implementing and using modern Observability solutions.
Linux systems administration
Good knowledge of Git
Experienced with security and encryption protocols.
Comfortable with facilitating collaboration, open communication and reaching across functional borders.
Benefits
Healthcare Coverage – Competitive medical (PPO/HDHP), dental, and vision plans as well as company contributions to your Health Savings Account (HSA) or pre-tax savings through flexible spending accounts (FSA) for commuting, health & dependent care expenses.
401(k) Retirement Plan – Featuring a 100% Company Safe Harbor Match on your first 6% deferral immediately upon eligibility.
Paid Time Off – Flexible Time Off for Exempt (salaried) employees, as well as generous PTO for Non-Exempt (hourly) employees, plus 11 paid company holidays and a paid volunteer day.
12 weeks of Paid Parental Leave
Maven Family Planning – provides support through your Parenting journey including egg freezing, fertility, adoption, surrogacy, pregnancy, postpartum, early pediatrics, and returning to work.
DevOps Manager leading a distributed team managing L3 support for vision AI solutions. Overseeing operations for Edge/on - prem and cloud platforms at Everseen.
Senior Site Reliability Engineer ensuring reliability of applications across AWS infrastructure at Onit. Collaborating with teams to troubleshoot and optimize system performance.
Chassis Engineer leading Brake system design for Ford Racing. Focused on delivering performance vehicle solutions through innovative design and collaboration with teams.
Site Reliability Engineer at Coinbase optimizing cloud deployments and enhancing system reliability. Working with engineering teams to improve software reliability and performance across the organization.
Senior Site Reliability Engineer designing and implementing high - reliability platforms for Broadridge. Collaborating with teams across hybrid environments and driving automation and efficiency in service delivery.
Staff Engineer for GM's Hybrid Services & Reliability team. Driving reliability architecture and maintenance for hybrid cloud services with a focus on SRE principles.
Senior Engineering Manager for Hybrid Services & Reliability within AV Core Infrastructure at GM. Leading a team for the measurable availability of hybrid cloud systems for autonomous vehicle development.
Reliability Engineer for PGD Wind Reliability team at NextEra Energy. Collaborating on optimizing wind turbine performance, increasing reliability, and reducing costs while managing complex technical issues.
Maintenance Reliability Engineer focusing on operational excellence at JLL. Driving reliability through advanced maintenance strategies and technologies in building systems.
Senior Data Platform DevOps Engineer for Expleo focusing on AWS infrastructure solutions. Responsibilities include designing, implementing, and maintaining data platform solutions with a collaborative team.