Principal Reliability Engineer leading Reliability Engineering team for an insurance company. Responsible for automation capabilities, incident management, and service reliability enhancements.
Responsibilities
Responsible for managing and directing all critical incidents, inclusive of defining root cause, developing and implementing remediation plans
Responsible for building reliability engineering, automation, and quality capabilities across 20+ applications and systems
Accountable for Operations, RE, DevSecOps, Quality, and Middleware technologies.
Build tools and capabilities needed by our software engineering teams to optimize development by providing a level of advancement in technology and achievement of efficiency for application teams.
Building an engineering culture with automation across our technology stack and application footprints across traditional and modern architectures resulting in overall IT Productivity improvements.
Support enterprise needs with improvements in Performance, Scalability, Resiliency, Reliability, Stability, Observability, Security, etc.. continuously evolving and modernizing available services to improve productivity, automation, quality, and optimize operational cost.
Market research on emerging trends for technology enablers in the field.
Information protection and secure development practices.
Lead transformational change management by championing the adoption of automation capabilities built and foster a culture of continuous learning and improvement mindset for the organization.
Requirements
8 + years IT professional experience in financial services or insurance in a large corporation.
8+ years of having assumed leadership, engineering, application management and operations roles with a demonstrated track record of technical innovation and experience influencing technically diverse teams.
Strong track record of production support, incident management and problem solving.
Strong cloud engineering mindset with cloud experience across public cloud providers and the technologies most frequently used in engineering and managing highly reliable and automated technology environments.
Demonstrated ability to own, transform, mature, and deliver reliability engineering tools and capabilities.
Strong knowledge and experience with cloud product management, cloud engineering, and Agile principles.
Experience with Performance and Observability tools such as Dynatrace, Splunk, CloudWatch, Cloud Trail, and related tools.
Experience with continuous integration and DevOps methodologies, preferred tools such as GitHub, Rally, SonarQube etc..
Strong solution engineering orientation to enable expedient troubleshooting, issue-resolution and root-cause removal.
Proven execution/delivery running and maintaining cloud-based and on prem automation tools and services across various service delivery models.
Quality Engineer leadership experience along with Test Data Management, Test Automation including unit, functional, regression, and integration testing, and Defect Management.
Demonstrated ability to act as a strategic thought leader and be seen as a credible business partner by peers.
Highly collaborative and team oriented
Exceptional critical thinking and problem-solving skills.
Able to influence diverse teams and build strong business relationships.
Bachelor of Science in Computer Science or equivalent preferred.
Candidate must be authorized to work in the US without company sponsorship.
Benefits
Other rewards may include short-term or annual bonuses
long-term incentives
on-the-spot recognition
Job title
Principal Reliability Engineer – Information Security
Senior Reliability Engineer at Sonova ensuring dependable performance of hearing solutions for millions of users globally. Involves engineering skills to improve product reliability across development stages.
Equipment and Reliability Engineer at Chobani responsible for improving asset efficiency, redesigning equipment. Collaborating with Operations to solve complex problems and lead projects in a team environment.
Reliability Engineer II focused on enhancing safety, efficiencies, and cost controls at Freeport - McMoRan mining operations. Collaborating with multiple teams and managing engineering projects.
Reliability Engineer I responsible for equipment failure analysis and improvement recommendations at Freeport - McMoRan's copper smelting operations. Ensuring uninterrupted production and managing equipment health through data analysis.
Designing, building, and maintaining the Kubernetes - based developer platform for Schwarz IT Barcelona. Collaborating with engineering teams to enhance services in Azure and Google Cloud.
Database Reliability Engineer managing MySQL database infrastructure at PointClickCare. Collaborating with Engineering and SRE teams for product development and reliable integration across the platform.
Teamleitung in der Gebäudereinigung in Grimma, verantwortliche Planung, Organisation und Führung des Reinigungsteams. Aktive Mitarbeit und Einhaltung von Hygiene - und Qualitätsstandards sind erforderlich.
Service Reliability Engineer providing technical support and managing incidents for BT International. Ensuring system availability and collaboration with global stakeholders to achieve objectives.
Studying Bachelor of Arts in Accounting, Taxation, and Economic Law while gaining practical experience in a dynamic team. Benefit from a diverse working day and continuous development opportunities.
Technical Trainer conducting workshops and training sessions on MERKUR Group's product content for diverse audiences. Engaging with employees and clients to ensure smooth product operation and understanding.