Software Reliability Engineer at Logix Federal Credit Union ensuring reliability of large-scale systems. Leading troubleshooting efforts, monitoring performance, and optimizing infrastructure for operational efficiency.
Responsibilities
Ensure services are reliable, optimized, and secure, and constantly work to improve these aspects
Lead technical management and troubleshooting to handle incidents, diagnose problems, and implement solutions to prevent future occurrences
Integrate, build, and maintain monitoring systems to proactively detect issues and alert appropriate teams
Monitor system capacity and plan for future needs to ensure services can handle expected traffic
Manage changes to the system, ensuring they are implemented safely and efficiently
Automate operational tasks to reduce manual work, improve efficiency, and reduce chances of human errors
Work closely with other IT teams, including product development and infrastructure teams, to ensure service reliability
Recommend and lead improvements to optimize existing systems or integrate new systems to enhance application/system functionality.
Requirements
4 Year / Bachelors Degree
Minimum 6 years of experience
Experience in a structured technical development environment
Experience with Application Performance Monitoring (APM) tools such as: Dynatrace, DataDog, Grafana/Prometheus
Recent and relevant experience with .NET (Framework/Core), WebAPI, HTML/CSS/JavaScript, microservices, secure coding practices
Solid understanding of networking infrastructure, application frameworks, containerization, and secure dataflow (ex. OpenShift, Kubernetes, Docker)
Strong problem-solving and analytical skills
Development with SQL Server databases, views, triggers, and stored procedures
Experience with large-scale distributed systems is highly desired.
Senior Reliability Engineer at Sonova ensuring dependable performance of hearing solutions for millions of users globally. Involves engineering skills to improve product reliability across development stages.
Equipment and Reliability Engineer at Chobani responsible for improving asset efficiency, redesigning equipment. Collaborating with Operations to solve complex problems and lead projects in a team environment.
Reliability Engineer II focused on enhancing safety, efficiencies, and cost controls at Freeport - McMoRan mining operations. Collaborating with multiple teams and managing engineering projects.
Reliability Engineer I responsible for equipment failure analysis and improvement recommendations at Freeport - McMoRan's copper smelting operations. Ensuring uninterrupted production and managing equipment health through data analysis.
Designing, building, and maintaining the Kubernetes - based developer platform for Schwarz IT Barcelona. Collaborating with engineering teams to enhance services in Azure and Google Cloud.
Database Reliability Engineer managing MySQL database infrastructure at PointClickCare. Collaborating with Engineering and SRE teams for product development and reliable integration across the platform.
Teamleitung in der Gebäudereinigung in Grimma, verantwortliche Planung, Organisation und Führung des Reinigungsteams. Aktive Mitarbeit und Einhaltung von Hygiene - und Qualitätsstandards sind erforderlich.
Service Reliability Engineer providing technical support and managing incidents for BT International. Ensuring system availability and collaboration with global stakeholders to achieve objectives.
Studying Bachelor of Arts in Accounting, Taxation, and Economic Law while gaining practical experience in a dynamic team. Benefit from a diverse working day and continuous development opportunities.
Technical Trainer conducting workshops and training sessions on MERKUR Group's product content for diverse audiences. Engaging with employees and clients to ensure smooth product operation and understanding.