Site Reliability Engineer supporting Red Hat's hybrid cloud infrastructure by ensuring service reliability and automation. Collaborating with teams to improve performance and respond to service incidents.
Responsibilities
Support Red Hat’s software manufacturing services on our hybrid cloud infrastructure
Create/Maintain service monitoring
Improve automation and uphold security best practices
Respond to service situations
Participate in communities of practice
Define Service Level Indicators (SLIs) and Service Level Objectives (SLOs)
Execute remediation plans if SLOs are not met
Respond during service outages and identify improvements
Requirements
Experience with OpenShift administration
Linux administration expertise
General knowledge of AWS technologies
Experience with CI/CD platforms like Tekton and Pipelines as a code, optionally GitHub Actions or Jenkins
Experience with automation services like Ansible or Terraform
Knowledge of open source monitoring technologies (Grafana, Prometheus, OpenTelemetry)
Excellent written and verbal communication skills in English
Senior DevOps Engineer managing DevOps processes and tooling for customer - facing platforms at Luminor. Building CI/CD pipelines and providing production support with a focus on mentoring and collaboration.
Building and maintaining DevOps processes and CI/CD pipelines for Luminor's banking champion. Collaborating in a flexible work environment with international teams.
Senior DevOps Engineer at Luminor, a leading bank in the Baltics, managing customer - facing platforms and infrastructure. Building CI/CD pipelines and mentoring junior engineers.
Sr. Site Reliability Engineer designing and automating robust technical infrastructure at Broadridge. Collaborating across teams for successful deployment and operational support of services.
Senior Fleet Reliability Engineer maintaining high fleet uptime for autonomous vehicle technology. Collaborating with technical teams to ensure peak operational performance in data collection efforts.
DevOps Lead at Leidos managing platform engineering, SRE, and application security functions. Driving operational excellence and ensuring scalability for federal government applications.
SRE Lead developing scalable cloud - native solutions for mission - critical systems supporting USAF. Managing teams, collaborating with cross - functional units, and ensuring high service reliability standards.
Junior DevOps / Platform Engineer at DieEnergiekoppler GmbH managing AWS/EKS platform operations. Collaborating with team members to improve platform functionalities and security compliance.
DevOps Engineer responsible for AWS infrastructures and backend development at Allguth GmbH. Engaging in greenfield projects with modern solutions in a collaborative team.
Cloud DevOps Specialist responsible for building scalable infrastructure solutions in AWS at SONDA. Focusing on automation, containerization, and data management in a collaborative environment.