Site Reliability Engineer ensuring system availability and performance for ADEO’s tech operations. Collaborating with teams on SRE practices and implementing monitoring solutions.
Responsibilities
Drive data quality related to operations on our product repositories
Manage and evolve SLI/SLOs for the entire GTDP
Implement and manage the Error Budget Policy process for the GTDP
Implement and manage CUJs (Critical User Journeys) for the GTDP
Coordinate decommissioning of obsolete products, servers, and APIs
Anticipate and manage technical debt (OS versions, DBMS, etc.)
Coordinate implementation of patches and security updates for systems
Support teams on monitoring, observability, and infrastructure-as-code topics
Ensure access to and analyzability of platform logs
Implement use cases around AI for Ops (e.g., predictive analysis of incidents)
Requirements
Proven experience as an SRE or Ops Engineer, or in a similar role within a technology environment
Bachelor's or Master's degree (Bac+3 to Bac+5) in Computer Science, Information Systems, or equivalent
Demonstrated experience in IT operations, DevOps, or SRE, ideally in a technical environment
Strong understanding of SRE concepts: SLI/SLO, Error Budget Policy, CUJ, Toil Management, etc.
Experience with monitoring solutions such as Prometheus, Grafana, or Datadog
Proficient with automation and CI/CD tools (Ansible, Terraform, etc.)
Apply — and challenge — architecture, security, and performance standards
Committed to service quality and system reliability
Enjoy working cross-functionally with multiple teams and stakeholders
Comfortable collaborating in an international environment; technical English is not a barrier
Benefits
A stimulating environment that encourages initiative and an entrepreneurial mindset
Role-specific training to develop your skills
Career growth and internal mobility opportunities within an international group
Quarterly team bonuses and the opportunity to become a shareholder
Flexible remote work policy
Support for sustainable commuting: contributions toward purchasing bikes and e-scooters, plus a carpooling allowance
DevOps Engineer managing and optimizing on - premises infrastructure while supporting cloud and hybrid environments. Building CI/CD pipelines and ensuring system reliability with a focus on collaboration.
Dev Ops Engineer responsible for managing applications and databases, and supporting customer IT transformation into cloud technologies at DATAGROUP. Collaborating with a team in an innovative environment.
Dev Ops Engineer managing applications and cloud technologies at DATAGROUP. Collaborating with clients to transform IT landscapes with modern tools and technologies.
Product Reliability Engineer focusing on data analysis and reporting within the reliability function at MineSense. Collaborating with teams to enhance mining technology for a sustainable future.
AWS Architect developing scalable and resilient cloud infrastructure for Nordcloud clients. Join Nordcloud to enhance cloud migration and security efforts with advanced solutions.
Site Reliability Engineer ensuring reliability and performance of FreeWheel systems. Collaborating with engineering and operations teams for optimization and troubleshooting.
DevOps Engineer supporting major customer(s) container and automation environments for UltraViolet Cyber. Focus on collection, curation, and delivery processes with a collaborative approach.
Join Protolabs as a Senior DevOps Engineer to support business applications and enhance reliability. This hybrid role involves collaboration with IT and development teams in Maple Plain, MN.
DevOps & Document Engineer at Ness Digital Engineering focusing on system monitoring and automation. Collaborating with global stakeholders and working in a hybrid cloud environment.
Senior DevOps Engineer with focus on infrastructure evolution for electric vehicle technology company. Leading efforts in internal tooling platforms and hybrid cloud management.