Senior SRE managing reliability of 300+ servers powering client Odoo ERP systems. Lead incident response and guide a team in building reliable systems.
Responsibilities
Define and track SLOs/SLIs and guard error budgets
Build a complete observability stack (metrics, logs, tracing, alerting)
Lead incident response, run blameless postmortems and raise the bar on operational excellence
Set standards for deployments, rollbacks and change management
Guide and mentor a team of three engineers
Improve CI/CD pipelines, Docker environments, Harbor registry and automation processes
Drive infrastructure automation: provisioning, backups, DR, security hardening, self-healing systems
Support our BI infrastructure and contribute to our self-service client platform
Requirements
4+ years in SRE/DevOps/infrastructure engineering
Strong Linux fundamentals and container expertise (Docker)
Deep experience with observability stacks (Prometheus, Grafana, etc.)
CI/CD experience (Jenkins, GitHub Actions, or similar)
Solid scripting skills (Python, Bash)
Experience with on-call rotations, incident management and root cause analysis
Ability to work across bare-metal and cloud environments (Hetzner, AWS, or similar)
A mindset that prioritizes reliability and sustainable operations
Ready to grow into people leadership
Fluent in English
Benefits
A real leadership path: Step into your first technical leadership role with direct mentorship
Real production scale: Own the reliability of 300+ servers running client Odoo ERP instances
Strategic impact: Assume a critical role in our self-service platform
Impact from day one: you shape monitoring, alerting, incident response, and rollout standards
Flexible working: We value results over hours — structure your work around your life
Competitive salary: Total expected compensation of €50.000 to €65.000 / year, with potential for growth as you take on leadership responsibilities
Plenty of other benefits: variable compensation scheme, learning budget, excellent private health insurance, state-of-the-art equipment - we go beyond fruit baskets & free drinks
Join CI&T as a DevOps Master in technology transformation involving a corporate developer platform. Collaborate closely with teams to enhance scalability and operational efficiency.
DevOps Engineer responsible for developing and operating CI/CD pipelines in hybrid environments. Join K - tronik to work on innovative software and hardware projects within a dedicated team.
Manage Mechanical Integrity Program for North America plants at Arkema, improving reliability and safety of operations with extensive engineering expertise and collaboration.
Senior Engineer responsible for building and scaling infrastructure at SaaS company Personio. Focused on improving reliability and performance of services for HR tech industry.
DevSecOps / Platform Engineer at Obviant designing secure cloud - native infrastructure. Collaborating with teams to build high - reliability shipping across various platforms.
Senior Business Analyst managing technical initiatives and acting as liaison for stakeholders in a tech organization. Supporting Agile frameworks and handling multiple priorities in a dynamic setting.
Junior DevOps Engineer supporting IT security improvements using an in - house developed platform. Working within a cross - functional team to enhance IT and OT environments focused on cybersecurity.
DevOps Engineer developing and managing cloud and on - prem infrastructure for AI - powered cyber - risk platform. Automating deployments and collaborating with data engineers to enhance cyber - security.
DevOps Engineer working with Deloitte's investment banking clients on data protection automation. Involves hands - on engineering, integration with third - party APIs, and Agile collaboration.
Senior DevOps Specialist ensuring reliability, scalability, and efficiency of SaaS platforms at Experlogix. Collaborating with development and operations teams to optimize infrastructure performance and deployment processes.