DevOps Engineer managing complex incidents and automations in L3 support for Everseen. Driving best practices and collaborating across teams in cutting-edge AI solutions.
Responsibilities
You will be part of the L3 support team for Operations across Edge/on‑prem and cloud, owning complex incidents end‑to‑end: triage, deep‑dive debugging, root‑cause analysis, remediation, and follow‑ups.
To reduce Ops toil, you will build targeted automations (Python, Bash, Ansible) and automate new and existing SOPs used by Operations.
You will execute safe deployments and upgrades via GitOps and IaC pipelines (Flux, Ansible, Terraform) on AKS and GKE—coordinating validation and rollback plans—and contribute to the maintenance of existing GitLab CI/CD pipelines together with the DevOps engineering teams.
You will design and continuously refine Alertmanager rules and standardize actionable Grafana dashboards with Operations, ensuring effective use of Prometheus metrics and logs (Grafana Alloy, Thanos).
Beyond day‑to‑day operations, you’ll apply deep DevOps, CI/CD, and infrastructure automation expertise, drive best practices, share knowledge through workshops and mentoring, write and maintain documentation and SOPs (Standard Operating Procedure), test infrastructure, and collaborate across teams to optimize systems and workflows.
Requirements
4+ years in DevOps-related roles with a strong focus on automation.
Proficient in DNS, routing, container communication, firewalls, reverse-proxying, load-balancing, edge to cloud communication and troubleshooting.
Strong system administration skills are required for deploying and troubleshooting OS level outages and Everseen’s containerized Edge application in customer network.
Extensive experience with Azure (or GCP), including fully automated infrastructure and deployment.
Experience with monitoring and optimizing cloud costs.
Proven experience in implementing and managing CI/CD pipelines (GitLab CI/CD preferred) and excellent knowledge of Git and associated workflows (e.g., Gitflow).
Proven experience with monitoring, logging, and alerting tools and stacks.
Excellent scripting skills in Bash and Python.
Advanced knowledge of Kubernetes and Openshift, including cluster management, orchestration and auto-scaling, deployments using Helm charts and GitOps.
Proven experience with microservices architecture and related deployment strategies.
Expertise with Terraform modules.
Deep experience with Ansible, including writing complex playbooks, roles, and using Ansible Vault for secrets management.
Strong understanding of DevSecOps principles and experience implementing security best practices within CI/CD pipelines.
Excellent presentation, oral, and written communication skills. Fluent business English is a requirement.
A passionate advocate for determining and delivering solutions with a high level of customer satisfaction.
Demonstrated interest in learning and a strong desire to expand knowledge in their respective field.
Capable of engaging in technical discussions with stakeholders and leading DevOps projects. Mentors and coaches team members.
Benefits
Everseen is committed to creating a safe environment for all employees and has a zero tolerance policy for bias and discrimination of any kind.
Our work environment is one without offensive, hostile, or intimidating conduct, whether verbal, written or physical, in nature.
Everseen will not tolerate prejudice or discrimination of any kind including without limitation, where based on aspects such as, race, colour, sex, gender, religion, age, family status, disability of any kind, sexual orientation.
(Senior) DevOps Engineer at Wavestone developing and operating complex software solutions for digitalization projects. Collaborating in teams and contributing to technology landscape advancements.
Reliability Engineer focused on the dependability and mission success of complex space systems. Involvement includes analyses, collaboration, and adherence to aerospace reliability standards.
DevOps Engineer automating IT processes at Maurer Electronics GmbH in Hannover. Engaging in continuous integration and development with team collaboration and innovative solutions.
DevOps Engineer working with IT Security Team in Berlin, developing and supporting complex IT Security Services. Collaborating on automated IT - Security - Services with cutting - edge technologies and methodologies.
DevOps Engineer focusing on deploying high - security on - prem infrastructure and MLOps platforms for mission - critical systems. Collaborating on Kubernetes - based orchestration and machine learning workloads.
Cloud Site Reliability Engineer managing Solace Cloud services across leading cloud providers. Ensuring reliability, handling incidents, and collaborating with customers for operational excellence.
Senior Cloud Site Reliability Engineer ensuring reliability and health of Solace Cloud Services with hands - on cloud operations expertise. Lead incident management and customer support for high - impact environments.
DevOps Engineer designing and operating AWS infrastructure within industrial IoT environments. Working on systems that ensure security, resilience, and end - to - end observability.