Join the DNS and Observability team at IONOS, leading the design and development of high-available services on Linux. Collaborate with groups to automate operational tasks and improve infrastructure performance.
Responsibilities
Collaborate with our team to design, develop, and operate high-available services on Linux, ensuring seamless integration with our operational requirements and product requirements;
Take ownership of the full stack, from hardware to application, including configuration management and monitoring, and drive continuous improvement and development;
Contribute to our open-source projects, such as DIM (github.com/ionos-cloud/dim) and monzero (github.com/ionos-cloud/monzero), and help shape the future of our infrastructure;
Support our system administrators in automating operational tasks and rollouts, and provide interfaces for integration with our hosting products;
Participate in an on-call service rotation, ensuring the smooth operation of our complex infrastructure.
Requirements
A completed computer science degree or equivalent qualification;
5+ years of practical experience in administering 100+ Linux systems (Debian/CentOS) in 3+ data centers;
Experience with monitoring, configuration, VCS and visualization tools (Icinga2, Puppet, Ansible, git, grafana);
Understanding of modern software architectures and their deployment (REST, Microservices, Docker, CI/CD);
Knowledge of network and infrastructure services (DNS, BGP, VLANs, Firewalling, IPv6)
Excellent communication skills in English (German is a plus).
Benefits
Access to local/international trainings, development and growth opportunities, including access to e-learning platforms, covering both technical and soft skills areas;
Modern technologies, product responsibility;
Flexible work schedule;
Hybrid work option;
Medical services package from one of two private providers;
25 vacation days per year;
Substitute days off for public holidays that occur on the weekend;
Meal tickets;
Internal referral program;
Employee Anniversary Program;
Internal anniversary rewards;
Team events, networking events organized to promote a passionate, creative and diverse culture;
Summerfest and Winterfest parties;
Of course, coffee, soft drinks and fresh fruits are on us in the office.
Cloud Site Reliability Engineer managing Solace Cloud services across leading cloud providers. Ensuring reliability, handling incidents, and collaborating with customers for operational excellence.
Senior Cloud Site Reliability Engineer ensuring reliability and health of Solace Cloud Services with hands - on cloud operations expertise. Lead incident management and customer support for high - impact environments.
DevOps Engineer designing and operating AWS infrastructure within industrial IoT environments. Working on systems that ensure security, resilience, and end - to - end observability.
Sr. Site Reliability Engineer (SRE) III providing technical solutions for the federal government. Collaborating in a high - performing team focused on reliability and application scalability.
Senior Linux System Engineer developing and maintaining Linux server infrastructure for Th. Geyer GmbH. Collaborating on ERP systems and CI/CD processes while ensuring system performance and security.
Platform Engineer leading the development of cloud application platforms for Allstate. Responsible for cloud infrastructure for ML experimentation and production deployments.
Cloud Platform Engineer (ML DevOps) developing and managing CI/CD pipelines for ML workflows in a leading insurance company. Collaborating with data scientists and ensuring infrastructure security and compliance.
DevOps Engineer developing and managing container platforms for client solutions at Booz Allen Hamilton. Utilizing cloud technologies to enhance capabilities and secure deployments.
Senior DevOps/Platform Engineer automating cloud infrastructure and optimizing delivery pipelines at S&P Global Mobility. Collaborating with teams to enhance product reliability and security.