DevOps/Platform Engineer implementing on-prem infrastructure and deployment at C4 Energy GmbH. Building PoC and ensuring operational processes are followed in the project.
Responsibilities
Implementation engagement to build the first on‑prem installation based on the refined architecture and stepwise plan defined by the Solution Architect.
Build a running single‑node PoC on an on‑prem server using containerised workloads, deployed and managed via infrastructure as code (e.g. using tools such as Docker and Ansible), with the option to introduce lightweight orchestration tooling (e.g. Nomad or a similar scheduler) as appropriate.
Establish a reproducible, automated infrastructure and deployment model (IaC) for the backend application(s) and data stores, aligned with the architecture and on‑prem constraints described in the use‑case documentation.
Implement secure secrets handling and TLS setups for external and internal traffic using tools such as Ansible Vault, in line with the platform’s security requirements.
Integrate and configure the observability stack (e.g. Prometheus, Grafana, Alertmanager and a logging stack) so that the backend application, platform components and databases can be monitored and operated reliably.
Support the evolution from the initial single‑node PoC towards a small multi‑node setup for the backend application and databases, keeping the system stable, maintainable and ready for future extension.
Take overall responsibility for the implementation of the on‑prem infrastructure and deployment model (IaC) for the backend application, ensuring that all provisioning and deployments are automated and repeatable.
Design and evolve the Ansible‑based infrastructure as code (roles, inventories, group variables) for OS base, container runtime, orchestration tooling, backend application(s) and databases across environments (DEV, TEST and PROD).
Define and implement the setup for running the backend application and additional infrastructure components in a small on‑prem cluster, including job definitions, basic health checks and placement concepts supporting 1 Hz device read/write requirements.
Design and operate secure configuration and secrets management using tools such as Ansible Vault, TLS and PKI concepts, ensuring credentials and keys for services and databases are handled safely.
Set up and operate core data stores (e.g. InfluxDB v3, Postgres) for the lab/PoC and follow‑on environments, including backup/restore procedures and disaster recovery readiness.
Integrate and operate the observability stack (e.g. Prometheus, Grafana, Alertmanager), ensuring visibility into the backend application, infrastructure components and 1 Hz read/write behavior.
Collaborate with the Full‑Stack Developer and Solution Architect to align health endpoints, logging formats, metrics and deployment patterns.
Contribute to the definition and implementation of CI/CD pipelines and operational processes with a clear governance model based on version-controlled automation.
Produce and maintain infrastructure-level documentation (deployment guides, operations runbooks, backup/restore procedures, incident handling guidelines).
Requirements
At least 3 years of experience in DevOps, platform engineering or infrastructure automation roles.
Strong hands‑on experience as a DevOps/Platform Engineer in on‑prem environments, ideally with data platforms or industrial/OT integrations.
Solid skills in Ansible (or similar) for infrastructure as code: roles, inventories, group variables and automated provisioning.
Practical experience with container orchestration tooling such as Nomad (or similar) for on‑prem deployments, including job specifications and rolling updates.
Experience building and running containerised services using Docker, including multi‑stage builds and non‑root containers.
Experience integrating observability stacks: Prometheus, Grafana, Alertmanager and centralized logging.
Familiarity with time‑series and relational databases: e.g. InfluxDB and Postgres, including configuration and backup/restore.
Experience implementing security and secrets management: Ansible Vault, TLS configuration and PKI concepts; familiarity with Keycloak or LDAP is a plus.
Comfortable working in isolated on‑prem environments with limited or no direct internet connectivity.
Ability to write clear technical documentation and runbooks.
DevOps Engineer working on Linux - based infrastructure focusing on automation with tools like Ansible and Terraform. Engaging in international projects and ensuring optimal system operations.
DevOps Manager leading a distributed team managing L3 support for vision AI solutions. Overseeing operations for Edge/on - prem and cloud platforms at Everseen.
Senior Site Reliability Engineer ensuring reliability of applications across AWS infrastructure at Onit. Collaborating with teams to troubleshoot and optimize system performance.
Chassis Engineer leading Brake system design for Ford Racing. Focused on delivering performance vehicle solutions through innovative design and collaboration with teams.
Site Reliability Engineer at Coinbase optimizing cloud deployments and enhancing system reliability. Working with engineering teams to improve software reliability and performance across the organization.
Senior Site Reliability Engineer designing and implementing high - reliability platforms for Broadridge. Collaborating with teams across hybrid environments and driving automation and efficiency in service delivery.
Staff Engineer for GM's Hybrid Services & Reliability team. Driving reliability architecture and maintenance for hybrid cloud services with a focus on SRE principles.
Senior Engineering Manager for Hybrid Services & Reliability within AV Core Infrastructure at GM. Leading a team for the measurable availability of hybrid cloud systems for autonomous vehicle development.
Reliability Engineer for PGD Wind Reliability team at NextEra Energy. Collaborating on optimizing wind turbine performance, increasing reliability, and reducing costs while managing complex technical issues.
Maintenance Reliability Engineer focusing on operational excellence at JLL. Driving reliability through advanced maintenance strategies and technologies in building systems.