Platform Engineer responsible for developing and managing Kubernetes environments for AI solutions in healthcare. Collaborating with teams to enhance core infrastructure and streamline deployments.
Responsibilities
Design, deployment, and management of scalable and secure Kubernetes clusters on OVHcloud.
Ownership and advancement of our CI/CD pipelines for automated, reliable application and infrastructure deployments.
Implementation and management of our GitOps workflows using tools like ArgoCD or Flux.
Management and scaling of GPU workloads in Kubernetes, ensuring optimal performance and resource utilization for our ML teams.
Development and maintenance of our observability stack (VictoriaMetrics, VictoriaLogs, Grafana, Tracing) to ensure deep visibility into system health.
Management of our cloud infrastructure on OVHcloud, focusing on automation (Infrastructure as Code), cost optimization, and security.
Lifecycle management of core platform services, including message brokers (RabbitMQ), databases (PostgreSQL, Redis), and authentication systems (Okta, OIDC, OAuth2).
Acting as a key responder for infrastructure incidents; debugging and troubleshooting complex production issues across distributed systems.
Supporting and empowering development teams by providing robust self-service tools, clear documentation, and collaborative support.
Requirements
3-5+ years of professional experience in a Platform Engineering, DevOps, or SRE role
Deep, hands-on experience with Kubernetes in a production environment (cluster management, networking, security, scheduling)
Proven experience managing infrastructure on a cloud provider (OVHcloud is a strong plus; AWS, GCP, or Azure experience is also valued)
Strong practical knowledge of CI/CD systems (e.g. GitHub Actions) and GitOps principles (ArgoCD, Flux)
Proficiency with Infrastructure as Code (IaC) tools like Terraform or Pulumi
Solid understanding of observability principles and tools (e.g. VictoriaMetrics, VictoriaLogs, OpenTelemetry/Tracing, Grafana)
Experience managing stateful services in production (e.g. PostgreSQL, Redis, RabbitMQ)
Solid scripting skills in Python
Benefits
Full ownership of a mission-critical platform
A team that values curiosity, learning, and experimentation
Remote-first setup with the option to work in our Berlin office
Intern assisting in modernization initiatives for agentic AI workflows and data platforms. Supporting the development and maintenance of data pipelines and prototyping AI use cases.
Senior Research and Development Engineer for transformer mechanical design at Hitachi Energy. Leading software development for innovative projects and collaborating within a global team.
Platform Engineer leading lifecycle management of MOM and AMHS systems across Kubernetes clusters in semiconductor industry. Collaborating with internal teams to ensure operational reliability in manufacturing.
Own product platform and release - quality systems for AI SaaS startup. Implement analytics, build dashboards, and ensure safe releases while maintaining high quality standards.
Principal Cloud Security Design Engineer defining and engineering cloud security architecture. Leading technical initiatives in Azure and AWS environments for financial services company.
Mid - level Platform Engineer for FAA modernization project at OCH Technologies. Responsible for designing, implementing, and managing secure automated platform environments supporting aviation systems.
Hands - on engineer designing, building, and maintaining core backend systems at MyFunded Futures. Leading technical architecture and mentoring the engineering team in a fintech environment.
Software Engineer developing advanced trading applications for professional derivatives traders at TT. Collaborate with the team to enhance the award - winning trading platform.
Senior Platform Engineer helping design, scale, and harden Pivotal’s AI - driven platform. Collaborating closely with engineering teams to improve reliability, security, and scalability.
Senior technical authority at Smarsh managing large - scale distributed data platforms. Leading architectural design, influencing engineering standards, and mentoring engineers across the organization.