DevSecOps role focusing on creating secure, scalable services in dual-use technology firm. Direct impact on production infrastructure and security with innovative solutions.
Responsibilities
Develop and implement a comprehensive observability strategy for self-hosted deployments, including infrastructure and tooling for monitoring, alerting, and troubleshooting.
Engineer the Acra platform for high availability and fault tolerance.
Guarantee 99.9% uptime for the platform's control plane and deployment management.
Architect and implement a highly available deployment setup for applications within the Acra platform.
Create and maintain robust backup and recovery strategies for all Valarian products, ensuring data integrity and minimal downtime in the event of a failure.
Integrate and manage an incident detection and paging solution to ensure rapid response to critical issues and minimize service disruptions.
Scale the Acra platform and applications to support large concurrent user bases (25+ users) and sustained daily usage.
Collaborate closely with the product engineering team to influence the design and implementation of new products and features, ensuring they meet our reliability and scalability standards from the outset.
Requirements
Bachelor’s degree (or foreign equivalent) in Computer Science or a related field is desired; relevant practical experience will also be considered.
Proficiency with programming languages like Go, Bash, Python.
Deep experience with Kubernetes security: RBAC, PodSecurityPolicies (or their replacements), Admission Controllers and Kubernetes network policies.
Proficiency in secure networking practices, including TLS, mutual TLS (mTLS), ingress/egress controls and VPN tunneling configurations.
Proven experience operating and securing service mesh technologies (e.g. Istio, Linkerd, or Consul Connect).
Hands on experience with HashiCorp Vault in production, including dynamic secrets engines, auth backends and policy design.
Practical knowledge of HAProxy or equivalent reverse proxies/load balancers, with experience configuring L4/L7 security protections.
Familiarity with CVE triage workflows and integrating vulnerability scanners into CI/CD and registry workflows.
Exposure to runtime security tooling (e.g. Falco, eBPF-based monitoring) and familiarity with basic incident response workflows.
Comfort representing engineering in external calls with auditors, pentesters and security vendors; able to explain infrastructure decisions in security terms.
Familiarity with compliance standards (SOC 2, ISO 27001, etc) and cloud security postures in AWS, Azure or GCP would be preferable but not essential.
Benefits
Competitive salary and equity grants
Employer pension contributions;
Platinum healthcare benefit;
Basic Life / AD&D and long-term disability insurance 100% covered by Valarian
Hybrid work arrangements are managed at team level
Generous holiday calendar and PTO
Relocation assistance (depending on role eligibility)
Senior Reliability Engineer at Sonova ensuring dependable performance of hearing solutions for millions of users globally. Involves engineering skills to improve product reliability across development stages.
Equipment and Reliability Engineer at Chobani responsible for improving asset efficiency, redesigning equipment. Collaborating with Operations to solve complex problems and lead projects in a team environment.
Reliability Engineer II focused on enhancing safety, efficiencies, and cost controls at Freeport - McMoRan mining operations. Collaborating with multiple teams and managing engineering projects.
Reliability Engineer I responsible for equipment failure analysis and improvement recommendations at Freeport - McMoRan's copper smelting operations. Ensuring uninterrupted production and managing equipment health through data analysis.
Designing, building, and maintaining the Kubernetes - based developer platform for Schwarz IT Barcelona. Collaborating with engineering teams to enhance services in Azure and Google Cloud.
Database Reliability Engineer managing MySQL database infrastructure at PointClickCare. Collaborating with Engineering and SRE teams for product development and reliable integration across the platform.
Teamleitung in der Gebäudereinigung in Grimma, verantwortliche Planung, Organisation und Führung des Reinigungsteams. Aktive Mitarbeit und Einhaltung von Hygiene - und Qualitätsstandards sind erforderlich.
Service Reliability Engineer providing technical support and managing incidents for BT International. Ensuring system availability and collaboration with global stakeholders to achieve objectives.
Studying Bachelor of Arts in Accounting, Taxation, and Economic Law while gaining practical experience in a dynamic team. Benefit from a diverse working day and continuous development opportunities.
Technical Trainer conducting workshops and training sessions on MERKUR Group's product content for diverse audiences. Engaging with employees and clients to ensure smooth product operation and understanding.