Senior MLOps Engineer building and operating the platform for identity-verification products at Entrust. Focused on bridging ML research and production environments with an emphasis on developer experience.
Responsibilities
Run and evolve our ML compute layer on Kubernetes/EKS (CPU/GPU) for multi-tenant workloads, and make workloads portable across regions (region-aware scheduling, cross-region data access, and artifact portability).
Operate Argo Workflows and Dask Gateway as reliable, self-serve services used by engineers and researchers to orchestrate data prep, training, evaluation, and large-scale batch compute (installation, upgrades, security, quotas, autoscaling).
Build GitOps -native delivery for ML jobs and platform components (GitLab CI, Helm, FluxCD ) with fast rollouts and safe rollbacks.
Design and maintain our data platform built on LakeFS to enable experiment reproducibility, data lineage tracking, and automated governance processes.
Own developer experience and enablement by creating clear APIs/CLIs and minimal UIs, maintaining comprehensive templates and documentation.
Requirements
You will have some MLOps experience as well as
You value developer experience and enjoy talking to users (engineers/scientists), removing friction, and treating the platform like a product.
Production experience with AWS and Kubernetes (EKS), including GPU workloads.
Proficiency in Python (e.g., FastAPI /Django) and solid CS fundamentals (performance, concurrency, data structures).
Experience building/operating data pipelines (idempotency, retries, backfills, reproducibility).
Working knowledge of Terraform, Helm, Docker, Git, and GitLab CI/CD.
Observability experience with Prometheus/Grafana and logs (e.g., Loki/ Promtail or Splunk/Sentry) with sensible alerting.
Good grasp of networking and security concepts and Linux systems administration.
Benefits
25 days annual leave plus + RTT + 1 day off for your birthday
Two paid volunteering days per year*
Meal Vouchers provided by Swile. 50% Covered and 50% is deducted from your payroll.
Health Insurance (Mutuelle) provided by ALAN
Disability & Life insurance (Prevoyance) provided by ALAN (3x Base Salary)
Commuter reimbursement up to €40 per month
Life enrichment allowance of up to €95 per month to use for services including gym, yoga, fitness classes, massages, childcare, and therapy
Dedicated learning opportunities including using tools like Linkedin Learning with availability to use for learning resources such as books, coaches, conferences, courses, podcasts, and more
Our open and transparent culture is reflected in our “Better Together” motto
Expense up to £300 (or local equivalent) to purchase workstation setup equipment
The opportunity to become a member of Entrust’s resource groups in order to learn different skills in our belonging groups
Senior Software Developer working on ML Infrastructure and Deployment at Verafin. Helping develop cutting - edge fraud detection tools alongside analytics teams using AWS and Terraform.
Machine Learning Engineer developing advanced SLAM systems for autonomous trucking environments at Bot Auto. Collaborating with cross - functional teams to optimize mapping solutions and ensure operational stability.
Graduate Deep Learning Algorithm Developer developing perception technologies for autonomous driving. Tackling challenges in object detection and 3D perception using state - of - the - art deep learning models.
Principal AI/ML Engineer leading the AI/ML infrastructure development for WEX's risk service needs. Focused on innovative engineering and technology solutions within a high - stakes environment.
AI/ML Engineer developing solutions in artificial intelligence for HPE. Responsible for conducting research, designing AI solutions, and mentoring team members.
Machine Learning Engineer focusing on modeling cancer cells and developing related tools. Collaborating with researchers and scientists to advance cancer treatment through ML.
Machine Learning Engineer II developing production - grade ML models for fraud detection at GEICO. Collaborating on system architecture and ensuring optimal performance of fraud assessment systems.
AI/ML Engineer III designing and architecting AI solutions at Hewlett Packard Enterprise. Collaborating with teams to drive innovation and tackle complex problems.
AI/ML Engineer deploying state - of - the - art AI models to solve real - world problems at Brain Co. Working in healthcare, government, and energy sectors for impactful results.
Trainer at WeAndTheMany facilitating learning by sharing experiences and creating interactive sessions. Engaging with students to enhance their skills and knowledge through dynamic teaching methods.