AI Engineer – LLM Ops, Evaluation at Auxilius.ai | Hybrid Hired

About the role

AI Engineer for LLMOps & Evaluation at Auxilius.ai building AI solutions for Governance, Risk and Compliance. Own LLMOps pipeline and drive prompt engineering in a hybrid environment.

Responsibilities

Own the LLMOps pipeline: Evaluate infrastructure, prompt optimization loop, and the production integration that turns experiments into reliable customer-facing features
Design evaluation strategy per output type: Decide when to use deterministic evals (exact match, schema validation, embeddings) vs. LLM-as-judge, and build the rubrics, test datasets, and human-review loops that make the system trustworthy
Drive prompt engineering and optimization across all LLM operations in the product: Moving from hand-tuned prompts to a measurable, iterative process
Pick the right tool for each problem: Some things are LLM problems, some are embedding + classical NLP problems, some are deterministic logic
Run the production side of AI features: Observability (Langfuse /LangSmith / similar), cost and latency engineering, incident response when an LLM feature degrades
Build human-in-the-loop workflows: Review queues, feedback ingestion, labeling; so production signal feeds back into evals and prompt iteration
Mentor our AI & Analytics Intern and contribute to how we build the AI team over time

Requirements

3+ years of hands-on experience building and shipping ML/AI systems in production (we care more about what you've shipped than years on a CV)
Have shipped an LLM evaluation or prompt optimization pipeline, not just used LLMs in a project, but owned the loop
Strong hands-on experience with LLM-as-judge, including its variance problems and concrete techniques for controlling them
Solid foundation in classical NLP and ML ops: Embeddings, semantic similarity, entity matching, classification, fuzzy matching
Informed opinions on deterministic vs. LLM-based evals, from experience
Production judgment: You've owned cost and latency tradeoffs, observability, and incident response for an LLM-powered feature. You're familiar with prompt regression and have strategies for managing it
Strong Python
Excellent English communication, written and verbal: We discuss nuanced technical tradeoffs daily with the founding team and customers
Comfort with ambiguity: You can run experiments on real data, build intuition for this domain, and know when to stop iterating

Benefits

Hands-on ownership of a real AI product used by enterprise customers
Work directly alongside the founding team from day one
Hybrid work model: Munich North, minimum one day per week in the office, otherwise flexible (open to strong candidates elsewhere in the EU for the right fit); onboarding will take in-office
A steep learning curve at the intersection of LLM engineering, enterprise GRC, and startup operations
The chance to shape the AI team as we grow

Similar roles

Browse all Ai Engineer jobs

1 hour ago

NI

AI Engineer

NetBrain Technologies Inc.

Senior AI Engineer developing production - grade AI and automation systems for NetBrain's network automation platform. Responsible for architecture, evaluation, scalability, and reliability in production.

Hybrid Role

Burlington United States Ai Engineer

$150,000 - $180,000 per year

1 hour ago

NI

AI Engineer

NetBrain Technologies Inc.

Senior AI Engineer designing and building agent and RAG systems for NetBrain's network automation platform. Combining engineering with system - level thinking to enhance reliability and scalability.

Hybrid Role

Toronto Canada Ai Engineer

CA$130,000 - CA$165,000 per year

5 hours ago

IS

AI Engineer – Agentic AI, Business Engagement

IDBC Creative Solutions

AI Engineer developing production - ready AI solutions for business engagements. Collaborating with stakeholders to define high - value use cases on the Agentic AI platform.

Onsite Role

Budapest Hungary Ai Engineer

6 hours ago

XE

Senior AI Engineer

Xelix

Senior AI Engineer designing and delivering AI systems for Xelix, transforming financial controls through automation. Leading AI solutions combining machine learning and engineering practices in a fast - paced environment.

Hybrid Role

London United Kingdom Ai Engineer

£80,000 - £90,000 per year

12 hours ago

ZS

AI Engineer

Zenith Insurance Company (United States)

AI Engineer working on LLM - powered applications and AI systems at Zenith Insurance Company. Collaborating with teams to innovate and improve AI solutions in insurance and financial services.

Hybrid Role

Woodland Hills United States Ai Engineer

$99,454 - $124,317 per year

15 hours ago

HY

Senior AI Engineer

HyperFi

Senior AI Engineer at HyperFi managing AI program end - to - end. Leading GPU architecture decisions and integrating AI into user - facing flows.

Hybrid Role

San Francisco United States Ai Engineer

yesterday

TL

Associate AI Engineer

Thinkahead Consultant Psychologist Pty Ltd

Associate AI Engineer in LAUNCH program developing AI solutions. Collaborating with teams and working post - training on client projects.

Hybrid Role

Chicago United States Ai Engineer

$88,000 per year

yesterday

EX

AI Engineer

EXL

AI Engineer responsible for developing AI - first products and designing agentic workflows. Collaborate across the tech stack to move ideas from concept to production.

Hybrid Role

India Ai Engineer

yesterday

FI

AI Engineer I

Finaira

Entry - level AI Engineer responsible for developing, testing, and deploying ML and LLM - based solutions. Supporting foundational engineering skills across AI spectrum in a hybrid work environment.

Hybrid Role

Cairo Egypt Ai Engineer

yesterday

PC

Senior Consultant, Artificial Intelligence Engineer

Pioneer Management Consulting

Senior Consultant delivering enterprise - grade AI solutions for business challenges. Collaborating with clients and teams at Pioneer Management Consulting to drive measurable outcomes.

Hybrid Role

Minneapolis United States Ai Engineer

$110,000 - $165,000 per year