Lead ML Engineer developing GenAI infrastructure at Zendesk. Building robust, production-grade ML platforms and collaborating with various teams on AI-driven products.
Responsibilities
Help build benchmarking frameworks for LLMs, including A/B, Offline Evals testing capabilities to assess quality, latency, and cost trade-offs.
Contribute to the design and implementation of Zendesk’s LLM Proxy to enable safe, observable, and cost-optimized access to multiple foundation models.
Partner with applied ML, product, and platform teams to ensure GenAI infrastructure meets the needs of diverse product use cases.
Implement best practices for monitoring, observability, rate-limiting, and cost attribution for LLM services.
Establish strong engineering practices around observability, reliability, security, and cost monitoring.
Work on orchestration tooling to enable multi-step, tool-using AI agents that integrate with Zendesk’s products.
Requirements
5+ years in developing and deploying ML systems in production, with hands-on experience in scaling infrastructure and ensuring service reliability.
Familiarity with core ML infrastructure components such as model registries, feature stores, orchestration tools, and inference serving systems.
Understanding of LLM systems, GenAI applications, or ML/AI platform components such as vector databases, serving layers, and orchestration tools.
Experience with GCP, AWS, or Azure; Kubernetes; Docker; and distributed systems.
Proficiency in at least one server-side language (Python, Java, Scala, Golang, or Ruby) and solid grounding in testing and CI/CD workflows.
Understanding of architecture principles and patterns for building scalable, resilient backend services.
Experience taking projects from design to production deployment, with a focus on maintainability and performance.
Preferred Qualifications: Experience with AI technologies in automating processes and developing agentic solutions and frameworks.
Experience building tools that improve developer productivity and platform adoption across multiple teams.
Benefits
Full ownership of the projects you work on.
Exciting projects, ability to implement your own ideas and improvements.
Opportunity to learn and grow.
Flexible working hours.
Professional development funds.
Comfortable office and a remote setup.
Choice of your laptop and other equipment.
Premium Medical Insurance as well as Private Life Assurance.
AI Platform/ Model Developer at Mars leveraging AI to enhance North America Supply Chain efficiency and resilience. Collaborating with various teams to design and implement scalable AI capabilities.
Principal AI Engineer leading platform engineering and AI enablement initiatives at Humana, driving strategy for AI tools and products while collaborating with cross - functional partners.
AI Engineer at Worldia creating and deploying AI workflows for travel agencies. Collaborating cross - functionally with product, data, and engineering teams to solve concrete problems.
Design and build an agentic AI platform to manage renewable energy certificates and automate related workflows. Collaborate across teams to integrate the platform, optimize user onboarding, and iterate based on feedback and metrics.
AI Engineer - Consultant developing and integrating AI solutions for clients. Working in a hybrid model to support their journey towards data - driven processes.
Senior AI Engineer building agentic workflows and production - grade systems for an AI - powered platform. Join Altura to innovate and drive impactful AI initiatives in a startup environment.
Applied AI Engineer at WorkOS designing and shipping production AI systems. Collaborating with teams across the company to improve efficiency and workflows using AI.
Staff AI Engineer developing first AI Engineering Co - Pilot for Black Semiconductor's process and device engineering. Utilizing complex datasets to produce insights and predictive models for improved processes.
AI Product Lead responsible for identifying and building AI - powered product solutions at Aspire Software. Engaging directly with customers to ensure real outcomes and value creation.
Senior Full Stack Engineer developing scalable SaaS solutions for logistics at Aspire Software. Focusing on React, TypeScript, and Jakarta EE for end - to - end product development.