Hybrid ML Inference Router Engineer

Posted 2 months ago

Apply now

About the role

  • ML Inference Router Engineer designing scalable inference systems at eBay. Aiming to support billions of daily requests with a focus on reliability and efficiency.

Responsibilities

  • Design and build an LLM inference gateway that scales to billions of daily requests with millisecond-level latency.
  • Develop intelligent request routing, load balancing, and fallback mechanisms across heterogeneous LLM backends (internal and external).
  • Optimize throughput, cost, and reliability of inference workloads in multi-tenant environments.
  • Collaborate with platform, research, and product teams to integrate new models and agentic capabilities into the gateway.
  • Implement observability, tracing, and autoscaling for inference traffic across Kubernetes-based clusters.
  • Conduct design and code reviews to ensure high standards in distributed systems architecture.
  • Stay current with advances in LLM serving, inference acceleration, and model APIs to continuously evolve the platform.

Requirements

  • 10+ years of experience building large-scale, fault-tolerant, high-performance distributed systems.
  • Strong programming skills in one or more of Java, Go, Rust, or C++ (Java preferred for gateway services).
  • Deep understanding of networking, concurrency, memory management, and performance tuning in production systems.
  • Proven experience designing and operating low-latency APIs at very large scale (10M+ QPS).
  • Hands-on experience with Kubernetes, service meshes, and container orchestration at scale.
  • Strong background in cloud infrastructure (AWS, GCP, Azure) and distributed system design.

Benefits

  • full range of medical benefits
  • financial benefits
  • various paid time off benefits, such as PTO and parental leave

Job title

ML Inference Router Engineer

Job type

Experience level

SeniorLead

Salary

$132,000 - $222,100 per year

Degree requirement

Bachelor's Degree

Location requirements

Report this job

See something inaccurate? Let us know and we'll update the listing.

Report job