AI Systems Engineer – Inference Frameworks at adaption | Hybrid Hired

About the role

AI Systems Engineer designing and building inference and optimization systems for core product. Work directly with founders, thriving in a zero-to-one environment.

Responsibilities

You’ll work directly with our founders to design and build the inference and optimization systems that power our core product.
This role bridges research and production, combining deep exploration of inference techniques with hands-on ownership of scalable, high-performance serving infrastructure.
You’ll own the full lifecycle of LLM inference—from experimentation and performance analysis to deployment and iteration in production—thriving in a zero-to-one environment and helping define the technical foundations of our inference stack.
design and build our LLM inference stack from zero to one, exploring and implementing advanced techniques for low-latency, high-throughput serving of language and multimodal models.
develop and optimize inference using modern frameworks (e.g., vLLM, SGLang, TensorRT-LLM), experimenting with batching strategies, KV-cache management, parallelism, and GPU utilization to push performance and cost efficiency.
collaborate closely with founders and model developers to analyze bottlenecks across the stack, co-optimizing model execution, infrastructure, and deployment pipelines.

Requirements

Strong experience building and optimizing LLM inference systems in production or research environments
Hands-on expertise with inference frameworks such as vLLM, SGLang, TensorRT-LLM, or similar
Deep performance mindset with experience in GPU-backed systems, latency/throughput optimization, and resource efficiency
Solid understanding of transformer inference, serving architectures, and KV-cache–based execution
Strong programming skills in Python; experience with CUDA, Triton, or C++ a plus
Comfort working in ambiguous, zero-to-one environments and driving research ideas into production systems
Nice to have: experience with model quantization or pruning, speculative decoding, multimodal inference, open-source contributions, or prior work in systems or ML research labs

Benefits

Flexible work: In-person collaboration in the Bay Area, a distributed global-first team, and quarterly offsites.
Adaption Passport: Annual travel stipend to explore a country you've never visited. We're building intelligence that evolves alongside you, so we encourage you to keep expanding your horizons.
Lunch Stipend: Weekly meal allowance for take-out or grocery delivery.
Well-Being: Comprehensive medical benefits and generous paid time off.

Similar roles

Browse all Artificial Intelligence jobs

40 minutes ago

TI

Automation & AI Solutions Specialist

Tiens

AI Solutions Specialist designing AI - driven automation solutions and transforming financial reporting at TIENS. Integrating AI services and supporting digital transformation in Europe.

Hybrid Role

Berlin Germany Artificial Intelligence

€45,000 - €60,000 per year

2 hours ago

HA

AI Consultant – 100%

HICO Group AG

AI Consultant at HICO Group developing and implementing AI strategies and solutions. Supporting organizations in transforming into data - driven companies with project management duties.

Hybrid Role

Zürich Switzerland Artificial Intelligence

3 hours ago

HA

AI Educator – Enterprise

Higgsfield AI

AI Educator leading enterprise product demos and creating educational sales tools for Higgsfield, a video AI company redefining synthetic media. Involving in client training and strategic collaboration with sales teams.

Hybrid Role

San Francisco United States Artificial Intelligence

10 hours ago

KA

Senior AI Specialist

Kantar

AI Senior Specialist driving AI innovation and embedding an AI culture in Kantar's Finance function. Identifying high - impact AI initiatives and collaborating with technology teams to implement solutions.

Hybrid Role

Mumbai India Artificial Intelligence

12 hours ago

AD

Senior Digital Strategist – AI Adoption

Adobe

Senior Adoption Digital Strategist for Adobe driving enterprise AI adoption and transformation strategies. Shaping operating models and communicating complex AI concepts to executives in multi - year programs.

Hybrid Role

New York City United States Artificial Intelligence

$131,600 - $245,300 per year

13 hours ago

CL

AI Governance & Reporting Intern, Student Position

Canada Life

AI Governance & Reporting Intern for Canada Life's Data, AI Automation & Platforms team. Supporting governance, automation, and reporting to operationalize responsible AI.

Hybrid Role

Toronto Canada Artificial Intelligence

CA$48,000 - CA$71,900 per year

13 hours ago

NB

Senior Analyst, AI Acceleration Office

Newell Brands

Senior Analyst supporting AI enablement across Supply Chain at Newell Brands. Focused on data analysis, process understanding, and supporting decision - making through AI solutions.

Hybrid Role

Huntersville United States Artificial Intelligence

15 hours ago

FR

Student Assistant, Artificial Intelligence

Fraunhofer-Gesellschaft

Student Assistant in AI at Fraunhofer supporting market research, event preparation, and publications. Engaging in diverse AI topics within a collaborative environment.

Onsite Role

Sankt Augustin Germany Artificial Intelligence

18 hours ago

LF

VP, Enterprise AI Enablement

Lincoln Financial

VP of Enterprise AI Enablement leading Lincoln’s AI productivity tools strategy and partnering with HR for workforce enablement. Influencing enterprise wide AI adoption and delivering measurable value across business lines.

Hybrid Role

Radnor United States Artificial Intelligence

$157,000 - $285,600 per year

20 hours ago

EG

Distinguished Engineer, AI Search

Expedia Group

Distinguished Engineer leading AI search systems design and implementation. Overseeing scalable solutions in AI and machine learning at Expedia Group.

Hybrid Role

Seattle United States Artificial Intelligence

$296,000 - $414,500 per year