Machine Learning Intern – Dynamic KV-Cache Modeling for Efficient LLM Inference at d-Matrix | Hybrid Hired

About the role

Machine Learning Intern at D-Matrix developing dynamic KV-Cache for efficient LLM inference, working with advanced memory techniques.

Responsibilities

Research and analyze existing KV-Cache implementations used in LLM inference, particularly those utilizing lists of past-key-values PyTorch tensors.
Investigate “Paged Attention” mechanisms that leverage dedicated CUDA data structures to optimize memory for variable sequence lengths.
Design and implement a torch-native dynamic KV-Cache model that can be integrated seamlessly within PyTorch.
Model KV-Cache behavior within the PyTorch compute graph to improve compatibility with torch.compile and facilitate the export of the compute graph.
Conduct experiments to evaluate memory utilization and inference efficiency on D-Matrix hardware.

Requirements

Currently pursuing a degree in Computer Science, Electrical Engineering, Machine Learning, or a related field.
Familiarity with PyTorch and deep learning concepts, particularly regarding model optimization and memory management.
Understanding of CUDA programming and hardware-accelerated computation (experience with CUDA is a plus).
Strong programming skills in Python, with experience in PyTorch.
Analytical mindset with the ability to approach problems creatively.

Benefits

Medical/Dental/Vision/401k
Inclusive rewards plan
Professional development opportunities

Similar roles

Browse all Machine Learning Engineer jobs

2 hours ago

PR

Senior Machine Learning Platform/Ops Engineer

Preply

Senior ML Platform/Ops Engineer at Preply building and maintaining ML pipelines, collaborating with ML Scientists and Data Engineers.

Hybrid Role

Kyiv Ukraine Machine Learning Engineer

2 hours ago

PR

Senior Machine Learning Platform/Ops Engineer

Preply

Senior ML Platform/Ops Engineer building AI - powered ML pipelines for a dynamic Ed - Tech company. Collaborating with ML scientists and engineers to ensure reliable deployment and observability.

Hybrid Role

Barcelona Spain Machine Learning Engineer

2 hours ago

PR

Senior Machine Learning Platform/Ops Engineer

Preply

Senior ML Platform/Ops Engineer building ML systems for AI - powered learning at Preply. Productionizing machine learning with high reliability, performance, and observability in a hybrid environment.

Hybrid Role

London United Kingdom Machine Learning Engineer

16 hours ago

MO

Machine Learning Engineer, Student Position

Mobileye

Machine Learning Engineer developing advanced Deep Learning models for autonomous driving technology at Mobileye. Collaborating in a high - end algorithmic engineering team on critical computer vision challenges.

Hybrid Role

Jerusalem Israel Machine Learning Engineer

18 hours ago

CU

Machine Learning Engineer – Secure AI Lab

Carnegie Mellon University

Machine Learning Engineer focusing on vulnerabilities and security of AI systems at Carnegie Mellon University. Collaborating with a team to build robust prototypes and provide solutions for government sponsors.

Hybrid Role

Pittsburgh United States Machine Learning Engineer

22 hours ago

EE

Lead AI & ML Engineer

EEOC

Lead machine learning engineer developing solutions for Army enterprise AI and ML team. Collaborating with experts to deliver cutting - edge analytics and models for real - world challenges.

Hybrid Role

McLean United States Machine Learning Engineer

$128,700 - $292,000 per year

yesterday

BT

Machine Learning, Signal Processing Scientist

BlueGreen Water Technologies

Machine Learning & Signal Processing Scientist at BlueGreen Water Technologies analyzing multi - source environmental data. Focused on developing algorithms and models for signal processing and machine learning techniques.

Hybrid Role

Modiin Israel Machine Learning Engineer

yesterday

PL

Senior Geospatial AI/ML Engineer

Planet

Senior Geospatial AI/ML Engineer at Planet developing cutting - edge geospatial intelligence solutions using machine learning and AI technologies. Collaborating across teams to transform global data into actionable insights.

Hybrid Role

United States Machine Learning Engineer

$160,600 - $200,800 per year

yesterday

CO

Machine Learning Engineer

Coody.io

Senior Machine Learning Engineer at Coody focusing on GenAI and fullstack projects. Collaborating across industries to deliver impactful machine learning solutions over extended assignments.

Hybrid Role

Stockholm Sweden Machine Learning Engineer

2 days ago

TL

Machine Learning Engineer

Twelve Labs

Machine Learning Engineer developing and operating ML systems for video analysis with a focus on reliability and performance. Join TwelveLabs in pioneering multimodal AI from Seoul.

Hybrid Role

Seoul South Korea Machine Learning Engineer