AI Inference Engineer at quadric.io | Hybrid Hired

About the role

AI Inference Engineer developing AI model optimizations for Quadric's GPNPU platforms. Porting and benchmarking AI models to enhance performance in edge devices.

Responsibilities

Quantize, prune and convert models for deployment
Port models to Quadric platform using Quadric toolchain
Optimize inference deployment for latency, speed
Benchmark and profile model performance and accuracy
Develop tools to scale and speed up the deployment
Make Improvement to SDK and runtime
Provide technical support and documents to customers and developer community

Requirements

Bachelor’s or Master’s in Computer Science and/or Electric Engineering.
5+ years of experience in AI/LLM model inference and deployment frameworks/tools
experience with model quantization (PTQ, QAT) and tools
experience with model accuracy measures
experience with model inference performance profiling
experience with at least one of the following frameworks: onnxruntime, Pytorch, vLLM, huggingface-transformer, neural-compressor, llamacpp
Proficiency in C/C++ and Python
Demonstrate good capability in problem solving, debug and communication

Benefits

Health Care Plan (Medical, Dental & Vision)
Retirement Plan (401k, IRA)
Life Insurance (Basic, Voluntary & AD&D)
Paid Time Off (Vacation, Sick & Public Holidays)
Family Leave (Maternity, Paternity)
Short Term & Long Term Disability
Training & Development
Work From Home
Free Food & Snacks
Stock Option Plan

Similar roles

Browse all Artificial Intelligence jobs

3 hours ago

HG

Dual Bachelor's Program – Artificial Intelligence

HEUFT SYSTEMTECHNIK GMBH

Full - time role focused on data preparation, AI applications, and deep learning methods in Burgbrohl, Germany. Collaborating on projects and prototype development with a strong emphasis on programming in Python.

Onsite Role

Burgbrohl Germany Artificial Intelligence

15 hours ago

IA

Praktikum – Computer Vision, AI

IUNA AI

Internship in computer vision and AI at IUNA AI Systems GmbH focusing on data preparation for AI models and collaboration with ML engineers. Join a deep tech startup specializing in AI - based image processing.

Hybrid Role

Heilbronn Germany Artificial Intelligence

23 hours ago

CI

Product Director – AI Knowledge Graph, Intelligent Data

Cisco

Product Director leading AI Knowledge Graph strategy for Splunk’s Data Fabric initiatives. Driving product definition and data processing strategies in AI - driven environments.

Hybrid Role

San Jose United States Artificial Intelligence

$225,000 - $325,300 per year

yesterday

GG

IT Process Manager – Artificial Intelligence

Garant Maschinenhandel GmbH

IT Process Manager using AI to improve process landscape for packaging machinery. Collaborating on AI use cases and managing their implementation in projects.

Onsite Role

Lengerich Germany Artificial Intelligence

yesterday

VE

Business Configuration Analyst – AI

Verisk

Business Configuration Analyst implementing AI - powered solutions across insurance platforms. Collaborating with engineers and customers while configuring and optimizing AI technologies.

Hybrid Role

Holmdel United States Artificial Intelligence

$61,200 - $116,800 per year

yesterday

SM

Senior Manager – AI Transformation

Solstice Advanced Materials

Sr. Manager leading enterprise - wide AI transformation initiatives at Solstice Advanced Materials. Collaborating with stakeholders to redesign business processes and drive adoption of AI solutions.

Hybrid Role

Morris Plains United States Artificial Intelligence

$190,000 - $220,000 per year

2 days ago

HE

Technical Marketing Manager – AI

Hewlett Packard Enterprise

Technical Product Marketing Manager focused on product marketing strategy for HPE Private Cloud AI. Responsible for technical content execution and collaborative efforts with product management.

Hybrid Role

Spring United States Artificial Intelligence

$105,500 - $243,000 per year

2 days ago

RE

AI Prompt Engineer

RELX

AI Prompt Engineer focusing on developing conversational AI experiences for healthcare professionals at Elsevier. Join a team creating innovative solutions powered by generative AI.

Onsite Role

Philadelphia United States Artificial Intelligence

$95,300 - $158,800 per year

2 days ago

HF

Junior AI Videographer

HFM

Junior AI Videographer creating engaging AI - driven video and visual content for a multi - asset broker. Collaborating on marketing campaigns and digital storytelling.

Hybrid Role

Larnaca Cyprus Artificial Intelligence

2 days ago

AV

AI Bootcamp

Avanade

Technology Consultant role with Avanade focusing on IT and digital solutions after completing a foundational training program. Join a community passionate about technology and innovation.

Onsite Role

London United Kingdom Artificial Intelligence

£30,320 per year