AI Inference Engineer developing AI model optimizations for Quadric's GPNPU platforms. Porting and benchmarking AI models to enhance performance in edge devices.

Quadric has built a unified hardware/software architecture optimized for on-device machine learning inference.
Only the Quadric GPNPU (general purpose neural processing unit) delivers high ML inference performance while also running C++ code without forcing the developer to artificially partition application code between two or three different kinds of processors.
Quadric's GPNPU is a licensable processor IP core that scales from 1 to 64 TOPs and seamlessly intermixes scalar, vector and matrix code.
Browse and apply for open jobs at quadric.io.
AI Inference Engineer developing AI model optimizations for Quadric's GPNPU platforms. Porting and benchmarking AI models to enhance performance in edge devices.
AI Kernel Engineer developing efficient AI kernels/operators for Quadric's neural processing unit. Analyze and optimize kernel performance across various hardware configurations.
Program Manager managing AI acceleration chips and embedded systems product lifecycle. Leading technical execution and customer engagement in a hybrid work environment.