AI Inference Engineer developing AI model optimizations for Quadric's GPNPU platforms. Porting and benchmarking AI models to enhance performance in edge devices.
Responsibilities
Quantize, prune and convert models for deployment
Port models to Quadric platform using Quadric toolchain
Optimize inference deployment for latency, speed
Benchmark and profile model performance and accuracy
Develop tools to scale and speed up the deployment
Make Improvement to SDK and runtime
Provide technical support and documents to customers and developer community
Requirements
Bachelor’s or Master’s in Computer Science and/or Electric Engineering.
5+ years of experience in AI/LLM model inference and deployment frameworks/tools
experience with model quantization (PTQ, QAT) and tools
experience with model accuracy measures
experience with model inference performance profiling
experience with at least one of the following frameworks: onnxruntime, Pytorch, vLLM, huggingface-transformer, neural-compressor, llamacpp
Proficiency in C/C++ and Python
Demonstrate good capability in problem solving, debug and communication
Full - time role focused on data preparation, AI applications, and deep learning methods in Burgbrohl, Germany. Collaborating on projects and prototype development with a strong emphasis on programming in Python.
Internship in computer vision and AI at IUNA AI Systems GmbH focusing on data preparation for AI models and collaboration with ML engineers. Join a deep tech startup specializing in AI - based image processing.
Product Director leading AI Knowledge Graph strategy for Splunk’s Data Fabric initiatives. Driving product definition and data processing strategies in AI - driven environments.
IT Process Manager using AI to improve process landscape for packaging machinery. Collaborating on AI use cases and managing their implementation in projects.
Business Configuration Analyst implementing AI - powered solutions across insurance platforms. Collaborating with engineers and customers while configuring and optimizing AI technologies.
Sr. Manager leading enterprise - wide AI transformation initiatives at Solstice Advanced Materials. Collaborating with stakeholders to redesign business processes and drive adoption of AI solutions.
Technical Product Marketing Manager focused on product marketing strategy for HPE Private Cloud AI. Responsible for technical content execution and collaborative efforts with product management.
AI Prompt Engineer focusing on developing conversational AI experiences for healthcare professionals at Elsevier. Join a team creating innovative solutions powered by generative AI.
Junior AI Videographer creating engaging AI - driven video and visual content for a multi - asset broker. Collaborating on marketing campaigns and digital storytelling.
Technology Consultant role with Avanade focusing on IT and digital solutions after completing a foundational training program. Join a community passionate about technology and innovation.