Edge Inference Engineer at Liquid AI optimizing machine code for resource-constrained devices. Implementing inference kernels and collaborating with ML researchers on new model architectures.
Responsibilities
Implement and optimize inference kernels for CPU, NPU, and GPU architectures across diverse edge hardware
Develop quantization strategies (INT4, INT8, FP8) that maximize compression while preserving model quality under strict memory budgets
Contribute to llama.cpp and other open-source inference frameworks, including new model architectures (audio, vision)
Profile and optimize end-to-end inference pipelines to achieve sub-100ms time-to-first-token on target devices
Collaborate with ML researchers to understand model architectures and identify optimization opportunities specific to Liquid Foundation Models
Requirements
5+ years of experience in systems programming with strong C++ proficiency
Embedded software engineering experience or work on resource-constrained systems
Understanding of ML fundamentals at the linear algebra level (how matrix operations, attention, and quantization work)
Experience with hardware architecture concepts: cache hierarchies, memory bandwidth, SIMD/vectorization
Contributions to llama.cpp, ExecuTorch, or similar inference frameworks (nice-to-have)
Experience with Rust for systems programming (nice-to-have)
Background in custom accelerator development (TPU, NPU) or work at companies like SambaNova, Cerebras, Groq, or Google/Amazon accelerator teams (nice-to-have)
Quantitative degree (mathematics, physics, or similar) combined with engineering experience (nice-to-have)
Benefits
Competitive base salary with equity in a unicorn-stage company
We pay 100% of medical, dental, and vision premiums for employees and dependents
401(k) matching up to 4% of base pay
Unlimited PTO plus company-wide Refill Days throughout the year
Engineering Technologist III providing technical expertise to solve complex issues at Duke Energy. Overseeing work in a specific area with limited supervision.
Engineering Technologist II at Duke Energy applying technical expertise to solve complex problems and providing oversight in specific areas. Building upon technical skills and working independently with general supervision.
Lead Internal Audit Developer crafting AI solutions for the innovation team in financial services. Collaborating with audit teams to drive practical analytics and automation solutions.
Software Senior Expert II at BBVA focusing on secure software development. Mobilizing and influencing security objectives in development teams and safeguarding software across the organization.
Internship/Co - Op position at BW Design Group, an architecture, engineering, and consulting firm. Engage in teamwork and client projects in the manufacturing industry.
Project Lead managing Engineering Digital Delivery projects at Rolls - Royce. Ensuring successful project delivery, adherence to quality standards, and integration with overall strategy.
Director of Technology Software Development at AT&T overseeing innovative software solutions for wireline networks. Leading recruitment and AI integration efforts while ensuring quality - first delivery in engineering leadership roles.
Strategic Intelligence Lead managing the analytical work and relationships with collaborators at Flutter. Involved in scenario planning, risk modeling, and leading workshops with senior executives.
Clinical Study Build Programmer at Lilly programming and testing clinical trial data collection databases. Collaborating with teams to ensure data accuracy for global clinical trials.
Senior Developer at Sharesies contributing to high quality, customer - centred product in an award - winning wealth app. Building financial empowerment for Australians and New Zealanders.