AI Software Development Engineer optimizing AI inference workloads including Large Language Models on Intel GPUs. Involves graph compilation, runtime execution, and kernel optimization.
Responsibilities
Optimize emerging AI inference workloads such as Large Language Models (LLMs) and Diffusion models on GPUs
Develop and optimize graph-based compilation flows (e.g., MLIR/LLVM) for neural network workloads
Write and tune performance-critical GPU kernels and runtime code in C++ or parallel programming languages
Identify and resolve bottlenecks across compiler, runtime, and kernel layers
Profile, benchmark, and characterize AI workloads to validate performance gains
Collaborate with hardware, driver, and framework teams on hardware/software co-optimization
Requirements
Bachelor's degree with 4+ years of relevant experience, OR Master's degree with 2+ years of relevant experience in Computer Science or a related field
Strong C++ development and debugging skills
Solid understanding of GPU architectures or AI accelerators
Hands-on experience with modern neural network architecture for inference on hardware accelerators
Preferred: PhD and 1+ years of relevant experience
Familiarity with OpenVINO or other AI inference frameworks
Knowledge of neural network optimization techniques and performance tradeoffs
Experience across multiple layers of the AI software stack, including AI inference engines or runtimes, graph compilers (e.g., MLIR/LLVM), GPU kernels or performance critical compute code
Tech Lead - AI Solutions at Exacaster impacting millions by building AI - powered applications. Leading a cross - functional team to deliver scalable, production - ready AI systems.
Principal Engineer at Frost bank leading architectural evolution and mentoring engineering teams. Collaborating on technical solutions to enhance banking systems and ensure security practices.
Principal Engineer responsible for technical and architectural vision for Wells Fargo's Enterprise Product and Pricing Management. Leading complex initiatives and providing expert advice throughout the organization.
Senior Fullstack Software Engineer developing scalable systems for Read AI, enhancing AI - powered collaboration tools. Lead technical roadmap and mentor engineers while ensuring high - performance user experiences.
Experienced Software Engineers developing and delivering complex software solutions at Boeing Precision Engagement Systems. Collaborating with a multi - discipline team in an Agile environment.
Associate Software Engineer developing and integrating embedded software solutions for Boeing’s precision engagement systems. Delivering real - time applications to support defense initiatives across the globe.
Enterprise Full - Stack Developer leading software development life - cycle for Middlebury College. Collaborating with IT and partners to enhance operational capabilities and applications.
Full Stack Developer responsible for developing innovative web and mobile applications at Interad Software GmbH. Collaborating in an agile team with a focus on frontend technologies like React and .NET.
Director of Engineering leading Digital Investing platforms, ensuring secure and resilient investment experiences. Fostering engineering excellence and collaborating across organizational structures.