AI Software Development Engineer optimizing AI inference workloads including Large Language Models on Intel GPUs. Involves graph compilation, runtime execution, and kernel optimization.
Responsibilities
Optimize emerging AI inference workloads such as Large Language Models (LLMs) and Diffusion models on GPUs
Develop and optimize graph-based compilation flows (e.g., MLIR/LLVM) for neural network workloads
Write and tune performance-critical GPU kernels and runtime code in C++ or parallel programming languages
Identify and resolve bottlenecks across compiler, runtime, and kernel layers
Profile, benchmark, and characterize AI workloads to validate performance gains
Collaborate with hardware, driver, and framework teams on hardware/software co-optimization
Requirements
Bachelor's degree with 4+ years of relevant experience, OR Master's degree with 2+ years of relevant experience in Computer Science or a related field
Strong C++ development and debugging skills
Solid understanding of GPU architectures or AI accelerators
Hands-on experience with modern neural network architecture for inference on hardware accelerators
Preferred: PhD and 1+ years of relevant experience
Familiarity with OpenVINO or other AI inference frameworks
Knowledge of neural network optimization techniques and performance tradeoffs
Experience across multiple layers of the AI software stack, including AI inference engines or runtimes, graph compilers (e.g., MLIR/LLVM), GPU kernels or performance critical compute code
Software Engineer Lead developing ETL solutions for PNC's regulatory compliance needs. Leading design and development of data solutions with compliance emphasis.
Senior Software Engineer focusing on backend development at CVS Health. Building software components using a cloud - native platform on Google Cloud Platform.
Software Engineer developing high quality products for OPENLANE in web, iOS, and Android environments. Collaborating in an agile team to build solutions with backend microservices on AWS cloud.
Software Engineer supporting BlueCard claims processing by enhancing applications and modernizing legacy systems. Requires experience in COBOL, C#, and SQL Server with remote work options.
Fullstack Developer skilled in HTML, CSS, JavaScript, and Node.js at tech company. Involves frontend and backend development along with CI/CD practices in Chennai.
Software Engineer responsible for full stack development at U.S. Bank. Collaborating within teams to design, develop, and maintain innovative software solutions.
Ground Software Engineer Intern at Millennium Space Systems focusing on software development, integration, and testing for satellite systems. Collaborating with engineers and technicians to ensure software quality and functionality.
Principal Engineer leading architectural evolution of Xero's global platform and influencing global engineering standards. Collaborating with executive stakeholders and driving innovation in fintech.
Senior Engineer solving engineering problems at scale and influencing architecture at Xero. Focusing on refactoring systems and enhancing developer experiences with scalable software.