Hybrid Senior Performance Software Engineer, Deep Learning Libraries

Posted 2 months ago

Apply now

About the role

  • Senior Performance Software Engineer developing optimized code for deep learning libraries on NVIDIA GPUs. Utilizing C++ CUDA for core deep learning operations and collaborating across teams.

Responsibilities

  • Writing highly tuned compute kernels, mostly in C++ CUDA, to perform core deep learning operations (e.g. matrix multiplies, convolutions, normalizations)
  • Following general software engineering best practices including support for regression testing and CI/CD flows
  • Collaborating with teams across NVIDIA: CUDA compiler team on generating optimal assembly code
  • Deep learning training and inference performance teams on which layers require optimization
  • Hardware and architecture teams on the programming model for new deep learning hardware features

Requirements

  • Masters or PhD degree or equivalent experience in Computer Science, Computer Engineering, Applied Math, or related field
  • 6+ years of relevant industry experience
  • Demonstrated strong C++ programming and software design skills, including debugging, performance analysis, and test design
  • Experience with performance-oriented parallel programming, even if it’s not on GPUs (e.g. with OpenMP or pthreads)
  • Solid understanding of computer architecture and some experience with assembly programming

Benefits

  • equity
  • benefits

Job title

Senior Performance Software Engineer, Deep Learning Libraries

Job type

Experience level

Senior

Salary

$184,000 - $356,500 per year

Degree requirement

Postgraduate Degree

Tech skills

Location requirements

Report this job

See something inaccurate? Let us know and we'll update the listing.

Report job