Senior Performance Software Engineer developing optimized code for deep learning libraries on NVIDIA GPUs. Utilizing C++ CUDA for core deep learning operations and collaborating across teams.
Responsibilities
Writing highly tuned compute kernels, mostly in C++ CUDA, to perform core deep learning operations (e.g. matrix multiplies, convolutions, normalizations)
Following general software engineering best practices including support for regression testing and CI/CD flows
Collaborating with teams across NVIDIA: CUDA compiler team on generating optimal assembly code
Deep learning training and inference performance teams on which layers require optimization
Hardware and architecture teams on the programming model for new deep learning hardware features
Requirements
Masters or PhD degree or equivalent experience in Computer Science, Computer Engineering, Applied Math, or related field
6+ years of relevant industry experience
Demonstrated strong C++ programming and software design skills, including debugging, performance analysis, and test design
Experience with performance-oriented parallel programming, even if it’s not on GPUs (e.g. with OpenMP or pthreads)
Solid understanding of computer architecture and some experience with assembly programming
Benefits
equity
benefits
Job title
Senior Performance Software Engineer, Deep Learning Libraries
Full - Stack Software Engineer at GovWell building AI - powered solutions for government services. Working across the stack to deliver features that improve public service efficiency.
Senior Manager managing CNB integration initiatives at RBC, focusing on engineering delivery and program governance. Engaging with technology teams to ensure successful project execution and reporting.
Software Engineer II at Carelon optimizing large - scale healthcare data solutions using Snowflake and Microsoft Data Fabric. Collaborating with stakeholders to develop impactful data solutions.
Senior Software Engineer designing and developing scalable data solutions using Snowflake and Microsoft Data Fabric at Carelon. Collaborating on healthcare data projects with technical data solutions.
Software Engineer working on scalable LLM and AI systems at Carelon Global Solutions. Responsibilities include building LLM model pipelines, collaborating with various teams, and mentoring junior engineers.
Dashboard Product Engineer overseeing the AIX Dashboard product at Applied Materials. Driving roadmap clarity and stakeholder alignment while ensuring adoption and collaboration across teams.
Senior Software Engineer driving AI innovation for Fortune 500 energy leader and AI Fund. Building systems to optimize the operation and management of critical assets in energy supply.
Intermediate Software Developer joining Aspire Software for cloud platform development in Lebanon. Responsible for full stack coding and collaboration with teams on project implementation.