Hybrid Senior Software Architect – Deep Learning, HPC Communications

Posted 6 hours ago

Apply now

About the role

  • Senior Software Architect leading the design of scalable communications software at NVIDIA. Focusing on AI and HPC performance improvements for cutting-edge technology solutions.

Responsibilities

  • Investigate opportunities to improve communication performance by identifying bottlenecks in today's systems
  • Design and implement new communication technologies to accelerate AI and HPC workloads
  • Explore innovative solutions in HW and SW for our next generation platforms as part of co-design efforts involving GPU, Networking, and SW architects
  • Build proofs-of-concept, conduct experiments, and perform quantitive modeling to evaluate and drive new innovations
  • Use simulation to explore performance of large GPU clusters (think scales of 100s of 1000s of GPUs)

Requirements

  • M.S./Ph.D. degree in CS/CE or equivalent experience
  • 5+ years of relevant experience
  • Excellent C/C++ programming and debugging skills
  • Experience with parallel programming models (MPI, SHMEM) and at least one communication runtime (MPI, NCCL, NVSHMEM, OpenSHMEM, UCX, UCC)
  • Deep understanding of operating systems, computer and system architecture
  • Solid in fundamentals of network architecture, topology, algorithms, and communication scaling relevant to AI and HPC workloads
  • Strong experience with Linux
  • Ability and flexibility to work and communicate effectively in a multi-national, multi-time-zone corporate environment

Benefits

  • equity
  • benefits

Job title

Senior Software Architect – Deep Learning, HPC Communications

Job type

Experience level

Senior

Salary

$184,000 - $287,500 per year

Degree requirement

Postgraduate Degree

Tech skills

Location requirements

Report this job

See something inaccurate? Let us know and we'll update the listing.

Report job