AI Computing Performance Architect Intern, Performance Analysis, Kernel Development at NVIDIA | Hybrid Hired

About the role

Performance Architect Intern optimizing major layers and kernels for NVIDIA architectures. Collaborating with teams on in-depth performance analysis and resource optimization.

Responsibilities

Design, develop, and optimize major layers in LLM (e.g attention, GEMM, inter-GPU communication) for NVIDIA's new architectures.
Implement and fine-tune kernels to achieve optimal performance on NVIDIA GPUs.
Conduct in-depth performance analysis of GPU kernels, including Attention and other critical operations.
Identify bottlenecks, optimize resource utilization, and improve throughput, and power efficiency
Create and maintain workloads and micro-benchmark suites to evaluate kernel performance across various hardware and software configurations.
Generate performance projections, comparisons, and detailed analysis reports for internal and external stakeholders.
Collaborate with architecture, software, and product teams to guide the development of next-generation deep learning hardware and software.

Requirements

Pursuing BS, MS or PhD in relevant discipline (CS, EE, CE).
Strong software skills with C/C++, Python, MPI, OpenMP etc.
Solid computer science (CS) SW & HW arch background.
Experience of DL workload and operator performance will be a plus.
Familiarity with GPU computing and parallel programming models will be a plus.
Excellent oral and written communication skills.
Good organizational, time management and task prioritization skills.

Similar roles

Browse all Artificial Intelligence jobs

5 minutes ago

JE

Head of Growth, Social & AI – Performance-Based

Jeenka

Head of Growth Social & AI leading the commercial launch of a new AI - powered social media offering. Focusing on client acquisition and market positioning in Europe.

Hybrid Role

Milan Italy Artificial Intelligence

1 hour ago

IE

Cloud Developer, Azure AI Foundry

Instituto de Pesquisas Eldorado

Cloud Developer (Azure AI Foundry) working on AI integration and cloud solutions. Responsible for MLOps practices and collaborating with data teams for performance monitoring.

Hybrid Role

Campinas Brasil Artificial Intelligence

5 hours ago

NG

AI Solutions Delivery Lead

Northrop Grumman

Staff AI Solutions Delivery Lead at Northrop Grumman focusing on AI adoption and delivery coordination. Engage with internal stakeholders to bridge business needs and emerging AI capabilities.

Hybrid Role

McLean United States Artificial Intelligence

$161,000 - $241,400 per year

8 hours ago

MA

PhD AI Product Innovations

MAHLE

PhD role developing AI frameworks for automotive product innovations at MAHLE. Involves integration of multi - source data and collaboration with industry partners.

Onsite Role

Stuttgart Germany Artificial Intelligence

8 hours ago

MA

PhD Candidate in AI for Product Innovations

MAHLE

PhD candidate in AI for product innovations at MAHLE in Stuttgart, focusing on multi - source data integration and development of scalable AI frameworks. Collaborating with industry partners for automotive applications.

Onsite Role

Stuttgart Germany Artificial Intelligence

8 hours ago

TC

Analyst, Data & AI Solutions

The AES Corporation

Analyst position focused on designing and deploying AI solutions within AES Corporation's Clean Energy sector. Collaborating on strategic AI/ML projects and stakeholder engagement.

Hybrid Role

Salt Lake City United States Artificial Intelligence

$73,000 - $88,400 per year

8 hours ago

JI

AI Development Intern

Jones Lang LaSalle Americas, Inc.

AI Development Intern at JLL assisting projects in technology infrastructure and real estate technology. Enhancing user experience and collaborating on data accuracy and documentation.

Onsite Role

Chicago United States Artificial Intelligence

$8,200 - $9,636 per year

9 hours ago

GC

AI/ML Internship 2026

Global Brain Corporation

AI/ML Engineer Intern at Brain Co. for current undergraduate students applying AI techniques to solve healthcare, government, and energy challenges.

Hybrid Role

San Francisco United States Artificial Intelligence

9 hours ago

GC

AI Product Internship

Global Brain Corporation

AI Product Intern collaborating with engineers and researchers to design and deploy AI - powered software solutions. Contributing to impactful projects in various sectors such as government and healthcare.

Hybrid Role

San Francisco United States Artificial Intelligence

10 hours ago

EY

Associate Consultant – Tech Consulting, AI and Data

EY

Associate Consultant in Tech Consulting focusing on AI and Data solutions for clients. Driving long - term value through tailored technology and innovation in a collaborative environment.

Onsite Role

Bengaluru India Artificial Intelligence