HPC and AI Cluster Engineer at NVIDIA | Hybrid Hired

About the role

HPC and AI Cluster Engineer maintaining large scale HPC/AI clusters for NVIDIA's Networking clusters solutions. Engaging with researchers and developers to optimize workflows and deliver solutions.

Responsibilities

Deploy, manage and maintain large scale HPC/AI clusters
Managing Linux job/workload schedules and orchestration tools
Support and maintain continuous integration and delivery pipelines
Troubleshooting and fixing, bottom up from bare metal, operating system, software stack and application level
Supporting Research & Development activities and engaging in POCs for future improvements

Requirements

Bachelor's Degree in Computer Science, Engineering, or a related field; or equivalent experience
3+ years of experience
Knowledge of HPC and AI solution technologies from CPU’s and GPU’s to high speed interconnects and supporting software
Experience with job scheduling workloads and orchestration tools such as Slurm, K8s
Excellent knowledge of Windows and Linux (Redhat/CentOS and Ubuntu) networking (sockets, firewalls, iptables, wireshark, etc.) and internals, ACLs and OS level security protection and common protocols e.g. TCP, DHCP, DNS, etc.
Python programming and bash scripting experience, automation and configuration management tools such as Jenkins, Ansible, Gitops
Experience with virtual systems (for example VMware, Hyper-V, KVM)

Benefits

Competitive salaries
Extensive benefits package
Work environment that promotes diversity, inclusion, and flexibility

Similar roles

Browse all Artificial Intelligence jobs

17 minutes ago

GG

Working Student, AI Automation, n8n, Vibe Coding

Galvanek Bau GmbH

Student working in AI Automation and Vibe Coding at Galvanek. Building automated workflows and business - critical apps while studying.

Hybrid Role

Nürnberg Germany Artificial Intelligence

€14,000 - €17,000 per year

9 hours ago

VI

Working Student Software Development, Low-Code, AI

Vi

Werkstudent:in Softwareentwicklung mit Low - Code & AI. Unterstützung beim digitalen Wandel und Entwicklung von Erweiterungen für Mendix bei Axians.

Hybrid Role

Köln Germany Artificial Intelligence

10 hours ago

GL

Innovation Manager, Applied AI

Greenberg Traurig, LLP

Innovation Manager improving legal workflows by integrating AI and emerging technologies with attorneys and innovation teams in a supportive environment.

Hybrid Role

Atlanta United States Artificial Intelligence

17 hours ago

AX

Werkstudent – Softwareentwicklung, Low-Code, AI

Axians

Werkstudent:in Softwareentwicklung mit Low - Code & AI bei Axians in Köln. Technische Evaluierung, Entwicklung von Erweiterungen und Showcases für die Mendix - Plattform.

Hybrid Role

Köln Germany Artificial Intelligence

23 hours ago

FA

Product Quality Analyst – Contract-to-Hire

Frontier Airlines

Quality Analyst responsible for testing product features and ensuring user experience for AI startup in Indonesia. Engaging in manual QA testing and reporting product issues fully.

Hybrid Role

Denpasar Indonesia Artificial Intelligence

IDR 20,000,000 - IDR 30,000,000 per month

yesterday

SB

AI Intern

Sally Beauty

AI Governance & Enablement Intern supporting the Enterprise AI Program at Sally Beauty Holdings. Responsible for organizing and operationalizing AI use across the company with cross - team collaboration.

Hybrid Role

Plano United States Artificial Intelligence

yesterday

TE

Bootcamp AI

Teamcubation

Join a free intensive AI bootcamp aimed at advanced students or recent graduates in systems or digital business. Opportunities for mentorship with Teamcubation after completion.

Hybrid Role

Buenos Aires Argentina Artificial Intelligence

yesterday

VG

Referent Forschungsförderung – Schwerpunkt Informatik, Künstliche Intelligenz

VDI GmbH

Evaluate research funding applications and assess R&D projects at VDI Technologies in a supportive mentoring environment. Flexible hours and emphasis on innovation in technology development.

Hybrid Role

Düsseldorf Germany Artificial Intelligence

yesterday

VG

Research Funding Officer, Computer Science, Artificial Intelligence

VDI Technologiezentrum GmbH

Referent*in Forschungsförderung im Bereich Informatik oder Künstliche Intelligenz bei VDI Technologienzentrum. Übernehmen von Anträgen zur Forschungsförderung sowie Bewertung von Forschungsprojekten.

Hybrid Role

Berlin Germany Artificial Intelligence

yesterday

ZI

Prompt Engineer, Generative AI, Fluent in German

Zurich Insurance

Prompt Engineer optimizing AI prompts and applications for Zurich's Bot family. Collaborate with tech teams to enhance context strategies and model evaluations.

Onsite Role

Frankfurt Germany Artificial Intelligence