HPC and AI Cluster Engineer maintaining large scale HPC/AI clusters for NVIDIA's Networking clusters solutions. Engaging with researchers and developers to optimize workflows and deliver solutions.
Responsibilities
Deploy, manage and maintain large scale HPC/AI clusters
Managing Linux job/workload schedules and orchestration tools
Support and maintain continuous integration and delivery pipelines
Troubleshooting and fixing, bottom up from bare metal, operating system, software stack and application level
Supporting Research & Development activities and engaging in POCs for future improvements
Requirements
Bachelor's Degree in Computer Science, Engineering, or a related field; or equivalent experience
3+ years of experience
Knowledge of HPC and AI solution technologies from CPU’s and GPU’s to high speed interconnects and supporting software
Experience with job scheduling workloads and orchestration tools such as Slurm, K8s
Excellent knowledge of Windows and Linux (Redhat/CentOS and Ubuntu) networking (sockets, firewalls, iptables, wireshark, etc.) and internals, ACLs and OS level security protection and common protocols e.g. TCP, DHCP, DNS, etc.
Python programming and bash scripting experience, automation and configuration management tools such as Jenkins, Ansible, Gitops
Experience with virtual systems (for example VMware, Hyper-V, KVM)
Benefits
Competitive salaries
Extensive benefits package
Work environment that promotes diversity, inclusion, and flexibility
Innovation Manager improving legal workflows by integrating AI and emerging technologies with attorneys and innovation teams in a supportive environment.
Werkstudent:in Softwareentwicklung mit Low - Code & AI bei Axians in Köln. Technische Evaluierung, Entwicklung von Erweiterungen und Showcases für die Mendix - Plattform.
Quality Analyst responsible for testing product features and ensuring user experience for AI startup in Indonesia. Engaging in manual QA testing and reporting product issues fully.
AI Governance & Enablement Intern supporting the Enterprise AI Program at Sally Beauty Holdings. Responsible for organizing and operationalizing AI use across the company with cross - team collaboration.
Join a free intensive AI bootcamp aimed at advanced students or recent graduates in systems or digital business. Opportunities for mentorship with Teamcubation after completion.
Evaluate research funding applications and assess R&D projects at VDI Technologies in a supportive mentoring environment. Flexible hours and emphasis on innovation in technology development.
Referent*in Forschungsförderung im Bereich Informatik oder Künstliche Intelligenz bei VDI Technologienzentrum. Übernehmen von Anträgen zur Forschungsförderung sowie Bewertung von Forschungsprojekten.
Prompt Engineer optimizing AI prompts and applications for Zurich's Bot family. Collaborate with tech teams to enhance context strategies and model evaluations.