Principal Engineer at NVIDIA architecting next-generation diagnostic systems for Cloud Service Providers. Leading technical strategy and mentoring engineering teams for scalable infrastructures.
Responsibilities
Define technical strategy and development of NVIDIA’s Data Center diagnostic systems, orchestrating large-scale stress testing for CPUs, GPUs, networking, memory, and high-speed interconnects.
Mentor and grow engineering teams, providing technical leadership and encouraging a culture of innovation and excellence.
Drive the root-cause analysis of systemic failures that intersect multiple hardware and software domains.
Partner with CSPs to diagnose and address scalability challenges within their unique data center infrastructures.
Requirements
Bachelor's degree in Computer Science/Engineering, Electrical Engineering, or a related field (or equivalent experience).
15+ years of system software experience working on highly resilient distributed systems with programming experience in C++ or Python.
Deep systems knowledge of x86/ARM architectures, Linux OS internals, firmware (UEFI/BIOS), Redfish, HMC, BMC protocols and platform security.
Consistent track record demonstrating technical leadership leading project teams and setting technical direction.
Expertise in software testing methodologies with an automation-led, AI-first approach to ensuring software quality.
Benefits
equity
benefits
Job title
Principal System Software Engineer – Data Center MODS
Senior System Software Engineer developing Microcontroller Firmware for GPU Server platforms at NVIDIA. Focusing on building and maintaining server manageability and embedded solutions.
Senior Engineering Technician focusing on electrical design for substation projects at Black & Veatch. Collaborating with multidisciplinary teams to create high - quality engineering deliverables.
Full - Stack Software Developer at Mycolever, developing a biocompound discovery platform with cloud technologies. Collaborating with scientists to integrate biological insights and ensuring platform scalability.
Lead Engineer developing and implementing HSE and ESG management systems at Honeywell. Collaborating with teams to ensure compliance and foster a culture of safety and responsibility.
Software Architect at IT - Strat improving end - to - end transaction posting logic and ensuring compliance. Collaborate on solutions and support new technologies in an Agile environment.
Full Stack Software Developer leading software design and development at VSolvit. Collaborating with cross - functional teams to deliver high - quality software solutions.
Technical Lead managing document lifecycle solutions at Luminor Group. Leading technical teams and ensuring architectural integrity in a hybrid environment.
Technical Lead responsible for document management and workflow systems at Luminor. Leading development of solutions for document lifecycle in an agile environment.
Technical Lead developing innovative document management solutions for Luminor, the leading bank in the Baltics. Leading projects to enhance the full document lifecycle in a collaborative team environment.
Softwareentwickler developing cloud - based inventory management solutions for retail and service businesses. Engaging in innovative software architecture and collaborating with teams.