Principal Engineer at NVIDIA architecting next-generation diagnostic systems for Cloud Service Providers. Leading technical strategy and mentoring engineering teams for scalable infrastructures.
Responsibilities
Define technical strategy and development of NVIDIA’s Data Center diagnostic systems, orchestrating large-scale stress testing for CPUs, GPUs, networking, memory, and high-speed interconnects.
Mentor and grow engineering teams, providing technical leadership and encouraging a culture of innovation and excellence.
Drive the root-cause analysis of systemic failures that intersect multiple hardware and software domains.
Partner with CSPs to diagnose and address scalability challenges within their unique data center infrastructures.
Requirements
Bachelor's degree in Computer Science/Engineering, Electrical Engineering, or a related field (or equivalent experience).
15+ years of system software experience working on highly resilient distributed systems with programming experience in C++ or Python.
Deep systems knowledge of x86/ARM architectures, Linux OS internals, firmware (UEFI/BIOS), Redfish, HMC, BMC protocols and platform security.
Consistent track record demonstrating technical leadership leading project teams and setting technical direction.
Expertise in software testing methodologies with an automation-led, AI-first approach to ensuring software quality.
Benefits
equity
benefits
Job title
Principal System Software Engineer – Data Center MODS
Senior Software Developer building and scaling Nasdaq's big data pipeline infrastructure. Collaborating with teams to design, implement, and optimize data lake solutions for global markets.
Salesforce Application Developer developing software solutions and supporting business processes at CDW. Collaborating on large - scale projects involving Salesforce CRM and Azure Cloud - based solutions.
Software Engineer at Notion, developing AI Meeting Notes and data capture features. Focused on building innovative tools for efficient team collaboration and information management.
Member of Technical Staff building internal data and agent infrastructure for Liquid AI. Design and build the unified company data graph and agent layer for operational efficiency.
Full Stack Developer supporting design and delivery of large - scale digital platforms within a multidisciplinary team. Leading technical development across frontend and backend components with a focus on scalable and secure solutions.
Join a cyber security scale - up as a Senior Engineer, leading feature development in a collaborative environment. Work closely with teams and contribute to decision - making in a complex domain.
Simulation & Virtual Engineering Lead at Airbus UpNext shaping UAS operation technologies. Designing simulation ecosystems and overseeing high - fidelity models for aerospace applications.
GTM Engineer at Baseten designing AI - powered workflows to enhance sales, marketing, and support. Driving CRM strategy and ensuring data quality for better performance.
Software Developer focused on full stack development and frontend solutions for Addvolt's cloud platform. Joining a team committed to innovative, eco - friendly technology for refrigerated transport.
Software Engineer designing and developing Python applications for Northrop Grumman. Collaborating with cross - functional teams and adhering to DoD security standards.