Senior Systems Engineer at NVIDIA focused on improving AI cluster resiliency and delivering AIOps solutions. Collaborating with team members to debug complex issues and enhance customer satisfaction.
Responsibilities
Bring together and understand internal and external customer requirements to improve AI cluster resiliency and design AIOps-based solutions that address these needs
Develop automated workflows for issue detection and root cause analysis and closely collaborate with operators to debug sophisticated, full-stack AI cluster problems
Deliver compelling technical presentations and lead hands-on demos or training
Handle evaluation deployments (POC/POV) and ensure smooth, reliable installations by staying engaged throughout the customer journey
Requirements
Bachelor of Science or equivalent experience
8+ years of networking experience in enterprise or service provider environments, with strong hands-on expertise in routing and switching
Proficient in scripting and automation using Python or similar languages, with strong Linux expertise
Proven experience working directly with customers to resolve issues and ensure success in Systems Engineer or SRE roles
Exceptional oral, written, and presentation skills for clearly communicating complex technical topics
Demonstrated ability to collaborate effectively across teams, partnering with operations, engineering, and product development
Benefits
Equity
Benefits
Job title
Senior Systems Engineer, Artificial Intelligence Operations
Lead Finance Systems Analyst driving financial systems integration and sustainability at Boeing. Collaborating across teams to enhance and support finance platforms and processes.
Senior Software Systems Engineer leading Full Stack Java development team at Boeing's Space Mission Systems. Focusing on Mission Management and Mission Planning software development.
Senior Applied AI Engineer designing and building AI workflows for mechanical design at Backflip. Shaping the future of how users interact with groundbreaking AI tools.
Systems Analyst at Deloitte focused on innovation within cloud technology projects. Collaborating with stakeholders to translate business needs into technical specifications and solutions.
Systems Engineer focused on Microsoft technologies at a global fintech leader. Leading design, implementation, and optimization of Windows and Azure infrastructure.
Staff Audio Systems Engineer responsible for audio systems architecture and delivery in a music tech startup. Collaborating with teams to ensure high - performance audio systems for on - demand vinyl records.
System Engineer responsible for Contact Center process development and collaboration with Genesys partner. Ensuring compliance and optimizing services in a hybrid work environment.
Business Systems Analyst providing systems support to develop and enhance business systems for IQVIA. Collaborating with teams to formulate systems scope and objectives to improve client delivery.
Senior Cloud Systems Engineer providing Databricks administration for government/financial services. Responsible for platform management, security, and automation in a hybrid setup.