Senior AI Software Engineer developing open-source AI frameworks for Large Language Models at NVIDIA. Collaborating on optimizing model training and providing innovative solutions in AI applications.
Responsibilities
Design and develop the GenAI open source Megatron Core and NeMo Framework
Solve large-scale, end-to-end AI training and inference challenges, spanning the full model lifecycle from initial orchestration, data pre-processing, and running of model training and tuning, to model deployment.
Work at the intersection of AI applications, libraries, frameworks, and the entire software stack.
Innovate and improve model architectures, distributed training algorithms, and model parallel paradigms.
Accelerate foundation model training and finetuning with mixed precision recipes and next-gen NVIDIA GPU architectures.
Performance tuning and optimizations of deep learning framework and software components.
Research, prototype, and develop robust and scalable AI tools and pipelines.
Requirements
MS, PhD or equivalent experience in Computer Science, AI, Applied Math, or related fields and 5+ years of industry experience.
Experience with AI Frameworks (e.g. PyTorch, JAX), and/or inference and deployment environments (e.g. TRTLLM, vLLM, SGLang).
Proficient in Python programming, software design, debugging, performance analysis, test design and documentation.
Consistent record of working effectively across multiple engineering initiatives and improving AI libraries with new innovations.
Strong understanding of AI/Deep-Learning fundamentals and their practical applications.
Field Software Engineer configuring and troubleshooting maritime sensor technologies for a global intelligence network. Collaborating remotely with field technicians and backend engineers.
Sr. Software Engineer developing Azure/Cloud applications. Overseeing architectural design and contributing to robust RESTful services creation in a cloud environment.
Full - Stack Software Engineer at GovWell building AI - powered solutions for government services. Working across the stack to deliver features that improve public service efficiency.
Senior Manager managing CNB integration initiatives at RBC, focusing on engineering delivery and program governance. Engaging with technology teams to ensure successful project execution and reporting.
Software Engineer working on scalable LLM and AI systems at Carelon Global Solutions. Responsibilities include building LLM model pipelines, collaborating with various teams, and mentoring junior engineers.
Senior Software Engineer designing and developing scalable data solutions using Snowflake and Microsoft Data Fabric at Carelon. Collaborating on healthcare data projects with technical data solutions.
Software Engineer II at Carelon optimizing large - scale healthcare data solutions using Snowflake and Microsoft Data Fabric. Collaborating with stakeholders to develop impactful data solutions.
Dashboard Product Engineer overseeing the AIX Dashboard product at Applied Materials. Driving roadmap clarity and stakeholder alignment while ensuring adoption and collaboration across teams.
Senior Software Engineer driving AI innovation for Fortune 500 energy leader and AI Fund. Building systems to optimize the operation and management of critical assets in energy supply.