Onsite AI Research Intern, Multi-Modal Model Development

Posted 2 days ago

Apply now

About the role

  • Develop and fine-tune multi-modal AI models using NVIDIA’s TAO Toolkit and deep learning frameworks.
  • Contribute to the design and implementation of vision-language models (VLMs) and universal segmentation systems.
  • Conduct experiments and benchmarking to evaluate model accuracy, robustness, and scalability.
  • Collaborate with cross-functional teams to integrate your research into production-level pipelines and NVIDIA SDKs.
  • Participate in research discussions, code reviews, and technical documentation to share insights and improve methodologies.

Requirements

  • Currently pursuing a degree in Computer Science, Computer Engineering, or a related field.
  • Proven experience with machine learning, deep learning, or computer vision model development.
  • Strong Python programming skills and proficiency with PyTorch or similar frameworks.
  • Solid understanding of neural network architectures, transformers, and multi-modal learning techniques.
  • Excellent problem-solving abilities, attention to detail, and a collaborative mindset.
  • Familiarity with vision-language models, image segmentation, or large-scale pretraining is a strong plus.

Benefits

  • Comprehensive benefits package

Job title

AI Research Intern, Multi-Modal Model Development

Job type

Experience level

Entry level

Salary

Not specified

Degree requirement

Bachelor's Degree

Tech skills

Location requirements

Report this job

See something inaccurate? Let us know and we'll update the listing.

Report job