Lead research on Foundational Multimodal Models for Conversational Avatars — systems that can perceive, reason, and generate across video, audio, and language.
Build and train models using Autoregressive, Predictive (e.g., V-JEPA), and Diffusion-based architectures with a deep focus on temporal and sequential data (not static frames).
Design and execute experiments to predict and control the visual, auditory, and linguistic responses of avatars.
Partner with the Applied ML team to bring research into real-world use cases.
Mentor other researchers and drive excellence across the team.
Requirements
A PhD plus 2–3+ years working hands-on with LLMs, VLMs, or multimodal systems.
Previous experience leading research efforts or mentoring teams.
Expertise in sequence modeling across video, audio, and text — with strong understanding of autoregressive, predictive, and diffusion frameworks.
Experience with large-scale model training and optimization for performance and real-time generation.
Proven ability to translate research ideas into production-grade systems.
Publications in top-tier venues (CVPR, ICCV, NeurIPS, ECCV, ACMMM).
Strong PyTorch skills and comfort moving fluidly between research and engineering.
Benefits
flexible work schedule
unlimited PTO
competitive healthcare
gear stipends
Job title
Senior AI Researcher, Multimodal Perception Models
AI Research Intern at NVIDIA focusing on multi - modal AI and vision - language model development. Collaborating with engineers to advance cutting - edge machine learning research in Vietnam.
AI Research Intern at NVIDIA developing generative models for biotechnology and computational science. Collaborating with a team to enhance AI model performance in drug discovery.
Applied AI Scientist enhancing healthcare AI products with a focus on responsible AI and collaborative development. Utilizing advanced AI techniques to support healthcare providers' needs and streamline processes in patient care.
AI Research Engineer developing innovative learning systems at Campus. Leading research and collaboration to enhance student engagement through AI solutions.
Senior Data Scientist leading design and execution of evaluation frameworks for generative AI systems at Resaro. Focusing on large language models, applying scientific methods to ensure AI safety and effectiveness.
Senior AI Scientist at Resaro, developing AI evaluation frameworks for generative AI systems. Engaging in deep learning applications, mentoring, and enhancing model performance metrics.
AI Research Scientist at Toyota Research Institute developing generative AI models for understanding human behavior. Focused on machine learning and behavioral science for carbon neutrality research.
Senior Applied AI Scientist at Messagepoint developing generative AI solutions using large language models. Focused on transforming research into production systems for customer communications.
Lead Data & AI Scientist at Causeway building ML models and data platforms for construction software. Drive insights, implement data architectures, and collaborate with product, architecture, and engineering teams.