Scholar
De-An Huang
Google Scholar ID: HEY3UzgAAAAJ
Stanford University
Computer Vision
Robotics
Machine Learning
Bioinformatics
Follow
Homepage
↗
Google Scholar
↗
Citations & Impact
All-time
Citations
6,563
H-index
37
i10-index
45
Publications
20
Co-authors
152
list available
Contact
CV
Open ↗
Publications
11 items
VideoITG: Multimodal Video Understanding with Instructed Temporal Grounding
2025
Cited
0
Argus: Vision-Centric Reasoning with Grounded Chain-of-Thought
2025
Cited
0
FRAG: Frame Selection Augmented Generation for Long Video and Long Document Understanding
2025
Cited
0
Eagle 2.5: Boosting Long-Context Post-Training for Frontier Vision-Language Models
2025
Cited
0
Token-Efficient Long Video Understanding for Multimodal LLMs
2025
Cited
0
QLIP: Text-Aligned Visual Tokenization Unifies Auto-Regressive Multimodal Understanding and Generation
2025
Cited
0
Eagle 2: Building Post-Training Data Strategies from Scratch for Frontier Vision-Language Models
2025
Cited
0
Omni-RGPT: Unifying Image and Video Region-level Understanding via Token Marks
2025
Cited
0
Load more
Resume (English only)
Academic Achievements
Authored multiple high-impact papers at top venues including CVPR 2025, ICLR 2025, and arXiv, such as:
- FRAG: Frame Selection Augmented Generation for Long Video and Long Document Understanding
- Eagle series (Eagle 2, Eagle 2.5): Post-training data strategies for frontier vision-language models
- QLIP: Text-Aligned Visual Tokenization
- Omni-RGPT: Unified region-level image and video understanding
- NVILA: Efficient frontier visual language models
- T-Stitch: Accelerating sampling in pre-trained diffusion models
- X-VILA: Cross-modality alignment for large language models
- ARDuP: Active region video diffusion for universal policies
Research Experience
Research Scientist at NVIDIA
Summer internships at leading research labs:
- NVIDIA Seattle Robotics Lab (with Dieter Fox)
- Facebook Applied Machine Learning (with Vignesh Ramanathan and Dhruv Mahajan)
- Microsoft Research Redmond (with Zicheng Liu)
- Disney Research Pittsburgh (with Leonid Sigal)
Co-authors
152 total
Anima Anandkumar
California Institute of Technology and NVIDIA
Li Fei-Fei
Professor of Computer Science, Stanford University
Zhiding Yu
Principal Research Scientist & Research Lead, NVIDIA Research
Juan Carlos Niebles
Research Director (Salesforce) & Adjunct Professor (Stanford University)
Yuke Zhu
The University of Texas at Austin, NVIDIA Research
Weili Nie
NVIDIA Research
Linxi "Jim" Fan
NVIDIA, https://jimfan.me
Chaowei Xiao
University of Wisconsin - Madison/NVIDIA
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up