Scholar
Qidong Huang
Google Scholar ID: F-OzLhQAAAAJ
Qwen Team, Alibaba Cloud
vision and language
Follow
Homepage
↗
Google Scholar
↗
Citations & Impact
All-time
Citations
1,039
H-index
13
i10-index
13
Publications
20
Co-authors
9
list available
Contact
No contact links provided.
Publications
7 items
CapRL++: Unified Reinforcement Learning with Verifiable Rewards for Dense Image and Video Captioning
2026
Cited
0
Qwen3-VL Technical Report
2025
Cited
0
CapRL: Stimulating Dense Image Caption Capabilities via Reinforcement Learning
2025
Cited
0
ScaleCap: Inference-Time Scalable Image Captioning via Dual-Modality Debiasing
2025
Cited
0
MMRC: A Large-Scale Benchmark for Understanding Multimodal Large Language Model in Real-World Conversation
2025
Cited
0
Light-A-Video: Training-free Video Relighting via Progressive Light Fusion
2025
Cited
0
PyramidDrop: Accelerating Your Large Vision-Language Models via Pyramid Visual Redundancy Reduction
arXiv.org · 2024
Cited
6
Resume (English only)
Co-authors
9 total
Nenghai Yu
University of Science and Technology of China
weiming zhang
University of Science and Technology of China
Xiaoyi Dong
Microsoft GenAI
Dongdong Chen
Principal Research Manager, GenAI, Microsoft
Jiaqi Wang
Shanghai AI Laboratory
Gang Hua
Director of Applied Science, AI, Amazon.com, Inc., IEEE & IAPR Fellow
Dahua Lin
The Chinese University of Hong Kong
Liao Jing
CityU Hong Kong, Associate Professor
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up