Scholar
Sibo Song
Google Scholar ID: kwWyE2cAAAAJ
Alibaba
computer vision
deep learning
multimodal learning
Follow
Google Scholar
↗
Citations & Impact
All-time
Citations
2,415
H-index
13
i10-index
13
Publications
16
Co-authors
4
list available
Contact
No contact links provided.
Publications
11 items
Learning Transferable Temporal Primitives for Video Reasoning via Synthetic Videos
2026
Cited
0
Qwen3-VL-Embedding and Qwen3-VL-Reranker: A Unified Framework for State-of-the-Art Multimodal Retrieval and Ranking
arXiv.org · 2026
Cited
9
Qwen3-VL Technical Report
2025
Cited
0
Revisiting Multimodal Positional Encoding in Vision-Language Models
2025
Cited
0
Hulu-Med: A Transparent Generalist Model towards Holistic Medical Vision-Language Understanding
2025
Cited
0
Knowing or Guessing? Robust Medical Visual Question Answering via Joint Consistency and Contrastive Learning
2025
Cited
0
CAPO: Reinforcing Consistent Reasoning in Medical Decision-Making
2025
Cited
0
OmniV-Med: Scaling Medical Vision-Language Model for Universal Visual Understanding
2025
Cited
0
Load more
Resume (English only)
Co-authors
4 total
Ngai-Man Cheung
Associate Professor, Singapore University of Technology and Design
Zhibo Yang
Alibaba Group; Tsinghua University
Cong Yao
Alibaba DAMO Academy
Co-author 4
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up