Scholar
Song Han
Google Scholar ID: E0iCaa4AAAAJ
Massachusetts Institute of Technology
Computer Architecture
Deep Learning
Computer Vision
Follow
Homepage
↗
Google Scholar
↗
Citations & Impact
All-time
Citations
69,193
H-index
75
i10-index
153
Publications
20
Co-authors
9
list available
Contact
Twitter
Open ↗
GitHub
Open ↗
LinkedIn
Open ↗
Publications
7 items
Fast-dVLM: Efficient Block-Diffusion VLM via Direct Conversion from Autoregressive VLM
2026
Cited
0
TriAttention: Efficient Long Reasoning with Trigonometric KV Compression
2026
Cited
0
Adaptive Block-Scaled Data Types
2026
Cited
0
Attend Before Attention: Efficient and Scalable Video Understanding via Autoregressive Gazing
2026
Cited
0
Stable Asynchrony: Variance-Controlled Off-Policy RL for LLMs
2026
Cited
0
ForeAct: Steering Your VLA with Efficient Visual Foresight Planning
2026
Cited
0
Jet-RL: Enabling On-Policy FP8 Reinforcement Learning with Unified Training and Rollout Precision Flow
2026
Cited
0
Resume (English only)
Academic Achievements
The 'Deep Compression' paper is among the top-5 most cited in the 50-year history of ISCA (1953–2023)
Best paper awards at ICLR'16, FPGA'17, and MLSys'24
MLSys'24 award-winning work AWQ enables 4-bit quantization with over 19 million downloads on HuggingFace
Recipient of NSF CAREER Award and Sloan Research Fellowship
Named to MIT Technology Review’s '35 Innovators Under 35'
Selected as one of IEEE’s 'AI’s 10 to Watch'
Research Experience
Led a series of works on LLM quantization and acceleration, including SmoothQuant, AWQ, and StreamingLLM
Proposed TinyML and Once-for-All Network (hardware-aware neural architecture search)
Heads the HAN Lab, researching efficient generative AI, visual generation, and vision-language models
Developed efficient visual generation models such as HART, SANA series, and DC-VideoGen
Designed multiple sparse attention mechanisms including SpAtten, Radial Attention, and XAttention
Built the VILA family of efficient vision-language models: VILA, VILA-U, and LongVILA
Co-authors
9 total
Co-author 1
Zhijian Liu
Research Scientist at NVIDIA, Assistant Professor at UC San Diego
Co-author 3
Hanrui Wang
MIT
Han Cai
NVIDIA
Haotian Tang
Massachusetts Institute of Technology
Yu Wang (汪玉)
Department of Electronic Engineering, Tsinghua University, China
Co-author 8
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up