AgoraResearch hub
ExploreLibraryProfile
Account
Yuqian Fu
Scholar

Yuqian Fu

Google Scholar ID: oRcXbE0AAAAJ
Institute of Automation,Chinese Academy of Sciences
Reinforcement LearningLarge Language Model
Google Scholar↗
Citations & Impact
All-time
Citations
55
 
H-index
5
 
i10-index
1
 
Publications
15
 
Co-authors
8
list available
Contact
No contact links provided.
Publications
3 items
When RL Fails after SFT: Rejuvenating Model Plasticity for Robust SFT-to-RL Handoff
2026
Cited
0
Are Full Rollouts Necessary for On-Policy Distillation?
2026
Cited
0
Revisiting On-Policy Distillation: Empirical Failure Modes and Simple Fixes
2026
Cited
0
Resume (English only)
Co-authors
8 total
Dongbin Zhao
Dongbin Zhao
Institute of Automation, Chinese Academy of Sciences
Yuanheng Zhu
Yuanheng Zhu
Institute of Automation, Chinese Academy of Sciences
Jiajun Chai
Jiajun Chai
Meituan Inc.
Guojun Yin
Guojun Yin
Meituan, University of Science and Technology of China
Xihuai Wang
Xihuai Wang
Shanghai Jiao Tong University
Jian Zhao
Jian Zhao
Zhongguancun Institute of Artificial Intelligence
Guohao Li
Guohao Li
University of Oxford, CAMEL-AI.org
Bernard Ghanem
Bernard Ghanem
Professor, King Abdullah University of Science and Technology

Welcome back

Sign in to Agora

Welcome back! Please sign in to continue.

Do not have an account?