Scholar

Yuqian Fu

Google Scholar ID: oRcXbE0AAAAJ

Institute of Automation，Chinese Academy of Sciences

Reinforcement LearningLarge Language Model

Google Scholar↗

Citations & Impact

All-time

Citations

55

H-index

5

i10-index

1

Publications

15

Co-authors

8

list available

Contact

No contact links provided.

Publications

3 items

When RL Fails after SFT: Rejuvenating Model Plasticity for Robust SFT-to-RL Handoff

2026

Cited

0

Are Full Rollouts Necessary for On-Policy Distillation?

2026

Cited

0

Revisiting On-Policy Distillation: Empirical Failure Modes and Simple Fixes

2026

Cited

0

Resume (English only)

Co-authors

8 total

Institute of Automation, Chinese Academy of Sciences

Institute of Automation, Chinese Academy of Sciences

Meituan, University of Science and Technology of China

Shanghai Jiao Tong University

Zhongguancun Institute of Artificial Intelligence

University of Oxford, CAMEL-AI.org

Professor, King Abdullah University of Science and Technology