Scholar
Qingnan Ren
Google Scholar ID: Ih8-1Y0AAAAJ
USTC
LLM training
RL
Agent
Follow
Google Scholar
↗
Citations & Impact
All-time
Citations
178
H-index
1
i10-index
1
Publications
1
Co-authors
6
list available
Contact
No contact links provided.
Publications
7 items
ADORA: Training Reasoning Models with Dynamic Advantage Estimation on Reinforcement Learning
2026
Cited
2
Internalizing Meta-Experience into Memory for Guided Reinforcement Learning in Large Language Models
2026
Cited
0
Implicit Strategic Optimization: Rethinking Long-Horizon Decision-Making in Adversarial Poker Environments
2026
Cited
0
ECHO-2: A Large-Scale Distributed Rollout Framework for Cost-Efficient Reinforcement Learning
2026
Cited
0
Controlled LLM Training on Spectral Sphere
2026
Cited
5
Echo: Decoupling Inference and Training for Large-Scale RL Alignment on Heterogeneous Swarms
2025
Cited
0
Logic-RL: Unleashing LLM Reasoning with Rule-Based Reinforcement Learning
2025
Cited
0
Resume (English only)
Co-authors
6 total
Tian Xie
Microsoft Research
Haoming Luo
Renmin University of China / University of Science and Technology of China
Zitian Gao
Ubiquant
Kai Qiu
Microsoft Research
Chong Luo
Microsoft Research
Yuqian Hong
University of Science and Technology of China
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up