Scholar
Penghui Qi
Google Scholar ID: CLRsGEMAAAAJ
Sea AI Lab & PhD student of NUS
Machine Learning
Reinforcement Learning
MLSys
Follow
Google Scholar
↗
Citations & Impact
All-time
Citations
526
H-index
6
i10-index
4
Publications
9
Co-authors
8
list available
Contact
No contact links provided.
Publications
10 items
Flow-DPPO: Divergence Proximal Policy Optimization for Flow Matching Models
2026
Cited
0
Rethinking the Divergence Regularization in LLM RL
2026
Cited
0
Rethinking the Trust Region in LLM Reinforcement Learning
2026
Cited
1
Revisiting Parameter Server in LLM Post-Training
2026
Cited
1
Defeating the Training-Inference Mismatch via FP16
2025
Cited
0
SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning
2025
Cited
0
Optimizing Anytime Reasoning via Budget Relative Policy Optimization
2025
Cited
0
Understanding R1-Zero-Like Training: A Critical Perspective
2025
Cited
0
Load more
Resume (English only)
Co-authors
8 total
Min Lin
Principal Research Scientist, Sea AI Lab
Zichen Liu
Sea AI Lab; National University of Singapore
Tianyu Pang
Senior Research Scientist, Sea AI Lab
Chao Du
Senior Research Scientist, Sea AI Lab
Wee Sun Lee
Professor, Department of Computer Science, National University of Singapore
Xinyi Wan
Sea AI Lab
Junxiao Song
DeepSeek AI
Co-author 8
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up