Guan Tongkun
Scholar

Guan Tongkun

Google Scholar ID: rTG1yTQAAAAJ
Shanghai Jiao Tong University
MLLMComputer VisionText Spotting
Citations & Impact
All-time
Citations
168
 
H-index
6
 
i10-index
4
 
Publications
11
 
Co-authors
3
list available
Publications
11 items
Browse publications on Google Scholar (top-right) ↗
Resume (English only)
Academic Achievements
  • Publications: One research paper about Multimodal Large Language Model is accepted by ICCV2025; one research paper about Multimodal Large Language Model is accepted by CVPR'25; one research paper about Self-supervised Text Recognition is accepted by TPAMI'25; two research papers about text detection and formula recognition are accepted by ECCV'24; one research paper about Self-supervised Text Recognition is accepted by ICCV'23; one research paper about Scene Text Recognition is accepted by CVPR'23; one research paper about Industrial Text Detection is accepted by TCSVT'22. Awards: National Scholarship for PhD, 2025-9; Wu Honor Class, 2024-3; Outstanding Graduate Student in Shanghai Jiao Tong University, 2022-12; National Scholarship for graduate student, 2022-9; Outstanding Undergraduate Student of Hunan Province, 2020-6; National Scholarship for Undergraduate Student, 2019-9 (1/259); National Scholarship for Undergraduate Student, 2018-9 (1/259); Pacemaker to Merit Student, 2018-9; Top 10 Outstanding Students of the College, 2019; Provincial Second Prize of National Electronic Design Competition, 2019; Second Prize of Advanced Mathematics Competition, 2018; National Encouragement Scholarship for Undergraduate Student, 2017-9.
Research Experience
  • 2024/09 - 2025/05, internship at Meituan, obtained technology application certification about MLLM; 2025/09 - Present, working with QwenVL Team, Tongyi Lab, Alibaba Group.
Education
  • 2023/04 - Present, pursuing a PhD degree at the Artificial Intelligence Institute, Department of Computer Science and Engineering, Shanghai Jiao Tong University, supervised by Prof. Xiaokang Yang and Prof. Wei Shen; 2020/09 - 2023/03, received an M.S. degree in the Department of Automation from Shanghai Jiao Tong University, with the National Scholarship; 2016/09 - 2020/06, received a B.S. degree in Electrical Engineering and Automation (double first-class discipline) from Hunan University, ranked first (1/259) with a GPA of 4.25/4.5 in all core courses.
Background
  • Research interests include Multimodal Large Language Model, Text to Image, Representation Learning, and End-to-end Text Spotting.
Miscellany
  • Journal Services: IEEE Transactions on Pattern Analysis and Machine Intelligence (IEEE TPAMI), IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), Pattern Recognition (PR). Conference Services: NeurIPS, CVPR, ICCV, ICLR, ECCV, WACV, PRCV, ACCV, ICDAR.