Scholar
Yufeng Zhong
Google Scholar ID: BUJNSJYAAAAJ
Meituan
Multimodal LLM
Computer Vision
Follow
Google Scholar
↗
Citations & Impact
All-time
Citations
58
H-index
4
i10-index
2
Publications
13
Co-authors
2
list available
Contact
No contact links provided.
Publications
12 items
OCRVerse: Towards Holistic OCR in End-to-End Vision-Language Models
2026
Cited
1
MobileDreamer: Generative Sketch World Model for GUI Agent
arXiv.org · 2026
Cited
2
VinciCoder: Unifying Multimodal Code Generation via Coarse-to-fine Visual Reinforcement Learning
2025
Cited
0
OmniActor: A Generalist GUI and Embodied Agent for 2D&3D Worlds
2025
Cited
0
UItron: Foundational GUI Agent with Advanced Perception and Planning
2025
Cited
0
Breaking the SFT Plateau: Multimodal Structured Reinforcement Learning for Chart-to-Code Generation
2025
Cited
0
DocTron-Formula: Generalized Formula Recognition in Complex and Structured Scenarios
2025
Cited
0
Chart-R1: Chain-of-Thought Supervision and Reinforcement for Advanced Chart Reasoner
2025
Cited
0
Load more
Resume (English only)
Co-authors
2 total
Lin Ma
Meituan
Long Xu
Ningbo University, Peng Cheng Laboratory
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up