JUNTAO ZHAO
Scholar

JUNTAO ZHAO

Google Scholar ID: QyxnGj0AAAAJ
Ph.D. of Computer Science, University of Hong Kong
Machine Learning SystemQuantizationBlockchain
Citations & Impact
All-time
Citations
79
 
H-index
3
 
i10-index
2
 
Publications
12
 
Co-authors
0
 
Publications
12 items
Browse publications on Google Scholar (top-right) ↗
Resume (English only)
Academic Achievements
  • 1. Efficient LLM Serving on Hybrid Real-time and Best-effort Requests (2025).
  • 2. OVERLORD: Ultimate Scaling of DataLoader for Multi-Source Large Foundation Model Training (2025).
  • 3. Cdmpp: A device-model agnostic framework for latency prediction of tensor programs (2024).
  • 4. Llm-pq: Serving llm on heterogeneous clusters with phase-aware partition and adaptive quantization (2024).
  • 5. POSTER: LLM-PQ: Serving LLM on Heterogeneous Clusters with Phase-Aware Partition and Adaptive Quantization (2024).
Research Experience
  • Open for full-time roles, especially in LLM inference optimization (kernel development, custom chip design, quantization, distributed inference). Also interested in game development and blockchain, with hands-on project experience in V/AR and game projects, and papers on blockchain.
Education
  • PhD in Machine Learning System, University of Hong Kong; BSc in Computer Science and Technology, The Chinese University of Hong Kong (Shenzhen)
Background
  • Research Interests: Machine Learning System, Games, Blockchain. Professional Field: Focuses on efficient inference and training of large foundation models, including vision-language models and large language models, as well as quantization techniques. Bio: PhD Candidate at the University of Hong Kong.
Miscellany
  • Personal Interests: Games and Blockchain
Co-authors
0 total
Co-authors: 0 (list not available)