Scholar
Peiwen Sun
Google Scholar ID: z7qS03sAAAAJ
Multimedia lab, The Chinese University of Hong Kong
multimodal learning
Follow
Google Scholar
↗
Citations & Impact
All-time
Citations
162
H-index
7
i10-index
7
Publications
11
Co-authors
0
Contact
No contact links provided.
Publications
11 items
Which Speech Representation Better Matches Text-Native Reasoning? A Study of Speech-Text Alignment on Frame Rate and Representation
2026
Cited
0
LongSpace: Exploring Long-Horizon Spatial Memory from Perception to Recall in Video
2026
Cited
0
X-Stream: Exploring MLLMs as Multiplexers for Multi-Stream Understanding
2026
Cited
0
Talker-T2AV: Joint Talking Audio-Video Generation with Autoregressive Diffusion Modeling
2026
Cited
0
AURA: Always-On Understanding and Real-Time Assistance via Video Streams
2026
Cited
0
PhoStream: Benchmarking Real-World Streaming for Omnimodal Assistants in Mobile Scenarios
2026
Cited
0
OneThinker: All-in-one Reasoning Model for Image and Video
2025
Cited
0
PrismAudio: Decomposed Chain-of-Thoughts and Multi-dimensional Rewards for Video-to-Audio Generation
2025
Cited
0
Load more
Resume (English only)
Co-authors
0 total
Co-authors: 0 (list not available)
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up