Scholar

Peiwen Sun

Google Scholar ID: z7qS03sAAAAJ

Multimedia lab, The Chinese University of Hong Kong

multimodal learning

Google Scholar↗

Citations & Impact

All-time

Citations

162

H-index

7

i10-index

7

Publications

11

Co-authors

0

Contact

No contact links provided.

Publications

11 items

Which Speech Representation Better Matches Text-Native Reasoning? A Study of Speech-Text Alignment on Frame Rate and Representation

2026

Cited

0

LongSpace: Exploring Long-Horizon Spatial Memory from Perception to Recall in Video

2026

Cited

0

X-Stream: Exploring MLLMs as Multiplexers for Multi-Stream Understanding

2026

Cited

0

Talker-T2AV: Joint Talking Audio-Video Generation with Autoregressive Diffusion Modeling

2026

Cited

0

AURA: Always-On Understanding and Real-Time Assistance via Video Streams

2026

Cited

0

PhoStream: Benchmarking Real-World Streaming for Omnimodal Assistants in Mobile Scenarios

2026

Cited

0

OneThinker: All-in-one Reasoning Model for Image and Video

2025

Cited

0

PrismAudio: Decomposed Chain-of-Thoughts and Multi-dimensional Rewards for Video-to-Audio Generation

2025

Cited

0

Resume (English only)

Co-authors

0 total

Co-authors: 0 (list not available)