Scholar
Ziyang Ma
Google Scholar ID: 4RZnXGMAAAAJ
Shanghai Jiao Tong University
Speech and Language Processing
Textless NLP
Self-supervised Learning
Multimedia
Follow
Homepage
↗
Google Scholar
↗
Citations & Impact
All-time
Citations
1,894
H-index
24
i10-index
36
Publications
20
Co-authors
30
list available
Contact
No contact links provided.
Publications
11 items
Speech Meets ELF: Audio Conditional Continuous-Target Diffusion for Speech Recognition and Translation
2026
Cited
0
MMAE: A Massive Multitask Audio Editing Benchmark
2026
Cited
0
Audio-Oscar: A Multi-Agent System for Complex Audio Scene Generation, Orchestration, and Refinement
2026
Cited
0
WavTTS: Towards High-Quality Zero-Shot TTS via Direct Raw Waveform Modeling
2026
Cited
0
WavCube: Unifying Speech Representation for Understanding and Generation via Semantic-Acoustic Joint Modeling
2026
Cited
0
NVBench: A Benchmark for Speech Synthesis with Non-Verbal Vocalizations
2026
Cited
0
FineLAP: Taming Heterogeneous Supervision for Fine-grained Language-Audio Pretraining
2026
Cited
0
Resonate: Reinforcing Text-to-Audio Generation via Online Feedback from Large Audio Language Models
2026
Cited
0
Load more
Resume (English only)
Co-authors
30 total
Xie Chen
Shanghai Jiao Tong University <- Microsoft <- Cambridge University
ShiLiang Zhang
Unknown affiliation
Kai Yu(俞凯)
Shanghai Jiao Tong University
Zhisheng Zheng
The University of Texas at Austin
Yifan Yang
Shanghai Jiao Tong University, Tencent, Microsoft, Xiaomi
gao zhifu
Tongyi Lab, Alibaba Group
Guanrou Yang
Shanghai Jiao Tong University
Zhihao Du
Alibaba
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up