Browse publications on Google Scholar (top-right) ↗
Resume (English only)
Academic Achievements
- Published over 30 papers in top international AI conferences and journals such as TASLP, TMM, ACM MM, ICASSP, INTERSPEECH
- 2025, Best Paper Award at APSIPA for 'Voice Conversion Augmentation for Speaker Recognition on Defective Datasets'
- Other notable publications include 'Interpolating Speaker Identities in Embedding Space for Data Expansion', 'Audio-Visual Target Speaker Extraction with Reverse Selective Auditory Attention', etc.
Research Experience
- 2024.12 - Present, Audio AI Engineer, Zoom, Singapore
- 2023.08 - 2024.11, Research Fellow, National University of Singapore (NUS), Singapore
Education
- 2019.08 - 2023.09, Ph.D. in Speech Processing and Computer Vision, National University of Singapore (NUS), supervised by Prof. Li Haizhou
- 2018.08 - 2019.06, M.Sc. in Electronic and Computer Engineering, National University of Singapore (NUS)
- 2014.09 - 2018.06, B.Eng. in Electronic Engineering, Soochow University, Suzhou, China
Background
- Research Interests: Speech processing (enhancement, extraction, and separation), speaker processing (recognition, diarization, active speaker detection, and anti-spoofing), multi-modal speech processing (active speaker detection, cross-modal speaker recognition), self-supervised learning
- Professional Field: Speech Processing and Computer Vision
- Brief Introduction: Currently an Audio AI Engineer at Zoom, Singapore. Previously, from 2023 to 2024, he was a Research Fellow at the National University of Singapore (NUS).