TQCodec: Towards neural audio codec for high-fidelity music streaming

πŸ“… 2026-03-02
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF

career value

222K/year
πŸ€– AI Summary
This work addresses the limitations of existing neural audio codecs, which primarily target ultra-low bitrates (≀16 kbps) and fail to meet the high-fidelity requirements of music streaming at 32–128 kbps. To bridge this gap, we propose TQCodec, a lightweight, asymmetric codec based on the SEANet architecture that supports 44.1 kHz sampling. TQCodec incorporates SimVQ vector quantization to preserve mid-frequency details, a phase-aware waveform loss function to enhance reconstruction accuracy, and an auditory-perception-driven bit allocation strategy that prioritizes perceptually critical low-frequency components. Experimental results across multiple music datasets demonstrate that TQCodec significantly outperforms current methods within the target bitrate range, achieving high-quality audio reconstruction suitable for high-fidelity streaming applications.

Technology Category

Application Category

πŸ“ Abstract
We propose TQCodec, a neural audio codec designed for high-bitrate, high-fidelity music streaming. Unlike existing neural codecs that primarily target ultra-low bitrates (<= 16kbps), TQCodec operates at 44.1 kHz and supports bitrates from 32 kbps to 128 kbps, aligning with the standard quality of modern music streaming platforms. The model adopts an encoder-decoder architecture based on SEANet for efficient on-device computation and introduces several enhancements: an imbalanced network design for improved quality with low overhead, SimVQ for mid-frequency detail preservation, and a phase-aware waveform loss. Additionally, we introduce a perception-driven band-wise bit allocation strategy to prioritize perceptually critical lower frequencies. Evaluations on diverse music datasets demonstrate that TQCodec achieves superior audio quality at target bitrates, making it well-suited for high-quality audio applications.
Problem

Research questions and friction points this paper is trying to address.

neural audio codec
high-fidelity music streaming
bitrate
audio quality
music compression
Innovation

Methods, ideas, or system contributions that make the work stand out.

neural audio codec
high-fidelity music streaming
perception-driven bit allocation
SimVQ
phase-aware waveform loss
πŸ”Ž Similar Papers
2024-09-26IEEE International Conference on Acoustics, Speech, and Signal ProcessingCitations: 0
L
Lixing He
Tencent Music Entertainment, The Chinese University of Hong Kong
Z
Zhouxuan Chen
Tencent Music Entertainment
M
Mingshuai Liu
Tencent Music Entertainment
X
Xinran Sun
Tencent Music Entertainment, Southeast University
W
Wucheng Wang
Tencent Music Entertainment
M
Minfu Li
Tencent Music Entertainment, Tsinghua University
L
Lingcheng Kong
Tencent Music Entertainment
W
Weifeng Zhao
Tencent Music Entertainment
Wenjiang Zhou
Wenjiang Zhou
Peking University, HUST
AI for scienceAtomistic simulationsSuper-Planckian far-field