STCC: A Unified Source-Channel Semantic Token Coding Framework for Semantic Communications

📅 2026-06-10
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the incompatibility of existing deep joint source-channel coding methods with discrete semantic tokens from foundation models and the susceptibility of fixed constellations to catastrophic errors under channel noise. To overcome these limitations, the paper proposes Semantic Token Channel Coding (STCC), a unified joint source-channel coding framework tailored for discrete semantic tokens. STCC employs a residual MLP encoder and a triple loss function to align the semantic embedding space with channel topology, thereby learning geometrically structured constellations that transform noise perturbations into semantically or structurally plausible deviations. Experimental results demonstrate that STCC significantly outperforms conventional approaches at low signal-to-noise ratios while enhancing semantic robustness without requiring modifications to the receiver.
📝 Abstract
Deep Joint Source-Channel Coding (JSCC) has emerged as a promising paradigm for overcoming the ``cliff effect" in wireless communications. However, existing Deep JSCC frameworks operate directly on raw analog data such as image pixels rather than the discrete semantic tokens that foundation models require. Moreover, traditional systems employ fixed, hand-designed constellations that treat all tokens equally, leading to catastrophic random errors under channel noise. In this paper, the Semantic Token Codebook Communication (STCC) is proposed as a unified source-channel semantic token coding framework designed to transmit the discrete semantic tokens of foundation models over noisy channels. The core of STCC is the Semantic Token Codec (STC). It accepts discrete tokens as input, which maintains compatibility with foundation models while employing a residual multiple layer perceptron, i.e., MLP-based encoder that learns geometrically structured constellations optimized with a triple-loss objective. This learned mapping forces the channel topology to align with the semantic embedding space, ensuring that channel noise results in topological errors rather than random corruption. This phenomenon is theoretically and empirically characterized, identifying ``Semantic Drift" in symbolic modalities and ``Structural Distortion" in perceptual modalities, where errors shift predictions to semantically or structurally similar tokens. Extensive experiments demonstrate that STCC significantly outperforms traditional systems in low-SNR regimes, effectively converting channel noise into semantic variations without requiring receiver-side modification.
Problem

Research questions and friction points this paper is trying to address.

Semantic Communications
Deep Joint Source-Channel Coding
Discrete Semantic Tokens
Channel Noise
Foundation Models
Innovation

Methods, ideas, or system contributions that make the work stand out.

Semantic Token Coding
Deep Joint Source-Channel Coding
Learned Constellations
Semantic Drift
Foundation Models
🔎 Similar Papers
No similar papers found.
Z
Zhicheng Bao
State Key Laboratory of Networking and Switching Technology, Beijing University of Posts and Telecommunications, Beijing 100876, China; Department of Broadband Communication, Peng Cheng Laboratory, Shenzhen 518000, China
Chen Dong
Chen Dong
Beijing University of Posts and Telecommunications
wireless communicationssemanticapplied math
S
Sen Wang
China Mobile Research Institute, Beijing, China
Long Liu
Long Liu
Professor, School of Biotechnology, Jiangnan University
Metabolic engineeringSynthetic BiologySystems Biology
Nan Ma
Nan Ma
Beijing University of Posts and Telecommunications
H
Hao Chen
Department of Broadband Communication, Peng Cheng Laboratory, Shenzhen 518000, China
Xiaodong Xu
Xiaodong Xu
Department of Physics, Department of MSE, University of Washington Seattle
Condensed matter physicsnanoelectronicsnano photonicsnano optoelectronics
Y
Yinqiu Liu
College of Computing and Data Science, Nanyang Technological University, Singapore
Ping Zhang
Ping Zhang
Beijing University of Posts and Telecommunications
next-generation mobile networkssemantic communicationsintellicise communication system