Adaptive Semantic Token Communication for Transformer-based Edge Inference

📅 2025-05-23
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Target-oriented semantic communication for resource-constrained edge devices demands efficient, robust, task-aware transmission under dynamic bandwidth and adverse channel conditions. To address this, we propose a dynamically configurable Transformer-driven deep joint source-channel coding (DJSCC) framework. Our approach introduces two key innovations: (i) a semantic token adaptive selection mechanism that identifies task-critical tokens, and (ii) a Lyapunov-based stochastic optimization framework enabling dual-dimensional resource control—jointly optimizing token count and embedding dimension. By performing end-to-end semantic modeling and compression, the framework significantly enhances downstream task performance. Specifically, in low-bandwidth (≤0.1 bits per pixel) and high-noise (SNR ≤ 5 dB) regimes, it achieves an average 12.6% improvement in mean Average Precision (mAP) over state-of-the-art methods for object detection, while reducing inference latency by 37%.

Technology Category

Application Category

📝 Abstract
This paper presents an adaptive framework for edge inference based on a dynamically configurable transformer-powered deep joint source channel coding (DJSCC) architecture. Motivated by a practical scenario where a resource constrained edge device engages in goal oriented semantic communication, such as selectively transmitting essential features for object detection to an edge server, our approach enables efficient task aware data transmission under varying bandwidth and channel conditions. To achieve this, input data is tokenized into compact high level semantic representations, refined by a transformer, and transmitted over noisy wireless channels. As part of the DJSCC pipeline, we employ a semantic token selection mechanism that adaptively compresses informative features into a user specified number of tokens per sample. These tokens are then further compressed through the JSCC module, enabling a flexible token communication strategy that adjusts both the number of transmitted tokens and their embedding dimensions. We incorporate a resource allocation algorithm based on Lyapunov stochastic optimization to enhance robustness under dynamic network conditions, effectively balancing compression efficiency and task performance. Experimental results demonstrate that our system consistently outperforms existing baselines, highlighting its potential as a strong foundation for AI native semantic communication in edge intelligence applications.
Problem

Research questions and friction points this paper is trying to address.

Efficient task-aware data transmission for edge inference
Adaptive semantic token compression under varying bandwidth
Robust resource allocation for dynamic network conditions
Innovation

Methods, ideas, or system contributions that make the work stand out.

Dynamic transformer-powered DJSCC architecture
Adaptive semantic token selection mechanism
Lyapunov-based resource allocation algorithm
🔎 Similar Papers
No similar papers found.
Alessio Devoto
Alessio Devoto
PhD Student, La Sapienza, University of Rome
Efficient LLMsInterpretable AIGraph Neural Networks
M
M. Merluzzi
CEA-Leti, Université Grenoble Alpes, Grenoble, France
P
P. Lorenzo
Department of Information Engineering, Electronics, and Telecommunications (DIET), Sapienza University of Rome, Italy
S
S. Scardapane
Department of Information Engineering, Electronics, and Telecommunications (DIET), Sapienza University of Rome, Italy