Enhancing Wireless Networks for IoT with Large Vision Models: Foundations and Applications

📅 2025-08-01
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address the weak generalization capability and poor task adaptability of vision-assisted wireless optimization in Low-Altitude Economic Networks (LAENets), this paper proposes a Large Vision Model (LVM)-integrated framework tailored for IoT wireless systems. Methodologically, we design a progressive multi-task fine-tuning mechanism enabling cross-layer co-optimization—spanning the physical layer (joint beamforming and localization), network layer, and application layer—while unifying classification, segmentation, generation, and multimodal understanding capabilities. Our key contribution lies in overcoming the limitations of conventional CNNs by enabling continual adaptation and lightweight deployment of LVMs under resource-constrained wireless conditions. Experiments in UAV-enabled IoT scenarios demonstrate that our approach significantly outperforms baseline CNNs in both joint optimization performance and cross-scenario generalization, thereby validating the technical feasibility and practical advantages of leveraging LVMs for intelligent wireless systems.

Technology Category

Application Category

📝 Abstract
Large vision models (LVMs) have emerged as a foundational paradigm in visual intelligence, achieving state-of-the-art performance across diverse visual tasks. Recent advances in LVMs have facilitated their integration into Internet of Things (IoT) scenarios, offering superior generalization and adaptability for vision-assisted network optimization. In this paper, we first investigate the functionalities and core architectures of LVMs, highlighting their capabilities across classification, segmentation, generation, and multimodal visual processing. We then explore a variety of LVM applications in wireless communications, covering representative tasks across the physical layer, network layer, and application layer. Furthermore, given the substantial model size of LVMs and the challenges of model retraining in wireless domains, we propose a progressive fine-tuning framework that incrementally adapts pretrained LVMs for joint optimization of multiple IoT tasks. A case study in low-altitude economy networks (LAENets) demonstrates the effectiveness of the proposed framework over conventional CNNs in joint beamforming and positioning tasks for Internet of drones, underscoring a promising direction for integrating LVMs into intelligent wireless systems.
Problem

Research questions and friction points this paper is trying to address.

Integrating large vision models into IoT for network optimization
Adapting pretrained LVMs for multiple IoT tasks efficiently
Improving joint beamforming and positioning in drone networks
Innovation

Methods, ideas, or system contributions that make the work stand out.

Integrating LVMs for IoT network optimization
Progressive fine-tuning for multi-task adaptation
Joint beamforming and positioning in drone networks
🔎 Similar Papers
No similar papers found.
Y
Yunting Xu
College of Computing and Data Science, Nanyang Technological University, Singapore, 639798
Jiacheng Wang
Jiacheng Wang
Nanyang Technological University
ISACGenAILow-altitude wireless networkSemantic Communications
Ruichen Zhang
Ruichen Zhang
Nanyang Technological University
Next-generation NetworkingEdge IntelligenceAgentic AIReinforcement learningLLM
D
Dusit Niyato
College of Computing and Data Science, Nanyang Technological University, Singapore, 639798
Deepu Rajan
Deepu Rajan
Nanyang Technological University
Image ProcessingComputer Vision
L
Liang Yu
Alibaba Cloud, China
H
Haibo Zhou
School of Electronic Science and Engineering, Nanjing University, Nanjing, China, 210023
A
Abbas Jamalipour
The University of Sydney, Sydney NSW 2006, Australia
X
Xianbin Wang
Department of Electrical and Computer Engineering, Western University, Canada