Offloading Artificial Intelligence Workloads across the Computing Continuum by means of Active Storage Systems

📅 2025-12-02

📈 Citations: 0

✨ Influential: 0

🤖 AI Summary

Traditional cloud architectures struggle to efficiently process the massive, high-velocity data generated by AI workloads, resulting in suboptimal storage, computation, and data migration performance, while lacking adaptability to device heterogeneity and dynamically evolving AI models. To address this, we propose an active-storage-enhanced AI task offloading architecture tailored for the compute continuum, deeply integrating active storage into an edge–cloud collaborative framework to enable proximity-based execution and dynamic scheduling of AI inference tasks. Our approach synergistically combines active storage, computational offloading, and distributed AI inference, thereby overcoming the fundamental bottleneck of the conventional “move data to compute” paradigm. Experimental results demonstrate that the proposed architecture significantly reduces inference latency and network bandwidth consumption, while improving system throughput and resource utilization. It achieves joint optimization of energy efficiency and performance across diverse application scenarios.

Technology Category

Application Category

Problem

Research questions and friction points this paper is trying to address.

Optimizing AI workload distribution across computing continuum

Reducing data transfer overhead via active storage systems

Improving memory efficiency and training speeds in AI deployments

Innovation

Methods, ideas, or system contributions that make the work stand out.

Embedding computation directly into storage architectures

Proposing a software architecture for seamless AI workload distribution

Utilizing active storage platform dataClay with Python libraries

🔎 Similar Papers

I/O in Machine Learning Applications on HPC Systems: A 360-degree Survey

2024-04-16ACM Computing SurveysCitations: 0

SSDTrain: An Activation Offloading Framework to SSDs for Faster Large Language Model Training

2024-08-19Citations: 1

Authors to Follow