Modular Architecture for High-Performance and Low Overhead Data Transfers

📅 2025-08-07
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address low efficiency and poor stability in large-scale data transfers within geo-distributed environments, this paper proposes AutoMDT, a modular data transfer architecture. AutoMDT decouples I/O and network tasks and introduces, for the first time, a Proximal Policy Optimization (PPO)-based deep reinforcement learning agent to jointly optimize concurrency levels across read, network, and write stages. A lightweight network-system co-simulator enables offline training, ensuring rapid convergence and dynamic adaptability to heterogeneous and time-varying network conditions. Evaluated on a production-grade platform, AutoMDT reduces average transfer completion time by 68% compared to state-of-the-art baselines, accelerates convergence by up to 8×, and significantly improves resource utilization and transfer robustness under network fluctuations and workload variability.

Technology Category

Application Category

📝 Abstract
High-performance applications necessitate rapid and dependable transfer of massive datasets across geographically dispersed locations. Traditional file transfer tools often suffer from resource underutilization and instability because of fixed configurations or monolithic optimization methods. We propose AutoMDT, a novel modular data transfer architecture that employs a deep reinforcement learning based agent to simultaneously optimize concurrency levels for read, network, and write operations. Our solution incorporates a lightweight network-system simulator, enabling offline training of a Proximal Policy Optimization (PPO) agent in approximately 45 minutes on average, thereby overcoming the impracticality of lengthy online training in production networks. AutoMDT's modular design decouples I/O and network tasks, allowing the agent to capture complex buffer dynamics precisely and to adapt quickly to changing system and network conditions. Evaluations on production-grade testbeds show that AutoMDT achieves up to 8x faster convergence and a 68% reduction in transfer completion times compared with state-of-the-art solutions.
Problem

Research questions and friction points this paper is trying to address.

Optimizing high-performance data transfers for large datasets
Overcoming resource underutilization in traditional transfer tools
Adapting dynamically to changing system and network conditions
Innovation

Methods, ideas, or system contributions that make the work stand out.

Modular architecture decouples I/O and network tasks
Deep reinforcement learning optimizes transfer operations
Lightweight simulator enables fast offline PPO training
🔎 Similar Papers
No similar papers found.