🤖 AI Summary
This paper addresses the online dynamic replication problem for data objects in distributed storage systems, aiming to minimize the total cost of storage and network transmission while serving real-time access requests across geographically distributed servers. To address flaws in prior competitive-ratio analyses, we construct a counterexample invalidating existing theoretical guarantees and rigorously prove that the competitive ratio lower bound for any deterministic online algorithm is 2. Building on this, we propose a novel online algorithm achieving a tight competitive ratio of max{2, min{γ, 3}}, where γ captures the relative cost ratio between storage and transmission. Our approach integrates online algorithm design, competitive analysis theory, and empirical evaluation on real-world access traces. Both theoretical optimality—attaining the proven lower bound—and extensive experiments validate the algorithm’s effectiveness and practicality.
📝 Abstract
This paper studies an online cost optimization problem for distributed storage and access. The goal is to dynamically create and delete copies of data objects over time at geo-distributed servers to serve access requests and minimize the total storage and network cost. We revisit a recent algorithm in the literature and show that it does not have a competitive ratio of $2$ as claimed by constructing a counterexample. We further prove that no deterministic online algorithm can achieve a competitive ratio bounded by $2$ for the general cost optimization problem. We develop an online algorithm and prove that it achieves a competitive ratio of $max{2, min{γ, 3}}$, where $γ$ is the max/min storage cost ratio among all servers. Examples are given to confirm the tightness of competitive analysis. We also empirically evaluate algorithms using real object access traces.