🤖 AI Summary
Link failures in torus networks cause control-plane synchronization packet loss, degrading communication reliability in large-scale distributed systems. Method: This paper proposes an asynchronous local forwarding mechanism leveraging topological rotation and reflection symmetry—establishing, for the first time, a theoretical link between topological symmetry and communication resilience. The approach requires no protocol modifications, header overhead, or global coordination, relying solely on local geometric information to guarantee deterministic reachability. Using topological potential field gradient modeling and percolation theory, we design two strategies—Reverse-Flow with Central Forwarding (RF-CF) and Reverse-Flow with Lateral Forwarding (RF-LF)—and introduce a backflow-induction mechanism to enhance robustness under failures. Contribution/Results: Experiments show a 17.5% reduction in end-to-end packet loss under 1% link failure rate; RF-LF alone accounts for 28% of successful transmissions. The method significantly improves communication reliability over unreliable links in systems such as satellite constellations and HPC clusters.
📝 Abstract
The proliferation of large-scale distributed systems, such as satellite constellations and high-performance computing clusters, demands robust communication primitives that maintain coordination under unreliable links. The torus topology, with its inherent rotational and reflection symmetries, is a prevalent architecture in these domains. However, conventional routing schemes suffer from substantial packet loss during control-plane synchronization after link failures. This paper introduces a symmetry-driven asynchronous forwarding mechanism that leverages the torus's geometric properties to achieve reliable packet delivery without control-plane coordination. We model packet flow using a topological potential gradient and demonstrate that symmetry-breaking failures naturally induce a reverse flow, which we harness for fault circumvention. We propose two local forwarding strategies, Reverse Flow with Counter-facing Priority (RF-CF) and Lateral-facing Priority (RF-LF), that guarantee reachability to the destination via forward-flow phase transition points, without protocol modifications or additional in-packet overhead. Through percolation analysis and packet-level simulations on a 16 x 16 torus, we show that our mechanism reduces packet loss by up to 17.5% under a 1% link failure rate, with the RF-LF strategy contributing to 28% of successfully delivered packets. This work establishes a foundational link between topological symmetry and communication resilience, providing a lightweight, protocol-agnostic substrate for enhancing distributed systems.