🤖 AI Summary
To address inaccurate latent variable inversion in diffusion-based steganography—leading to failed secret information extraction—this paper proposes RF-Stego, a high-fidelity steganographic framework leveraging path consistency and Rectified Flow (RF). Methodologically, we introduce PCLI (Path-Consistent Linear Inversion), a linear inversion process that rigorously aligns forward generation and backward reconstruction trajectories in latent space. We theoretically prove that Rectified Flow guarantees invertibility and numerical stability in the latent space, and design a lightweight RF sampler for efficient embedding and extraction. Experiments demonstrate that RF-Stego consistently outperforms state-of-the-art methods across all key metrics: extraction accuracy, image fidelity (FID reduced by up to 28%), robustness against noise, security against steganalysis, and generation efficiency (50% fewer sampling steps).
📝 Abstract
Steganography based on diffusion models has attracted increasing attention due to its ability to generate high-quality images and exhibit strong robustness. In such approaches, the secret message is first embedded into the initial latent variable, and then the stego image is generated through the forward process. To extract the message, an inversion process is required to reconstruct the latent variables from the received image. However, inaccurate latent inversion leads to significant discrepancies between the reconstructed and original latent variables, rendering message extraction infeasible. To address this issue, we propose extbf{RF-Stego}, a novel generative image steganography method that enables accurate latent inversion and significantly improves extraction accuracy. First, we develop the extbf{P}ath extbf{C}onsistency extbf{L}inear extbf{I}nversion ( extbf{PCLI}), which imposes formal constraints on the inversion process. By explicitly aligning it with the forward generation path and modeling both directions along a shared linear path, PCLI eliminates path mismatch and ensures path consistency throughout the steganographic process. Second, through rigorous theoretical proof, we demonstrate that extbf{R}ectified extbf{F}low extbf{(RF)} offers both theoretical reversibility and numerical stability in the inversion process. Based on this, we replace traditional unstable samplers with RF sampler which effectively improves the numerical precision of the inversion process. Experimental results show RF-Stego outperforms state-of-the-art methods in terms of extraction accuracy, image quality, robustness, security and generation efficiency.