Selection Integrity for LLM Graph Memory: An Accumulability Criterion for Information-Flow-Blind Retrieval

📅 2026-06-10

📈 Citations: 0

✨ Influential: 0

🤖 AI Summary

This work addresses a critical blind spot in existing graph-based memory defense mechanisms during the structured selection phase, where malicious structural writes can indirectly corrupt trusted facts by reallocating retrieval results. We formally define, for the first time, the “structural reallocation vulnerability” and introduce a “reallocatability” criterion to characterize this threat, revealing security risks in methods such as personalized PageRank while proving that certain distance metrics are inherently immune. Building on these insights, we propose AuthSelect, a defense framework integrating information-flow control, graph neural network–guided selection, and subgraph recomputation. Experiments demonstrate that AuthSelect completely blocks all 28 source-free structural attacks in real-world multi-session agent memory settings, incurs only 2%–3% latency overhead, and exhibits no over-blocking behavior.

📝 Abstract

Agent memory is moving to graphs, and the provenance defenses now being built for it all check one thing: the provenance of the records an agent retrieves. We show that this entire class of defense is blind by construction. A long-term graph memory runs a global selection step over writable graph structure, so structure that an untrusted principal writes changes \emph{which} authenticated facts are selected while the cited evidence stays fully authenticated; faithful information-flow control (IFC), checking the provenance of what the reader uses (all of it authenticated), makes the byte-identical decision to no defense at all, across document-QA substrates and real multi-session agent memory. In the most consequential instance, a no-source structural write silently misdirects $28$ irreversible ledger transfers over $499$ live actions: faithful IFC permits every one, and \authselect\ prevents every one. We then characterize exactly which memories are exposed: a selector admits the channel when its structural term can reallocate an $Ω(1)$ share of top-$k$ membership past a selected fact's margin. Personalized PageRank can, since a sourceless write reroutes conserved random-walk mass; a content-fixed reranker cannot, and Graphiti's node-distance, which leans on structure \emph{more} than PageRank does, stays immune. Reallocatability, not reliance, is the predictor. We prove the immune case in general and the open case under a chokepoint condition we verify. Closing the channel forces any provenance defense to recompute selection on the authenticated subgraph, which is what \authselect\ does, at zero over-block and $2$--$3\%$ latency.

Problem

Research questions and friction points this paper is trying to address.

Selection Integrity

Graph Memory

Information-Flow Control

Provenance Defense

Structural Manipulation

Innovation

Methods, ideas, or system contributions that make the work stand out.

graph memory

information-flow control

provenance defense