AnyEdit++: Adaptive Long-Form Knowledge Editing via Bayesian Surprise

πŸ“… 2026-05-31
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF

career value

171K/year
πŸ€– AI Summary
This work addresses the challenge that large language models often produce incoherent or factually inconsistent edits when modifying complex long-form texts, primarily due to fixed-window chunking that disregards logical structure. To overcome this limitation, the authors propose AnyEdit++, which introduces Bayes-Chunkβ€”a Bayesian surprise-driven adaptive segmentation mechanism that dynamically identifies semantic boundaries to enable structure-aware editing. Grounded in two theoretical principles, structural independence and causal locality, the method incorporates geometrically orthogonal anchor points to minimize cross-chunk interference and enhance edit controllability. Experimental results demonstrate that AnyEdit++ significantly outperforms existing approaches across diverse tasks, including mathematical reasoning, code generation, and narrative editing, thereby validating the efficacy and robustness of structure-aware mechanisms for knowledge editing in long texts.
πŸ“ Abstract
Editing complex, long-form knowledge in Large Language Models remains a significant challenge due to the difficulty of maintaining generation coherence. Existing autoregressive methods like AnyEdit alleviate length constraints but rely on Fixed-window Chunking, which disregards logical structure and compromises consistency. To address this, we present AnyEdit++, a structure-aware framework incorporating Bayes-Chunk, an adaptive segmentation mechanism that dynamically identifies semantic boundaries based on Bayesian Surprise. We underpin this approach with a theoretical framework establishing two key principles: (1) Structural Independence: we prove that cross-segment interference is minimized when anchor keys are geometrically orthogonal (a condition naturally satisfied by our surprisal-based boundaries but violated by fixed windows), and (2) Causal Locality: we demonstrate that updates injected at these semantic peaks yield strictly superior control compared to arbitrary split points. Extensive experiments across mathematical reasoning, code generation, and narrative tasks demonstrate that AnyEdit++ achieves superior performance and robustness compared to state-of-the-art baselines, validating that structural awareness is critical for effective long-form knowledge editing.
Problem

Research questions and friction points this paper is trying to address.

knowledge editing
long-form generation
coherence
consistency
Large Language Models
Innovation

Methods, ideas, or system contributions that make the work stand out.

Bayesian Surprise
adaptive segmentation
structure-aware editing
long-form knowledge editing
causal locality
πŸ”Ž Similar Papers
2024-05-06Conference on Empirical Methods in Natural Language ProcessingCitations: 9