Toward Ultra-Long-Horizon Sequential Model Editing

📅 2026-01-30
📈 Citations: 0
Influential: 0
📄 PDF

career value

174K/year
🤖 AI Summary
This work addresses the critical issue of model collapse in existing model editing methods during long-sequence edits, which stems from the exponential growth of MLP weight norms and severely limits continual knowledge updating. For the first time, this study theoretically establishes the connection between editing-induced collapse and weight norm explosion. To mitigate this, the authors propose Norm-Anchor Scaling (NAS), a plug-and-play mechanism requiring only a single line of code and negligible computational overhead. NAS effectively curbs MLP weight growth through lightweight norm constraints. Integrated within the Locate-and-Edit framework, NAS significantly enhances editing stability across mainstream large language models, delaying the collapse point of representative algorithms by over fourfold and improving average editing performance by 72.2%, thereby demonstrating its efficiency and broad applicability.

Technology Category

Application Category

📝 Abstract
Model editing has emerged as a practical approach for mitigating factual errors and outdated knowledge in large language models (LLMs). Among existing methods, the Locate-and-Edit (L&E) paradigm is the dominant framework: it locates MLP parameters implicated in expressing a target fact, and then performs a localized update to rewrite that fact. However, long sequences of edits often trigger abrupt model collapse in L&E beyond a critical point. We empirically identify a strong correlation between collapse and explosive growth of edited MLP weight norms, and formally prove that commonly used L&E update rules can induce exponential norm growth across sequential edits in the absence of explicit norm control. To address this issue, we propose Norm-Anchor Scaling NAS, a plug-and-play norm-constrained strategy. Across extensive experiments, NAS delays the collapse point of representative L&E algorithms by more than 4 times and yields a 72.2% average relative gain in editing performance, requiring only a single additional line of code and incurring negligible computational overhead.
Problem

Research questions and friction points this paper is trying to address.

model editing
Locate-and-Edit
sequential edits
model collapse
weight norm explosion
Innovation

Methods, ideas, or system contributions that make the work stand out.

model editing
Locate-and-Edit
norm explosion
Norm-Anchor Scaling
LLM updating
🔎 Similar Papers
No similar papers found.