Categorical Prior Lock-in: Why In-Context Learning Fails for Structured Data

📅 2026-06-10

📈 Citations: 0

✨ Influential: 0

🤖 AI Summary

This work investigates the limitations of large language models (LLMs) in generating structured data under distribution shift via in-context learning (ICL). It identifies a previously unreported phenomenon—“category prior locking”—where LLMs fail to adjust the category priors established during pretraining, leading to complete failure on rare categories. The study systematically compares ICL with LoRA fine-tuning on two open-source 7B models: while ICL improves numerical fidelity, it cannot alleviate prior locking; in contrast, LoRA fine-tuning overcomes this limitation but introduces memory leakage and generation instability. These findings reveal a fundamental trade-off between adaptation efficacy through fine-tuning and privacy-preserving generation, highlighting structural constraints in ICL’s ability to adapt to shifted category distributions.

📝 Abstract

Large language models (LLMs) are increasingly used as conditional generators for structured data, relying on in-context learning (ICL) to adapt to new distributions without parameter updates. We investigate the limits of ICL for structured generation under distribution mismatch, using high-cardinality tabular data as a controlled test case, and identify a structural failure mode we term \textit{categorical prior lock-in}: the inability of ICL to update the model's prior over token distributions inherited from pre-training. Across two 7B-parameter open-weight models, ICL improves numerical fidelity with additional examples but exhibits a sharp ceiling on categorical distributions, failing to reproduce rare classes entirely. Parameter-efficient fine-tuning (LoRA) overcomes these limitations but introduces measurable memorization risk and, in some cases, destabilizes structured output generation, highlighting a fundamental trade-off between adaptability and privacy.

Problem

Research questions and friction points this paper is trying to address.

in-context learning

structured data

categorical prior lock-in

distribution mismatch

large language models

Innovation

Methods, ideas, or system contributions that make the work stand out.

categorical prior lock-in

in-context learning

structured data generation