🤖 AI Summary
While exchangeability and independent and identically distributed (IID) assumptions are equivalent for infinite sequences, they fundamentally diverge in finite-sample settings—a distinction long overlooked in statistical learning, leading to widespread conflation (“exchangeable ≈ IID”). Method: We systematically characterize this finite-sample gap using rigorous probability theory, the theory of exchangeable sequences, and nonparametric statistical modeling, supported by explicit counterexample construction and asymptotic analysis. Contribution/Results: We establish precise, necessary and sufficient criteria under which exchangeability and IID are non-interchangeable for fixed-length sequences. Our results formally refute the erroneous equivalence assumption, providing critical theoretical boundaries for confidence set construction, conformal prediction, and Bayesian nonparametric inference. The work delivers foundational constraints on the validity and applicability of key statistical methodologies, correcting a persistent conceptual misconception in modern statistics and machine learning.
📝 Abstract
Randomness (in the sense of being generated in an IID fashion) and exchangeability are standard assumptions in nonparametric statistics and machine learning, and relations between them have been a popular topic of research. This note draws the reader's attention to the fact that, while for infinite sequences of observations the two assumptions are almost indistinguishable, the difference between them becomes very significant for finite sequences of a given length.