🤖 AI Summary
This work addresses the limited predictive performance of small tabular foundation models on high-dimensional, low-sample-size (HDLSS) tabular data. The authors propose the Graph-guided Ordering with Local Refinement (GO-LR) algorithm, which arranges features in a meaningful sequence and establishes its equivalence to the weighted minimum linear arrangement problem. Building upon this, they introduce a Neuro-inspired Subcell Compression (NSC) module that aggregates neighboring features to generate compact meta-features. For the first time, the study integrates compact tokenization into TabPFN-like architectures, substantially improving prediction accuracy and stability under strict token constraints—without requiring retraining of large backbone models.
📝 Abstract
We investigate how to make small tabular foundation models effective for High-Dimensional, Low-Sample Size (HDLSS) tabular prediction without retraining large backbones. We introduce Graph-guided Ordering with Local Refinement (GO-LR), show its equivalence to weighted Minimum Linear Arrangement, and interpret the practical solver as a TSP-path-style surrogate. We propose GOTabPFN,which builds on GO-LR, and a Neuro-Inspired Subunit Compression (NSC) unit to pool locally adjacent ordered features into meta-features, yielding a compact representation that makes TabPFN-style prediction practical in HDLSS regimes. Across tabular benchmarks, GOTabPFN improves stability and accuracy under tight token budgets.