đ€ AI Summary
Art style classification faces two key challenges: scarcity of expert annotations and difficulty in modeling nonlinear interactions among stylistic features. To address these, we propose a spline-activated dual-teacher self-supervised framework. It replaces conventional MLPs with KolmogorovâArnold Networks (KANs) to explicitly capture complex nonlinear dependencies between global composition and local brushstrokes. A dual-path teacher architecture is designedâone pathway encodes local texture patterns, the other hierarchically represents global stylistic attributesâjointly optimized via knowledge distillation. Evaluated on WikiArt and Pandora18k, our method achieves significant Top-1 accuracy gains over state-of-the-art baselines. Linear probe experiments further demonstrate superior discriminability of the learned style representations. This work establishes a new paradigm for culturally diverse art style understandingâoffering both interpretability through spline-based feature decomposition and strong generalization across artistic domains.
đ Abstract
Art style classification remains a formidable challenge in computational aesthetics due to the scarcity of expertly labeled datasets and the intricate, often nonlinear interplay of stylistic elements. While recent dual-teacher self-supervised frameworks reduce reliance on labeled data, their linear projection layers and localized focus struggle to model global compositional context and complex style-feature interactions. We enhance the dual-teacher knowledge distillation framework to address these limitations by replacing conventional MLP projection and prediction heads with Kolmogorov-Arnold Networks (KANs). Our approach retains complementary guidance from two teacher networks, one emphasizing localized texture and brushstroke patterns, the other capturing broader stylistic hierarchies while leveraging KANs' spline-based activations to model nonlinear feature correlations with mathematical precision. Experiments on WikiArt and Pandora18k demonstrate that our approach outperforms the base dual teacher architecture in Top-1 accuracy. Our findings highlight the importance of KANs in disentangling complex style manifolds, leading to better linear probe accuracy than MLP projections.