SLIM-LLMs: Modeling of Style-Sensory Language RelationshipsThrough Low-Dimensional Representations

๐Ÿ“… 2025-08-04
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF
๐Ÿค– AI Summary
This work investigates the association mechanism between sensory language and stylistic features, proposing the SLIM-LLMs frameworkโ€”the first to incorporate low-dimensional stylistic representations into nonlinear sensory language prediction modeling. Methodologically, it employs Rank-Revealing Ridge Regression (Rโด) to extract compact, interpretable, low-rank stylistic embeddings from LIWC-derived features, then integrates them into a lightweight nonlinear predictive architecture. Experiments across five diverse text genres demonstrate that SLIM-LLMs achieves performance on par with full-scale language models using only 20% of their parameters, markedly improving computational efficiency and model interpretability. Crucially, its low-dimensional stylistic encoding effectively captures cross-genre stylistic invariances, enabling robust sensory language analysis under resource constraints. This establishes a novel, efficient, and interpretable paradigm for sensory language modeling in low-resource settings.

Technology Category

Application Category

๐Ÿ“ Abstract
Sensorial language -- the language connected to our senses including vision, sound, touch, taste, smell, and interoception, plays a fundamental role in how we communicate experiences and perceptions. We explore the relationship between sensorial language and traditional stylistic features, like those measured by LIWC, using a novel Reduced-Rank Ridge Regression (R4) approach. We demonstrate that low-dimensional latent representations of LIWC features r = 24 effectively capture stylistic information for sensorial language prediction compared to the full feature set (r = 74). We introduce Stylometrically Lean Interpretable Models (SLIM-LLMs), which model non-linear relationships between these style dimensions. Evaluated across five genres, SLIM-LLMs with low-rank LIWC features match the performance of full-scale language models while reducing parameters by up to 80%.
Problem

Research questions and friction points this paper is trying to address.

Modeling relationships between sensorial language and stylistic features
Predicting sensorial language using low-dimensional LIWC representations
Reducing model parameters while maintaining performance in language modeling
Innovation

Methods, ideas, or system contributions that make the work stand out.

Uses Reduced-Rank Ridge Regression (R4)
Employs low-dimensional latent representations
Introduces Stylometrically Lean Interpretable Models (SLIM-LLMs)
๐Ÿ”Ž Similar Papers
No similar papers found.