An Algebraic View of the Expressivity of Recurrent Language Models

📅 2026-06-01
📈 Citations: 0
Influential: 0
📄 PDF

career value

201K/year
🤖 AI Summary
This work resolves theoretical discrepancies regarding the capacity of recurrent neural language models to recognize formal languages under different arithmetic models, offering a unified characterization of their expressive power. By constructing a framework grounded in algebraic automata theory, the study reformulates the question of language expressivity as an algebraic problem: whether the syntactic monoid of a language divides a specific wreath product. The analysis reveals that the choice of quantization critically affects computational capability—under floating-point recurrence, diagonal state-space models cannot implement even-modulo counters, whereas with unsigned integer quantization, they can realize all such counters. This result provides a novel algebraic perspective on the theoretical limits of recurrent architectures.
📝 Abstract
What formal languages can a recurrent neural language model recognize? Formal results in the literature conflict: some authors report Turing-completeness, while others show equivalence to regular languages. The reason for this discrepancy is that the underlying arithmetic model differs. The paper develops a unified algebraic account of the expressivity of recurrent neural networks, starting with a formal account of various arithmetic models. This account reduces expressivity to an algebraic question, e.g., whether a network's syntactic monoid divides a certain wreath product. As a case study, the paper revisits diagonal state-space models: the same architecture cannot implement an even-modulus counter once floating-point recurrences are enforced, yet realizes every even-modulus counter under unsigned-integer quantization.
Problem

Research questions and friction points this paper is trying to address.

recurrent language models
formal languages
expressivity
arithmetic models
Turing-completeness
Innovation

Methods, ideas, or system contributions that make the work stand out.

algebraic expressivity
recurrent neural networks
arithmetic models
syntactic monoid
wreath product
🔎 Similar Papers
2024-05-27Neural Information Processing SystemsCitations: 24