Interlocking-free Selective Rationalization Through Genetic-based Learning

📅 2024-12-13

🏛️ arXiv.org

📈 Citations: 0

✨ Influential: 0

career value

252K/year

🤖 AI Summary

Selective rationalization’s select-then-predict architecture suffers from interlocking—mutual dependency between the generator and predictor during joint training—leading to suboptimal equilibria. Existing approaches only mitigate this issue heuristically, via sampling or ad hoc regularization, without eliminating its root cause. This paper introduces the first interlocking-free selective rationalization framework: it decouples the generator and predictor modules and jointly optimizes them globally using a genetic algorithm, thereby circumventing equilibrium traps entirely. The framework incurs no additional learning overhead and remains end-to-end trainable. Experiments on both synthetic and real-world datasets demonstrate that our method significantly outperforms multiple state-of-the-art baselines in both explanation quality and prediction accuracy, validating the effectiveness and generalizability of the interlocking-free design.

Technology Category

Application Category

📝 Abstract

A popular end-to-end architecture for selective rationalization is the select-then-predict pipeline, comprising a generator to extract highlights fed to a predictor. Such a cooperative system suffers from suboptimal equilibrium minima due to the dominance of one of the two modules, a phenomenon known as interlocking. While several contributions aimed at addressing interlocking, they only mitigate its effect, often by introducing feature-based heuristics, sampling, and ad-hoc regularizations. We present GenSPP, the first interlocking-free architecture for selective rationalization that does not require any learning overhead, as the above-mentioned. GenSPP avoids interlocking by performing disjoint training of the generator and predictor via genetic global search. Experiments on a synthetic and a real-world benchmark show that our model outperforms several state-of-the-art competitors.

Problem

Research questions and friction points this paper is trying to address.

Addresses suboptimal equilibrium in select-then-predict rationalization

Eliminates interlocking without heuristic-based solutions

Proposes disjoint training via genetic search for optimal performance

Innovation

Methods, ideas, or system contributions that make the work stand out.

Genetic-based learning for disjoint training

Interlocking-free architecture GenSPP

Global search avoids learning overhead

🔎 Similar Papers

Semantic Self-Consistency: Enhancing Language Model Reasoning via Semantic Weighting