Decider: A Dual-System Rule-Controllable Decoding Framework for Language Generation

๐Ÿ“… 2024-03-04
๐Ÿ›๏ธ IEEE Transactions on Knowledge and Data Engineering
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF
๐Ÿค– AI Summary
This work addresses the insufficient fine-grained logical controllability in large language model (LLM) generation. We propose a dual-system decoding framework that integrates an LLM with a first-order logic (FOL) reasoner, jointly guiding generation via a differentiable decision function. Inspired by cognitive dual-process theoryโ€”the first application of this theory to decoding designโ€”it shifts the paradigm from local token-level encouragement to global rule-satisfying token-set guidance, harmonizing human intuition with formal logic. The framework is end-to-end trainable and enables programmable, rule-aware generation control. Experiments on CommonGen and PersonaChat demonstrate significant improvements: +12.7% rule adherence rate and +23% naturalness in human evaluation, empirically validating the synergistic enhancement of logical rigor and linguistic fluency.

Technology Category

Application Category

๐Ÿ“ Abstract
Constrained decoding approaches aim to control the meaning or style of text generated by the pre-trained large language models (LLMs or also PLMs) for various tasks at inference time. However, these methods often guide plausible continuations by greedily and explicitly selecting targets. Though fulfilling the task requirements, these methods may overlook certain general and natural logics that humans would implicitly follow towards such targets. Inspired by cognitive dual-process theory, in this work, we propose a novel decoding framework DECIDER where the base LLMs are equipped with a First-Order Logic (FOL) reasoner to express and evaluate the rules, along with a decision function that merges the outputs of both systems to guide the generation. Unlike previous constrained decodings, DECIDER transforms the encouragement of target-specific words into all words that satisfy several high-level rules, enabling us to programmatically integrate our logic into LLMs. Experiments on CommonGen and PersonaChat demonstrate that DECIDER effectively follows given FOL rules to guide LLMs in a more human-like and logic-controlled manner.
Problem

Research questions and friction points this paper is trying to address.

Control text meaning/style in LLMs via constrained decoding
Balance task requirements with human-like natural logic
Integrate First-Order Logic rules into LLM generation
Innovation

Methods, ideas, or system contributions that make the work stand out.

Dual-system framework combines LLMs with FOL reasoner
Decision function merges outputs for controlled generation
Transforms target words into rule-satisfying logic
๐Ÿ”Ž Similar Papers
C
Chen Xu
Beijing Institute of Technology, Beijing, China
T
Tian Lan
Beijing Institute of Technology, Beijing, China
C
Changlong Yu
W
Wei Wang
J
Jun Gao
Y
Yu Ji
Beijing Institute of Technology, Beijing, China
Qunxi Dong
Qunxi Dong
Scholar of BIT
Computational Neuroscience
K
Kun Qian
Beijing Institute of Technology, Beijing, China
P
Piji Li
Nanjing University of Aeronautics and Astronautics, Nanjing, China
Wei Bi
Wei Bi
HKUST
NLGDialog SystemNLPMachine LearningData Mining
B
Bin Hu
Beijing Institute of Technology, Beijing, China