Agentic Search for Counterfactual Recourse under Fixed LLM Budgets

📅 2026-06-07
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the challenge of efficiently generating diverse and validated counterfactual explanations under a limited large language model (LLM) query budget. The problem is formulated as a fixed-budget agent-based search task, and the authors propose Comp-MCTS, a training-free framework that integrates LLM-generated intervention proposals, external validator feedback, compression-guided tree pruning, and Monte Carlo tree search strategies. Relying solely on an external validator, Comp-MCTS enables efficient exploration of high-quality, diverse counterfactuals. Evaluated on four real-world tabular datasets, the method significantly outperforms single-candidate baselines, achieving comparable or higher yields of valid counterfactuals at equal or lower validation costs across three datasets while maintaining strong proximity, sparsity, and novelty.
📝 Abstract
Counterfactual recourse aims to provide actionable feature changes that would alter an unfavorable decision made by a predictive model. In practice, affected individuals often benefit from multiple feasible alternatives rather than a single optimal explanation. A natural way to produce such alternatives is to prompt large language models (LLMs). However, prompting incurs a practical constraint: the number of LLM calls is often the dominant computational and economic cost. Together, the need for multiple alternatives and this cost constraint shift the problem from finding a single high-quality counterfactual to efficiently generating a set of oracle-validated counterfactuals under a fixed LLM-call budget. In this work, we study counterfactual recourse generation in the LLM-agentic setting as a fixed-budget search problem and propose Comp-MCTS, an agentic tree-search framework that maximizes the yield of unique, oracle-validated counterfactuals under this budget while maintaining favorable quantity--quality trade-offs. Comp-MCTS allocates the budget toward novel intervention directions via LLM-based proposal generation, oracle validation, and compression-guided pruning, in a training-free, oracle-only setting. Experiments on four real-world tabular datasets show that Comp-MCTS substantially outperforms single-candidate LATS-style baselines in the yield of unique, oracle-validated counterfactuals, and offers favorable quantity--quality--efficiency trade-offs against stronger multi-candidate variants: comparable or higher yield at similar or lower oracle-evaluation cost on three of four datasets, plus competitive proximity, sparsity, and novelty.
Problem

Research questions and friction points this paper is trying to address.

counterfactual recourse
large language models
fixed budget
oracle validation
actionable explanations
Innovation

Methods, ideas, or system contributions that make the work stand out.

counterfactual recourse
LLM-agentic search
fixed-budget optimization
Comp-MCTS
oracle-validated explanations