Agentic Search for Counterfactual Recourse under Fixed LLM Budgets

📅 2026-06-07

📈 Citations: 0

✨ Influential: 0

🤖 AI Summary

This work addresses the challenge of efficiently generating diverse and validated counterfactual explanations under a limited large language model (LLM) query budget. The problem is formulated as a fixed-budget agent-based search task, and the authors propose Comp-MCTS, a training-free framework that integrates LLM-generated intervention proposals, external validator feedback, compression-guided tree pruning, and Monte Carlo tree search strategies. Relying solely on an external validator, Comp-MCTS enables efficient exploration of high-quality, diverse counterfactuals. Evaluated on four real-world tabular datasets, the method significantly outperforms single-candidate baselines, achieving comparable or higher yields of valid counterfactuals at equal or lower validation costs across three datasets while maintaining strong proximity, sparsity, and novelty.

📝 Abstract

Counterfactual recourse aims to provide actionable feature changes that would alter an unfavorable decision made by a predictive model. In practice, affected individuals often benefit from multiple feasible alternatives rather than a single optimal explanation. A natural way to produce such alternatives is to prompt large language models (LLMs). However, prompting incurs a practical constraint: the number of LLM calls is often the dominant computational and economic cost. Together, the need for multiple alternatives and this cost constraint shift the problem from finding a single high-quality counterfactual to efficiently generating a set of oracle-validated counterfactuals under a fixed LLM-call budget. In this work, we study counterfactual recourse generation in the LLM-agentic setting as a fixed-budget search problem and propose Comp-MCTS, an agentic tree-search framework that maximizes the yield of unique, oracle-validated counterfactuals under this budget while maintaining favorable quantity--quality trade-offs. Comp-MCTS allocates the budget toward novel intervention directions via LLM-based proposal generation, oracle validation, and compression-guided pruning, in a training-free, oracle-only setting. Experiments on four real-world tabular datasets show that Comp-MCTS substantially outperforms single-candidate LATS-style baselines in the yield of unique, oracle-validated counterfactuals, and offers favorable quantity--quality--efficiency trade-offs against stronger multi-candidate variants: comparable or higher yield at similar or lower oracle-evaluation cost on three of four datasets, plus competitive proximity, sparsity, and novelty.

Problem

Research questions and friction points this paper is trying to address.

counterfactual recourse

large language models

fixed budget

oracle validation

actionable explanations

Innovation

Methods, ideas, or system contributions that make the work stand out.

counterfactual recourse

LLM-agentic search

fixed-budget optimization