ACT: Agentic Classification Tree

📅 2025-09-30
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
In high-stakes domains, AI decision-making requires transparency, explainability, and auditability—yet conventional decision trees are limited to structured data, while large language models (LLMs), though effective on unstructured text, lack verifiable and traceable reasoning. Method: This paper introduces the first method for constructing Natural Language Decision Trees (NL-Trees) over unstructured text. It reformulates decision-tree splitting as an optimizable natural language problem, integrating LLM-generated feedback, TextGrad-based gradient optimization, and information-entropy-driven impurity evaluation to iteratively induce interpretable, auditable classification paths. Contribution/Results: NL-Trees achieve classification accuracy comparable to or exceeding state-of-the-art prompt-engineering approaches on benchmark text classification tasks. Crucially, they produce explicit, hierarchical, and human-verifiable reasoning chains—thereby bridging the long-standing gap between interpretability and robust reasoning over unstructured textual data.

Technology Category

Application Category

📝 Abstract
When used in high-stakes settings, AI systems are expected to produce decisions that are transparent, interpretable, and auditable, a requirement increasingly expected by regulations. Decision trees such as CART provide clear and verifiable rules, but they are restricted to structured tabular data and cannot operate directly on unstructured inputs such as text. In practice, large language models (LLMs) are widely used for such data, yet prompting strategies such as chain-of-thought or prompt optimization still rely on free-form reasoning, limiting their ability to ensure trustworthy behaviors. We present the Agentic Classification Tree (ACT), which extends decision-tree methodology to unstructured inputs by formulating each split as a natural-language question, refined through impurity-based evaluation and LLM feedback via TextGrad. Experiments on text benchmarks show that ACT matches or surpasses prompting-based baselines while producing transparent and interpretable decision paths.
Problem

Research questions and friction points this paper is trying to address.

Extends decision trees to handle unstructured text inputs
Ensures transparent and interpretable AI decision paths
Replaces free-form reasoning with verifiable natural-language questions
Innovation

Methods, ideas, or system contributions that make the work stand out.

Extends decision trees to unstructured data via natural-language splits
Refines splits using impurity evaluation and LLM feedback
Produces transparent interpretable decision paths matching baseline performance
🔎 Similar Papers
No similar papers found.
V
Vincent Grari
AXA Group Operations, TRAIL, Sorbonne Université, Paris, France
T
Tim Arni
EPFL, Lausanne, Switzerland
Thibault Laugel
Thibault Laugel
Researcher @AXA, Associate Researcher @Sorbonne Université/LIP6
Machine LearningXAIAI FairnessTrustworthy ML
S
Sylvain Lamprier
LERIA, Université d’Angers, France
James Zou
James Zou
Stanford University
Machine learningcomputational biologycomputational healthstatisticsbiotech
M
Marcin Detyniecki
AXA Group Operations, TRAIL, Sorbonne Université, Paris, France, Polish Academy of Science, IBS PAN, Warsaw, Poland