SAGE: An LLM-driven Self Reflective Agentic Framework for Fraud Detection

📅 2026-06-06

📈 Citations: 0

✨ Influential: 0

🤖 AI Summary

This work addresses key challenges in fraud detection within payment, e-commerce, and communication systems—namely insufficient individual accuracy, extreme class imbalance, and limited interpretability—by proposing the first end-to-end large language model (LLM)-driven multi-agent framework. The approach innovatively integrates specialized agents, a six-layer data diagnostic tree, and a natural language gradient-guided Markov decision process to enable semantic-aware collaborative reasoning and optimization, complemented by a fraud-specific reward mechanism. Extensive experiments across five real-world datasets and five LLM backbones demonstrate that the proposed method outperforms baselines in 96% of comparisons, achieving an average F1-score improvement of 40.86% while simultaneously delivering high recall, high precision, and interpretable decision-making.

📝 Abstract

Fraud detection in payment, e-commerce, and telecommunications systems requires accuracy at the individual level, robustness under severe class imbalance, and ease of understanding for risk managers. Existing methods fall at least one of these requirements: automated machine learning systems search a fixed numerical space without semantic awareness of the dataset; graph neural network-based methods require pre-defined relational graphs and remain opaque at the individual-decision level; and the design of general-purpose large language model (LLM) agents does not consider the recall and precision constraints specific to real-world fraud detection. In this paper, we propose SAGE, the first end-to-end LLM-driven multi-agent framework for fraud detection. SAGE coordinates three dedicated agents that make decisions based on a six-layer Data Diagnostic Tree (DDT) and a Markov decision process guided by natural-language gradients, automatically optimizing the model under a fraud-specific reward. On five fraud datasets and five LLM backbones, SAGE wins $96.00\%$ of method--dataset comparisons and improves F1 by an average of $40.86\%$ over baselines. The code is available at https://github.com/yichenC1c/SAGE.

Problem

Research questions and friction points this paper is trying to address.

fraud detection

class imbalance

interpretability

large language models

multi-agent systems

Innovation

Methods, ideas, or system contributions that make the work stand out.

LLM-driven multi-agent

Data Diagnostic Tree

natural-language gradients