DatawiseAgent: A Notebook-Centric LLM Agent Framework for Automated Data Science

📅 2025-03-10
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Existing LLM-based approaches for data science focus narrowly on isolated subtasks, neglecting inter-stage dependencies and thus failing to support end-to-end workflows. Method: This paper proposes a Jupyter Notebook–centric LLM agent framework that unifies human-agent-environment interaction via Markdown cells and executable code cells. It introduces a novel finite-state transducer (FST)–based four-phase mechanism: depth-first search (DFS)–guided task planning, incremental code execution, self-diagnostic debugging, and post-hoc result filtering—enabling dynamic closed-loop feedback and autonomous error recovery. Contribution/Results: The framework achieves performance on par with or surpassing state-of-the-art methods across diverse data science tasks—including data analysis, visualization, and modeling—demonstrating strong cross-scenario generalization and the feasibility of fully automated, end-to-end data science workflows.

Technology Category

Application Category

📝 Abstract
Data Science tasks are multifaceted, dynamic, and often domain-specific. Existing LLM-based approaches largely concentrate on isolated phases, neglecting the interdependent nature of many data science tasks and limiting their capacity for comprehensive end-to-end support. We propose DatawiseAgent, a notebook-centric LLM agent framework that unifies interactions among user, agent and the computational environment through markdown and executable code cells, supporting flexible and adaptive automated data science. Built on a Finite State Transducer(FST), DatawiseAgent orchestrates four stages, including DSF-like planning, incremental execution, self-debugging, and post-filtering. Specifically, the DFS-like planning stage systematically explores the solution space, while incremental execution harnesses real-time feedback and accommodates LLM's limited capabilities to progressively complete tasks. The self-debugging and post-filtering modules further enhance reliability by diagnosing and correcting errors and pruning extraneous information. Extensive experiments on diverse tasks, including data analysis, visualization, and data modeling, show that DatawiseAgent consistently outperforms or matches state-of-the-art methods across multiple model settings. These results highlight its potential to generalize across data science scenarios and lay the groundwork for more efficient, fully automated workflows.
Problem

Research questions and friction points this paper is trying to address.

Unifies interactions for automated data science tasks
Supports end-to-end data science workflow automation
Enhances reliability with self-debugging and post-filtering
Innovation

Methods, ideas, or system contributions that make the work stand out.

Notebook-centric LLM framework for data science
Finite State Transducer orchestrates four stages
Self-debugging and post-filtering enhance reliability
🔎 Similar Papers
No similar papers found.
Z
Ziming You
National Engineering Research Center for Software Engineering, Peking University
Y
Yumiao Zhang
School of Software & Microelectronics, Peking University
D
Dexuan Xu
School of Computer Science, Peking University
Y
Yiwei Lou
School of Computer Science, Peking University
Y
Yandong Yan
School of Computer Science, Peking University
W
Wei Wang
Xi’an Jiaotong University
Huaming Zhang
Huaming Zhang
University of Alabama in Huntsville
Y
Yu Huang
National Engineering Research Center for Software Engineering, Peking University