π€ AI Summary
Existing LLM-based database interaction approaches suffer from poor usability, uncontrolled access permissions, weak security guarantees, and inefficient data transfer. This paper introduces DBAgentβa database-agnostic, modular toolkit that enables context-aware SQL generation and ACID-compliant execution via three core mechanisms: fine-grained decomposition of SQL operations into specialized tools, policy-driven co-governance of permissions and database states, and an LLM-free cross-tool data proxy that bypasses intermediate LLM processing. Its key innovation lies in the first unified architectural framework that jointly optimizes both efficiency and security. Evaluations on a newly constructed benchmark demonstrate that DBAgent reduces LLM token consumption by up to 80%, significantly improves success rates on complex workflow tasks, and excels particularly in high-data-density scenarios. A fully open-sourced implementation is publicly available.
π Abstract
As large language models (LLMs) demonstrate increasingly powerful reasoning and orchestration capabilities, LLM-based agents are rapidly proliferating for complex data-related tasks. Despite this progress, the current design of how LLMs interact with databases exhibits critical limitations in usability, security, privilege management, and data transmission efficiency. To resolve these challenges, we introduce BridgeScope, a universal toolkit bridging LLMs and databases through three key innovations. First, it modularizes SQL operations into fine-grained tools for context retrieval, CRUD execution, and ACID-compliant transaction management, enabling more precise and LLM-friendly functionality controls. Second, it aligns tool implementations with both database privileges and user security policies to steer LLMs away from unsafe or unauthorized operations, improving task execution efficiency while safeguarding database security. Third, it introduces a proxy mechanism for seamless inter-tool data transfer, bypassing LLM transmission bottlenecks. All of these designs are database-agnostic and can be transparently integrated with existing agent architectures. We also release an open-source implementation of BridgeScope for PostgreSQL. Evaluations on two novel benchmarks demonstrate that BridgeScope enables LLM agents to operate databases more effectively, reduces token usage by up to 80% through improved security awareness, and uniquely supports data-intensive workflows beyond existing toolkits, establishing BridgeScope as a robust foundation for next-generation intelligent data automation.