A Flexible Multi-Agent LLM-Human Framework for Fast Human Validated Tool Building

📅 2025-12-01
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address the challenges of poor human-intent alignment, slow domain adaptation, and high human-feedback costs in tool construction, this paper proposes an expert-in-the-loop multi-agent LLM collaboration framework. It introduces four specialized agents—Coach, Coder, Critic, and Reuser—alongside a dynamic reinforcement prompting mechanism to enable system-level in-context learning, real-time intent calibration, and reusable tool accumulation. By integrating multi-agent collaborative reasoning, human feedback reinforcement learning (HFRL), and dynamic prompt engineering, the framework significantly reduces domain adaptation time (by 62% on average) and human feedback iterations (by 57%) for complex tasks such as scientific document generation. Its core innovation lies in deeply embedding expert knowledge into a closed-loop multi-agent collaboration, thereby enabling efficient, scalable, and intent-driven tool evolution.

Technology Category

Application Category

📝 Abstract
We introduce CollabToolBuilder, a flexible multiagent LLM framework with expert-in-the-loop (HITL) guidance that iteratively learns to create tools for a target goal, aligning with human intent and process, while minimizing time for task/domain adaptation effort and human feedback capture. The architecture generates and validates tools via four specialized agents (Coach, Coder, Critic, Capitalizer) using a reinforced dynamic prompt and systematic human feedback integration to reinforce each agent's role toward goals and constraints. This work is best viewed as a system-level integration and methodology combining multi-agent in-context learning, HITL controls, and reusable tool capitalization for complex iterative problems such as scientific document generation. We illustrate it with preliminary experiments (e.g., generating state-of-the-art research papers or patents given an abstract) and discuss its applicability to other iterative problem-solving.
Problem

Research questions and friction points this paper is trying to address.

Develops a multi-agent LLM framework for tool creation
Integrates human feedback to align tools with user intent
Accelerates adaptation for complex iterative tasks like document generation
Innovation

Methods, ideas, or system contributions that make the work stand out.

Multi-agent LLM framework with human-in-the-loop guidance
Four specialized agents generate and validate tools iteratively
Reinforced dynamic prompts and systematic human feedback integration
🔎 Similar Papers
No similar papers found.
X
Xavier Daull
Toulon Univ, Aix Marseille Univ, CNRS, LIS
Patrice Bellot
Patrice Bellot
Aix-Marseille Université - CNRS (LIS)
Information RetrievalNatural Language ProcessingArtificial IntelligenceMachine LearningText and Data Mining
E
Emmanuel Bruno
Toulon Univ, Aix Marseille Univ, CNRS, LIS
V
Vincent Martin
Naval Group
E
Elisabeth Murisasco
Toulon Univ, Aix Marseille Univ, CNRS, LIS