ContraGen: A Multi-Agent Generation Framework for Enterprise Contradictions Detection

📅 2025-10-03
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Enterprise RAG systems frequently produce untrustworthy outputs due to intrinsic or cross-document contradictions in external documents—yet existing contradiction detection benchmarks operate only at the sentence level and fail to capture the structural and semantic complexity of high-stakes enterprise documents (e.g., contracts, financial reports). Method: We propose the first enterprise-oriented contradiction detection benchmark framework. It introduces a fine-grained, enterprise-specific contradiction taxonomy supporting controllable synthesis of self-consistency and mutual-exclusivity contradictions; integrates multi-agent generation, domain-knowledge injection, human-in-the-loop reinforcement learning, and retrieval consistency evaluation for high-fidelity document synthesis; and establishes a human–AI collaborative verification pipeline for end-to-end contradiction-aware evaluation. Contribution/Results: Experiments demonstrate that our framework effectively detects multi-granularity contradictions in RAG outputs, significantly enhancing system trustworthiness and explainability in high-risk domains such as regulatory compliance and corporate governance.

Technology Category

Application Category

📝 Abstract
Retrieval-Augmented Generation (RAG) integrates LLMs with external sources, offering advanced capabilities for information access and decision-making. However, contradictions in retrieved evidence can result in inconsistent or untrustworthy outputs, which is especially problematic in enterprise settings where compliance, governance, and accountability are critical. Existing benchmarks for contradiction detection are limited to sentence-level analysis and do not capture the complexity of enterprise documents such as contracts, financial filings, compliance reports, or policy manuals. To address this limitation, we propose ContraGen, a contradiction-aware benchmark framework tailored to enterprise domain. The framework generates synthetic enterprise-style documents with embedded contradictions, enabling systematic evaluation of both intra-document and cross-document consistency. Automated contradiction mining is combined with human-in-the-loop validation to ensure high accuracy. Our contributions include generating realistic enterprise documents, modeling a taxonomy of contradiction types common in business processes, enabling controlled creation of self- and pairwise contradictions, developing a contradiction-aware retrieval evaluation pipeline and embedding human oversight to reflect domain-specific judgment complexity. This work establishes a foundation for more trustworthy and accountable RAG systems in enterprise information-seeking applications, where detecting and resolving contradictions is essential for reducing risk and ensuring compliance.
Problem

Research questions and friction points this paper is trying to address.

Detecting contradictions in enterprise documents for compliance
Addressing limitations of sentence-level contradiction detection benchmarks
Generating synthetic enterprise documents with embedded contradictions
Innovation

Methods, ideas, or system contributions that make the work stand out.

Generates synthetic enterprise documents with contradictions
Models taxonomy of business process contradiction types
Combines automated mining with human validation
🔎 Similar Papers
No similar papers found.
Ananya Mantravadi
Ananya Mantravadi
North Carolina State University
Data ScienceDeep LearningMachine LearningComputer Vision
S
Shivali Dalmia
Centific, USA
A
Abhishek Mukherji
Centific, USA
N
Nand Dave
Centific, USA
A
Anudha Mittal
Centific, USA