Document Retrieval Augmented Fine-Tuning (DRAFT) for safety-critical software assessments

πŸ“… 2025-05-02
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
In safety-critical software compliance assessment, manual review suffers from low efficiency, inaccurate evidence citation, and weak reasoning robustness. To address these challenges, this paper proposes DRAFTβ€”a novel framework featuring a dual-path collaborative retrieval-augmented paradigm that jointly retrieves software documentation and regulatory standards. We introduce a semi-automated data generation method incorporating distractors to realistically model expert cognitive load during evaluation. DRAFT integrates retrieval-augmented generation (RAG), supervised fine-tuning, and lightweight adaptation of GPT-4o-mini. Evaluated in highly regulated settings, DRAFT improves assessment accuracy by 7%, significantly enhancing evidence traceability, response structuring, and domain-specific reasoning stability. It establishes a reproducible, verifiable pathway for high-assurance AI-assisted compliance review.

Technology Category

Application Category

πŸ“ Abstract
Safety critical software assessment requires robust assessment against complex regulatory frameworks, a process traditionally limited by manual evaluation. This paper presents Document Retrieval-Augmented Fine-Tuning (DRAFT), a novel approach that enhances the capabilities of a large language model (LLM) for safety-critical compliance assessment. DRAFT builds upon existing Retrieval-Augmented Generation (RAG) techniques by introducing a novel fine-tuning framework that accommodates our dual-retrieval architecture, which simultaneously accesses both software documentation and applicable reference standards. To fine-tune DRAFT, we develop a semi-automated dataset generation methodology that incorporates variable numbers of relevant documents with meaningful distractors, closely mirroring real-world assessment scenarios. Experiments with GPT-4o-mini demonstrate a 7% improvement in correctness over the baseline model, with qualitative improvements in evidence handling, response structure, and domain-specific reasoning. DRAFT represents a practical approach to improving compliance assessment systems while maintaining the transparency and evidence-based reasoning essential in regulatory domains.
Problem

Research questions and friction points this paper is trying to address.

Enhancing LLM capabilities for safety-critical compliance assessment
Automating evaluation against complex regulatory frameworks
Improving evidence handling and domain-specific reasoning accuracy
Innovation

Methods, ideas, or system contributions that make the work stand out.

DRAFT combines dual-retrieval with fine-tuning for compliance.
Semi-automated dataset generation enhances real-world assessment accuracy.
Improves correctness and reasoning in regulatory compliance assessments.
πŸ”Ž Similar Papers
No similar papers found.
R
Regan Bolton
Digital Transit Limited, 3M Buckley Innovation Centre, UK, HD1 3BD
M
Mohammadreza Sheikhfathollahi
Department of Computer Science, University of Huddersfield, UK, HD1 3DH
Simon Parkinson
Simon Parkinson
Professor, University of Huddersfield & UK Government Cyber Security Advisory Board Member
Cyber SecurityArtificial IntelligenceAutomated Planning
V
Vanessa Vulovic
Digital Transit Limited, 3M Buckley Innovation Centre, UK, HD1 3BD
G
Gary Bamford
Digital Transit Limited, 3M Buckley Innovation Centre, UK, HD1 3BD
D
D. Basher
Digital Transit Limited, 3M Buckley Innovation Centre, UK, HD1 3BD
H
Howard Parkinson
Digital Transit Limited, 3M Buckley Innovation Centre, UK, HD1 3BD