AIANO: Enhancing Information Retrieval with AI-Augmented Annotation

📅 2026-02-04
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the inefficiency of traditional information retrieval dataset annotation, which relies on generic tools and struggles to meet the growing demand for high-quality question-answering data driven by large language models and retrieval-augmented generation (RAG). To overcome this limitation, the authors propose AIANO, a human-AI collaborative annotation tool that integrates large language model suggestions, an interactive interface, and a RAG-oriented workflow. While preserving full annotator control, AIANO significantly enhances both annotation efficiency and quality. User studies demonstrate that AIANO nearly doubles annotation speed compared to baseline tools, offers superior usability, and effectively improves downstream retrieval accuracy.

Technology Category

Application Category

📝 Abstract
The rise of Large Language Models (LLMs) and Retrieval-Augmented Generation (RAG) has rapidly increased the need for high-quality, curated information retrieval datasets. These datasets, however, are currently created with off-the-shelf annotation tools that make the annotation process complex and inefficient. To streamline this process, we developed a specialized annotation tool - AIANO. By adopting an AI-augmented annotation workflow that tightly integrates human expertise with LLM assistance, AIANO enables annotators to leverage AI suggestions while retaining full control over annotation decisions. In a within-subject user study ($n = 15$), participants created question-answering datasets using both a baseline tool and AIANO. AIANO nearly doubled annotation speed compared to the baseline while being easier to use and improving retrieval accuracy. These results demonstrate that AIANO's AI-augmented approach accelerates and enhances dataset creation for information retrieval tasks, advancing annotation capabilities in retrieval-intensive domains.
Problem

Research questions and friction points this paper is trying to address.

information retrieval
dataset annotation
annotation efficiency
retrieval-augmented generation
large language models
Innovation

Methods, ideas, or system contributions that make the work stand out.

AI-augmented annotation
information retrieval
large language models
annotation tool
retrieval-augmented generation
🔎 Similar Papers
No similar papers found.
S
Sameh Khattab
Institute for Artificial Intelligence in Medicine (IKIM), University Hospital Essen (AöR), Essen, Germany
Marie Bauer
Marie Bauer
Software Developer, Insitute for AI in Medicine, Essen
NLPMLComputational LinguisticsLinguistics
Lukas Heine
Lukas Heine
Institute for AI in Medicine (IKIM), University Hospital Essen (AöR), Essen, Germany
T
Till Rostalski
Institute for Artificial Intelligence in Medicine (IKIM), University Hospital Essen (AöR), Essen, Germany
J
J. Kleesiek
Institute for Artificial Intelligence in Medicine (IKIM), University Hospital Essen (AöR), Essen, Germany
J
Julian Friedrich
Institute for Artificial Intelligence in Medicine (IKIM), University Hospital Essen (AöR), Essen, Germany