🤖 AI Summary
This study addresses the limited semantic understanding and reasoning capabilities of large language models (LLMs) in Islamic knowledge question answering. We propose a three-stage hybrid retrieval-augmented generation (RAG) framework: (1) initial keyword-based retrieval using BM25; (2) fine-grained semantic matching via dense embedding models; and (3) candidate paragraph re-ranking using a cross-encoder. By synergistically integrating sparse and dense retrieval paradigms, the framework significantly improves semantic matching accuracy and answer generation quality for religious texts. Experimental results on two Islamic knowledge subtasks demonstrate up to a 25% absolute improvement in accuracy over baseline methods. Specifically, the Fanar model achieves 45% accuracy on Subtask 1 and 80% on Subtask 2, validating the framework’s effectiveness and generalizability for domain-specific knowledge QA.
📝 Abstract
This paper presents our submission to the QIAS 2025 shared task on Islamic knowledge understanding and reasoning. We developed a hybrid retrieval-augmented generation (RAG) system that combines sparse and dense retrieval methods with cross-encoder reranking to improve large language model (LLM) performance. Our three-stage pipeline incorporates BM25 for initial retrieval, a dense embedding retrieval model for semantic matching, and cross-encoder reranking for precise content retrieval. We evaluate our approach on both subtasks using two LLMs, Fanar and Mistral, demonstrating that the proposed RAG pipeline enhances performance across both, with accuracy improvements up to 25%, depending on the task and model configuration. Our best configuration is achieved with Fanar, yielding accuracy scores of 45% in Subtask 1 and 80% in Subtask 2.