Context-Aware Search and Retrieval Over Erasure Channels

πŸ“… 2025-07-16
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
This paper addresses semantic retrieval errors in remote document retrieval over symbol-erasure channels, caused by the loss of query features. To tackle this, we propose a context-aware semantic communication framework. Methodologically, we design an adaptive repetition coding strategy guided by contextual importance: query feature vectors are extracted via term-frequency weighting, and redundancy is dynamically allocated according to semantic significance; a closed-form upper bound on retrieval error probability is derived using Gaussian approximation; and at the decoder, semantic recovery is performed via context-similarity-based decision making. Experiments on synthetic data and the Google Natural Questions dataset demonstrate that our approach significantly reduces retrieval error rates induced by critical feature erasures. Theoretical analysis aligns closely with empirical results, establishing a novel paradigm for robust, semantics-driven retrieval under unreliable channels.

Technology Category

Application Category

πŸ“ Abstract
This paper introduces and analyzes a search and retrieval model that adopts key semantic communication principles from retrieval-augmented generation. We specifically present an information-theoretic analysis of a remote document retrieval system operating over a symbol erasure channel. The proposed model encodes the feature vector of a query, derived from term-frequency weights of a language corpus by using a repetition code with an adaptive rate dependent on the contextual importance of the terms. At the decoder, we select between two documents based on the contextual closeness of the recovered query. By leveraging a jointly Gaussian approximation for both the true and reconstructed similarity scores, we derive an explicit expression for the retrieval error probability, i.e., the probability under which the less similar document is selected. Numerical simulations on synthetic and real-world data (Google NQ) confirm the validity of the analysis. They further demonstrate that assigning greater redundancy to critical features effectively reduces the error rate, highlighting the effectiveness of semantic-aware feature encoding in error-prone communication settings.
Problem

Research questions and friction points this paper is trying to address.

Analyzes document retrieval over erasure channels using semantic principles
Derives retrieval error probability for context-aware query encoding
Demonstrates redundancy in critical features reduces error rates
Innovation

Methods, ideas, or system contributions that make the work stand out.

Uses repetition code for adaptive query encoding
Leverages Gaussian approximation for error analysis
Assigns redundancy to critical features effectively
πŸ”Ž Similar Papers
No similar papers found.
S
Sara Ghasvarianjahromi
Department of Electrical and Computer Engineering, New Jersey Institute of Technology, Newark, New Jersey, 07102, USA
Yauhen Yakimenka
Yauhen Yakimenka
Postdoctoral Research Associate, New Jersey Institute of Technology
coding theoryinformation theoryprivate information retrievalcompressed sensing
J
JΓΆrg Kliewer
Department of Electrical and Computer Engineering, New Jersey Institute of Technology, Newark, New Jersey, 07102, USA