Keyword Queries over the Deep Web

📅 2016-11-14
🏛️ International Conference on Conceptual Modeling
📈 Citations: 6
Influential: 1
📄 PDF
🤖 AI Summary
To address the challenge of keyword search over structured deep web data—particularly restricted tables—that are inherently inaccessible to conventional keyword-based retrieval, this paper proposes a keyword query modeling framework tailored for the deep web. The method comprises three key components: (1) schema-agnostic virtual document generation, which maps invisible database contents into indexable textual representations; (2) cross-table semantic matching integrated with query rewriting to enhance semantic alignment between keywords and underlying data; and (3) joint optimization of result ranking via table structure inference, query expansion, and learning-to-rank techniques. Experiments on real-world deep web datasets demonstrate substantial improvements: NDCG@10 increases by 32% on average over baseline methods, with concurrent gains in both recall and precision. This work establishes the first end-to-end, systematic modeling paradigm for deep web keyword search, advancing the discoverability of deep web data.

Technology Category

Application Category

Problem

Research questions and friction points this paper is trying to address.

Deep Web
Search Engine Optimization
Data Retrieval
Innovation

Methods, ideas, or system contributions that make the work stand out.

Deep Web Search
Dynamic Page Information Retrieval
Optimized Search Strategy
🔎 Similar Papers
No similar papers found.
A
A. Calí
Birkbeck, University of London, UK
D
D. Martinenghi
Politecnico di Milano, Italy
Riccardo Torlone
Riccardo Torlone
Università Roma Tre
Data exchange and integrationdata analysisdata qualitydata-centric AIethical data