[ACM Press Proceeding of the 18th ACM conference - Hong Kong, China (2009.11.02-2009.11.06)] Proceeding of the 18th ACM conference on Information and knowledge management - CIKM '09 - Effective XML content and structure retrieval with relevance ranking
β Scribed by Liu, Xiping; Wan, Changxuan; Chen, Lei
- Book ID
- 127252708
- Publisher
- ACM Press
- Year
- 2009
- Tongue
- English
- Weight
- 848 KB
- Category
- Article
- ISBN
- 1605585122
No coin nor oath required. For personal study only.
β¦ Synopsis
XML documents can be retrieved by means of not only contentonly (CO) queries, but also content-and-structure (CAS) queries. Though promising better retrieval precision, CAS queries introduce several new challenges. To address these challenges, we propose a novel approach for XML CAS retrieval. The distinctive feature of the approach is that it adopts a content-oriented point of view. Specifically, the approach first decomposes a CAS query into several fragments, then retrieves results for each query fragment in a content-centric way, and finally scores each answer node. The approach is adaptive to versatile homogeneous and heterogeneous data environments. To assess the relevance of retrieval results to a query fragment, we present a scoring strategy that measures relevance from both content and structure perspectives. In addition, an effective approach is proposed to infer answer nodes based on the CAS query and document structure. An efficient algorithm is also presented for CAS retrieval. Finally, we demonstrate the effectiveness of the proposed methods through comprehensive experimental studies.
π SIMILAR VOLUMES