๐”– Scriptorium
โœฆ   LIBER   โœฆ

๐Ÿ“

XML Retrieval (Synthesis Lectures on Information Concepts, Retrieval, and Services)

โœ Scribed by Mounia Lalmas


Publisher
Morgan and Claypool Publishers
Year
2009
Tongue
English
Leaves
112
Series
Synthesis Lectures on Information Concepts, Retrieval, and Services
Category
Library

โฌ‡  Acquire This Volume

No coin nor oath required. For personal study only.

โœฆ Synopsis


Documents usually have a content and a structure. The content refers to the text of the document, whereas the structure refers to how a document is logically organized. An increasingly common way to encode the structure is through the use of a mark-up language. Nowadays, the most widely used mark-up language for representing structure is the eXtensible Mark-up Language (XML). XML can be used to provide a focused access to documents, i.e. returning XML elements, such as sections and paragraphs, instead of whole documents in response to a query. Such focused strategies are of particular benefit for information repositories containing long documents, or documents covering a wide variety of topics, where users are directed to the most relevant content within a document. The increased adoption of XML to represent a document structure requires the development of tools to effectively access documents marked-up in XML. This book provides a detailed description of query languages, indexing strategies, ranking algorithms, presentation scenarios developed to access XML documents. Major advances in XML retrival were seen from 2002 as a result of INEX, the Initiative for Evaluation of XML Retrieval. INEX, also described in this book, provided test sets for evaluating XML retrieval effectiveness. Many of the developments and results described in this book were investigated within INEX. Table of Contents: Introduction / Basic XML Concepts / Historical Perspectives / Query Languages / Indexing Strategies / Ranking Strategies / Presentation strategies / Evaluating XML Retrieval Effectiveness / Conclusions

โœฆ Table of Contents


Acknowledgments......Page 12
Introduction......Page 14
Element......Page 16
Well-Formed XML Document......Page 17
Document Type Declaration......Page 19
XML Schema......Page 20
XML Documents as Trees......Page 21
Structured Document Retrieval......Page 24
Structured Text Retrieval......Page 25
Data- vs Document-Centric XML Documents......Page 26
Content-Oriented XML Retrieval......Page 28
Focused Retrieval......Page 29
Structural Constraints......Page 30
Content-and-Structure......Page 32
XPath......Page 34
NEXI......Page 36
XQuery......Page 37
XQuery Full-Text......Page 38
Discussion......Page 40
Indexing strategies......Page 42
Element-Based Indexing......Page 43
Leaf-Only Indexing......Page 44
Selective Indexing......Page 45
Distributed Indexing......Page 46
Structure Indexing......Page 47
Discussion......Page 48
Element Scoring......Page 50
Contextualization......Page 51
Propagation......Page 52
Aggregation......Page 54
Merging......Page 55
Processing Structural Constraints......Page 56
Discussion......Page 58
Presentation strategies......Page 60
Dealing with Overlaps......Page 61
Presenting Elements in Context......Page 63
Entry Points......Page 64
Discussion......Page 66
Document Collections......Page 68
Topics......Page 69
Relevance Assessments......Page 72
Retrieval Tasks......Page 77
Measures......Page 81
Discussion......Page 84
XML element'' Retrieval......Page 86<br>Beyond XMLelement'' Retrieval......Page 89
Beyond XML Retrieval......Page 91
Bibliography......Page 94
Biography......Page 112


๐Ÿ“œ SIMILAR VOLUMES


Estimating the Query Difficulty for Info
โœ David Carmel, Elad Yom-Tov ๐Ÿ“‚ Library ๐Ÿ“… 2010 ๐ŸŒ English

Many information retrieval (IR) systems suffer from a radical variance in performance when responding to users' queries. Even for systems that succeed very well on average, the quality of results returned for some of the queries is poor. Thus, it is desirable that IR systems will be able to identify

Online Multiplayer Games (Synthesis Lect
โœ William Sims Bainbridge ๐Ÿ“‚ Library ๐Ÿ“… 2010 ๐ŸŒ English

This lecture introduces fundamental principles of online multiplayer games, primarily massively multiplayer online role-playing games (MMORPGs), suitable for students and faculty interested both in designing games and in doing research on them. The general focus is human-centered computing, which in

Reading and Writing the Electronic Book
โœ Catherine C. Marshall ๐Ÿ“‚ Library ๐Ÿ“… 2009 ๐Ÿ› Morgan and Claypool Publishers ๐ŸŒ English

Developments over the last twenty years have fueled considerable speculation about the future of the book and of reading itself. This book begins with a brief historical overview the history of electronic books, including the social and technical forces that have shaped their development. The focus

Information Architecture: The Design and
โœ Wei Ding, Xia Lin ๐Ÿ“‚ Library ๐Ÿ“… 2009 ๐ŸŒ English

Information Architecture is about organizing and simplifying information, designing and integrating information spaces/systems, and creating ways for people to find and interact with information content. Its goal is to help people understand and manage information and make right decisions accordingl

Hypermedia Genes: An Evolutionary Perspe
โœ Nuno Guimaraes, Luis Carrico? ๐Ÿ“‚ Library ๐Ÿ“… 2009 ๐ŸŒ English

The design space of information services evolved from seminal works through a set of prototypical hypermedia systems and matured in open and widely accessible web-based systems. The original concepts of hypermedia systems are now expressed in different forms and shapes. The first works on hypertext