𝔖 Scriptorium
✦   LIBER   ✦

πŸ“

Query Processing Over Incomplete Databases

✍ Scribed by H. V. Jagadish (editor), Yunjun Gao, Xiaoye Miao


Publisher
MORGAN & CLAYPOOL
Year
2018
Tongue
English
Leaves
124
Series
Synthesis Lectures on Data Management
Category
Library

⬇  Acquire This Volume

No coin nor oath required. For personal study only.

✦ Synopsis


Incomplete data is part of life and almost all areas of scientific studies. Users tend to skip certain fields when they fill out online forms; participants choose to ignore sensitive questions on surveys; sensors fail, resulting in the loss of certain readings; publicly viewable satellite map services have missing data in many mobile applications; and in privacy-preserving applications, the data is incomplete deliberately in order to preserve the sensitivity of some attribute values.

Query processing is a fundamental problem in computer science, and is useful in a variety of applications. In this book, we mostly focus on the query processing over incomplete databases, which involves finding a set of qualified objects from a specified incomplete dataset in order to support a wide spectrum of real-life applications. We first elaborate the three general kinds of methods of handling incomplete data, including (i) discarding the data with missing values, (ii) imputation for the missing values, and (iii) just depending on the observed data values. For the third method type, we introduce the semantics of k-nearest neighbor (kNN) search, skyline query, and top-k dominating query on incomplete data, respectively. In terms of the three representative queries over incomplete data, we investigate some advanced techniques to process incomplete data queries, including indexing, pruning as well as crowdsourcing techniques.

✦ Table of Contents


Preface
Acknowledgments
Introduction
Applications of Incomplete Data Management
Overview of Incomplete Databases
Indexing Incomplete Databases
Querying Incomplete Databases
Incomplete Database Management Systems
Challenges of Querying Incomplete Databases
Organization
Handling Incomplete Data Methods
Method Taxonomy
Overview of Imputation Methods
Statistical Imputation
Machine Learning-Based Imputation
Modern Imputation Methods
Query Semantics on Incomplete Data
k-Nearest Neighbor Search on Incomplete Data
Background
Problem Definition
Skyline Queries on Incomplete Data
Background
Problem Definition
Top-k Dominating Queries on Incomplete Data
Background
Problem Definition
Advanced Techniques
Index Structures
LB Index for k-Nearest Neighbor Search on Incomplete Data
Histogram Index for k-Nearest Neighbor Search on Incomplete Data
Bitmap Index for Top-k Dominating Queries on Incomplete Data
Pruning Heuristics
Alpha Value Pruning for k-Nearest Neighbor Search on Incomplete Data
Histogram-Based Pruning for k-Nearest Neighbor Search on Incomplete Data
Local Skyband Pruning for Top-k Dominating Queries on Incomplete Data
Upper Bound Score Pruning for Top-k Dominating Queries on Incomplete Data
Bitmap Pruning for Top-k Dominating Queries on Incomplete Data
Crowdsourcing Techniques
Crowdsourcing Framework for Skyline Queries on Incomplete Data
C-Table Construction
Probability Computation
Crowd Task Selection
Conclusions
Bibliography
Authors' Biographies
Blank Page


πŸ“œ SIMILAR VOLUMES


Query Processing in Database Systems
✍ Matthias Jarke, JΓΌrgen Koch, Joachim W. Schmidt (auth.), Dr. Won Kim, Dr. David πŸ“‚ Library πŸ“… 1985 πŸ› Springer-Verlag Berlin Heidelberg 🌐 English

<p>This book is an anthology of the results of research and development in database query processing during the past decade. The relational model of data provided tremendous impetus for research into query processing. Since a relational query does not specify access paths to the stored data, the dat

Adaptive Query Processing (Foundations a
✍ Amol Deshpande, Zachary Ives, Vijayshankar Raman πŸ“‚ Library πŸ“… 2007 🌐 English

Adaptive Query Processing surveys the fundamental issues, techniques, costs, and benefits of adaptive query processing. It begins with a broad overview of the field, identifying the dimensions of adaptive techniques. It then looks at the spectrum of approaches available to adapt query execution at r

Data Management and Query Processing in
✍ Sven Groppe (auth.) πŸ“‚ Library πŸ“… 2011 πŸ› Springer-Verlag Berlin Heidelberg 🌐 English

<p><p>The Semantic Web, which is intended to establish a machine-understandable Web, is currently changing from being an emerging trend to a technology used in complex real-world applications. A number of standards and techniques have been developed by the World Wide Web Consortium (W3C), e.g., the

Data Management and Query Processing in
✍ Sven Groppe (auth.) πŸ“‚ Library πŸ“… 2011 πŸ› Springer-Verlag Berlin Heidelberg 🌐 English

<p><p>The Semantic Web, which is intended to establish a machine-understandable Web, is currently changing from being an emerging trend to a technology used in complex real-world applications. A number of standards and techniques have been developed by the World Wide Web Consortium (W3C), e.g., the

RDF Database Systems: Triples Storage an
✍ Olivier CurΓ©, Guillaume Blin πŸ“‚ Library πŸ“… 2014 πŸ› Morgan Kaufmann 🌐 English

<i>RDF Database Systems</i> is a cutting-edge guide that distills everything you need to know to effectively use or design an RDF database. This book starts with the basics of linked open data and covers the most recent research, practice, and technologies to help you leverage semantic technology. W

Peer-to-peer query processing over multi
✍ Akrivi Vlachou, Christos Doulkeridis, Kjetil NΓΈrvΓ₯g, Yannis Kotidis (auth.) πŸ“‚ Library πŸ“… 2012 πŸ› Springer-Verlag New York 🌐 English

<p>Applications that require a high degree of distribution and loosely-coupled connectivity are ubiquitous in various domains, including scientific databases, bioinformatics, and multimedia retrieval. In all these applications, data is typically voluminous and multidimensional, and support for advan