This book focuses on new and emerging data mining solutions that offer a greater level of transparency than existing solutions. Transparent data mining solutions with desirable properties (e.g. effective, fully automatic, scalable) are covered in the book. Experimental findings of transparent soluti
Small Summaries for Big Data
β Scribed by Graham Cormode, Ke Yi
- Publisher
- Cambridge University Press
- Year
- 2020
- Tongue
- English
- Leaves
- 279
- Edition
- 1
- Category
- Library
No coin nor oath required. For personal study only.
β¦ Synopsis
The massive volume of data generated in modern applications can overwhelm our ability to conveniently transmit, store, and index it. For many scenarios, building a compact summary of a dataset that is vastly smaller enables flexibility and efficiency in a range of queries over the data, in exchange for some approximation. This comprehensive introduction to data summarization, aimed at practitioners and students, showcases the algorithms, their behavior, and the mathematical underpinnings of their operation. The coverage starts with simple sums and approximate counts, building to more advanced probabilistic structures such as the Bloom Filter, distinct value summaries, sketches, and quantile summaries. Summaries are described for specific types of data, such as geometric data, graphs, and vectors and matrices. The authors offer detailed descriptions of and pseudocode for key algorithms that have been incorporated in systems from companies such as Google, Apple, Microsoft, Netflix and Twitter.
π SIMILAR VOLUMES
<p>This book focuses on new and emerging data mining solutions that offer a greater level of transparency than existing solutions. Transparent data mining solutions with desirable properties (e.g. effective, fully automatic, scalable) are covered in the book. Experimental findings of transparent sol
This book offers an engaging and accessible introduction to data visualization for communicators, covering everything from data collection and analysis to the creation of effective data visuals. γ Straying from the typical "how to visualize data" genre often written for technical audiences, Big D
<p><p>Owing to continuous advances in the computational power of handheld devices like smartphones and tablet computers, it has become possible to perform <b><i>Big Data</i></b> operations including modern data mining processes onboard these small devices. A decade of research has proved the feasibi
<p><p>This book aims to provide some insights into recently developed bio-inspired algorithms within recent emerging trends of fog computing, sentiment analysis, and data streaming as well as to provide a more comprehensive approach to the big data management from pre-processing to analytics to visu
"This book is designed to help practitioners and students in a wide range of construction project management professions understand what BIM and big data could mean for them, and how they should prepare to work successfully on BIM-compliant projects and maintain their competencies in this essential