𝔖 Scriptorium
✦   LIBER   ✦

πŸ“

Disk-based algorithms for big data

✍ Scribed by Healey, Christopher Graham


Publisher
CRC Press
Year
2017
Tongue
English
Leaves
205
Category
Library

⬇  Acquire This Volume

No coin nor oath required. For personal study only.

✦ Synopsis


Disk-Based Algorithms for Big Data is a product of recent advances in the areas of big data, data analytics, and the underlying file systems and data management algorithms used to support the storage and analysis of massive data collections. The book discusses hard disks and their impact on data management, since Hard Disk Drives continue to be common in large data clusters. It also explores ways to store and retrieve data though primary and secondary indices. This includes a review of different in-memory sorting and searching algorithms that build a foundation for more sophisticated on-disk approaches like mergesort, B-trees, and extendible hashing.

Following this introduction, the book transitions to more recent topics, including advanced storage technologies like solid-state drives and holographic storage; peer-to-peer (P2P) communication; large file systems and query languages like Hadoop/HDFS, Hive, Cassandra, and Presto; and NoSQL databases like Neo4j for graph structures and MongoDB for unstructured document data.

Designed for senior undergraduate and graduate students, as well as professionals, this book is useful for anyone interested in understanding the foundations and advances in big data storage and management, and big data analytics.


About the Author


Dr. Christopher G. Healey is a tenured Professor in the Department of Computer Science and the Goodnight Distinguished Professor of Analytics in the Institute for Advanced Analytics, both at North Carolina State University in Raleigh, North Carolina. He has published over 50 articles in major journals and conferences in the areas of visualization, visual and data analytics, computer graphics, and artificial intelligence. He is a recipient of the National Science Foundation’s CAREER Early Faculty Development Award and the North Carolina State University Outstanding Instructor Award. He is a Senior Member of the Association for Computing Machinery (ACM) and the Institute of Electrical and Electronics Engineers (IEEE), and an Associate Editor of ACM Transaction on Applied Perception, the leading worldwide journal on the application of human perception to issues in computer science.

✦ Table of Contents


Content: Chapter 1. Physical disk storage --
Chapter 2. File management --
Chapter 3. Sorting --
Chapter 4. Searching --
Chapter 5. Disk-based sorting --
Chapter 6. Disk-based searching --
Chapter 7. Storage technology --
Chapter 8. Distributed hast tables --
Chapter 9. Large file systems --
Chapter 10. NoSQL storage.

✦ Subjects


Big data;Disk access (Computer science);COMPUTERS;Data Processing


πŸ“œ SIMILAR VOLUMES


Disk-based algorithms for big data
✍ Healey, Christopher Graham πŸ“‚ Library πŸ“… 2017 πŸ› CRC Press Taylor & Francis Group 🌐 English

<P>Disk-Based Algorithms for Big Data is a product of recent advances in the areas of big data, data analytics, and the underlying file systems and data management algorithms used to support the storage and analysis of massive data collections. The book discusses hard disks and their impact on data

Algorithms For Big Data
✍ Feldman, Moran πŸ“‚ Library πŸ“… 2021 πŸ› World Scientific Publishing Co. Pte. Ltd. 🌐 English

This unique volume is an introduction for computer scientists, including a formal study of theoretical algorithms for Big Data applications, which allows them to work on such algorithms in the future. It also serves as a useful reference guide for the general computer science population, providing a

Algorithms for Big Data
✍ Moran Feldman πŸ“‚ Library πŸ“… 2020 πŸ› WSPC 🌐 English

<p>This unique volume is an introduction for computer scientists, including a formal study of theoretical algorithms for Big Data applications, which allows them to work on such algorithms in the future. It also serves as a useful reference guide for the general computer science population, providin

Bio-inspired Algorithms for Data Streami
✍ Simon James Fong, Richard C. Millham πŸ“‚ Library πŸ“… 2021 πŸ› Springer Singapore;Springer 🌐 English

<p><p>This book aims to provide some insights into recently developed bio-inspired algorithms within recent emerging trends of fog computing, sentiment analysis, and data streaming as well as to provide a more comprehensive approach to the big data management from pre-processing to analytics to visu

Sublinear Algorithms for Big Data Applic
✍ Dan Wang, Zhu Han πŸ“‚ Library πŸ“… 2015 πŸ› Springer International Publishing 🌐 English

The brief focuses on applying sublinear algorithms to manage critical big data challenges. The text offers an essential introduction to sublinear algorithms, explaining why they are vital to large scale data systems. It also demonstrates how to apply sublinear algorithms to three familiar big data a