Data Deduplication for High Performance Storage System

✍ Scribed by Dan Feng

Publisher: Springer Nature
Year: 2022
Tongue: English
Leaves: 170
Category: Library

No coin nor oath required. For personal study only.

✦ Synopsis

This book comprehensively introduces data deduplication technologies for storage systems. It first presents the overview of data deduplication including its theoretical basis, basic workflow, application scenarios and its key technologies, and then the book focuses on each key technology of the deduplication to provide an insight into the evolution of the technology over the years including chunking algorithms, indexing schemes, fragmentation reduced schemes, rewriting algorithm and security solution. In particular, the state-of-the-art solutions and the newly proposed solutions are both elaborated. At the end of the book, the author discusses the fundamental trade-offs in each of deduplication design choices and propose an open-source deduplication prototype. The book with its fundamental theories and complete survey can guide the beginners, students and practitioners working on data deduplication in storage system. It also provides a compact reference in the perspective of key data deduplication technologies for those researchers in developing high performance storage solutions.

📜 SIMILAR VOLUMES

Data Deduplication for High Performance

📁 Data Deduplication for High Performance Storage System

✍ Dan Feng 📂 Library 📅 2022 🏛 Springer 🌐 English

This book comprehensively introduces data deduplication technologies for storage systems. It first presents the overview of data deduplication including its theoretical basis, basic workflow, application scenarios and its key technologies, and then the book focuses on each key technology of

Data Deduplication for Data Optimization

📁 Data Deduplication for Data Optimization for Storage and Network Systems

✍ Daehee Kim, Sejun Song, Baek-Young Choi (auth.) 📂 Library 📅 2017 🏛 Springer International Publishing 🌐 English

This book introduces fundamentals and trade-offs of data de-duplication techniques. It describes novel emerging de-duplication techniques that remove duplicate data both in storage and network in an efficient and effective manner. It explains places where duplicate data are originated, and pro

Client Data Caching: A Foundation for Hi

📁 Client Data Caching: A Foundation for High Performance Object Database Systems

✍ Michael J. Franklin (auth.) 📂 Library 📅 1996 🏛 Springer US 🌐 English

Despite the significant ongoing work in the development of new database systems, many of the basic architectural and performance tradeoffs involved in their design have not previously been explored in a systematic manner. The designers of the various systems have adopted a wide range of strategie

Storage Systems: Organization, Performan

📁 Storage Systems: Organization, Performance, Coding, Reliability, and Their Data Processing

✍ Alexander Thomasian 📂 Library 📅 2021 🏛 Morgan Kaufmann 🌐 English

Storage Systems: Organization, Performance, Coding, Reliability and Their Data Processing covers the coding, reliability and performance of popular RAID organizations: RAID1 mirrored disks, RAID5/6/7 1/2/3-disk failure tolerant - 1/2/3DFT arrays. Readers will learn about the storage of fil

Storage Systems: Organization, Performan

📁 Storage Systems: Organization, Performance, Coding, Reliability, and Their Data Processing

✍ Alexander Thomasian 📂 Library 📅 2021 🏛 Morgan Kaufmann 🌐 English

Storage Systems: Organization, Performance, Coding, Reliability and Their Data Processing was motivated by the 1988 Redundant Array of Inexpensive/Independent Disks proposal to replace large form factor mainframe disks with an array of commodity disks. Disk loads are balanced by striping data into s

Advanced Error Control Techniques for Da

📁 Advanced Error Control Techniques for Data Storage Systems

✍ Erozan M. Kurtas, Bane Vasic 📂 Library 📅 2005 🌐 English

With the massive amount of data produced and stored each year, reliable storage and retrieval of information is more crucial than ever. Robust coding and decoding techniques are critical for correcting errors and maintaining data integrity. Comprising chapters thoughtfully selected from the highly p