𝔖 Scriptorium
✦   LIBER   ✦

πŸ“

Data simplification : taming information with open source tools

✍ Scribed by Berman, Jules J


Publisher
Morgan Kaufmann is an imprint of Elsevier
Year
2016
Tongue
English
Leaves
386
Edition
1
Category
Library

⬇  Acquire This Volume

No coin nor oath required. For personal study only.

✦ Synopsis


Data Simplification: Taming Information With Open Source Tools addressesthe simple fact that modern data is too big and complex to analyze in its native form. Data simplification is the process whereby large and complex data is rendered usable. Complex data must be simplified before it can be analyzed, but the process of data simplification is anything but simple, requiring a specialized set of skills and tools.

This book provides data scientists from every scientific discipline with the methods and tools to simplify their data for immediate analysis or long-term storage in a form that can be readily repurposed or integrated with other data.

Drawing upon years of practical experience, and using numerous examples and use cases, Jules Berman discusses the principles, methods, and tools that must be studied and mastered to achieve data simplification, open source tools, free utilities and snippets of code that can be reused and repurposed to simplify data, natural language processing and machine translation as a tool to simplify data, and data summarization and visualization and the role they play in making data useful for the end user.

  • Discusses data simplification principles, methods, and tools that must be studied and mastered
  • Provides open source tools, free utilities, and snippets of code that can be reused and repurposed to simplify data
  • Explains how to best utilize indexes to search, retrieve, and analyze textual data
  • Shows the data scientist how to apply ontologies, classifications, classes, properties, and instances to data using tried and true methods

✦ Table of Contents


Content:
Front Matter,Copyright,Dedication,Foreword,Preface,Author BiographyEntitled to full textChapter 1 - The Simple Life, Pages 1-44
Chapter 2 - Structuring Text, Pages 45-89
Chapter 3 - Indexing Text, Pages 91-133
Chapter 4 - Understanding Your Data, Pages 135-187
Chapter 5 - Identifying and Deidentifying Data, Pages 189-231
Chapter 6 - Giving Meaning to Data, Pages 233-284
Chapter 7 - Object-Oriented Data, Pages 285-319
Chapter 8 - Problem Simplification, Pages 321-360
Index, Pages 361-366


πŸ“œ SIMILAR VOLUMES


Data Analysis with Open Source Tools
✍ Philipp K. Janert πŸ“‚ Library πŸ“… 2010 πŸ› O'Reilly Media 🌐 English

These days it seems like everyone is collecting data. But all of that data is just raw information -- to make that information meaningful, it has to be organized, filtered, and analyzed. Anyone can apply data analysis tools and get results, but without the right approach those results may be useless

Bioinformatics data skills: [reproducibl
✍ O'Reilly Media.;Buffalo, Vince πŸ“‚ Library πŸ“… 2015 πŸ› O'Reilly Media, Inc. 🌐 English

Learn the data skills necessary for turning large sequencing datasets into reproducible and robust biological findings. With this practical guide, youll learn how to use freely available open source tools to extract meaning from large complex biological data sets.

Bioinformatics Data Skills: Reproducible
✍ Vince Buffalo πŸ“‚ Library πŸ“… 2015 πŸ› O'Reilly Media 🌐 English

<div>This practical book teaches the skills that scientists need for turning large sequencing datasets into reproducible and robust biological findings. Many biologists begin their bioinformatics training by learning scripting languages like Python and R alongside the Unix command line. But there's