A key task that any aspiring data-driven organization needs to learn is data wrangling, the process of converting raw data into something truly useful. This practical guide provides business analysts with an overview of various data wrangling techniques and tools, and puts the practice of data wrang
Principles of Data Wrangling: Practical Techniques for Data Preparation
โ Scribed by Tye Rattenbury, Joseph M. Hellerstein, Jeffrey Heer, Sean Kandel, Connor Carreras
- Publisher
- OโReilly Media
- Year
- 2017
- Tongue
- English
- Leaves
- 94
- Edition
- 1
- Category
- Library
No coin nor oath required. For personal study only.
โฆ Synopsis
A key task that any aspiring data-driven organization needs to learn is data wrangling, the process of converting raw data into something truly useful. This practical guide provides business analysts with an overview of various data wrangling techniques and tools, and puts the practice of data wrangling into context by asking, "What are you trying to do and why?"
Wrangling data consumes roughly 50-80% of an analystโs time before any kind of analysis is possible. Written by key executives at Trifacta, this book walks you through the wrangling process by exploring several factorsโtime, granularity, scope, and structureโthat you need to consider as you begin to work with data. Youโll learn a shared language and a comprehensive understanding of data wrangling, with an emphasis on recent agile analytic processes used by many of todayโs data-driven organizations.
Appreciate the importanceโand the satisfactionโof wrangling data the right way.
- Understand what kind of data is available
- Choose which data to use and at what level of detail
- Meaningfully combine multiple sources of data
- Decide how to distill the results to a size and shape that can drive downstream analysis
โฆ Subjects
Education & Reference;CPA Test;GMAT Test;Statistics;Business & Money;Research & Development;Processes & Infrastructure;Business & Money;Data Warehousing;Databases & Big Data;Computers & Technology;Data Processing;Databases & Big Data;Computers & Technology;Mathematical Analysis;Mathematics;Science & Math;Research;Mathematics;Science & Math;Business & Finance;Accounting;Banking;Business Communication;Business Development;Business Ethics;Business Law;Economics;Entrepreneurship;Finance;Human Resour
๐ SIMILAR VOLUMES
Written in lucid language, this valuable textbook brings together fundamental concepts of data mining and data warehousing in a single volume. Important topics including information theory, decision tree, Naรฏve Bayes classifier, distance metrics, partitioning clustering, associate mining, data marts
Written in lucid language, this valuable textbook brings together fundamental concepts of data mining and data warehousing in a single volume. Important topics including information theory, decision tree, Naรฏve Bayes classifier, distance metrics, partitioning clustering, associate mining, data marts
<h4>Key Features</h4><ul><li>This easy-to-follow guide takes you through every step of the data wrangling process in the best possible way</li><li>Work with different types of datasets, and reshape the layout of your data to make it easier for analysis</li><li>Get simple examples and real-life data
<div><p>There are awesome discoveries to be made and valuable stories to be told in datasets--and this book will help you uncover them. Whether you already work with data or just want to understand its possibilities, the techniques and advice in this practical book will help you learn how to better
<h4>Key Features</h4><ul><li>This easy-to-follow guide takes you through every step of the data wrangling process in the best possible way</li><li>Work with different types of datasets, and reshape the layout of your data to make it easier for analysis</li><li>Get simple examples and real-life data