𝔖 Scriptorium
✦   LIBER   ✦

πŸ“

Data Mining and Exploration: From Traditional Statistics to Modern Data Science

✍ Scribed by Chong Ho Alex Yu


Publisher
CRC Press
Year
2022
Tongue
English
Leaves
290
Edition
1
Category
Library

⬇  Acquire This Volume

No coin nor oath required. For personal study only.

✦ Synopsis


This book introduces both conceptual and procedural aspects of cutting-edge data science methods, such as dynamic data visualization, artificial neural networks, ensemble methods, and text mining. There are at least two unique elements that can set the book apart from its rivals.

First, most students in social sciences, engineering, and business took at least one class in introductory statistics before learning data science. However, usually these courses do not discuss the similarities and differences between traditional statistics and modern data science; as a result learners are disoriented by this seemingly drastic paradigm shift. In reaction, some traditionalists reject data science altogether while some beginning data analysts employ data mining tools as a β€œblack box”, without a comprehensive view of the foundational differences between traditional and modern methods (e.g., dichotomous thinking vs. pattern recognition, confirmation vs. exploration, single method vs. triangulation, single sample vs. cross-validation etc.). This book delineates the transition between classical methods and data science (e.g. from p value to Log Worth, from resampling to ensemble methods, from content analysis to text mining etc.). Second, this book aims to widen the learner's horizon by covering a plethora of software tools. When a technician has a hammer, every problem seems to be a nail. By the same token, many textbooks focus on a single software package only, and consequently the learner tends to fit the problem with the tool, but not the other way around. To rectify the situation, a competent analyst should be equipped with a tool set, rather than a single tool. For example, when the analyst works with crucial data in a highly regulated industry, such as pharmaceutical and banking, commercial software modules (e.g., SAS) are indispensable. For a mid-size and small company, open-source packages such as Python would come in handy. If the research goal is to create an executive summary quickly, the logical choice is rapid model comparison. If the analyst would like to explore the data by asking what-if questions, then dynamic graphing in JMP Pro is a better option. This book uses concrete examples to explain the pros and cons of various software applications.

✦ Table of Contents


Cover
Half Title
Title Page
Dedication
Preface
Table of Contents
Chapter 1: Re-examination of Traditional Statistics
Chapter 2: Why Data Science?
Chapter 3: Cutting Edge Data Analytical Tools
Chapter 4: Exploratory Data Analysis and Data Visualization: Pattern Seeking
Chapter 5: Generalized Regression: Penalty against Complexity
Chapter 6: Classification and Model Screening
Chapter 7: Ensemble Methods: The Wisdom of the Crowd
Chapter 8: Dimension Reduction: Breaking the Curse of Dimensionality
Chapter 9: Clustering: Divide and Conquer
Chapter 10: Neural Networks: Machines Mimic Human Intelligence
Chapter 11: Text Mining: Structure the Unstructured
Index


πŸ“œ SIMILAR VOLUMES


Modern Statistics with R: From Wrangling
✍ MΓ₯ns Thulin πŸ“‚ Library πŸ“… 2024 πŸ› CRC Press 🌐 English

The past decades have transformed the world of statistical data analysis, with new methods, new types of data, and new computational tools. Modern Statistics with R introduces you to key parts of this modern statistical toolkit. It teaches you: β€’ Data wrangling - importing, formatting, reshaping, me

Modern Statistics with R: From Wrangling
✍ MΓ₯ns Thulin πŸ“‚ Library πŸ“… 2024 πŸ› Chapman and Hall/CRC 🌐 English

<p><span>The past decades have transformed the world of statistical data analysis, with new methods, new types of data, and new computational tools. </span><span>Modern Statistics with R</span><span> introduces you to key parts of this modern statistical toolkit. It teaches you:</span></p><ul><li><s

Data Mining: Exploring the Data
✍ Inmon W.H. πŸ“‚ Library πŸ“… 1997 🌐 English

One of the most important uses of the data warehouse is that of data mining. Data mining is the process of using raw data to infer important business relationships. Once the business relationships have been discovered, they can then be used for business advantage. Certainly a data warehouse has othe

Statistical Mining and Data Visualizatio
✍ Timothy J. Brown, Paul W. Mielke Jr. (auth.), Timothy J. Brown, Paul W. Mielke J πŸ“‚ Library πŸ“… 2000 πŸ› Springer US 🌐 English

<p><em>Statistical Mining and Data Visualization in Atmospheric Sciences</em> brings together in one place important contributions and up-to-date research results in this fast moving area. <br/><em>Statistical Mining and Data Visualization in Atmospheric Sciences</em> serves as an excellent referenc

Intelligent Data Warehousing: From Data
✍ Zhengxin Chen (Author) πŸ“‚ Library πŸ“… 2001 πŸ› CRC Press

<p>Effective decision support systems (DSS) are quickly becoming key to businesses gaining a competitive advantage, and the effectiveness of these systems depends on the ability to construct, maintain, and extract information from data warehouses. While many still perceive data warehousing as a subd