<p><i>Data Preprocessing for Data Mining</i> addresses one of the most important issues within the well-known Knowledge Discovery from Data process. Data directly taken from the source will likely have inconsistencies, errors or most importantly, it is not ready to be considered for a data mining pr
Data Preprocessing in Data Mining
✍ Scribed by Salvador García, Julián Luengo, Francisco Herrera (auth.)
- Publisher
- Springer International Publishing
- Year
- 2015
- Tongue
- English
- Leaves
- 327
- Series
- Intelligent Systems Reference Library 72
- Edition
- 1
- Category
- Library
No coin nor oath required. For personal study only.
✦ Synopsis
Data Preprocessing for Data Mining addresses one of the most important issues within the well-known Knowledge Discovery from Data process. Data directly taken from the source will likely have inconsistencies, errors or most importantly, it is not ready to be considered for a data mining process. Furthermore, the increasing amount of data in recent science, industry and business applications, calls to the requirement of more complex tools to analyze it. Thanks to data preprocessing, it is possible to convert the impossible into possible, adapting the data to fulfill the input demands of each data mining algorithm. Data preprocessing includes the data reduction techniques, which aim at reducing the complexity of the data, detecting or removing irrelevant and noisy elements from the data.
This book is intended to review the tasks that fill the gap between the data acquisition from the source and the data mining process. A comprehensive look from a practical point of view, including basic concepts and surveying the techniques proposed in the specialized literature, is given.Each chapter is a stand-alone guide to a particular data preprocessing topic, from basic concepts and detailed descriptions of classical algorithms, to an incursion of an exhaustive catalog of recent developments. The in-depth technical descriptions make this book suitable for technical professionals, researchers, senior undergraduate and graduate students in data science, computer science and engineering.
✦ Table of Contents
Front Matter....Pages i-xv
Introduction....Pages 1-17
Data Sets and Proper Statistical Analysis of Data Mining Techniques....Pages 19-38
Data Preparation Basic Models....Pages 39-57
Dealing with Missing Values....Pages 59-105
Dealing with Noisy Data....Pages 107-145
Data Reduction....Pages 147-162
Feature Selection....Pages 163-193
Instance Selection....Pages 195-243
Discretization....Pages 245-283
A Data Mining Software Package Including Data Preparation and Reduction: KEEL....Pages 285-313
Back Matter....Pages 315-320
✦ Subjects
Computational Intelligence; Image Processing and Computer Vision; Data Mining and Knowledge Discovery
📜 SIMILAR VOLUMES
<p><span>Handbook of Mobility Data Mining, Volume One: Data Preprocessing and Visualization</span><span> introduces the fundamental technologies of mobile big data mining (MDM), advanced AI methods, and upper-level applications, helping readers comprehensively understand MDM with a bottom-up approac
<p><span>Handbook of Mobility Data Mining, Volume One: Data Preprocessing and Visualization</span><span> introduces the fundamental technologies of mobile big data mining (MDM), advanced AI methods, and upper-level applications, helping readers comprehensively understand MDM with a bottom-up approac
This book offers a comprehensible overview of Big Data Preprocessing, which includes a formal description of each problem. It also focuses on the most relevant proposed solutions. This book illustrates actual implementations of algorithms that helps the reader deal with these problems.&n
Trakia Journal of Sciences, Vol. 8, Suppl. 3, 2010, pp. 41-48.<br/>Abstract. This work presents a system for primary processing and analysis of experimental data acquired from measuring devices. The main parameters of time series are defined. Data operations to extract parameters from the sequences,
<p><b>The first comprehensive overview of preprocessing, mining, and postprocessing of biological data</b></p> <p>Molecular biology is undergoing exponential growth in both the volume and complexity of biological data—and knowledge discovery offers the capacity to automate complex search and data an