With the unprecedented growth-rate at which data is being collected and stored electronically today in almost all fields of human endeavor, the efficient extraction of useful information from the data available is becoming an increasing scientific challenge and a massive economic need. This book pre
Evolutionary Decision Trees in Large-Scale Data Mining
โ Scribed by Marek Kretowski
- Publisher
- Springer International Publishing
- Year
- 2019
- Tongue
- English
- Leaves
- 184
- Series
- Studies in Big Data 59
- Edition
- 1st ed.
- Category
- Library
No coin nor oath required. For personal study only.
โฆ Synopsis
This book presents a unified framework, based on specialized evolutionary algorithms, for the global induction of various types of classification and regression trees from data. The resulting univariate or oblique trees are significantly smaller than those produced by standard top-down methods, an aspect that is critical for the interpretation of mined patterns by domain analysts. The approach presented here is extremely flexible and can easily be adapted to specific data mining applications, e.g. cost-sensitive model trees for financial data or multi-test trees for gene expression data. The global induction can be efficiently applied to large-scale data without the need for extraordinary resources. With a simple GPU-based acceleration, datasets composed of millions of instances can be mined in minutes. In the event that the size of the datasets makes the fastest memory computing impossible, the Spark-based implementation on computer clusters, which offers impressive fault tolerance and scalability potential, can be applied.
โฆ Table of Contents
Front Matter ....Pages i-xi
Front Matter ....Pages 1-1
Evolutionary Computation (Marek Kretowski)....Pages 3-20
Decision Trees in Data Mining (Marek Kretowski)....Pages 21-48
Parallel and Distributed Computation (Marek Kretowski)....Pages 49-68
Front Matter ....Pages 69-69
Global Induction of Univariate Trees (Marek Kretowski)....Pages 71-99
Oblique and Mixed Decision Trees (Marek Kretowski)....Pages 101-113
Front Matter ....Pages 115-115
Cost-Sensitive Tree Induction (Marek Kretowski)....Pages 117-129
Multi-test Decision Trees for Gene Expression Data (Marek Kretowski)....Pages 131-142
Front Matter ....Pages 143-143
Parallel Computations for Evolutionary Induction (Marek Kretowski)....Pages 145-174
Back Matter ....Pages 175-180
โฆ Subjects
Engineering; Computational Intelligence; Big Data
๐ SIMILAR VOLUMES
<p>This book explores a proactive and domain-driven method to classification tasks. This novel proactive approach to data mining not only induces a model for predicting or explaining a phenomenon, but also utilizes specific problem/domain knowledge to suggest specific actions to achieve optimal chan
Using SAS Enterprise Miner, Barry de Ville's Decision Trees for Business Intelligence and Data Mining illustrates the application and operation of decision trees in business intelligence, data mining, business analytics, prediction, and knowledge discovery. It explains in detail the use of decision
This book offers a true enterprise view of business intelligence. IBM expert Mike Biere shows managers how to create a coherent BI plan that reflects the needs of users throughout the organization-and then implement that plan successfully. Biere explains how to objectively assess the business case f
<p><span>This is the first book to provide a systematic description of statistical properties of large-scale financial data. Specifically, the power-law and log-normal distributions observed at a given time and their changes using time-reversal symmetry, quasi-time-reversal symmetry, Gibrat's law, a