๐”– Scriptorium
โœฆ   LIBER   โœฆ

๐Ÿ“

Evolutionary Decision Trees in Large-Scale Data Mining

โœ Scribed by Marek Kretowski


Publisher
Springer International Publishing
Year
2019
Tongue
English
Leaves
184
Series
Studies in Big Data 59
Edition
1st ed.
Category
Library

โฌ‡  Acquire This Volume

No coin nor oath required. For personal study only.

โœฆ Synopsis


This book presents a unified framework, based on specialized evolutionary algorithms, for the global induction of various types of classification and regression trees from data. The resulting univariate or oblique trees are significantly smaller than those produced by standard top-down methods, an aspect that is critical for the interpretation of mined patterns by domain analysts. The approach presented here is extremely flexible and can easily be adapted to specific data mining applications, e.g. cost-sensitive model trees for financial data or multi-test trees for gene expression data. The global induction can be efficiently applied to large-scale data without the need for extraordinary resources. With a simple GPU-based acceleration, datasets composed of millions of instances can be mined in minutes. In the event that the size of the datasets makes the fastest memory computing impossible, the Spark-based implementation on computer clusters, which offers impressive fault tolerance and scalability potential, can be applied.


โœฆ Table of Contents


Front Matter ....Pages i-xi
Front Matter ....Pages 1-1
Evolutionary Computation (Marek Kretowski)....Pages 3-20
Decision Trees in Data Mining (Marek Kretowski)....Pages 21-48
Parallel and Distributed Computation (Marek Kretowski)....Pages 49-68
Front Matter ....Pages 69-69
Global Induction of Univariate Trees (Marek Kretowski)....Pages 71-99
Oblique and Mixed Decision Trees (Marek Kretowski)....Pages 101-113
Front Matter ....Pages 115-115
Cost-Sensitive Tree Induction (Marek Kretowski)....Pages 117-129
Multi-test Decision Trees for Gene Expression Data (Marek Kretowski)....Pages 131-142
Front Matter ....Pages 143-143
Parallel Computations for Evolutionary Induction (Marek Kretowski)....Pages 145-174
Back Matter ....Pages 175-180

โœฆ Subjects


Engineering; Computational Intelligence; Big Data


๐Ÿ“œ SIMILAR VOLUMES


Large-Scale Parallel Data Mining
โœ Mohammed J. Zaki (auth.), Mohammed J. Zaki, Ching-Tien Ho (eds.) ๐Ÿ“‚ Library ๐Ÿ“… 2000 ๐Ÿ› Springer-Verlag Berlin Heidelberg ๐ŸŒ English

With the unprecedented growth-rate at which data is being collected and stored electronically today in almost all fields of human endeavor, the efficient extraction of useful information from the data available is becoming an increasing scientific challenge and a massive economic need. This book pre

Proactive Data Mining with Decision Tree
โœ Haim Dahan, Shahar Cohen, Lior Rokach, Oded Maimon (auth.) ๐Ÿ“‚ Library ๐Ÿ“… 2014 ๐Ÿ› Springer-Verlag New York ๐ŸŒ English

<p>This book explores a proactive and domain-driven method to classification tasks. This novel proactive approach to data mining not only induces a model for predicting or explaining a phenomenon, but also utilizes specific problem/domain knowledge to suggest specific actions to achieve optimal chan

Decision Trees for Business Intelligence
โœ Barry De Ville ๐Ÿ“‚ Library ๐Ÿ“… 2006 ๐ŸŒ English

Using SAS Enterprise Miner, Barry de Ville's Decision Trees for Business Intelligence and Data Mining illustrates the application and operation of decision trees in business intelligence, data mining, business analytics, prediction, and knowledge discovery. It explains in detail the use of decision

Decision Trees for Business Intelligence
โœ Barry De Ville ๐Ÿ“‚ Library ๐Ÿ“… 2006 ๐Ÿ› SAS Publishing ๐ŸŒ English

This book offers a true enterprise view of business intelligence. IBM expert Mike Biere shows managers how to create a coherent BI plan that reflects the needs of users throughout the organization-and then implement that plan successfully. Biere explains how to objectively assess the business case f

Statistical Properties in Firmsโ€™ Large-s
โœ Atushi Ishikawa ๐Ÿ“‚ Library ๐Ÿ“… 2021 ๐Ÿ› Springer ๐ŸŒ English

<p><span>This is the first book to provide a systematic description of statistical properties of large-scale financial data. Specifically, the power-law and log-normal distributions observed at a given time and their changes using time-reversal symmetry, quasi-time-reversal symmetry, Gibrat's law, a