๐”– Scriptorium
โœฆ   LIBER   โœฆ

๐Ÿ“

Data Mining Algorithms in C++: Data Patterns and Algorithms for Modern Applications

โœ Scribed by Timothy Masters (auth.)


Publisher
Apress
Year
2018
Tongue
English
Leaves
296
Edition
1
Category
Library

โฌ‡  Acquire This Volume

No coin nor oath required. For personal study only.

โœฆ Synopsis


Discover hidden relationships among the variables in your data, and learn how to exploit these relationships. This book presents a collection of data-mining algorithms that are effective in a wide variety of prediction and classification applications. All algorithms include an intuitive explanation of operation, essential equations, references to more rigorous theory, and commented C++ source code.
Many of these techniques are recent developments, still not in widespread use. Others are standard algorithms given a fresh look. In every case, the focus is on practical applicability, with all code written in such a way that it can easily be included into any program. The Windows-based DATAMINE program lets you experiment with the techniques before incorporating them into your own work.
What you'll learn

  • Monte-Carlo permutation tests provide statistically sound assessment of relationships present in your data.
  • Combinatorially symmetric cross validation reveals whether your model has true power or has just learned noise by overfitting the data.
  • Feature weighting as regularized energy-based learning ranks variables according to their predictive power when there is too little data for traditional methods.
  • The eigenstructure of a dataset enables clustering of variables into groups that exist only within meaningful subspaces of the data.
  • Plotting regions of the variable space where there is disagreement between marginal and actual densities, or where contribution to mutual information is high, provides visual insight into anomalous relationships.

Who this book is for

The techniques presented in this book and in the DATAMINE program will be useful to anyone interested in discovering and exploiting relationships among variables. Although all code examples are written in C++, the algorithms are described in sufficient detail that they can easily be programmed in any language.

โœฆ Table of Contents


Front Matter ....Pages i-xiv
Information and Entropy (Timothy Masters)....Pages 1-73
Screening for Relationships (Timothy Masters)....Pages 75-166
Displaying Relationship Anomalies (Timothy Masters)....Pages 167-184
Fun with Eigenvectors (Timothy Masters)....Pages 185-265
Using the DATAMINE Program (Timothy Masters)....Pages 267-279
Back Matter ....Pages 281-286

โœฆ Subjects


Programming Languages, Compilers, Interpreters


๐Ÿ“œ SIMILAR VOLUMES


Data Mining Algorithms in C++: Data Patt
โœ Timothy Masters ๐Ÿ“‚ Library ๐Ÿ“… 2018 ๐Ÿ› Apress ๐ŸŒ English

Discover hidden relationships among the variables in your data, and learn how to exploit these relationships. This book presents a collection of data-mining algorithms that are effective in a wide variety of prediction and classification applications. All algorithms include an intuitive explanation

Data mining algorithms in C++: data patt
โœ Masters, Timothy ๐Ÿ“‚ Library ๐Ÿ“… 2018 ๐Ÿ› Apress ๐ŸŒ English

Find the various relationships among variables that can be present in big data as well as other data sets. This book also covers information entropy, permutation tests, combinatorics, predictor selections, and eigenvalues to give you a well-rounded view of data mining and algorithms in C++. Furtherm

Data Mining Algorithms in C++: Data Patt
โœ Timothy Masters ๐Ÿ“‚ Library ๐Ÿ“… 2017 ๐Ÿ› Apress ๐ŸŒ English

Discover hidden relationships among the variables in your data, and learn how to exploit these relationships. This book presents a collection of data-mining algorithms that are effective in a wide variety of prediction and classification applications. All algorithms include an intuitive explanation

Modern Data Mining Algorithms in C++ and
โœ Timothy Masters ๐Ÿ“‚ Library ๐Ÿ“… 2020 ๐Ÿ› Apress ๐ŸŒ English

<p>Discover a variety of data-mining algorithms that are useful for selecting small sets of important features from among unwieldy masses of candidates, or extracting useful features from measured variables. </p> <p>As a serious data miner you will often be faced with thousands of candidate features

Recent Advances In Data Mining Of Enterp
โœ T. Warren Liao, Evangelos Triantaphyllou ๐Ÿ“‚ Library ๐Ÿ“… 2008 ๐Ÿ› World Scientific Publishing Company ๐ŸŒ English

The main goal of the new field of data mining is the analysis of large and complex datasets. Some very important datasets may be derived from business and industrial activities. This kind of data is known as enterprise data . The common characteristic of such datasets is that the analyst wishes to a