✦ LIBER ✦

A numeric comparison of variable selection algorithms for supervised learning

✍ Scribed by G. Palombo; I. Narsky

Publisher: Elsevier Science
Year: 2009
Tongue: English
Weight: 411 KB
Volume: 612
Category: Article
ISSN: 0168-9002
DOI: 10.1016/j.nima.2009.09.059

No coin nor oath required. For personal study only.

✦ Synopsis

Datasets in modern High Energy Physics (HEP) experiments are often described by dozens or even hundreds of input variables. Reducing a full variable set to a subset that most completely represents information about data is therefore an important task in analysis of HEP data. We compare various variable selection algorithms for supervised learning using several datasets such as, for instance, imaging gamma-ray Cherenkov telescope (MAGIC) data found at the UCI repository. We use classifiers and variable selection methods implemented in the statistical package StatPatternRecognition (SPR), a free open-source Cþ þ package developed in the HEP community (http://sourceforge.net/projects/ statpatrec/). For each dataset, we select a powerful classifier and estimate its learning accuracy on variable subsets obtained by various selection algorithms. When possible, we also estimate the CPU time needed for the variable subset selection. The results of this analysis are compared with those published previously for these datasets using other statistical packages such as R and Weka. We show that the most accurate, yet slowest, method is a wrapper algorithm known as generalized sequential forward selection (''Add N Remove R'') implemented in SPR.

📜 SIMILAR VOLUMES

Numerical comparison of network design a

Numerical comparison of network design algorithms for regionalized variables

✍ Jesus Carrera; Ferenc Szidarovszky 📂 Article 📅 1985 🏛 Elsevier Science 🌐 English ⚖ 657 KB

A supervised learning algorithm for hier

A supervised learning algorithm for hierarchical classification of fuzzy patterns

✍ Prasenjit Biswas; Arun K. Majumdar 📂 Article 📅 1983 🏛 Elsevier Science 🌐 English ⚖ 807 KB

A comparison of nonlinear optimization m

A comparison of nonlinear optimization methods for supervised learning in multilayer feedforward neural networks

✍ James W. Denton; Ming S. Hung 📂 Article 📅 1996 🏛 Elsevier Science 🌐 English ⚖ 922 KB

A comparison of five algorithms for the

A comparison of five algorithms for the training of CMAC memories for learning control systems

✍ P.C. Parks; J. Militzer 📂 Article 📅 1992 🏛 Elsevier Science 🌐 English ⚖ 781 KB

First, the cerebellar model articulation controller (CMAC), invented in the early 1970s by AIbus, and the associative memory system (AMS), developed for learning control systems by H. Tolle et al. in the early 1980s, are briefly described. The underlying mathematics of the AMS learning or training a

A Comparison of Numerical Algorithms for

A Comparison of Numerical Algorithms for Fourier Extension of the First, Second, and Third Kinds

✍ John P. Boyd 📂 Article 📅 2002 🏛 Elsevier Science 🌐 English ⚖ 488 KB

The range of Fourier methods can be significantly increased by extending a nonperiodic function f (x) to a periodic function f on a larger interval. When f (x) is analytically known on the extended interval, the extension is straightforward. When f (x) is unknown outside the physical interval, there

A comparison of active set method and ge

A comparison of active set method and genetic algorithm approaches for learning weighting vectors in some aggregation operators

✍ David Nettleton; Vicenc Torra 📂 Article 📅 2001 🏛 John Wiley and Sons 🌐 English ⚖ 110 KB 👁 1 views

In this article we compare two contrasting methods, active set method ASM and genetic algorithms, for learning the weights in aggregation operators, such as weighted mean Ž . Ž . WM , ordered weighted average OWA , and weighted ordered weighted average Ž . WOWA . We give the formal definitions for e