𝔖 Bobbio Scriptorium
✦   LIBER   ✦

Variable selection in large environmental data sets using principal components analysis

✍ Scribed by Jacquelynne R. King; Donald A. Jackson


Publisher
John Wiley and Sons
Year
1999
Tongue
English
Weight
117 KB
Volume
10
Category
Article
ISSN
1180-4009

No coin nor oath required. For personal study only.

✦ Synopsis


In many large environmental datasets redundant variables can be discarded without the loss of extra variation. Principal components analysis can be used to select those variables that contain the most information. Using an environmental dataset consisting of 36 meteorological variables spanning 37 years, four methods of variable selection are examined along with dierent criteria levels for deciding on the number of variables to retain. Procrustes analysis, a measure of similarity and bivariate plots are used to assess the success of the alternative variable selection methods and criteria levels in extracting representative variables. The Broken-stick model is a consistent approach to choosing signi®cant principal components and is chosen here as the more suitable criterion in combination with a selection method that requires one principal component analysis and retains variables by starting with selection from the ®rst component.


📜 SIMILAR VOLUMES


Fast principal component analysis of lar
✍ F. Vogt; M. Tacke 📂 Article 📅 2002 🏛 John Wiley and Sons 🌐 English ⚖ 349 KB

## Abstract Principal component analysis (PCA) and principal component regression (PCR) are routinely used for calibration of measurement devices and for data evaluation. However, their use is hindered in some applications, e.g. hyperspectral imaging, by excessive data sets that imply unacceptable