## Abstract Principal component analysis (PCA) and principal component regression (PCR) are routinely used for calibration of measurement devices and for data evaluation. However, their use is hindered in some applications, e.g. hyperspectral imaging, by excessive data sets that imply unacceptable
Variable selection in large environmental data sets using principal components analysis
✍ Scribed by Jacquelynne R. King; Donald A. Jackson
- Publisher
- John Wiley and Sons
- Year
- 1999
- Tongue
- English
- Weight
- 117 KB
- Volume
- 10
- Category
- Article
- ISSN
- 1180-4009
No coin nor oath required. For personal study only.
✦ Synopsis
In many large environmental datasets redundant variables can be discarded without the loss of extra variation. Principal components analysis can be used to select those variables that contain the most information. Using an environmental dataset consisting of 36 meteorological variables spanning 37 years, four methods of variable selection are examined along with dierent criteria levels for deciding on the number of variables to retain. Procrustes analysis, a measure of similarity and bivariate plots are used to assess the success of the alternative variable selection methods and criteria levels in extracting representative variables. The Broken-stick model is a consistent approach to choosing signi®cant principal components and is chosen here as the more suitable criterion in combination with a selection method that requires one principal component analysis and retains variables by starting with selection from the ®rst component.
📜 SIMILAR VOLUMES