𝔖 Bobbio Scriptorium
✦   LIBER   ✦

Principal component analysis for grouped data—a case study

✍ Scribed by Lukman Thalib; Roger L. Kitching; M. Ishaq Bhatti


Publisher
John Wiley and Sons
Year
1999
Tongue
English
Weight
102 KB
Volume
10
Category
Article
ISSN
1180-4009

No coin nor oath required. For personal study only.

✦ Synopsis


Two of the most popular descriptive multivariate methods currently employed are the principal component analysis and canonical variate analysis methods. Canonical variate analysis is the most appropriate technique to use whenever the multivariate data are grouped and to discriminate group dierences using multiple variables. Principal component analysis, on the other hand, is used to develop linear combinations that successively maximize the total variance of a sample where there is no known group structure. However, when there are more variables than within-group degrees of freedom, due to the singularity of the within-group covariance matrix, canonical variates cannot be derived. The main aim of this paper is to explore such situations with an example from a biodiversity study and a further computer simulation; principal component analysis can be a viable substitute, if the between-group dierences are prominent.


📜 SIMILAR VOLUMES


Principal component variable discriminan
✍ Nils B. Vogt 📂 Article 📅 1988 🏛 John Wiley and Sons 🌐 English ⚖ 271 KB 👁 1 views

Principal component analysis is a useful method for analysing data-matrices. By analysing separate class models, i.e. disjoint principal component modelling as in the SIMCA or FCVPC programs (developed for supervised and unsupervised principal component analysis respectively), the principal componen

Hierarchical principal component analysi
✍ K. Janné; J. Pettersen; N.-O. Lindberg; T. Lundstedt 📂 Article 📅 2001 🏛 John Wiley and Sons 🌐 English ⚖ 172 KB 👁 1 views

## Abstract Spectroscopic data consists of several hundred to some thousand variables, wherein most of the variables are autocorrelated. When PCA and PLS techniques are used for the interpretation of these kinds of data, the loading plots are usually complex due to the covariation in the spectrum,