## Abstract A situation where **__J__** blocks of variables are observed on the same set of individuals is considered in this paper. A factor analysis logic is applied to tables instead of variables. The latent variables of each block should well explain their own block and, at the same time, the l
Analysis of multiblock and hierarchical PCA and PLS models
β Scribed by Johan A. Westerhuis; Theodora Kourti; John F. MacGregor
- Publisher
- John Wiley and Sons
- Year
- 1998
- Tongue
- English
- Weight
- 168 KB
- Volume
- 12
- Category
- Article
- ISSN
- 0886-9383
No coin nor oath required. For personal study only.
β¦ Synopsis
Multiblock and hierarchical PCA and PLS methods have been proposed in the recent literature in order to improve the interpretability of multivariate models. They have been used in cases where the number of variables is large and additional information is available for blocking the variables into conceptually meaningful blocks. In this paper we compare these methods from a theoretical or algorithmic viewpoint using a common notation and illustrate their differences with several case studies. Undesirable properties of some of these methods, such as convergence problems or loss of data information due to deflation procedures, are pointed out and corrected where possible. It is shown that the objective function of the hierarchical PCA and hierarchical PLS methods is not clear and the corresponding algorithms may converge to different solutions depending on the initial guess of the super score. It is also shown that the results of consensus PCA (CPCA) and multiblock PLS (MBPLS) can be calculated from the standard PCA and PLS methods when the same variable scalings are applied for these methods. The standard PCA and PLS methods require less computation and give better estimation of the scores in the case of missing data. It is therefore recommended that in cases where the variables can be separated into meaningful blocks, the standard PCA and PLS methods be used to build the models and then the weights and loadings of the individual blocks and super block and the percentage variation explained in each block be calculated from the results.
π SIMILAR VOLUMES
A novel multiblock PLS algorithm called S-PLS (serial PLS) is presented. S-PLS models the separate predictor blocks serially, making it a supplement to hierarchical PLS. In the S-PLS algorithm the predictor blocks are connected only via the response Y. The block models are calculated using the Y res
During the Genetic Analysis Workshop 9 presentations a brief discussion took place about the value of empirical-Bayes methods in genetic analysis. Due to the informal nature of this discussion, the improvements available for analyzing data with this approach S and with the broader class of hierarch
## Abstract Spectroscopic data consists of several hundred to some thousand variables, wherein most of the variables are autocorrelated. When PCA and PLS techniques are used for the interpretation of these kinds of data, the loading plots are usually complex due to the covariation in the spectrum,