## Abstract The present paper develops a class of generalized partial least squares (GPLS) regression methods. GPLS can be regarded as a kind of weighted partial least squares regression method. Two special cases of them, ridge partial least squares (RPLS) and generalized ridge partial least square
Robust and classical PLS regression compared
โ Scribed by Bettina Liebmann; Peter Filzmoser; Kurt Varmuza
- Publisher
- John Wiley and Sons
- Year
- 2010
- Tongue
- English
- Weight
- 435 KB
- Volume
- 24
- Category
- Article
- ISSN
- 0886-9383
- DOI
- 10.1002/cem.1279
No coin nor oath required. For personal study only.
โฆ Synopsis
Abstract
Classical PLS regression is a wellโestablished technique in multivariate data analysis. Since classical PLS is known to be severely affected by the presence of outliers in the data or deviations from normality, several PLS regression methods with robust behavior towards data contamination have been proposed. We compare the performance of the classical SIMPLS approach with the partial robust M regression (PRM). Both methods are applied to three different data sets including outliers intentionally created. A simulated data set with known true model parameters allows insight in the modeling performance with increasing data contamination. QSPR data are modified with a cluster of outlying observations. A third data set from near infrared (NIR) spectroscopy is likely to include noise and experimental errors already in the original variables, and is further contaminated with outliers. To provide a sound comparison of the considered methods we apply repeated double cross validation. This validation procedure judiciously optimizes the model complexity (number of PLS components) and estimates the models' prediction performance based on testโset predicted errors. All studied robust regression models outperform the classical PLS models when outlying observations are present in the data. For uncontaminated data, the prediction performances of both the classical and the robust models are in the same range. Copyright ยฉ 2010 John Wiley & Sons, Ltd.
๐ SIMILAR VOLUMES
Three alternative approaches are discussed for finding the final calibration model (regression coefficients) in PLS regression of k-way Y on N-way X. The simplest approach is to skip the deflation of the X-data. From the observation that the specific deflation used in multiway PLS is inconsequential