𝔖 Bobbio Scriptorium
✦   LIBER   ✦

Improving the Robustness and Stability of Partial Least Squares Regression for Near-infrared Spectral Analysis

✍ Scribed by Xueguang SHAO; Da CHEN; Heng XU; Zhichao LIU; Wensheng CAI


Publisher
John Wiley and Sons
Year
2009
Tongue
English
Weight
94 KB
Volume
27
Category
Article
ISSN
0256-7660

No coin nor oath required. For personal study only.

✦ Synopsis


Abstract

Partial least‐squares (PLS) regression has been presented as a powerful tool for spectral quantitative measurement. However, the improvement of the robustness and stability of PLS models is still needed, because it is difficult to build a stable model when complex samples are analyzed or outliers are contained in the calibration data set. To achieve the purpose, a robust ensemble PLS technique based on probability resampling was proposed, which is named RE‐PLS. In the proposed method, a probability is firstly obtained for each calibration sample from its residual in a robust regression. Then, multiple PLS models are constructed based on probability resampling. At last, the multiple PLS models are used to predict unknown samples by taking the average of the predictions from the multiple models as final prediction result. To validate the effectiveness and universality of the proposed method, it was applied to two different sets of NIR spectra. The results show that RE‐PLS can not only effectively avoid the interference of outliers but also enhance the precision of prediction and the stability of PLS regression. Thus, it may provide a useful tool for multivariate calibration with multiple outliers.


📜 SIMILAR VOLUMES


Use of Fourier transform infrared spectr
✍ Holland, J K; Kemsley, E K; Wilson, R H 📂 Article 📅 1998 🏛 John Wiley and Sons 🌐 English ⚖ 315 KB 👁 1 views

Fourier transform infrared (FT-IR) spectroscopy and chemometrics have been combined to detect adulteration in strawberry pure es. The mid-IR spectra of 983 fruit pure es were used as the data for a partial least squares regression on to a binary dummy variable, that represents two sample types, stra