๐”– Bobbio Scriptorium
โœฆ   LIBER   โœฆ

Multiple regression for environmental data: nonlinearities and prediction bias

โœ Scribed by Paul Geladi; Lubomir Hadjiiski; Philip Hopke


Book ID
104309726
Publisher
Elsevier Science
Year
1999
Tongue
English
Weight
178 KB
Volume
47
Category
Article
ISSN
0169-7439

No coin nor oath required. For personal study only.

โœฆ Synopsis


Multiple regression models are often tested by making plots of predicted against measured values. In these plots, all observations are supposed to fall on the diagonal. Points not positioned on the diagonal show unmodeled behaviour. Some of these deviations are caused by random noise. Environmental data have quite some measurement and sampling noise and one is not supposed to model or predict this noise. However, there can also be a systematic variation, a bias. This bias is often expressed as systematically low predictions for high values. The high values fall below the diagonal in the plot. A kind of bias is a contraction around the diagonal. The high values are predicted too low and the low values are predicted too high: the predictions are contracted around the center of the data set. One factor contributing to bias or contraction is nonlinearities in the true physical relationship. The data set consists of hourly ozone measurements and parallel measurements of nitrogen oxides, temperature, UV radiation and more than 50 organic chemicals. The measurements were made on surface air in an urban environment. It may be assumed that the ozone concentrations are influenced by all the other variables, so a multivariate regression model may be made with 57 predictor variables and ozone concentration as the response variable. Because of ลฝ . expected collinearities and large noise, a partial least squares PLS regression model is chosen. The total set of 717 objects is split into a calibration and a test set of 358 and 359 objects, respectively. The data are noisy and the relationship is very nonlinear. It is shown how contraction and prediction bias occur and how extra steps of reducing and nonlinearizing the data remove these effects until only a substantial random noise is left.


๐Ÿ“œ SIMILAR VOLUMES


A random effects nonlinear regression mo
โœ Keiko Otani; Megu Ohtaki; Shaw Watanabe ๐Ÿ“‚ Article ๐Ÿ“… 2003 ๐Ÿ› John Wiley and Sons ๐ŸŒ English โš– 91 KB ๐Ÿ‘ 2 views

## Abstract The purpose of this study is to examine the relationship between dioxin concentration in humans and their living environmental factors such as diet or residential district. We develop a nonlinear random effects regression model based on a pharmacokinetic model that explains dioxin accum

Linear Predictivity: An Alternative for
โœ L. P. Lefkovitch ๐Ÿ“‚ Article ๐Ÿ“… 1986 ๐Ÿ› John Wiley and Sons ๐ŸŒ English โš– 555 KB

The maximal linear predictable combination of a eet of dependent variables is defined 88 that linear combination maximizing the multiple correlation coefficient with the predictor eet. It allows the relative importance of a number of factors to be evaluated for the joint response, rather then for th