Model clustering and its application to water quality monitoring
β Scribed by Rong Zhu; Abdel H. El-Shaarawi
- Publisher
- John Wiley and Sons
- Year
- 2009
- Tongue
- English
- Weight
- 133 KB
- Volume
- 20
- Category
- Article
- ISSN
- 1180-4009
- DOI
- 10.1002/env.922
No coin nor oath required. For personal study only.
β¦ Synopsis
Abstract
The classification of objects into groups where the objects within a group share a set of common traits is important in many areas of applications and particularly in environmental pollution studies. Consider the situation where variables are measured on different occasions for each of K objects, and the objective is to classify these objects into groups according to some common characteristics. The procedure introduced in this paper consists of two aspects: model fitting and clustering. The model fitting selects a family of models which is appropriate for the structure and nature of the available measurements, and then is performed for both individual and pooled datasets. The clustering starts with K models that represent the K objects and thus the similarity of the objects reduces to the similarity of their models. Since the models are members of the same family, the models similarity is defined as the equality of their parameters of interest. Here, we partition the parameter vector into two subβvectors corresponding to the interested parameters and ancillary parameters. The clustering will group together objects that have common interested parameters while allowing the ancillary parameters to be object specifics. The pβvalue associated with the proposed model linking test is used as the similarity measure. Several grouping strategies are proposed like cluster peeling, pairwise combining, as well as a speeding technique called splittingβandβbinding. A small simulation study is used to demonstrate the utility of the method. The paper concludes by presenting an environmental application where the interest is to classify E. coli bacteria according to their responses to antibiotic treatments. The data were collected biβweekly at several locations within three Canadian watersheds during 2005. Metric closeness in parameter space used by conventional method and likelihood closeness in model space employed by model clustering are discussed in this application. Copyright Β© 2008 John Wiley & Sons, Ltd.
π SIMILAR VOLUMES
## Abstract We present a forecasting model based on fuzzy pattern recognition and weighted linear regression. In this model fuzzy pattern recognition is used to find homogeneous fuzzy classes in a heterogeneous data set. It is assumed that the classes represent typical situations. For each class a