The One-Class Classification Approach to Data Description and to Models Applicability Domain
β Scribed by Igor I. Baskin; Natalia Kireeva; Alexandre Varnek
- Publisher
- Wiley (John Wiley & Sons)
- Year
- 2010
- Tongue
- English
- Weight
- 240 KB
- Volume
- 29
- Category
- Article
- ISSN
- 1868-1743
No coin nor oath required. For personal study only.
β¦ Synopsis
Abstract
In this paper, we associate an applicability domain (AD) of QSAR/QSPR models with the area in the input (descriptor) space in which the density of training data points exceeds a certain threshold. It could be proved that the predictive performance of the models (built on the training set) is larger for the test compounds inside the high density area, than for those outside this area. Instead of searching a decision surface separating high and low density areas in the input space, the oneβclass classification 1βSVM approach looks for a hyperplane in the associated feature space. Unlike other reported in the literature AD definitions, this approach: (i) is purely βdataβbasedβ, i.e. it assigns the same AD to all models built on the same training set, (ii) provides results that depend only on the initial descriptors pool generated for the training set, (iii) can be used for the huge number of descriptors, as well as in the framework of structured kernelβbased approaches, e.g., chemical graph kernels. The developed approach has been applied to improve the performance of QSPR models for stability constants of the complexes of organic ligands with alkalineβearth metals in water.
π SIMILAR VOLUMES
## Abstract Interaction between rotation and intrinsic motion is considered in the framework of the semiphenomenological model of the nucleus (the schemes of strong and weak coupling). The theory is applied to the description of the rotational spectra up to states with high values of the angular mo