## Abstract ChemInform is a weekly Abstracting Service, delivering concise information at a glance that was extracted from about 200 leading journals. To access a ChemInform Abstract, please click on HTML or PDF.
Fast screening of large databases using clustering and PCA based on structure fragments
✍ Scribed by Johan Nouwen; Fredrik Lindgren; Bjorn Hansen; Walter Karcher; Henk J. M. Verhaar; Joop L. M. Hermens
- Publisher
- John Wiley and Sons
- Year
- 1996
- Tongue
- English
- Weight
- 758 KB
- Volume
- 10
- Category
- Article
- ISSN
- 0886-9383
No coin nor oath required. For personal study only.
✦ Synopsis
Jarvis-Patrick clustering based on structural fragments with the Tanimoto coefficient as the similarity measure provides a fast tool for classification of large amounts of chemicals. This clustering technique was applied to chemicals in relation to their acute fish toxicity (LC5,J. Correlation analysis with log LCm as the response variable and log Kow as the predictor variable resulted in good models for several clusters. Benzylic chemicals were not recognized as separate clusters. Including them in the training set resulted in models without any predictive capability. Based on statistical and chemical criteria,they were rejected, improving the final model substantially. The toxicological response of phenols and some organophosphates was found to fit well into one model. The clustering resulted in smaller groupings than those listed by Verhaar et al. but were only in dispute for a minority of chemicals. PCA allowed a quick visual inspection of the application limits of the models for the HPVCs and the EINECS. The models performed well for the HPVCs but could only be used to estimate a fraction of the E N C S . PCA showed that in some cases subclusters were present.
📜 SIMILAR VOLUMES