Automatic classification of chemical structure databases using a highly parallel array processor
✍ Scribed by Edie M. Rasmussen; Geoffrey M. Downs; Peter Willett
- Publisher
- John Wiley and Sons
- Year
- 1988
- Tongue
- English
- Weight
- 874 KB
- Volume
- 9
- Category
- Article
- ISSN
- 0192-8651
No coin nor oath required. For personal study only.
✦ Synopsis
This article describes the use of the ICL Distributed Array Processor (DAP) for the automatic classification of chemical structure databases using the Jarvis-Patrick clustering method. This method is based upon the calculation of a table containing the nearest neighbors for each of the molecules in the database which is to be clustered. These nearest neighbors can be identified very efficiently using the DAP since it allows up to 4096 molecules to be compared with a specified molecule in parallel. Experiments with files of 4096 and 8192 structures from the Fine Chemicals Database show that clustering with the DAP is up to 6.7 times as fast as using a highly efficient, inverted file algorithm on an IBM 3083 mainframe.