## Abstract Identification and characterization of antigenic determinants on proteins has received considerable attention utilizing both, experimental as well as computational methods. For computational routines mostly structural as well as physicochemical parameters have been utilized for predicti
Prediction of antibacterial compounds by machine learning approaches
โ Scribed by Xue-Gang Yang; Duan Chen; Min Wang; Ying Xue; Yu-Zong Chen
- Publisher
- John Wiley and Sons
- Year
- 2009
- Tongue
- English
- Weight
- 159 KB
- Volume
- 30
- Category
- Article
- ISSN
- 0192-8651
No coin nor oath required. For personal study only.
โฆ Synopsis
Abstract
The machine learning (ML) as well as quantitative structure activity relationship (QSAR) method has been explored for predicting compounds with antibacterial activities at impressive performance. It is desirable to test additional ML methods, select most representative sets of molecular descriptors, and subject the developed prediction models to rigorous evaluations. This work evaluated three ML methods, support vector classification (SVC), kโnearest neighbor (kโNN), and C4.5 decision tree, which were trained and tested by 230 antibacterial and 381 nonantibacterial compounds. A wellโestablished feature selection method was used to select representative molecular descriptors from a larger pool than that used in reported studies. The performance of the developed prediction models was tested by 5โfold crossโvalidation and independent evaluation set. SVC produced the best prediction accuracies of 96.66 and 98.15% for antibacterial compounds, and 99.50 and 98.02% for nonantibacterial compounds respectively, which are slightly improved against those of the reported ML as well as QSAR models and outperform the kโNN and C4.5 decision tree models developed in this work. Our study suggests that ML methods, particularly SVC, are potentially useful for facilitating the discovery of antibacterial agents. ยฉ 2008 Wiley Periodicals, Inc. J Comput Chem, 2009
๐ SIMILAR VOLUMES
A novel method for estimating prediction uncertainty using machine learning techniques is presented. Uncertainty is expressed in the form of the two quantiles (constituting the prediction interval) of the underlying distribution of prediction errors. The idea is to partition the input space into dif
## Abstract Consistently growing architectural complexity and machine scales make the creation of accurate performance models for largeโscale applications increasingly challenging. Traditional analytic models are difficult and time consuming to construct, and are often unable to capture full system
## Abstract In the drug discovery process, it is important to know the properties of both drug candidates and their metabolites. Fast and precise prediction of metabolites is essential. However, it has been difficult to predict metabolites because of the complexity of the mechanism of cytochrome P4
Computational methods for predicting compounds of specific pharmacodynamic and ADMET (absorption, distribution, metabolism, excretion and toxicity) property are useful for facilitating drug discovery and evaluation. Recently, machine learning methods such as neural networks and support vector machin