𝔖 Bobbio Scriptorium
✦   LIBER   ✦

Graph kernels for chemical informatics

✍ Scribed by Liva Ralaivola; Sanjay J. Swamidass; Hiroto Saigo; Pierre Baldi


Publisher
Elsevier Science
Year
2005
Tongue
English
Weight
280 KB
Volume
18
Category
Article
ISSN
0893-6080

No coin nor oath required. For personal study only.

✦ Synopsis


Increased availability of large repositories of chemical compounds is creating new challenges and opportunities for the application of machine learning methods to problems in computational chemistry and chemical informatics. Because chemical compounds are often represented by the graph of their covalent bonds, machine learning methods in this domain must be capable of processing graphical structures with variable size. Here, we first briefly review the literature on graph kernels and then introduce three new kernels (Tanimoto, MinMax, Hybrid) based on the idea of molecular fingerprints and counting labeled paths of depth up to d using depth-first search from each possible vertex. The kernels are applied to three classification problems to predict mutagenicity, toxicity, and anti-cancer activity on three publicly available data sets. The kernels achieve performances at least comparable, and most often superior, to those previously reported in the literature reaching accuracies of 91.5% on the Mutag dataset, 65-67% on the PTC (Predictive Toxicology Challenge) dataset, and 72% on the NCI (National Cancer Institute) dataset. Properties and tradeoffs of these kernels, as well as other proposed kernels that leverage 1D or 3D representations of molecules, are briefly discussed.


πŸ“œ SIMILAR VOLUMES


Graph Kernels for Molecular Similarity
✍ Matthias Rupp; Gisbert Schneider πŸ“‚ Article πŸ“… 2010 πŸ› Wiley (John Wiley & Sons) 🌐 English βš– 574 KB

## Abstract Molecular similarity measures are important for many cheminformatics applications like ligand‐based virtual screening and quantitative structure‐property relationships. Graph kernels are formal similarity measures defined directly on graphs, such as the (annotated) molecular structure g

Bond graph models for electrochemical en
✍ Dean Karnopp πŸ“‚ Article πŸ“… 1990 πŸ› Elsevier Science 🌐 English βš– 537 KB

Bond graph models for chemical kinetics are extended to electrochemical systems. Although many electrochemical systems can be considered to function in a constant temperature environment, high-power energy storage systems, such as electric vehicle batteries, undergo drastic changes in temperature in

Parallel algorithm for the computation o
✍ P. Venuvanalingam; P. Thangavel πŸ“‚ Article πŸ“… 1991 πŸ› John Wiley and Sons 🌐 English βš– 413 KB

A parallel algorithm is developed for the f i t time based on Frame's method to compute the characteristic polynomials of chemical graphs. This algorithm can handle all types of graphs: ordinary, weighted, directed, and signed. Our algorithm takes only linear time in the CRCW PRAM model with O(n9) p