Finding Patterns in Three-Dimensional Graphs: Algorithms and Applications to Scientific Data Mining

✍ Scribed by Wang X., Wang J.T.L., Shasha D.

Year: 2002
Tongue: English
Leaves: 19
Category: Library

No coin nor oath required. For personal study only.

✦ Synopsis

This paper presents a method for finding patterns in 3D graphs. Each node in a graph is an undecomposable or atomic unit and has a label. Edges are links between the atomic units. Patterns are rigid substructures that may occur in a graph after allowing for an arbitrary number of whole-structure rotations and translations as well as a small number (specified by the user) of edit operations in the patterns or in the graph. (When a pattern appears in a graph only after the graph has been modified, we call that appearance approximate occurrence.º) The edit operations include relabeling a node, deleting a node and inserting a node. The proposed method is based on the geometric hashing technique, which hashes node-triplets of the graphs into a 3D table and compresses the labeltriplets in the table. To demonstrate the utility of our algorithms, we discuss two applications of them in scientific data mining. First, we apply the method to locating frequently occurring motifs in two families of proteins pertaining to RNA-directed DNA Polymerase and Thymidylate Synthase and use the motifs to classify the proteins. Then, we apply the method to clustering chemical compounds pertaining to aromatic, bicyclicalkanes, and photosynthesis. Experimental results indicate the good performance of our algorithms and high recall and precision rates for both classification and clustering.

📜 SIMILAR VOLUMES

Graph Data Mining: Algorithm, Security a

📁 Graph Data Mining: Algorithm, Security and Application

✍ Qi Xuan 📂 Library 📅 2021 🏛 Springer Nature 🌐 English

Data Mining Algorithms in C++: Data Patt

📁 Data Mining Algorithms in C++: Data Patterns and Algorithms for Modern Applications

✍ Timothy Masters 📂 Library 📅 2018 🏛 Apress 🌐 English

Discover hidden relationships among the variables in your data, and learn how to exploit these relationships. This book presents a collection of data-mining algorithms that are effective in a wide variety of prediction and classification applications. All algorithms include an intuitive explanation

Data Mining Algorithms in C++. Data Patt

📁 Data Mining Algorithms in C++. Data Patterns and Algorithms for modern Applications

✍ Timothy Masters 📂 Library 📅 2018 🏛 Apress 🌐 English

Data Mining Algorithms in C++: Data Patt

📁 Data Mining Algorithms in C++: Data Patterns and Algorithms for Modern Applications

✍ Timothy Masters (auth.) 📂 Library 📅 2018 🏛 Apress 🌐 English

<p>Discover hidden relationships among the variables in your data, and learn how to exploit these relationships. This book presents a collection of data-mining algorithms that are effective in a wide variety of prediction and classification applications. All algorithms include an intuitive explanati

Data mining algorithms in C++: data patt

📁 Data mining algorithms in C++: data patterns and algorithms for modern applications

✍ Masters, Timothy 📂 Library 📅 2018 🏛 Apress 🌐 English

Find the various relationships among variables that can be present in big data as well as other data sets. This book also covers information entropy, permutation tests, combinatorics, predictor selections, and eigenvalues to give you a well-rounded view of data mining and algorithms in C++. Furtherm

Data Mining Algorithms in C++: Data Patt

📁 Data Mining Algorithms in C++: Data Patterns and Algorithms for Modern Applications

✍ Timothy Masters 📂 Library 📅 2017 🏛 Apress 🌐 English