[ACM Press the 2011 international conference - Athens, Greece (2011.06.12-2011.06.16)] Proceedings of the 2011 international conference on Management of data - SIGMOD '11 - Neighborhood based fast graph search in large networks
โ Scribed by Khan, Arijit; Li, Nan; Yan, Xifeng; Guan, Ziyu; Chakraborty, Supriyo; Tao, Shu
- Book ID
- 125532118
- Publisher
- ACM Press
- Year
- 2011
- Tongue
- English
- Weight
- 565 KB
- Category
- Article
- ISBN
- 1450306616
No coin nor oath required. For personal study only.
โฆ Synopsis
Complex social and information network search becomes important with a variety of applications. In the core of these applications, lies a common and critical problem: Given a labeled network and a query graph, how to efficiently search the query graph in the target network. The presence of noise and the incomplete knowledge about the structure and content of the target network make it unrealistic to find an exact match. Rather, it is more appealing to find the top-k approximate matches.In this paper, we propose a neighborhood-based similarity measure that could avoid costly graph isomorphism and edit distance computation. Under this new measure, we prove that subgraph similarity search is NP hard, while graph similarity match is polynomial. By studying the principles behind this measure, we found an information propagation model that is able to convert a large network into a set of multidimensional vectors, where sophisticated indexing and similarity search algorithms are available. The proposed method, called Ness (Neighborhood Based Similarity Search), is appropriate for graphs with low automorphism and high noise, which are common in many social and information networks. Ness is not only efficient, but also robust against structural noise and information loss. Empirical results show that it can quickly and accurately find high-quality matches in large networks, with negligible cost.
๐ SIMILAR VOLUMES
In this paper, we tackle the problem of efficiently answering queries on probabilistic RDF data graphs. Specifically, we model RDF data by probabilistic graphs, and an RDF query is equivalent to a search over subgraphs of probabilistic graphs that have high probabilities to match with a given query