✦ LIBER ✦

Author name disambiguation for collaboration network analysis and visualization

✍ Scribed by Andreas Strotmann; Dangzhi Zhao; Tania Bubela

Book ID: 102514161
Publisher: Wiley (John Wiley & Sons)
Year: 2009
Tongue: English
Weight: 194 KB
Volume: 46
Category: Article
ISSN: 0044-7870
DOI: 10.1002/meet.2009.1450460218

No coin nor oath required. For personal study only.

✦ Synopsis

Abstract

In this paper we outline a heuristic algorithm for disambiguating author names of publications via deterministic clustering based on well‐defined similarity measures between publications in which their names appear as authors. The algorithm is designed to be used in the construction of a collaboration network, i.e., a graph of author nodes and co‐author links. In this context, the goal is to produce a co‐authorship graph with network characteristics that are close to those of the “true” collaboration network, so that meaningful network metrics can be determined.

The algorithm we present here is fairly easily comprehended as it does not depend on any sophisticated AI techniques. This is important in the context of policy studies, in which we successfully applied it, as it enables policy makers to judge the soundness of the methodology with considerable confidence. It is also quite fast, making it possible to run large‐scale analyses (here, in the order of a hundred thousand publications and in the order of a million names to be disambiguated) on a moderately sized desktop computer within a few days.

The algorithm is, finally, open to improvement via extensions that take into account additional kinds of fields in bibliographic records of publications to provide evidence that two occurrences of similar names belong to the same individual.

📜 SIMILAR VOLUMES

A probabilistic similarity metric for Me

A probabilistic similarity metric for Medline records: A model for author name disambiguation

✍ Vetle I. Torvik; Marc Weeber; Don R. Swanson; Neil R. Smalheiser 📂 Article 📅 2004 🏛 John Wiley and Sons 🌐 English ⚖ 573 KB

## Abstract We present a model for estimating the probability that a pair of author names (sharing last name and first initial), appearing on two different Medline articles, refer to the same individual. The model uses a simple yet powerful similarity profile between a pair of articles, based on ti

A heuristic approach to author name disa

A heuristic approach to author name disambiguation in bibliometrics databases for large-scale research assessments

✍ Ciriaco Andrea D'Angelo; Cristiano Giuffrida; Giovanni Abramo 📂 Article 📅 2010 🏛 John Wiley and Sons 🌐 English ⚖ 120 KB 👁 1 views

Collaborative routing and camera selecti

Collaborative routing and camera selection for visual wireless sensor networks

✍ Amiri, S.M.; Nasiopoulos, P.; Leung, V.C.M. 📂 Article 📅 2011 🏛 The Institution of Engineering and Technology 🌐 English ⚖ 589 KB

Cytoscape 2.8: new features for data int

Cytoscape 2.8: new features for data integration and network visualization

✍ Smoot, Michael E. (author);Ono, Keiichiro (author);Ruscheinski, Johannes (author 📂 Article 📅 2010 🏛 Oxford University Press 🌐 English ⚖ 160 KB

Cytoscape 2.8: new features for data int

Cytoscape 2.8: new features for data integration and network visualization

✍ Smoot, Michael E. (author);Ono, Keiichiro (author);Ruscheinski, Johannes (author 📂 Article 📅 2010 🏛 Oxford University Press 🌐 English ⚖ 160 KB

Multimodal and multimedia image analysis

Multimodal and multimedia image analysis and collaborative networking for digestive endoscopy

✍ d’Orazio, L.; Bartoli, A.; Baetz, A.; Beorchia, S.; Calvary, G.; Chabane, Y.; Ch 📂 Article 📅 2014 🏛 Elsevier 🌐 French ⚖ 736 KB