✦ LIBER ✦

A case for automated large-scale semantic annotation

✍ Scribed by Stephen Dill; Nadav Eiron; David Gibson; Daniel Gruhl; R. Guha; Anant Jhingran; Tapas Kanungo; Kevin S. McCurley; Sridhar Rajagopalan; Andrew Tomkins; John A. Tomlin; Jason Y. Zien

Publisher: Elsevier Science
Year: 2003
Tongue: English
Weight: 441 KB
Volume: 1
Category: Article
ISSN: 1570-8268
DOI: 10.1016/j.websem.2003.07.006

No coin nor oath required. For personal study only.

✦ Synopsis

This paper describes Seeker, a platform for large-scale text analytics, and SemTag, an application written on the platform to perform automated semantic tagging of large corpora. We apply SemTag to a collection of approximately 264 million web pages, and generate approximately 434 million automatically disambiguated semantic tags, published to the web as a label bureau providing metadata regarding the 434 million annotations. To our knowledge, this is the largest scale semantic tagging effort to date.

We describe the Seeker platform, discuss the architecture of the SemTag application, describe a new disambiguation algorithm specialized to support ontological disambiguation of large-scale data, evaluate the algorithm, and present our final results with information about acquiring and making use of the semantic tags. We argue that automated large-scale semantic tagging of ambiguous content can bootstrap and accelerate the creation of the semantic web.

📜 SIMILAR VOLUMES

GAT: a Graphical Annotation Tool for sem

GAT: a Graphical Annotation Tool for semantic regions

✍ Xavier Giro-i-Nieto; Neus Camps; Ferran Marques 📂 Article 📅 2009 🏛 Springer US 🌐 English ⚖ 942 KB

Schema mediation for large-scale semanti

Schema mediation for large-scale semantic data sharing

✍ Alon Y. Halevy; Zachary G. Ives; Dan Suciu; Igor Tatarinov 📂 Article 📅 2005 🏛 Springer-Verlag 🌐 English ⚖ 262 KB

An Integrated Framework for Semantic Ann

An Integrated Framework for Semantic Annotation and Adaptation

✍ M. Bertini; R. Cucchiara; A. Del Bimbo; A. Prati 📂 Article 📅 2005 🏛 Springer US 🌐 English ⚖ 617 KB

Large-scale mutational analysis for the

Large-scale mutational analysis for the annotation of the mouse genome

✍ Johannes Beckers; Martin Hrabé de Angelis 📂 Article 📅 2002 🏛 Elsevier Science 🌐 English ⚖ 73 KB

Bridge ontology: A multi-ontologies-base

Bridge ontology: A multi-ontologies-based approach for semantic annotation

✍ Wang Peng; Xu Bao-wen; Lu Jian-jiang; Li Yan-hui; Jiang Jian-hua 📂 Article 📅 2004 🏛 Wuhan University 🌐 English ⚖ 800 KB

Semi-automated collection evaluation for

Semi-automated collection evaluation for large-scale aggregations

✍ Katrina Fenlon; Peter Organisciak; Jacob Jett; Miles Efron 📂 Article 📅 2011 🏛 Wiley (John Wiley & Sons) 🌐 English ⚖ 82 KB

Library and museum digital collections are increasingly aggregated at various levels. Large-scale aggregations, often characterized by heterogeneous or messy metadata, pose unique and growing challenges to aggregation administrators -not only in facilitating end-user discovery and access, but in per