𝔖 Bobbio Scriptorium
✦   LIBER   ✦

Lexical and semantic clustering by Web links

✍ Scribed by Filippo Menczer


Publisher
John Wiley and Sons
Year
2004
Tongue
English
Weight
196 KB
Volume
55
Category
Article
ISSN
1532-2882

No coin nor oath required. For personal study only.

✦ Synopsis


Abstract

Recent Web‐searching and ‐mining tools are combining text and link analysis to improve ranking and crawling algorithms. The central assumption behind such approaches is that there is a correlation between the graph structure of the Web and the text and meaning of pages. Here I formalize and empirically evaluate two general conjectures drawing connections from link information to lexical and semantic Web content. The link‐content conjecture states that a page is similar to the pages that link to it, and the link‐cluster conjecture that pages about the same topic are clustered together. These conjectures are often simply assumed to hold, and Web search tools are built on such assumptions. The present quantitative confirmation sheds light on the connection between the success of the latest Web‐mining techniques and the small world topology of the Web, with encouraging implications for the design of better crawling algorithms.


📜 SIMILAR VOLUMES


Distributed cell assemblies for general
✍ Friedemann Pulvermüller; Ferath Kherif; Olaf Hauk; Bettina Mohr; Ian Nimmo-Smith 📂 Article 📅 2009 🏛 John Wiley and Sons 🌐 English ⚖ 494 KB

## Abstract Here, we ask whether frontotemporal cortex is functionally dissociated into distributed lexical and category‐specific semantic networks. To this end, fMRI activation patterns elicited during the processing of words from different semantic categories were categorized using __k__‐means cl