𝔖 Bobbio Scriptorium
✦   LIBER   ✦

Integrating log-based and text-based methods towards automatic web thesaurus construction

✍ Scribed by Hsiao-Tieh Pu; Lee-Feng Chien


Publisher
Wiley (John Wiley & Sons)
Year
2005
Tongue
English
Weight
717 KB
Volume
41
Category
Article
ISSN
0044-7870

No coin nor oath required. For personal study only.

✦ Synopsis


Abstract

This paper presents an approach to investigating the possibility for constructing an automatic and scalable thesaurus based on Web users' vocabularies with search interests. The proposed approach mainly includes two techniques, namely, relevant term extraction and concept ciustering. The former combines query‐session‐based and text‐based methods to extract relevant terms for a given search term; and the latter organizes these relevant terms into concept classes based on the search results from search engines. Some initial experiments have been conducted to test feasibility of the proposed approach to organizing Web users' vocabularies. The obtained results show that relevant terms could be extracted efficiently and concept classes be more well organized. The approach has a great potential to benefit the automatic construction of a (arge scale thesaurus for future Web IR applications.