𝔖 Bobbio Scriptorium
✦   LIBER   ✦

Introduction to the JASIST Special Topic issue on web retrieval and mining: A machine learning perspective

✍ Scribed by Hsinchun Chen


Publisher
John Wiley and Sons
Year
2003
Tongue
English
Weight
62 KB
Volume
54
Category
Article
ISSN
1532-2882

No coin nor oath required. For personal study only.

✦ Synopsis


Web Retrieval and Mining: Introduction

Research in information retrieval (IR) has advanced significantly in the past few decades. Many tasks, such as indexing and text categorization, can be performed automatically with minimal human effort. Machine learning has played an important role in such automation by learning various patterns such as document topics, text structures, and user interests from examples.

In recent years, it has become increasingly difficult to search for useful information on the World Wide Web because of its large size and unstructured nature. Useful information and resources are often hidden in the Web. While machine learning has been successfully applied to traditional IR systems, it poses some new challenges to apply these algorithms to the Web due to its large size, link structure, diversity in content and languages, and dynamic nature. On the other hand, such characteristics of the Web also provide interesting patterns and knowledge that do not present in traditional information retrieval systems.


πŸ“œ SIMILAR VOLUMES


Introduction to the special topic sectio
✍ Wai Lam; Christopher C. Yang; Filippo Menczer πŸ“‚ Article πŸ“… 2007 πŸ› John Wiley and Sons 🌐 English βš– 180 KB πŸ‘ 1 views

## Abstract The amount of information on the Web has been expanding at an enormous pace. There are a variety of Web documents in different genres, such as news, reports, reviews. Traditionally, the information displayed on Web sites has been static. Recently, there are many Web sites offering conte

Soft approaches to information retrieval
✍ Enrique Herrera-Viedma; Gabriella Pasi πŸ“‚ Article πŸ“… 2006 πŸ› John Wiley and Sons 🌐 English βš– 57 KB πŸ‘ 2 views

## Abstract The World Wide Web is a popular and interactive medium used to collect, disseminate, and access an increasingly huge amount of information, which constitutes the mainstay of the so‐called information and knowledge society. Because of its spectacular growth, related to both Web resources