𝔖 Bobbio Scriptorium
✦   LIBER   ✦

Investigating the performance of automatic new topic identification across multiple datasets

✍ Scribed by H. Cenk Özmutlu; Fatih Cavdur; Amanda Spink; Seda Ozmutlu


Publisher
Wiley (John Wiley & Sons)
Year
2007
Tongue
English
Weight
234 KB
Volume
43
Category
Article
ISSN
0044-7870

No coin nor oath required. For personal study only.

✦ Synopsis


Abstract

Recent studies on automatic new topic identification in Web search engine user sessions demonstrated that neural networks are successful in automatic new topic identification. However most of this work applied their new topic identification algorithms on data logs from a single search engine. In this study, we investigate whether the application of neural networks for automatic new topic identification are more successful on some search engines than others. Sample data logs from the Norwegian search engine FAST (currently owned by Overture) and Excite are used in this study. Findings of this study suggest that query logs with more topic shifts tend to provide more successful results on shift‐based performance measures, whereas logs with more topic continuations tend to provide better results on continuation‐based performance measures.