Algorithms for clustering Web search results have to be efficient and robust. Furthermore they must be able to cluster a data set without using any kind of a priori information, such as the required number of clusters. Clustering algorithms inspired by the behavior of real ants generally meet these
Refining web search engine results using incremental clustering
β Scribed by Ya-Jun Zhang; Zhi-Qiang Liu
- Publisher
- John Wiley and Sons
- Year
- 2004
- Tongue
- English
- Weight
- 141 KB
- Volume
- 19
- Category
- Article
- ISSN
- 0884-8173
No coin nor oath required. For personal study only.
β¦ Synopsis
In this article, we present a new solution to improve the Web search performance. Our algorithm is based on a new clustering algorithm that classifies the results of a query from a search engine into subgroups and assigns each group a short series of keywords together with some statistics data. Then, the user may look into the group with the keywords that he/she finds interesting. Compared with the approaches available in the literature, our algorithm does not require the number of groups as the prior knowledge; it starts from a single prototype group and adaptively expands the prototype set based on a self-spawning splitting scheme until all the groups are finally identified.
π SIMILAR VOLUMES
## Abstract Clustering web search results into dynamic clusters and cluster hierarchies has been shown to be promising in reducing the information overload typically found in the ranked list search engines. The study compared sixteen participants' search performance and subjective satisfaction leve
## Abstract Most Web search tools integrate sponsored results with results from their internal editorial database in providing results to users. The goal of this research is to get a better idea of how much of the screen real estate displays βrealβ editorial results as compared to sponsored results
## Abstract Session characteristics taken from large transaction logs of three Web search environments (academic website, public search engine, consumer health information service) are modeled using cluster analysis to determine if different session groups emerge for each environment. The analysis