✦ LIBER ✦

An analysis of the relative hardness of Reuters-21578 subsets

✍ Scribed by Franca Debole; Fabrizio Sebastiani

Publisher: John Wiley and Sons
Year: 2005
Tongue: English
Weight: 180 KB
Volume: 56
Category: Article
ISSN: 1532-2882
DOI: 10.1002/asi.20147

No coin nor oath required. For personal study only.

✦ Synopsis

Abstract

The existence, public availability, and widespread acceptance of a standard benchmark for a given information retrieval (IR) task are beneficial to research on this task, because they allow different researchers to experimentally compare their own systems by comparing the results they have obtained on this benchmark. The Reuters‐21578 test collection, together with its earlier variants, has been such a standard benchmark for the text categorization (TC) task throughout the last 10 years. However, the benefits that this has brought about have somehow been limited by the fact that different researchers have “carved” different subsets out of this collection and tested their systems on one of these subsets only; systems that have been tested on different Reuters‐21578 subsets are thus not readily comparable. In this article, we present a systematic, comparative experimental study of the three subsets of Reuters‐21578 that have been most popular among TC researchers. The results we obtain allow us to determine the relative hardness of these subsets, thus establishing an indirect means for comparing TC systems that have, or will be, tested on these different subsets.

📜 SIMILAR VOLUMES

An analysis of the relative growth-rates

An analysis of the relative growth-rates within the incisor tooth of the rat

✍ Wierda, J. L. 📂 Article 📅 1942 🏛 John Wiley and Sons 🌐 English ⚖ 334 KB 👁 2 views

The multidimensionality of schizotypy in

The multidimensionality of schizotypy in nonpsychotic relatives of patients with schizophrenia and its applications in ordered subsets linkage analysis of schizophrenia

✍ Yin-Ju Lien; Hui-Chun Tsuang; Abigail Chiang; Chih-Min Liu; Ming H. Hsieh; Tzung 📂 Article 📅 2009 🏛 John Wiley and Sons 🌐 English ⚖ 175 KB

Utility of cytokeratin 7 and 20 subset a

Utility of cytokeratin 7 and 20 subset analysis as an aid in the identification of primary site of origin of malignancy in cytologic specimens

✍ Walter Blumenfeld; George K. Turi; George Harrison; Darota Latuszynski; Cunxian 📂 Article 📅 1999 🏛 John Wiley and Sons 🌐 English ⚖ 33 KB 👁 2 views

This study was undertaken to assess the utility of combined cytokeratin (CK) 7/20 immunoprofile determination in malignant cytologic cell blocks as an aid to the identification of tumor primary site of origin. Fifty-one cases in which CK 7/20 immunocytochemistry was performed as part of the initial