๐”– Bobbio Scriptorium
โœฆ   LIBER   โœฆ

Learning to classify documents according to genre

โœ Scribed by Aidan Finn; Nicholas Kushmerick


Book ID
101655399
Publisher
John Wiley and Sons
Year
2006
Tongue
English
Weight
169 KB
Volume
57
Category
Article
ISSN
1532-2882

No coin nor oath required. For personal study only.

โœฆ Synopsis


Abstract

Current documentโ€retrieval tools succeed in locating large numbers of documents relevant to a given query. While search results may be relevant according to the topic of the documents, it is more difficult to identify which of the relevant documents are most suitable for a particular user. Automatic genre analysis (i.e., the ability to distinguish documents according to style) would be a useful tool for identifying documents that are most suitable for a particular user. We investigate the use of machine learning for automatic genre classification. We introduce the idea of domain transferโ€”genre classifiers should be reusable across multiple topicsโ€”which does not arise in standard text classification. We investigate different features for building genre classifiers and their ability to transfer across multipleโ€topic domains. We also show how different featureโ€sets can be used in conjunction with each other to improve performance and reduce the number of documents that need to be labeled.


๐Ÿ“œ SIMILAR VOLUMES


Learning to classify species with barcod
โœ Paola Bertolazzi; Giovanni Felici; Emanuel Weitschek ๐Ÿ“‚ Article ๐Ÿ“… 2009 ๐Ÿ› BioMed Central ๐ŸŒ English โš– 514 KB
Learning to recognize webpage genres
โœ Ioannis Kanaris; Efstathios Stamatatos ๐Ÿ“‚ Article ๐Ÿ“… 2009 ๐Ÿ› Elsevier Science ๐ŸŒ English โš– 394 KB
Genres and Learning to Read
โœ Lynne Kirk; Henry Pearson ๐Ÿ“‚ Article ๐Ÿ“… 1996 ๐Ÿ› John Wiley and Sons ๐ŸŒ English โš– 505 KB
A method to classify neutrino events acc
โœ N. Armenise; G. Iaselli ๐Ÿ“‚ Article ๐Ÿ“… 1986 ๐Ÿ› Elsevier Science ๐ŸŒ English โš– 604 KB

Complete neutrino events are separated from the total sample with a discriminant analysis in a many-dimensional space described in the text. Two new powerful variables are found which discriminate with high efficiency and tag the event type as far as the completeness is concerned.