Learning to classify documents according to genre
โ Scribed by Aidan Finn; Nicholas Kushmerick
- Book ID
- 101655399
- Publisher
- John Wiley and Sons
- Year
- 2006
- Tongue
- English
- Weight
- 169 KB
- Volume
- 57
- Category
- Article
- ISSN
- 1532-2882
No coin nor oath required. For personal study only.
โฆ Synopsis
Abstract
Current documentโretrieval tools succeed in locating large numbers of documents relevant to a given query. While search results may be relevant according to the topic of the documents, it is more difficult to identify which of the relevant documents are most suitable for a particular user. Automatic genre analysis (i.e., the ability to distinguish documents according to style) would be a useful tool for identifying documents that are most suitable for a particular user. We investigate the use of machine learning for automatic genre classification. We introduce the idea of domain transferโgenre classifiers should be reusable across multiple topicsโwhich does not arise in standard text classification. We investigate different features for building genre classifiers and their ability to transfer across multipleโtopic domains. We also show how different featureโsets can be used in conjunction with each other to improve performance and reduce the number of documents that need to be labeled.
๐ SIMILAR VOLUMES
Complete neutrino events are separated from the total sample with a discriminant analysis in a many-dimensional space described in the text. Two new powerful variables are found which discriminate with high efficiency and tag the event type as far as the completeness is concerned.