Ranking and selecting terms for text cat
โ
Tien-Fang Kuo; Yasutoshi Yajima
๐
Article
๐
2009
๐
John Wiley and Sons
๐
English
โ 298 KB
The problem of natural language document categorization consists of classifying documents into predetermined categories based on their contents. Each distinct term, or word, in the documents is a feature for representing a document. In general, the number of terms may be extremely large and the doze