𝔖 Bobbio Scriptorium
✦   LIBER   ✦

A preprocess algorithm of filtering irrelevant information based on the minimum class difference

✍ Scribed by Zhiping Chen; Kevin Lü


Publisher
Elsevier Science
Year
2006
Tongue
English
Weight
298 KB
Volume
19
Category
Article
ISSN
0950-7051

No coin nor oath required. For personal study only.

✦ Synopsis


Whether a word (or a feature) should be included or excluded during the process of text classification could depend on a number of factors, such as the amount of information it represents, its appearance frequency and its meaning. The application context is another important factor that needs to be considered. A word may be able to represent the characteristic of a document in one application context but may not reflect its nature in another. This paper reports on an investigation into the selection of features for classification with the consideration of the application context of the documents to be processed. A new feature selection algorithm for text classification to be known as the PBMCD algorithm is proposed. This algorithm has been implemented and tested using three different data sets. The experiment results have shown that this algorithm cannot only filter out irrelevant features before the classification process but also can increase the classification accuracy. As a comparison, experiment results with other methods have also been presented.


📜 SIMILAR VOLUMES


A new algorithm for a class of linear no
✍ B.Y. Wu; X.Y. Li 📂 Article 📅 2011 🏛 Elsevier Science 🌐 English ⚖ 203 KB

In this work, we present an algorithm for solving fourth-order multi-point boundary value problems (BVPs) based on the reproducing kernel method (RKM). In previous works, the RKM has been used to solve various two-point BVPs. However, it cannot be used directly to solve multi-point BVPs, since it is

Age Classes and Sex Differences in the S
✍ Giulia Mo; Alessandro Zotti; Sabrina Agnesi; Maria Grazia Finoia; Daniele Bernar 📂 Article 📅 2009 🏛 Wiley (John Wiley & Sons) 🌐 English ⚖ 862 KB

## Abstract This study analyzes morphometrically 17 skulls of the Mediterranean monk seal __Monachus monachus__ housed in different Italian Museums and collections. We considered several morphometric variables (31 linear, 1 volumetric and 1 surface area measurements). In addition, we identified, me