Logic classification and feature selection for biomedical data
โ Scribed by P. Bertolazzi; G. Felici; P. Festa; G. Lancia
- Publisher
- Elsevier Science
- Year
- 2008
- Tongue
- English
- Weight
- 314 KB
- Volume
- 55
- Category
- Article
- ISSN
- 0898-1221
No coin nor oath required. For personal study only.
โฆ Synopsis
In this paper we investigate logic classification and related feature selection algorithms for large biomedical data sets. When the data is in binary/logic form, the feature selection problem can be formulated as a Set Covering problem of very large dimensions, whose solution is computationally challenging. We propose an alternative approximated formulation for feature selection that results in an extension of Set Covering of compact size, and use the logic classifier Lsquare to test its performances on two wellknown data sets. An ad hoc metaheuristic of the GRASP type is used to solve efficiently the feature selection problem. A simple and effective method to convert rational data into logic data by interval mapping is also described. The computational results obtained are promising and the use of logic models, that can be easily understood and integrated with other domain knowledge, is one of the major strengths of this approach.
๐ SIMILAR VOLUMES
The automated analysis of patients' biomedical data can be used to derive diagnostic and prognostic inferences about the observed patients. Many noninvasive techniques for acquiring biomedical samples generate data that are characterized by a large number of distinct attributes (i.e., features) and
A pattern classification problem usually involves using high-dimensional features that make the classifier very complex and difficult to train. With no feature reduction, both training accuracy and generalization capability will suffer. This paper proposes a novel hybrid filter-wrapper-type feature
Feature selection is a crucial step in pattern recognition. Most feature selection algorithms reported are developed for continuous features. In this paper, we propose a feature selection algorithm for mixed-typed data containing both continuous and nominal features. The algorithm consists of a nove