A multistrategy approach to classification learning in databases
โ Scribed by Chang-Hwan Lee; Dong-Guk Shin
- Publisher
- Elsevier Science
- Year
- 1999
- Tongue
- English
- Weight
- 478 KB
- Volume
- 31
- Category
- Article
- ISSN
- 0169-023X
No coin nor oath required. For personal study only.
โฆ Synopsis
This paper proposes a hybrid classiยฎcation learning system for databases that integrates rule induction and lazy learning. For rule induction learning, we use an entropy function based on Hellinger divergence to measure the amount of information each inductive rule contains. For lazy learning, we also use the Hellinger measure to automatically generate attribute weights and to compute similarities between data values of non-numeric data types. Our system has been implemented and tested extensively on a number of well-known machine learning data sets. The performance of our system was favorable compared to those of other well-known classiยฎcation learning techniques based on monostrategic learning methods.
๐ SIMILAR VOLUMES
In this paper, we propose a lazy learning strategy for building classification learning models. Instead of learning the models with the whole training data set before observing the new instance, a selection of patterns is made depending on the new query received and a classification model is learnt
## Abstract Database Selection is the problem of choosing, from a finite number of databases, the one that contains the most relevant information pertaining to a query. Previous approaches to this problem consisted of deterministic search techniques in conjunction with efficient pruning of search s
Data classification is a well-organized operation in the field of data mining. This article presents an application of the k-nearest neighbor classification technique for mining a fuzzy database. We consider a data set in which attribute values have certain similarities in nature and analyze the obs
A new definition of Fuzzy Relational Database is given by using logical concepts. With this definition it is possible to consider several kinds of fuzziness for the database attributes. A Domain Calculus-based query language is also established for the model. It allows us to formulate several types