✦ LIBER ✦

DB-HReduction: A data preprocessing algorithm for data mining applications

✍ Scribed by Xiaohua Hu

Publisher: Elsevier Science
Year: 2003
Tongue: English
Weight: 574 KB
Volume: 16
Category: Article
ISSN: 0893-9659
DOI: 10.1016/s0893-9659(03)90013-9

No coin nor oath required. For personal study only.

✦ Synopsis

Data preprocessing

is an important and critical step in the data mining process and it has a huge impact on the SUCCESS of a data mining project.

In this paper, we present an algorithm DB-HFkduction, which discretiaes or eliminates numeric attributes and generalizes or eliminates symbolic attributes very efficiently and effectively. This algorithm greatly decreases the number of attributes and tuplea of the data set and improves the accuracy and decreases the running time of the data mining algorithms in the later stage.

📜 SIMILAR VOLUMES

MS-Analyzer: preprocessing and data mini

MS-Analyzer: preprocessing and data mining services for proteomics applications on the Grid

✍ Mario Cannataro; Pierangelo Veltri 📂 Article 📅 2007 🏛 John Wiley and Sons 🌐 English ⚖ 425 KB

A co-training algorithm for multi-view d

A co-training algorithm for multi-view data with applications in data fusion

✍ Mark Culp; George Michailidis 📂 Article 📅 2009 🏛 John Wiley and Sons 🌐 English ⚖ 385 KB

## Abstract In several scientific applications, data are generated from two or more diverse sources (views) with the goal of predicting an outcome of interest. Often it is the case that the outcome is not associated with any single view. However, the synergy of all measurements from each view may y

LICRA: A replicated-data management algo

LICRA: A replicated-data management algorithm for distributed synchronous groupware applications

✍ Rushed Kanawati 📂 Article 📅 1997 🏛 Elsevier Science 🌐 English ⚖ 1023 KB

Replicated data consistency is a key issue in the design of distributed real time groupware applications. In this paper, we propose a new protocol to cope with this problem. The proposed algorithm guarantees an optimal response time while ensuring data consistency at system quiescence. The originali

A data structure for bicategories, with

A data structure for bicategories, with application to speeding up an approximation algorithm

✍ Philip N. Klein 📂 Article 📅 1994 🏛 Elsevier Science 🌐 English ⚖ 521 KB

A direct LDA algorithm for high-dimensio

A direct LDA algorithm for high-dimensional data — with application to face recognition

✍ Hua Yu; Jie Yang 📂 Article 📅 2001 🏛 Elsevier Science 🌐 English ⚖ 80 KB

An architecture and a dynamic scheduling

An architecture and a dynamic scheduling algorithm of grid for providing security for real-time data-intensive applications

✍ Mohd Rafiqul Islam; Mohd Toufiq Hasan; G. M. Ashaduzzaman 📂 Article 📅 2011 🏛 John Wiley and Sons 🌐 English ⚖ 261 KB