๐”– Bobbio Scriptorium
โœฆ   LIBER   โœฆ

[IEEE 2012 IEEE/ACIS 11th International Conference on Computer and Information Science (ICIS) - Shanghai (2012.05.30-2012.06.1)] 2012 IEEE/ACIS 11th International Conference on Computer and Information Science - On the Use of Data Mining Tools for Data Preparation in Classification Problems

โœ Scribed by Goncalves, P. M.; Barros, R. S. M.; Vieira, D. C. L.


Book ID
118053948
Publisher
IEEE
Year
2012
Weight
522 KB
Category
Article
ISBN
1467315362

No coin nor oath required. For personal study only.

โœฆ Synopsis


The data preparation phase is a critical step in the KDD (Knowledge Discovery in Databases) process. This phase is crucial for a good data mining result because if data is not correctly prepared, all the next phases of the process are compromised. DMPML is a framework that stores preprocessed data for different data mining algorithms in an XML document and retrieves the correct codification by the use of an XSLT document according to the needs of the data mining algorithm. This paper presents a comparison between DMPML and three data mining applications (Weka, RapidMiner, and KNIME) that implement the directed graph approach, concerning the time spent to create and execute the data preparation tasks for two data mining algorithms. The tests were executed using different types of data sets: numerical, categorical, and mixed. We observed that the scheme used by DMPML can simplify the usage of different data mining algorithms and significantly reduce the time spent creating the data preparation tasks.


๐Ÿ“œ SIMILAR VOLUMES