<p><em>Mining Very Large Databases with Parallel Processing</em> addresses the problem of large-scale data mining. It is an interdisciplinary text, describing advances in the integration of three computer science areas, namely `intelligent' (machine learning-based) data mining techniques, relational
Parallel data mining for very large relational databases
โ Scribed by Freitas A.A., Lavington S.H.
- Tongue
- English
- Leaves
- 6
- Category
- Library
No coin nor oath required. For personal study only.
โฆ Synopsis
Data mining, or Knowledge Discovery in Databases (KDD), is of little benefit to commercial enterprises unless it can be carried out efficiently on realistic volumes of data. Operational factors also dictate that KDD should be performed within the context of standard DBMS. Fortunately, relational DBMS have a declarative query interface (SQL) that has allowed designers of parallel hardware to exploit data parallelism efficiently. Thus, an effective approach to the problem of efficient KDD consists of arranging that KDD tasks execute on a parallel SQL server. In this paper we devise generic KDD primitives, map these to SQL and present some results of running these primitives on a commercially-available parallel SQL server.
๐ SIMILAR VOLUMES
With the unprecedented growth-rate at which data is being collected and stored electronically today in almost all fields of human endeavor, the efficient extraction of useful information from the data available is becoming an increasing scientific challenge and a massive economic need. This book pre
A data mining system, DBMiner, has been developed for interactive mining of multiple-level knowledge in large relational databases and data warehouses. The system implements a wide spectrum of data mining functions, including characterization, comparison, association, classification, prediction, and
This thin book presents eight academic papers discussing handling of sequences. I did not find any of them interesting on its own or good as a survey, but academics doing research in machine learning may disagree. If you are one, you most likely can get the original papers. If you are a practitioner
Adding the time dimension to real-world databases produces TimeSeries Databases (TSDB) and introduces new aspects and difficultiesto data mining and knowledge discovery. This book covers thestate-of-the-art methodology for mining time series databases. Thenovel data mining methods presented in the b
Adding the time dimension to real-world databases produces Time Series Databases (TSDB) and introduces new aspects and difficulties to data mining and knowledge discovery. This book covers the state-of-the-art methodology for mining time series databases. The novel data mining methods presented in t