𝔖 Bobbio Scriptorium
✦   LIBER   ✦

Predictive weighting for cluster ensembles

✍ Scribed by Christine Smyth; Danny Coomans


Publisher
John Wiley and Sons
Year
2007
Tongue
English
Weight
414 KB
Volume
21
Category
Article
ISSN
0886-9383

No coin nor oath required. For personal study only.

✦ Synopsis


Abstract

An ensemble of regression models predicts by taking a weighted average of the predictions made by individual models. Calculating the weights such that they reflect the accuracy of individual models (post processing the ensemble) has been shown to increase the ensemble's accuracy. However, post processing cluster ensembles has not received as much attention because of the inherent difficulty in assessing the accuracy of an individual cluster model. By enforcing the notion that clusters must be ‘predictable’, this paper suggests a means of implicitly post processing cluster ensembles by drawing analogies with regression post processing techniques. The product of the post processing procedure is an intelligently weighted co‐occurrence matrix. A new technique, similarity‐based k‐means (SBK), is developed to split this matrix into clusters. The results using three real life datasets underpinned by chemical and biological phenomena show that splitting an intelligently weighted co‐occurrence matrix gives accuracy that approaches supervised classification methods. Copyright © 2007 John Wiley & Sons, Ltd.


📜 SIMILAR VOLUMES


Predicting structural models for silicon
✍ Carlos Renato Zacharias; Maurício Ruv Lemes; Arnaldo Dal Pino Júnior; David Sant 📂 Article 📅 2003 🏛 John Wiley and Sons 🌐 English ⚖ 167 KB

## Abstract This article introduces an efficient method to generate structural models for medium‐sized silicon clusters. Geometrical information obtained from previous investigations of small clusters is initially sorted and then introduced into our predictor algorithm in order to generate structur

Rainfall-runoff models using artificial
✍ Dae-Il Jeong; Young-Oh Kim 📂 Article 📅 2005 🏛 John Wiley and Sons 🌐 English ⚖ 865 KB

## Abstract Previous ensemble streamflow prediction (ESP) studies in Korea reported that modelling error significantly affects the accuracy of the ESP probabilistic winter and spring (i.e. dry season) forecasts, and thus suggested that improving the existing rainfall‐runoff model, TANK, would be cr