𝔖 Bobbio Scriptorium
✦   LIBER   ✦

Clustering for binary data and mixture models—choice of the model

✍ Scribed by Nadif, M. ;Govaert, G.


Publisher
John Wiley and Sons
Year
1997
Tongue
English
Weight
101 KB
Volume
13
Category
Article
ISSN
8755-0024

No coin nor oath required. For personal study only.

✦ Synopsis


When cluster analysis is based on mixture models, choosing an appropriate model is a difficult problem. Previous studies usually addressed a part of this problem by estimating the number of clusters and assuming the type of model to be known. Various criteria to be minimized have been proposed to measure a model's suitability by balancing model fit and model complexity. In this work, we extend the work of and to the use of some of these information criteria in the detection of the type of Bernoulli mixture model while assuming that the number of clusters is known. We simulated samples with various underlying types of model and separations of components using Monte Carlo simulations. These simulations show the advantages and the weaknesses of the considered information criteria with a view to determining the type of model. In addition, they underline the importance of a judicious choice of model type in order to obtain a good clustering.


📜 SIMILAR VOLUMES


An exponential family model for clustere
✍ Geert Molenberghs; Louise M. Ryan 📂 Article 📅 1999 🏛 John Wiley and Sons 🌐 English ⚖ 217 KB 👁 1 views

This paper focuses on the analysis of clustered multivariate binary data that arise from developmental toxicity studies. In these studies, pregnant mice are exposed to chemicals to assess possible adverse eects on developing fetuses. Multivariate binary outcomes arise when each fetus in a litter is

Modelling the EuroQol data: a comparison
✍ Zafar Hakim; Dev S. Pathak 📂 Article 📅 1999 🏛 John Wiley and Sons 🌐 English ⚖ 98 KB 👁 2 views

This article compares two measurement strategies for measuring EuroQol health state preferences: (a) conditional preference modelling, implemented using rating scale and standard gamble scaling methods and (b) discrete choice conjoint modelling. The nature of the model form of the EuroQol health sta

On the Multivariate Probit Model for Exc
✍ Catalina Stefanescu; Bruce W. Turnbull 📂 Article 📅 2005 🏛 John Wiley and Sons 🌐 English ⚖ 153 KB

This paper considers the use of a multivariate binomial probit model for the analysis of correlated exchangeable binary data. The model can naturally accommodate both cluster and individual level covariates, while keeping a fairly flexible intracluster association structure. We discuss Bayesian esti