A co-training algorithm for multi-view data with applications in data fusion
β Scribed by Mark Culp; George Michailidis
- Publisher
- John Wiley and Sons
- Year
- 2009
- Tongue
- English
- Weight
- 385 KB
- Volume
- 23
- Category
- Article
- ISSN
- 0886-9383
- DOI
- 10.1002/cem.1233
No coin nor oath required. For personal study only.
β¦ Synopsis
Abstract
In several scientific applications, data are generated from two or more diverse sources (views) with the goal of predicting an outcome of interest. Often it is the case that the outcome is not associated with any single view. However, the synergy of all measurements from each view may yield a more predictive classifier. For example, consider a drug discovery application in which individual molecules are described partially by several assay screens based on diverse profiles and partially by their chemical structural fingerprints. A common classification problem is to determine whether the molecule is associated with a particular disease. In this paper, a coβtraining algorithm is developed to utilize data from diverse sources to predict the common class variable. Novel enhancements for variable importance, robustness to a mislabeled class variable, and a technique to handle unbalanced classes are applied to the motivating data set, highlighting that the approach attains strong performance and provides useful diagnostics for data analytic purposes. In addition, comparisons to a framework with data fusion using partial least squares (PLS) are also assessed on real data. An R package for performing the proposed approach is provided as Supporting information. Copyright Β© 2003 John Wiley & Sons, Ltd.
π SIMILAR VOLUMES
Oral practice examinations (OPEs) are used in many anaesthesiology programmes to familiarize anaesthesiology residents with the format of the oral examination administered by the American Board of Anesthesiology. The OPE outcome ("nal grade) consists of &De"nite Not Pass', &Probable Not Pass', &Prob
Molybdenum(CO) 3 compounds / Conformational ensembles in solution / Modelling NOE contacts / MM2\* Force field calculations / Packing forces NMR-NOE analysis of the three compounds (RRS/SSR)-observed and calculated NOE distances is highly satisfactory in each case (rms = 0.2 A Λto 0.3 A Λ). By a st