𝔖 Bobbio Scriptorium
✦   LIBER   ✦

Estimation of the average correlation coefficient for stratified bivariate data

✍ Scribed by Linda M. Rubenstein; Charles S. Davis


Publisher
John Wiley and Sons
Year
1999
Tongue
English
Weight
109 KB
Volume
18
Category
Article
ISSN
0277-6715

No coin nor oath required. For personal study only.

✦ Synopsis


If the relationship between two ordered categorical variables X and Y is in uenced by a third categorical variable with K levels, the Cochran-Mantel-Haenszel (CMH) correlation statistic QC is a useful stratumadjusted summary statistic for testing the null hypothesis of no association between X and Y . Although motivated by and developed for the case of K I × J contingency tables, the correlation statistic QC is also applicable when X and Y are continuous variables. In this paper we derive a corresponding estimator of the average correlation coe cient for K I × J tables. We also study two estimates of the variance of the average correlation coe cient. The ÿrst is a restricted variance based on the variances of the observed cell frequencies under the null hypothesis of no association. The second is an unrestricted variance based on an asymptotic variance derived by Brown and Benedetti. The estimator of the average correlation coe cient works well in tables with balanced and unbalanced margins, for equal and unequal stratum-speciÿc sample sizes, when correlation coe cients are constant over strata, and when correlation coe cients vary across strata. When the correlation coe cients are zero, close to zero, or the cell frequencies are small, the conÿdence intervals based on the restricted variance are preferred. For larger correlations and larger cell frequencies, the unrestricted conÿdence intervals give superior performance.

We also apply the CMH statistic and proposed estimators to continuous non-normal data sampled from bivariate gamma distributions. We compare our methods to statistics for data sampled from normal distributions. The size and power of the CMH and normal theory statistics are comparable. When the stratum-speciÿc sample sizes are small and the distributions are skewed, the proposed estimator is superior to the normal theory estimator. When the correlation coe cient is zero or close to zero, the restricted conÿdence intervals provide the best performance. None of the conÿdence intervals studied provides acceptable performances across all correlation coe cients, sample sizes and non-normal distributions.


📜 SIMILAR VOLUMES


Comparison of geostatistical methods for
✍ Pardo-Igúzquiza, Eulogio 📂 Article 📅 1998 🏛 John Wiley and Sons 🌐 English ⚖ 354 KB 👁 2 views

The results of estimating the areal average climatological rainfall mean in the Guadalhorce river basin in southern Spain are presented in this paper. The classical Thiessen method and three different geostatistical approaches (ordinary kriging, cokriging and kriging with an external drift) have bee