We discuss a sample size calculation for a pair-matched case-control study to test for interaction between a speci"c exposure and a second risk factor. The second risk factor could be either binary or continuous. An algorithm for the calculation of sample size is suggested which is based on a logist
Sample Size Determination for Designing a Strata-Matched Case-Control Study to Detect Multiple Risk Factors
โ Scribed by Jun-Mo Nam
- Publisher
- John Wiley and Sons
- Year
- 1997
- Tongue
- English
- Weight
- 729 KB
- Volume
- 39
- Category
- Article
- ISSN
- 0323-3847
No coin nor oath required. For personal study only.
โฆ Synopsis
Investigations of sample size for planning casecontrol studies have usually been limited to detecting a single factor. In this paper, we investigate sample size for multiple risk factors in strata-matched casecontrol studies. We construct an omnibus statistic for testing M different risk factors based on the jointly sufficient statistics of parameters associated with the risk factors. The statistic is non-iterative, and it reduces to the Cochran statistic when M = 1. The asymptotic power function of the test is a noncentral chi-square with M degrees of free3om and the sample size required for a specific power can be obtained by the inverse relationship. We find that the equal sample allocation is optimum. A Monte Carlo experiment demonstrates that an approximate formula for calculating sample size is satisfactory in typical epidemiologic studies. An approximate sample size obtained using Bonfemni's method for multiple comparisons is much larger than that obtained using the omnibus test. Approximate sample size formulas investigated in this paper using the omnibus test, as well as the individual tests, can be useful in designing case-control studies for detecting multiple risk factors.
๐ SIMILAR VOLUMES
The analysis of case-control data using logistic regression involves making some critical model assumpt.ions. In the analysis of matched data for which the values of the matching variable(s) are unavailable, use of logistic regression presuppoees that odds ratios are homogeneous over the levels of t