Document Detail

Testing for heterogeneity among the components of a binary composite outcome in a clinical trial.
Jump to Full Text
MedLine Citation:
PMID:  20529275     Owner:  NLM     Status:  MEDLINE    
BACKGROUND: Investigators designing clinical trials often use composite outcomes to overcome many statistical issues. Trialists want to maximize power to show a statistically significant treatment effect and avoid inflation of Type I error rate due to evaluation of multiple individual clinical outcomes. However, if the treatment effect is not similar among the components of this composite outcome, we are left not knowing how to interpret the treatment effect on the composite itself. Given significant heterogeneity among these components, a composite outcome may be judged as being invalid or un-interpretable for estimation of the treatment effect. This paper compares the power of different tests to detect heterogeneity of treatment effect across components of a composite binary outcome.
METHODS: Simulations were done comparing four different models commonly used to analyze correlated binary data. These models included: logistic regression for ignoring correlation, logistic regression weighted by the intra cluster correlation coefficient, population average logistic regression using generalized estimating equations (GEE), and random effects logistic regression.
RESULTS: We found that the population average model based on generalized estimating equations (GEE) had the greatest power across most scenarios. Adequate power to detect possible composite heterogeneity or variation between treatment effects of individual components of a composite outcome was seen when the power for detecting the main study treatment effect for the composite outcome was also reasonably high.
CONCLUSIONS: It is recommended that authors report tests of composite heterogeneity for composite outcomes and that this accompany the publication of the statistically significant results of the main effect on the composite along with individual components of composite outcomes.
Janice Pogue; Lehana Thabane; P J Devereaux; Salim Yusuf
Related Documents :
11478645 - Robust linear regression taking into account errors in the predictor and response varia...
16850435 - A weighted logistic regression model for estimation of recurrence of adenomas.
18603325 - A set of sas macros for calculating and displaying adjusted odds ratios (with confidenc...
18310095 - Biological behavior of cin lesions is predictable by multiple parameter logistic regres...
19895845 - Influence of the implanted pulse generator as reference electrode in finite element mod...
23247105 - Analysis of omics data with genome-scale models of metabolism.
Publication Detail:
Type:  Clinical Trial; Journal Article     Date:  2010-06-07
Journal Detail:
Title:  BMC medical research methodology     Volume:  10     ISSN:  1471-2288     ISO Abbreviation:  BMC Med Res Methodol     Publication Date:  2010  
Date Detail:
Created Date:  2010-07-26     Completed Date:  2011-03-29     Revised Date:  2013-05-29    
Medline Journal Info:
Nlm Unique ID:  100968545     Medline TA:  BMC Med Res Methodol     Country:  England    
Other Details:
Languages:  eng     Pagination:  49     Citation Subset:  IM    
Department of Clinical Epidemiology and Biostatistics, McMaster University, Hamilton, Ontario, Canada.
Export Citation:
APA/MLA Format     Download EndNote     Download BibTex
MeSH Terms
Clinical Trials as Topic / statistics & numerical data
Data Interpretation, Statistical*
Logistic Models*
Outcome Assessment (Health Care) / statistics & numerical data*

From MEDLINE®/PubMed®, a database of the U.S. National Library of Medicine

Full Text
Journal Information
Journal ID (nlm-ta): BMC Med Res Methodol
ISSN: 1471-2288
Publisher: BioMed Central
Article Information
Copyright ©2010 Pogue et al; licensee BioMed Central Ltd.
Received Day: 29 Month: 10 Year: 2009
Accepted Day: 7 Month: 6 Year: 2010
collection publication date: Year: 2010
Electronic publication date: Day: 7 Month: 6 Year: 2010
Volume: 10First Page: 49 Last Page: 49
Publisher Id: 1471-2288-10-49
PubMed Id: 20529275
DOI: 10.1186/1471-2288-10-49

Testing for heterogeneity among the components of a binary composite outcome in a clinical trial
Janice Pogue12 Email:
Lehana Thabane1 Email:
PJ Devereaux12 Email:
Salim Yusuf12 Email:
1Department of Clinical Epidemiology and Biostatistics, McMaster University, Hamilton, Ontario, Canada
2Faculty of Health Sciences, McMaster University, Hamilton, Ontario, Canada


Composite outcomes can often be difficult to interpret, especially when the treatment effects on some of its components individually show differences in magnitude or even in direction. For example, in a trial of localized intracoronary gamma-radiation therapy versus placebo [1] the primary composite outcome of death, myocardial infarction, or revascularization of target lesion showed an overall benefit of gamma-radiation compared to placebo (24.4% vs 42.1%, p = 0.02); however, myocardial infarction individually had a non-significant effect in the opposite direction (9.9% vs. 4.1%, p = 0.09). Many authors have expressed concerns regarding interpretation of a treatment effect for a composite outcome when it appears that there is heterogeneity in the treatment effect across the composite components [2-4]. How then can we best determine the existence of important composite heterogeneity in treatment effect among the individual components of a composite outcome?

A composite outcome is defined as having occurred if one of a group of outcomes occurs. The main treatment effect is defined as the absolute or relative difference between treatment and control in the proportions of participants who have at least one component of the composite. The problems with interpreting composite outcomes are well known. The treatment effect observed on the components may go in opposite directions and reduce the power of the trial [5,6]. The components may not have similar importance or frequency to one another [2-4,7]. These issues make composite outcomes difficult to interpret in many trials.

Despite difficulties with interpretation, trialists are unlikely to abandon composite outcomes. Trials in cardiovascular disease commonly use composite endpoints as their primary outcome [8] and there are efforts in many other areas of research to follow suit. Many authors have expressed the need to use composite outcomes to increase the feasibility of conducting clinical trials research in their areas including: cardiology [9,10], HIV/AIDS [11], organ transplantation [12], psychiatric disorders [13], adverse event reporting [14], and obstetrics and gynecology [15]. The reasons for use of composite outcomes are well documented and include: reduced sample size due to increased outcome rates, the ability to answer important questions quickly, capturing the multi-dimensional nature of disease, seeking a better understanding of total disease burden, the inability to select the most important of many outcomes, concerns with multiplicity for testing many outcomes, and addressing competing risks.

Various approaches have been suggested for the analysis and interpretation of composite outcomes. For example, a multivariate global test across all the components could be used to look for simultaneous demonstrated benefit; but readers may find it difficult to interpret such a result [16,17]. Alternatively, if the composite shows a statistically significant treatment effect, the component specific tests can be performed using a closed test procedure. Many authors recommend that each component of the composite should be defined as secondary outcomes for the trial [6]. However, it is doubtful that there would be sufficient power to detect effects on the individual components for the very reason that the composite outcome was chosen (i.e. there are too few events for each outcome). Individual tests on each component would also inflate the overall Type I error rate for the study. Berger [18] has suggested the use of informative preserving composite endpoints and the use of omnibus test functions. However, trialists have rarely utilized this procedure. Finally, another method would involve analysis of the weighted components of the composite. Although many different weighting schemes have been suggested [6,9,19,20], these methods are not in common use by trialists [5]. Further, weighting systems can introduce their own set of problems with interpretation, due to the perceived subjectivity of the weights.

Composites may be used either under the assumption of homogeneity of treatment effect across components or to summarize a risk-benefit profile of an intervention. In this manuscript we address the former use, where the best knowledge of the disease being studied points to a likely similarity of treatment effect on all component outcomes, based on known physiological pathways and theoretical models. While the treatment effect is assumed to be similar across each of the components in terms of direction, it is recognized that the magnitude may differ [2,5]. Many authors recommend reviewing suspected treatment homogeneity through visual inspection of the direction of relative risk estimates for individual components of the composite in a trial [2,7]. However, it is possible to test for heterogeneity of these treatment effects across components directly using standard methods for correlated binary data. If significant heterogeneity is found then the composite outcome may be invalidated or inappropriate for use. If not, we may have more confidence in the composite outcome, viewing it as meaningful, interpretable to represent treatment effect as a whole, and likely free from evidence of heterogeneity. However, tests for heterogeneity have been shown to lack power in meta-analyses and subgroup analyses [21]. The purpose of this paper is to compare the power of different tests to detect heterogeneity of treatment effect across components of composite binary outcomes. We then explore the usefulness of such tests for detecting composite heterogeneity when the power is high for the treatment comparison on the composite outcome as a whole.

A. Methods for analysis of correlated binary outcomes

Participants in a trial who are followed beyond their first outcome may experience more than one component of the composite primary outcome. For example, for a trial with the primary outcome of myocardial infarction, stroke or cardiovascular death, a participant may experience a stroke and then die a cardiovascular death. Thus there is a repeated measurement of the different component outcomes for each individual. This binary data then has an intra cluster correlation due to repeated outcomes on the same individuals.

All models used contain parameters that estimate the treatment effect, the specific individual outcome component in the composite outcome, and the interaction of these two factors. These are presented for the jth treatment group, the kth component of the composite component outcome, and the ith participant in the trial. The test of the interaction term will allow detection of possible heterogeneity or difference in the study treatment effect across the composite components.

The following models will be studied using SAS 9.1 [22] as presented in Shoukri and Chaudhary [23]:

Model 1 Logistic regression ignoring correlation

It is possible that the intra cluster correlation seen among outcomes in typical cardiovascular trials is too small to make a difference to this analysis of composite homogeneity. We will fit a simple logistic regression to test this hypothesis (implemented in SAS using proc logistic [22]). The model fit will be: Logit(yijk) = β0 + β1x1 + β2x2 + β3x3 + εijk

Here yijk is a binary response representing whether an event (i.e. one of the components of a composite outcome) has occurred (coded 1) or not (coded 0). The fixed factors for all participants are the intercept β0, treatment effect β1, composite outcome component β2, and interaction of treatment and outcome β3. With more than two component outcomes to the composite, there would be additional regression coefficients for each additional component and an additional term for its interaction with treatment. The error term εijk here does not take into account the correlation of composite outcome components within each individual. Therefore, the fitted regression coefficients are:

For example, the following matrices display the outcomes status (Y) and independent variables (X) for the first two participants in our simulation. Since our composite outcomes has two components, the vector Y has two rows for each participant with the first containing the outcome status (0,1) for the first component and the second row for the outcome on the second component. Both of the following participants have experienced a composite outcome. Participant 1 experienced both components of the composite outcome and participant 2 experienced only the second component.

For this and all subsequent models, the test for heterogeneity will test whether β3 is significantly different from zero at p < 0.05 level.

Model 2 Weighted logistic regression

Simple methods for the analysis of binary correlated data have been suggested using weighted logistic regression. Donald and Donner [24] proposed a weighting based directly on the intra cluster correlation (ρ) calculated for the trial overall and Rao and Scott [25] base the weights on the variance inflation factor (υ) estimated per treatment group (proc logistic [22] with weights ρ or υ). Note that a single weight may not be appropriate with more than two components to the composite outcome. The fitted regression coefficients are:

Model 3 Population average logistic models (GEE)

Here treatment and outcome component effects are estimated at the margin by averaging across individuals. The generalized estimating equations (GEE) methods will be used, which treats the correlation among individuals as a nuisance factor. Correlation between outcomes of individuals is modeled through a working correlation matrix and adjustments for misspecification are made using the sandwich variance formula [26]. The covariance matrix will be unstructured to allow for different variances for each composite component (proc genmod [22]). The model is: where μijk = E(yijk ), the marginal expectation and the β*'s estimate the population average response parameters.

Model 4 Random effects logistic models

This model incorporates a term for the individual in the analysis and allows the intercept to vary across individuals. Individuals are considered to be randomly selected from a population that has a normally distributed intercept component [27]. The model is

Logit(E[yijk|γk]) = β0 + β1x1 + β2x2 + β3x3 + γi + ϕijk where γi is the random effect of participant with composite outcome component clustered within individual and ϕijk is the error term (proc glimmix [22]). The covariance matrix will be unstructured, or determined by the random effect.

B. Simulation data

The purpose of this simulation was to examine the power to detect heterogeneity among the components of a composite outcome for a well-designed trial. We began with a study design that had good power to detect a modestly estimated main treatment effect on the odds ratio (OR). Such a design was chosen since it is unlikely that a composite outcome heterogeneity test would be performed if the main treatment effect were not statistically significant. The total study sample size was 2000 for a two-arm trial with equal allocation to each treatment group, and a 50% composite outcome event rate in the control group. This was calculated using a continuity corrected chi-square test of equal proportion with two-sided type I error rate of 0.05. There was 88% to detect a 25% reduction in the OR and 97% power for a 30% OR. A composite with two components was simulated with a correlation between the two components of ρ = 0.10 (estimated using cardiovascular outcomes from the HOPE trial [28], unpublished data). Simulations were run with 10,000 iterations and we recorded both power for the test of treatment effect on the composite outcome and for the heterogeneity of treatment across the composite components for each model. We examined the power for these tests by varying the following:

a) Degree of treatment heterogeneity of the composite components: The odds ratio of the first component (OR1) was kept constant, while the second component odds ratio (OR2) was varied to simulate composite heterogeneity. Low heterogeneity is demonstrated by both OR's showing the same direction of treatment effect, moderate is indicated by a neutral effect in one component, and large is seen where the OR's have opposite patterns of risk.

b) Balance of the components: Simulations included cases where the components occurred equally (1:1) or unequally. For the unequal case, the composite outcome contained one component that occurred three or five times more often than the other.

Multivariate binary correlated data was generated using the method described in Park et al. [29]. Sums of independent Poisson random variables were generated which share components such that the resulting sums are multiple correlated Poisson variables. Indicator functions were used to transform these variables into correlated binary data with the desired correlational structure.


As expected the power to detect heterogeneity among the composite outcome components increased as the difference between the two component odd ratios became larger (see Table 1 and Figure 1). The Population Average logistic regression had the greatest power across all levels of composite heterogeneity. The next largest power was seen in both the independent and random effects logistic regressions. Lastly, the weighted logistic regression displayed the least power for this test. It should also be noted that the population average model had a type I error rate of 0.053 for the case of no composite heterogeneity, exceeding chance level of 0.05.

When imbalance existed between the frequencies of the two components the power to demonstrate heterogeneity decreased as this imbalance increased (see table 2). This power was greater when the component displaying moderate treatment heterogeneity was also the less frequent of the two components. Note again that population average logistic model had the greatest power, except for the single case of 1:5 imbalances, where the component with the larger OR was the most frequent. For this case only, the weighted logistic regression had the greatest power and the population average logistic regression had the second greatest power.

Table 3 and Figure 2 show the relationship between power for the test of treatment on the composite outcome as a whole and power to detect treatment heterogeneity among it components, using the population average model. Both the effect size of the composite outcome and the degree of composite heterogeneity are varied to show the relationship in power for both tests. The region in bold for this table indicates the conditions when both tests show greater than 50% power, over various combinations of the two odd ratios for each component. This is illustrated in Figure 2, where the region between the vertical dotted lines indicates the range where both the test of the composite outcome and the test for composite heterogeneity are both have 50% power or greater. When the odds ratio for the most effective component is 0.75, this region is the narrowest.


These simulations demonstrate that generally the population average (GEE) model has the greatest power to detect composite outcome treatment heterogeneity, of the four methods investigated. This is further supported by the conclusion that population average models (GEE) are the more powerful test among possible methods for analyzing cluster randomized trials data [30]. It should be noted that the GEE and random effects models do not estimate the same parameters, since GEE is a marginal model and the random effects allows the estimation of individual effects. For effect estimation the GEE models are known to bias model parameter estimates towards the null, but at the same time have smaller parameter standard deviations compared to random effects models [31]. Since the focus for this application is on the test statistics itself, rather than estimation, it seems reasonable that the population average model would have the greatest power. We found only one exception to this conclusion. When there was a large imbalance between the two composite components, where the most frequent of these had the smaller treatment effect, the weighted regression model had higher power, with the population average (GEE) model being second. We should also consider the fact that the GEE model was somewhat liberal in its type I error rate for the case of no composite outcome heterogeneity.

Even small amounts of component heterogeneity, can reduce study power to detect a treatment effect for the composite outcome. However, we did find regions where the power for both tests for the composite outcome and composite heterogeneity were greater than 50%. This indicates a range of results where tests for composite heterogeneity would be useful. One may only want to perform a test of composite outcome heterogeneity when the main effect is statistically significant but regardless of the statistical significance of the composite outcome, test for composite heterogeneity may provide insight into the differing mechanisms for each component outcome. This information could then aid in the design of future trials. However, for the current trial, the presence of composite heterogeneity should never lead researchers to assume that the composite outcome as a whole would have been statistically significant if only the mix of components were slightly altered.

The use of models for correlated binary data to explore composite outcome heterogeneity has some important advantages. It can easily be implemented in common statistical software packages using currently available repeated/recurrent outcomes methods. The methodology suggested in this manuscript can be generalized to other outcomes types in addition to binary, including continuous outcomes, time to first event and time to recurrent events. Given the ease of implementation and application to a variety of outcome types, trialists may be encouraged to address the issue of potential composite heterogeneity more often and more directly in the presentation of trial results.

There are limitations to the results presented here. We have not explored differing event rates, component correlations, extreme imbalance in component ratios, and the effects of more than two composite components. This area will require more research and such simulations could be a productive exercise when designing a randomized clinical trial. The methods presented would not be appropriate to use when the composite components are expected to show differing treatment directions, as in a risk-benefit composite outcome. Lastly, failure to detect statistically significant composite heterogeneity may be a result of lower power, rather than true treatment homogeneity across the composite components. Trialists would be wise to consider the power to detect composite heterogeneity in the design of trials with composite outcomes.

The methods of exploring composite outcome heterogeneity directly, using the tests described here, may partially address the concerns raised about using composite outcomes in many fields. When reporting trial results, it would seem reasonable to expect to see such a test for composite heterogeneity presented along side a statistically significant treatment effect test for the composite outcome.


We compared the power of different tests to detect composite heterogeneity for treatment effect across components of a composite binary outcome. Simulations were done comparing four different models commonly used to analyze correlated binary data. The results of these simulations are quite clear. Generally, GEE model should be chosen for investigating possible heterogeneity among the components of a binary composite outcome, since it demonstrated the greatest power. This is particularly true when the power for the test of treatment effect on the composite outcome as a whole was also reasonably high. It is recommended that tests of composite heterogeneity for composite outcomes accompany the publication of the results for statistically significant composite outcomes along with individual components of composite outcomes. Further simulations are still required to explore the impact on power of differing event rates, component correlations, extreme imbalance in component ratios, and the effects of more than two composite components.

Competing interests

The authors declare that they have no competing interests.

Authors' contributions

JP conceived of applying the methods presented to analysis of binary composite outcomes, performed the simulations, and produced the figures. All authors helped define the conditions of the simulations and participated in drafting the manuscript. All authors have read and approved the final manuscript.

Pre-publication history

The pre-publication history for this paper can be accessed here:

Leon MB,Teirstein PS,Moses JW,Tripuranenin P,Lansky AJ,Jani S,Wong SC,Fish D,Ellis S,Holmes DR,Kerieakes D,Kuntz RE,Localized intracoronary gamma-radiation therapy to inhibit the recurrence of restenosis after stentingThe New England Journal of MedicineYear: 2001344250610.1056/NEJM20010125344040211172151
Montori VM,Busse JW,Permanyer-Miralda G,Ferreira I,Guyatt GH,How should clinicians interpret results reflecting the effect of an intervention on composite endpoints: Should I dump this lump?ACP Journal ClubYear: 2005143A-89
Montori VM,Permanyer-Miralda G,Ferreira-Gonzalez I,Busse JW,Pacheco-Huergo V,Bryant D,Alonso J,Akl EA,Domingo-Salvany A,Mills E,Wu P,Schunemann HJ,Jaeschke R,Guyatt GH,Validity of composite outcomes in clinical trialsBritish Medical JournalYear: 2005330594610.1136/bmj.330.7491.59415761002
Ferreira-Gonzalez I,Busse JW,Heels-Ansdell,Montori VM,Akl EA,Bryant DM,Alonso-Coello P,Alonso J,Worster A,Upadhye S,Jaeschke R,Schunemann HJ,Permanyer-Miralda G,Pacheco-Huergo V,Domingo-Salvany A,Wu P,Mills EJ,Guyatt GH,Problems with use of composite end points in cardiovascular trials: systematic review of randomized controlled trialsBritish Medical JournalYear: 2007334759778610.1136/bmj.39136.682083.AE17403713
DeMets DL,Califf RM,Lessons learned from recent cardiovascular clinical trials: Part ICirculationYear: 20021067465110.1161/01.CIR.0000023219.51483.6612163438
Neaton JD,Gray G,Zuckerman BD,Konstam MA,Key issues in end point selection for heart failure trials: Composite end pointsJournal of Cardiac FailureYear: 2005115677510.1016/j.cardfail.2005.08.35016230258
Moye LA,Multiple analyses in clinical trialsYear: 2003New York: Springer
Bergman S,Feldman LS,Barkun JS,Evaluating surgical outcomesSurgical Clinics of North AmericaYear: 2006861294910.1016/j.suc.2005.10.00716442425
Califf RM,Harrelson-Woodlief L,Topol EJ,Left ventricular ejection fraction may not be useful as an end point of thrombolytic therapy comparative trialsCirculationYear: 1990821847532225381
Braunwald E,Cannon CP,McCabe CH,An approach to evaluating thrombolytic therapy in acute myocardial infarction. The 'unsatisfactory outcome' end pointCirculationYear: 19928668371638732
Follmann D,Duerr A,Tabet S,Gilber P,Moddie Z,Fast P,Cardinali M,Self S,Endpoints and regulatory issues in HIV vaccine clinical trialsJournal of Acquired Immune Deficiency SyndromeYear: 200744496010.1097/01.qai.0000247227.22504.ce
Hariharan S,McBride MA,Cohen EP,Evolution of endpoints for renal transplant outcomeAmerican Journal of TransplantationYear: 200339334110.1034/j.1600-6143.2003.00176.x12859527
Davis SM,Koch GG,Davis CE,LaVange LM,Statistical approaches to effectiveness measurement and outcome-driven re-randomizations in the clinical antipsychotic trials of intervention effectiveness (CATIE) studiesSchizophrenia BulletinYear: 200329738012908662
Tugwell P,Judd MG,Fries JF,Singh G,Wells GA,Powering our way to the elusive side effect: A composite outcome 'basket' of predefined designated endpoints in each organ system should be included in all controlled trialsJournal of Clinical EpidemiologyYear: 2005587859010.1016/j.jclinepi.2004.11.02816018913
Ross S,Composite outcomes in randomized clinical trial: arguments for and againstAmerican Journal of Obstetrics & GynecologyYear: 2007196119e1e6
Huque MF,Sankoh AJ,A reviewer's perspective on multiple endpoint issues in clinical trialsJournal of Biopharmaceutical StatisticsYear: 199775456410.1080/105434097088352069358328
Sankoh AJ,D'Argostina RB Sr,Huque MF,Efficacy endpoint selection and multiplicity adjustment methods in clinical trials with inherent multiple endpoint issuesStatistics in MedicineYear: 20032231335010.1002/sim.155714518019
Berger V,Improving the information content of categorical clinical trials endpointsControlled Clinical TrialsYear: 2002235021410.1016/S0197-2456(02)00233-712392864
Hallstrom AP,Litwin PE,Weaver WD,A method of assigning scores to the components of a composite outcome: An example from the MITI trialControlled Clinical TrialsYear: 1992131485510.1016/0197-2456(92)90020-Z1316829
Bjorling LE,Hodges JS,Rule-based ranking schemes for antiretroviral trialsStatistics in MedicineYear: 19971611759110.1002/(SICI)1097-0258(19970530)16:10<1175::AID-SIM522>3.0.CO;2-G9179982
Hardy RJ,Thompson SG,Detecting and describing heterogeneity in meta-analysisStatistics in MedicineYear: 1998178415610.1002/(SICI)1097-0258(19980430)17:8<841::AID-SIM781>3.0.CO;2-D9595615
SAS InstituteSAS version 9.1SAS Institute, Cary, NC
Shoukri MM,Chaudhary MA,Analysis of Correlated Data with SAS and RYear: 20073London, Chapman & Hall
Donald A,Donner A,Adjustment to the Mantel-Haenszel chi-squared statistic and odds ratio estimator when the data are clusteredStatistics in MedicineYear: 19876491910.1002/sim.47800604083629050
Rao JNK,Scott AJ,A simple method for the analysis of clustered binary dataBiometricsYear: 1992485778510.2307/25323111637980
Liang KY,Zeger SL,Longitudinal data analysis using generalized linear modelsBiometrikaYear: 198673132210.1093/biomet/73.1.13
McCullagh P,Nelder JA,Generalized Linear ModelsYear: 1989London: Chapman and Hall
The Heart Outcomes Prevention Evaluation (HOPE) Study InvestigatorsEffect of an angiotensin-converting-enzyme inhibitor, ramipril on cardiovascular events in high-risk patientsThe New England Journal of MedicineYear: 20003421455310.1056/NEJM20000120342030110639539
Park CG,Park T,Shin DW,A simple method for generating correlated binary variatesThe American StatisticianYear: 1996503061010.2307/2684925
Austin PC,A comparison of the statistical power of different methods for the analysis of cluster randomization trials with binary outcomesStatistics in MedicineYear: 20072635506510.1002/sim.281317238238
Hosmer DW,Lemeshow S,Applied Logistic RegressionYear: 2000New York: John Wiley & Sons, Inc


[Figure ID: F1]
Figure 1 

Power for composite outcome heterogeneity by model as a function of treatment effect for the second component. Note that power curves for both weighted models completely overlap in this figure. Independent and Random Effects line also overlap to a large degree.

[Figure ID: F2]
Figure 2 

The power for the main effect of treatment (black line) and the power for the test of heterogeneity of the composite components (blue line) by degree of composite heterogeneity.

[TableWrap ID: T1] Table 1 

Power to detect heterogeneity between the two components of a composite outcome by degree of heterogeneity (equal balance among components) with OR1 = 0.65

Hetero- geneity OR2 Composite Overall OR Weighted DD Weighted RS Independent Random Effects GEE
None 0.65 0.65 3.0 3.2 3.9 4.0 5.3
0.70 0.67 5.1 5.2 6.3 6.4 8.1
Low 0.75 0.70 13.1 13.2 15.6 15.6 17.9
0.80 0.72 26.0 26.2 29.5 29.8 33.4
Moderate 0.85 0.75 42.7 42.9 46.9 46.9 51.1
0.90 0.78 60.2 60.3 63.9 64.0 67.8
0.95 0.80 74.6 74.6 77.7 77.8 80.7
1.00 0.83 85.3 85.4 87.6 87.5 89.9
High 1.05 0.85 92.2 92.3 93.8 93.8 95.0
1.10 0.88 96.6 96.7 97.4 97.4 97.8
1.15 0.91 98.4 98.4 98.8 98.8 99.0
1.20 0.93 99.4 99.4 99.6 99.5 99.7

[TableWrap ID: T2] Table 2 

Power for detecting heterogeneity of treatment effect by varying degrees of balance among the components of the composite for a moderate heterogeneity pattern OR1, OR2= (0.65, 1.00) and ratio (p1:p2) of occurrence of components 1 and 2.

Balance (p1:p2) Weighted DD Weighted RS Independent Random Effects GEE
1:1 85.3 85.8 88.1 88.2 90.0
1:3 77.0 77.1 75.4 75.4 78.7
1:5 65.0 65.0 59.4 59.4 62.8
3:1 79.1 79.1 79.5 79.9 82.3
5:1 70.3 70.3 68.2 68.6 71.1

[TableWrap ID: T3] Table 3 

Comparison of power for the main treatment effect with power for interaction test, using the population average model (GEE)

OR1 = 0.65 OR1 = 0.65 OR1 = 0.70 OR1 = 0.70 OR1 = 0.75 OR1 = 0.75
OR2 Treatment Effect Heterogeneity Test Treatment Effect Heterogeneity Test Treatment Effect Heterogeneity Test
0.65 >99.9 5.3 - - - -
0.70 99.9 8.1 99.4 5.0 - -
0.75 99.6 17.9 98.2 8.3 95.7 5.5
0.80 98.2 33.4 95.7 16.7 89.8 8.3
0.85 95.5 51.1 89.5 30.5 81.5 16.0
0.90 90.7 67.8 81.6 44.1 68.7 28.6
0.95 82.2 80.7 70.4 63.5 55.5 43.7
1.00 70.7 89.9 57.8 78.8 41.5 58.9
1.05 57.7 95.0 42.9 86.3 28.2 72.4
1.10 44.6 97.8 30.2 92.8 18.9 82.4
1.15 31.3 99.0 19.6 96.8 11.3 90.5
1.20 21.5 99.7 8.1 98.3 7.2 94.8

Article Categories:
  • Research Article

Previous Document:  Depression and loneliness in Jamaicans with Sickle Cell Disease.
Next Document:  Structural and micro-anatomical changes in vertebrae associated with idiopathic-type spinal curvatur...