MetaDiSc: a software for metaanalysis of test accuracy data.  
Jump to Full Text  
MedLine Citation:

PMID: 16836745 Owner: NLM Status: MEDLINE 
Abstract/OtherAbstract:

BACKGROUND: Systematic reviews and metaanalyses of test accuracy studies are increasingly being recognised as central in guiding clinical practice. However, there is currently no dedicated and comprehensive software for metaanalysis of diagnostic data. In this article, we present MetaDiSc, a Windowsbased, userfriendly, freely available (for academic use) software that we have developed, piloted, and validated to perform diagnostic metaanalysis. RESULTS: MetaDiSc a) allows exploration of heterogeneity, with a variety of statistics including chisquare, Isquared and Spearman correlation tests, b) implements metaregression techniques to explore the relationships between study characteristics and accuracy estimates, c) performs statistical pooling of sensitivities, specificities, likelihood ratios and diagnostic odds ratios using fixed and random effects models, both overall and in subgroups and d) produces high quality figures, including forest plots and summary receiver operating characteristic curves that can be exported for use in manuscripts for publication. All computational algorithms have been validated through comparison with different statistical tools and published metaanalyses. MetaDiSc has a Graphical User Interface with rolldown menus, dialog boxes, and online help facilities. CONCLUSION: MetaDiSc is a comprehensive and dedicated test accuracy metaanalysis software. It has already been used and cited in several metaanalyses published in highranking journals. The software is publicly available at http://www.hrc.es/investigacion/metadisc_en.htm. 
Authors:

Javier Zamora; Victor Abraira; Alfonso Muriel; Khalid Khan; Arri Coomarasamy 
Related Documents
:

19282505  The effectiveness of intervention on the behavior of individuals with autism: a metaan... 20942855  The influence of the assessment method on the incidence of visual field progression in ... 23366065  A cad system for atherosclerotic plaque assessment. 12723775  Hierarchical linear models for the quantitative integration of effect sizes in singlec... 11315045  Empirical bayes estimation for combinations of multivariate bioassays. 20835375  Generalized weighted likelihood density estimators with application to finite mixture o... 
Publication Detail:

Type: Journal Article; Research Support, NonU.S. Gov't Date: 20060712 
Journal Detail:

Title: BMC medical research methodology Volume: 6 ISSN: 14712288 ISO Abbreviation: BMC Med Res Methodol Publication Date: 2006 
Date Detail:

Created Date: 20060822 Completed Date: 20060908 Revised Date: 20130607 
Medline Journal Info:

Nlm Unique ID: 100968545 Medline TA: BMC Med Res Methodol Country: England 
Other Details:

Languages: eng Pagination: 31 Citation Subset: IM 
Affiliation:

Clinical Biostatistics Unit, Ramón y Cajal Hospital, Spain. javier.zamora@hrc.es 
Export Citation:

APA/MLA Format Download EndNote Download BibTex 
MeSH Terms  
Descriptor/Qualifier:

Clinical Laboratory Techniques
/
standards* Data Interpretation, Statistical Diagnostic Imaging / standards* Diagnostic Services / standards* Endometrium / pathology, ultrasonography Female Humans MetaAnalysis as Topic* Research Design* Sensitivity and Specificity Software* UserComputer Interface Uterine Diseases / ultrasonography 
Comments/Corrections 
Full Text  
Journal Information Journal ID (nlmta): BMC Med Res Methodol ISSN: 14712288 Publisher: BioMed Central, London 
Article Information Copyright ? 2006 Zamora et al; licensee BioMed Central Ltd. openaccess: This is an Open Access article distributed under the terms of the Creative Commons Attribution License (), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Received Day: 31 Month: 3 Year: 2006 Accepted Day: 12 Month: 7 Year: 2006 collection publication date: Year: 2006 Electronic publication date: Day: 12 Month: 7 Year: 2006 Volume: 6First Page: 31 Last Page: 31 ID: 1552081 Publisher Id: 14712288631 PubMed Id: 16836745 DOI: 10.1186/14712288631 
MetaDiSc: a software for metaanalysis of test accuracy data  
Javier Zamora1  Email: javier.zamora@hrc.es 
Victor Abraira1  Email: Victor.abraira@hrc.es 
Alfonso Muriel1  Email: Alfonso.muriel@hrc.es 
Khalid Khan2  Email: k.s.khan@bham.ac.uk 
Arri Coomarasamy2  Email: arricoomar@blueyonder.co.uk 
1Clinical Biostatistics Unit, Ram?n y Cajal Hospital, Madrid, Ctra. Colmenar km 9.100 Madrid 28034, Spain 

2University of Birmingham and Birmingham Women's Hospital, Edgbaston, Birmingham, UK 
Accurate diagnosis forms the basis of good clinical care, as without it one can neither prognosticate correctly nor choose the right treatment. Indeed, a wrong diagnosis can harm patients by exposing them to inappropriate or suboptimal therapy [^{1}]. Thus studies of diagnostic accuracy, and particularly their systematic reviews and metaanalyses, are being recognised as instrumental in underpinning evidencebased clinical practice. Initiatives such as STARD [^{2}] and developments within the Cochrane Collaboration [^{3}] to accept protocols and reviews of test accuracy studies highlight the emphasis being given to evidencebased diagnosis.
Currently, there is only one test accuracy metaanalysis package, MetaTest [^{4}], which addresses some of the unique statistical issues related to test accuracy, such as pooling of sensitivities and specificities and summary receiver operating characteristics (sROC) analysis. However, it is a DOSbased application with an interface that many find difficult to use, and integrate into Windowsbased applications. Moreover, it lacks crucial analytical tools such as pooling of likelihood ratios (LRs), tests for heterogeneity and metaregression facilities.
We, therefore, developed, piloted and validated a comprehensive, Windowsbased test accuracy metaanalysis software, MetaDiSc, which is presented in this article, with a worked example.
MetaDiSc software was created in Microsoft Visual Basic 6, and some mathematical routines have been linked from the NAG C mathematical library [^{5}]. The software is distributed as a single file, downloadable freely from URL: . Its installation is simple, guided by onscreen instructions. The programme has a userfriendly interface with rolldown menus, dialog boxes and online HTML compiled help files. These help files include a user manual and a description of the implemented statistical methods.
MetaDiSc allows data entry into its datasheet in three different ways: a) directly by typing data into the datasheet using the keyboard, b) copying from another spreadsheet (e.g. Microsoft Excel) and pasting into MetaDiSc datasheet, or c) importing text files from other sources (for example, in the comma delimited format). Several variables can be defined in the datasheet, including study identifiers, accuracy data from each study (true positives, false positives, true negatives and false negatives) and study level covariates, such as those defining population spectrum or methodological quality of the studies.
Once the data have been entered into the datasheet of MetaDiSc, various statistical analyses can be implemented (Figure 1). The implementation of these statistical procedures needs to be carefully thought through and judicious, as it may be inappropriate (or indeed misleading) to use all the procedures (particularly statistical pooling) in all reviews. MetaDiSc provides analysts with adequate tools to assess the appropriateness of pooling. Readers interested in details of these methods are referred to statistical methods section of the help files (also available as a PDF standalone document [^{6}] and to existing texts and guidelines on diagnostic metaanalysis [^{7}^{}^{10}].
When describing accuracy results from several studies, it is important to get an indication of the magnitude and precision of the accuracy estimates derived from each study, as well as to assess the presence or absence of inconsistencies in accuracy estimates across studies (heterogeneity). As accuracy estimates are paired and often interrelated (sensitivity and specificity, or LR positive and LR negative), it is necessary to report these simultaneously [^{11}]. One accuracy measure that combines these paired measures is diagnostic odd ratio (dOR) [^{12}], which has limited clinical use, although useful in procedures like metaregression (see below).
MetaDiSc computes accuracy estimates and confidence intervals from individual studies and shows results either as numerical tabulations or graphical plots in two formats: a) forest plots, for sensitivities, specificities, LRs or dOR, with respective confidence intervals; and b) plots of individual study results in ROC space, with or without an sROC curve.
Exploring heterogeneity is a critical issue to a) understand the possible factors that influence accuracy estimates, and b) to evaluate the appropriateness of statistical pooling of accuracy estimates from various studies. One of the primary causes of heterogeneity in test accuracy studies is threshold effect, which arises when differences in sensitivities and specificities or LRs occur due to different cutoffs or thresholds used in different studies to define a positive (or negative) test result. When threshold effect exists, there is a negative correlation between sensitivities and specificities (or a positive correlation between sensitivities and 1specificities), which results in a typical pattern of "shoulder arm" plot in a sROC space [^{8}]. It is worth noting that correlation between sensitivity and specificity could arise due to a number of reasons other than threshold (e.g. partial verification bias, different spectrum of patients or different settings).
MetaDiSc allows assessment for threshold effect in three different ways: a) visual inspection of relationship between pairs of accuracy estimates in forest plots. If threshold effect is present, the forest plots will show increasing sensitivities with decreasing specificities, or vice versa. The same inverse relationship will be apparent with LR positive and LR negative; b) representation of accuracy estimates from each study in a sROC space ? a typical "shoulder arm" pattern would suggest presence of threshold effect; and c) computation of Spearman correlation coefficient between the logit of sensitivity and logit of 1specificity. A strong positive correlation would suggest threshold effect.
Apart from variations due to threshold effect, there are several other factors that can result in variations in accuracy estimates amongst different test accuracy studies in a review. These reasons include chance as well as variations in study population (e.g. severity of disease and comorbidities), index test (differences in technology, assays, operator etc.), reference standard, and the way a study was designed and conducted [^{13}]. Since such heterogeneity is almost always present in accuracy systematic reviews, testing for the presence and the extent of heterogeneity of results between primary studies, prior to undertaking any metaanalysis, is a critical part of any diagnostic review, as is exploration of the possible causes of heterogeneity [^{14}].
MetaDiSc allows users to test for heterogeneity amongst various studies in two different ways: a) Visual inspection of forest plots of accuracy estimates. If the studies are reasonably homogeneous, the accuracy estimates from individual studies will lie along a line corresponding to the pooled accuracy estimate. Large deviations from this line will indicate possible heterogeneity; b) statistical tests, including Chisquare and CochranQ, which are automatically implemented during analysis to evaluate if the differences across the studies are greater than expected by chance alone. A low pvalue will suggest presence of heterogeneity beyond what could be expected by chance alone. In addition to these heterogeneity statistics, MetaDiSc computes the inconsistency index (Isquared) which has been proposed as a measure to quantify the amount of heterogeneity [^{15}].
If substantial heterogeneity is found to be present from the analyses detailed above, then reasons for such heterogeneity can be explored by relating study level covariates (e.g., population, test, reference standard or methodological features) to an accuracy measure, using metaregression techniques. The accuracy measure that is normally used is dOR, as it is a unitary measure of diagnostic performance that encompasses both sensitivity and specificity or both LR positive and LR negative. Using dOR as a global measure of accuracy is a suitable method to compare the overall diagnostic accuracy of different tests [^{13}]. However, its use is limited because it cannot be used directly in clinical practice and, furthermore, possible opposing effects of a study characteristic on sensitivity or specificity may be masked by using dOR.
MetaDiSc implements metaregression using a generalization of Littenberg and Moses Linear model [^{8},^{13}] weighted by inverse of the variance or study size or unweighted. Random effects between studies can be estimated by different methods and added to the weighting scheme [^{16}]. Estimations of coefficients of the model are performed by least squares method as implemented in NAG mathematical routines. The outcome variable is ln(dOR) which is related via a linear model to any number of study level covariates, and optionally including the variable representing threshold effect [^{13}]. The outputs from metaregression modelling in MetaDiSc are the coefficients of the model, as well as ratio of dOR (rdOR) with respective confidence intervals. If a particular study level covariate is significantly associated with diagnostic accuracy, then its coefficient will have a low pvalue, and the rdOR will give a measure of magnitude of the association.
More advanced metaregression techniques such as Hierarchical sROC model [^{17}] and bivariate analysis of sensitivity and specificity [^{18}] has been developed. These methods overcome some of the statistical shortcomings inherent to Littenberg and Moses model [^{8},^{19}].
Statistical pooling is not always appropriate or necessary in every systematic review of test accuracy studies. However, when used appropriately, pooling can provide useful summary information. The necessary precondition for simple pooling (weighted averaging) of each of sensitivities, specificities, LR positives and LR negatives, is that the studies and results are reasonably homogeneous (i.e. no substantial heterogeneity, including threshold effect, is present). If heterogeneity due to threshold effect were present, the accuracy data can be pooled by fitting a sROC curve and summarising that curve by means of the Area Under the Curve (AUC) or using other statistics such as the Q* index [^{19}] (i.e. the point of the curve in which sensitivity equals specificity). If there is heterogeneity due to sources other than threshold effect, then pooling should only be attempted within homogeneous subsets, which would normally have been defined a priori.
MetaDiSc has comprehensive functionality for statistical pooling: a) It allows pooling of sensitivities, specificities, LR positive and LR negative each separately, using either fixed or random effect [^{10},^{20}] models. The output from these analyses are presented numerically in tables, and graphically as forest plots. Pooled estimates are provided with their respective confidence intervals; b) It implements several ways to fit a sROC curve when threshold effect is present. Default option is to compute a symmetrical sROC curve after fitting the linear model proposed by Littenberg and Moses. However, users can choose different options to fit this curve, for example, combining individual dORs by the MantelHaenszel or the DerSimonian Laird methods [^{10},^{20}] to estimate an overall dOR, and then fitting an sROC curve. When the dOR changes with diagnostic threshold, the sROC curve is asymmetrical. MetaDiSc allows the user to check for asymmetry of the sROC curve, and fit an asymmetrical sROC curve if appropriate. Finally, MetaDiSc allows estimation of AUC and the Q* index, along with their standard errors, as a summary measure of global accuracy which also aids intertest comparisons; c) MetaDiSc allows pooling of various summary measures within subgroups defined by study level covariates with the help of a filter utility.
Wherever possible, the results of the above statistical procedures were validated using different general purpose statistical software such as STATA (ver 8.2) and SAS (8.2) using actually published and simulated data sets (Table 1).
We illustrate the various procedures that MetaDiSc implements in a casestudy of ultrasound test in the diagnosis of uterine pathology [^{21},^{22}]. Ultrasound measurement of the lining of the uterus (endometrium) can predict pathology such as endometrial hyperplasia (a precancerous condition) or cancer. The greater the thickness of endometrium, the more likely that the target condition is present. Various thresholds (such as 3, 4 or 5 mm etc) have been used to define a positive ultrasound result.
A systematic review of test accuracy studies identified 57 studies. Figure 2 shows a datasheet in MetaDiSc which has been loaded with information from these 57 studies. The information includes study identifiers, accuracy data, thresholds, and some study level covariates (such as hormone replacement therapy use).
As the first step in the analysis, we have used MetaDiSc to present accuracy measures from each individual study in forest plots for sensitivities (figure 3a), specificities (figure 3b), LRs (figures 4a and 4b) and dOR (figure 5). All these indices can also be represented in tabular form as shown in table 2. Although the forest plots and the tables contain a pooled summary at the bottom, at this early stage in the analysis, it is recommended that the plots are used to obtain a general overview of the accuracy estimates from each study, and the interpretation of the pooled summary is left to later stages of analysis.
The next step is the representation of sensitivity against 1specificity from each study in a ROC space (figure 6), which can be used for exploration for threshold effect. The pattern of the points in this plot suggest a "shoulderarm" shape, indicating the possibility of threshold effect. We, therefore, performed a Spearman rank correlation as a further test for threshold effect, and found that there was further indication of threshold effect (Table 3, Spearman correlation coefficient = 0.394; p = 0.006). Having found some clues about the presence of threshold effect, we now focus on a subgroup of 21 studies that used a singular threshold of >5 mm to define test positivity. Although an explicit threshold of 5 mm was used in these studies, there can still be an implicit threshold effect due to, for example, variation in the interpretation of the test results. Therefore, within this subgroup with an explicit threshold of 5 mm, it is still recommended that the above explorations for threshold effect are undertaken. We performed such analyses for this subgroup in MetaDiSc, and found no evidence of further threshold effect (data not shown). There are a number of other more advanced methods not implemented in MetaDiSc that allow to incorporate explicitly information about tests thresholds defined between or within studies [^{17}].
As the next step, heterogeneity arising from factors other than threshold effect is explored. We performed a visual exploration of the forest plots of accuracy measures for these 21 studies as well as statistical tests for heterogeneity (MetaDiSc output not shown). In addition, possible sources of heterogeneity across the studies were explored using metaregression analysis with the following covariates as predictor variables: use or nonuse of hormone replacement therapy (HRT); technique of ultrasound measurement (single or double layer); and population enrolment (consecutive or other). Results are shown in Table 4, which suggest that the number of layers is strongly associated with accuracy. The double layer technique is associated with two times higher accuracy compared to single layer measurement (rdOR = 2.04; 95% CI: 1.01?4.13; p = 0.048)
The final step in the analysis is pooling if this is considered appropriate. We illustrate pooling of the LRs for negative test results in one homogenous subgroup of studies of nonHRT users, with a test threshold of ? 5 mm, and using a single layer technique (Figure 7). Finally, we demonstrate sROC curve fitting in the presence of threshold effect for the whole dataset in Figure 8.
MetaDiSc allows description of individual study results; exploration of heterogeneity with a variety of statistics including chisquare, Isquared and Spearman correlation tests; implements metaregression techniques to explore the relationships between study characteristics and accuracy estimates; performs statistical pooling of sensitivities, specificities, likelihood ratios and diagnostic odds ratios, using fixed and random effects models, both overall and in subgroups; and produces high quality figures, including forest plots and summary receiver operating characteristic curves that can be exported for use in manuscripts for publication.
MetaDiSc is an evolving software. As new diagnostic metaanalytic methods become established over time, they will be implemented into the program in the future. For example, bivariate method of pooling sensitivity and specificity [^{18}] is currently being developed. We will carefully follow the progress in this field. Once accepted as an established metaanalytic method, it will be implemented in MetaDiSc. On similar lines, methods of data extraction from individual studies that only provide accuracy measures are currently being developed within our department. Once these methods have been verified, we will implement this option to assist systematic reviewers in extracting 2by2 tables from such studies.
MetaDiSc is a comprehensive and dedicated test accuracy metaanalysis software. All computational algorithms in it have been validated through comparison with different statistical tools and published metaanalyses. Its use and citation in several metaanalyses published in highranking journals is evidence of external validation of its high quality [^{23}^{}^{28}].
The software is publicly available at .
Operating system: The software runs on Windows based personal computers (Windows 95 or higher) with Pentiumclass processor or equivalent, with minimum of 32 MB of RAM and minimum of 20 MB of hard disk space. SVGA color monitor; minimum 800 ? 600 screen resolution and 256 colors.
Licence: Freeware for academic use.
The author(s) declare that they have no competing interests.
JZ conceived the idea. AM, VA and JZ developed the software. AC and KSK tested the software on a number of reviews and gave suggestions for improvements. All authors participated in preparing this manuscript.
The prepublication history for this paper can be accessed here:
MetaDisc data set. This file contains simulated data. It is provided to help users to validate statistical procedures shown in table 1.
Additional File 2
STATA data set. This file contains simulated data. It is provided to help users to validate statistical procedures shown in table 1.
This work has been partly funded by Spanish Health Ministry Grants no PI02/0954, G03/090 and PI04/1055.
References
Thomson R,McElroy H,Sudlow M. Guidelines on anticoagulant treatment in atrial fibrillation in Great Britain: variation in content and implications for treatmentBMJ 1998;316:509–513. [pmid: 9501712]  
Bossuyt PM,Reitsma JB,Bruns DE,Gatsonis CA,Glasziou PP,Irwig LM,Lijmer JG,Moher D,Rennie D,de Vet HC. Towards complete and accurate reporting of studies of diagnostic accuracy: The STARD InitiativeRadiology 2003;226:24–28. [pmid: 12511664]  
Collaboration C. Methods Groups Newsletterhttp://www cochrane org/newslett/MGNews2004 pdf. 2006  
Lau J. MetaTest. 1997Boston: New England Medical Center;  
The NAG C Library, Mark 6. 2004Oxford: Numerical Algorithms Group;  
Zamora J,Muriel A,Abraira V. MetaDiSc Statistical Methods2006  
Irwig L,Tosteson ANA,Gatsonis C,Lau J,Colditz G,Chalmers TC,Mosteller F. Guidelines for Metaanalyses Evaluating DiagnosticTestsAnnals of Internal Medicine 1994;120:667–676. [pmid: 8135452]  
Moses LE,Shapiro D,Littenberg B. Combining independent studies of a diagnostic test into a summary ROC curve: dataanalytic approaches and some additional considerationsStat Med 1993;12:1293–1316. [pmid: 8210827]  
Deville WL,Buntinx F,Bouter LM,Montori VM,de Vet HC,van der Windt DA,Bezemer PD. Conducting systematic reviews of diagnostic studies: didactic guidelinesBMC Med Res Methodol 2002;2:9. [pmid: 12097142] [doi: 10.1186/1471228829]  
Deeks JJ. Egger M, Davey SG and Altman DGSystematic reviews of evaluations of diagnostic and screening tests studiesSystematic reviews in health care: metaanalysis in context (2nd Edition). 20012nd Edition. BMJ Books;  
Honest H,Khan KS. Reporting of measures of accuracy in systematic reviews of diagnostic literatureBmc Health Services Research 2002;2 [pmid: 11884248]  
Glas AS,Lijmer JG,Prins MH,Bonsel GJ,Bossuyt PM. The diagnostic odds ratio: a single indicator of test performanceJ Clin Epidemiol 2003;56:1129–1135. [pmid: 14615004] [doi: 10.1016/S08954356(03)00177X]  
Lijmer JG,Bossuyt PM,Heisterkamp SH. Exploring sources of heterogeneity in systematic reviews of diagnostic testsStat Med 2002;21:1525–1537. [pmid: 12111918] [doi: 10.1002/sim.1185]  
Dinnes J,Deeks J,Kirby J,Roderick P. A methodological review of how heterogeneity has been examined in systematic reviews of diagnostic test accuracyHealth Technol Assess 2005;9:1–113.  
Higgins JP,Thompson SG. Quantifying heterogeneity in a metaanalysisStat Med 2002;21:1539–1558. [pmid: 12111919] [doi: 10.1002/sim.1186]  
Thompson SG,Sharp SJ. Explaining heterogeneity in metaanalysis: a comparison of methodsStat Med 1999;18:2693–2708. [pmid: 10521860] [doi: 10.1002/(SICI)10970258(19991030)18:20<2693::AIDSIM235>3.0.CO;2V]  
Rutter CM,Gatsonis CA. A hierarchical regression approach to metaanalysis of diagnostic test accuracy evaluationsStat Med 2001;20:2865–2884. [pmid: 11568945] [doi: 10.1002/sim.942]  
Reitsma JB,Glas AS,Rutjes AWS,Scholten RJPM,Bossuyt PM,Zwinderman AH. Bivariate analysis of sensitivity and specificity produces informative summary measures in diagnostic reviewsJournal of Clinical Epidemiology 2005;58:982–990. [pmid: 16168343] [doi: 10.1016/j.jclinepi.2005.02.022]  
Walter SD. Properties of the summary receiver operating characteristic (SROC) curve for diagnostic test dataStat Med 2002;21:1237–1256. [pmid: 12111876] [doi: 10.1002/sim.1099]  
DerSimonian R,Laird N. Metaanalysis in clinical trialsControl Clin Trials 1986;7:177–188. [pmid: 3802833] [doi: 10.1016/01972456(86)900462]  
Gupta JK,Chien PF,Voit D,Clark TJ,Khan KS. Ultrasonographic endometrial thickness for diagnosing endometrial pathology in women with postmenopausal bleeding: a metaanalysisActa Obstet Gynecol Scand 2002;81:799–816. [pmid: 12225294] [doi: 10.1034/j.16000412.2001.810902.x]  
Khan KS,Kunz R,Kleijnen J,Antes G. Case study 4: Reviewing evidence on test accuracySystematic Review to Support Evidencebased Medicine (2003) 20032003. London, The Royal Society of Medicine; :109–119.  
Morgan M,Kalantri S,Flores L,Pai M. A commercial line probe assay for the rapid detection of rifampicin resistance in Mycobacterium tuberculosis: a systematic review and metaanalysisBMC Infectious Diseases 2005;5:62. [pmid: 16050959] [doi: 10.1186/14712334562]  
Flores L,Pai M,Colford JM,Riley LW. Inhouse nucleic acid amplification tests for the detection of Mycobacterium tuberculosis in sputum specimens: metaanalysis and metaregressionBMC Microbiol 2005;5:55. [pmid: 16202138] [doi: 10.1186/14712180555]  
Gisbert J,Abraira V. Accuracy of Helicobacter pylori Diagnostic Tests in Patients with Bleeding Peptic Ulcer: A Systematic Review and MetaanalysisThe American Journal of Gastroenterology 2006;101:848–863. [pmid: 16494583] [doi: 10.1111/j.15720241.2006.00528.x]  
Shiga T,Wajima Z,Inoue T,Sakamoto A. Predicting difficult intubation in apparently normal patients: a metaanalysis of bedside screening test performanceAnesthesiology 2005;103:429–437. [pmid: 16052126] [doi: 10.1097/0000054220050800000027]  
Zijlstra JM,van der Werf G,Hoekstra OS,Hooft L,Huijgens PC. Ffluorodeoxyglucose positron emission tomography for posttreatment evaluation of malignant lymphoma: a systematic reviewHaematologica 2006;91:522–9. [pmid: 16585017]  
Goodacre S,Sutton AJ,Sampson FC. MetaAnalysis: The Value of Clinical Assessment in the Diagnosis of Deep Venous ThrombosisAnnals of Internal Medicine 2005;143:129–139. [pmid: 16027455] 
Figures
Tables
Validation of statistical procedures. Validation of different statistical procedures using a simulated dataset. Results of MetaDiSc (version 1.4) are compared with those obtained with metan (version 1.86) and metareg (version 1.06) STATA commands. Prior to the analyses, all four cells of all studies were added with 1/2 to avoid division by zero when computing some indices or standard errors. MetaDiSc and STATA dataset are provided as additional files [see Additional file 1] and [see Additional file 2].
Results  


Procedure  MetaDiSc (version 1.4)  STATA (ver 8.2) 
Random Effect Model  
Pooled +ve LR  2.447  2.447 
(95%(CI)  (2.085 ? 2.871)  (2.085 ? 2.871) 
Tausquare  0.0932  0.0932 
CochraneQ  139.71  139.71 
Pooled ve LR  0.157  0.157 
(95%(CI)  (0.095 ? 0.257)  (0.095 ? 0.257) 
Tausquare  0.4631  0.46357 
CochraneQ  33.00  33.07 
Fixed Effect Model  
Pooled +ve LR  2.330  2.330 
(95%(CI)  (2.208 ? 2.459)  (2.208 ? 2.459) 
CochraneQ  139.71  139.71 
Pooled ve LR  0.105  0.104 
(95%(CI)  (0.073 ? 0.149)  (0.073 ? 0.148) 
CochraneQ  33.00  33.07 
MetaRegression^{1}  
TauSquare  0.1141  0.1141 
Constant coefficient (SE)  2.520 (0.8370)  2.5197 (0.83699) 
S coefficient (SE)  0.330 (0.1912)  0.3304 (0.19123) 
Covariable coefficient (SE)  0.036 (0.0904)  0.0355 (0.09041) 
(1) Metaregression was weighted by the inverse of the variance of dOR and between study variance was estimated by REML.
Tabulation of Likelihood ratio for positive test result (LR+) with respective 95% confidence intervals from all test accuracy studies included in systematic review of ultrasound for prediction of endometrial cancer.
Study  LR+  [95% Conf. Iterval.]  % Weight  
Auslender  1,994  1,623  2,449  2,54 
Zannoni  2,092  1,919  2,280  2,77 
Bakour  1,895  1,490  2,408  2,45 
Botsis  7,360  4,437  12,208  1,69 
Fistonic  1,200  1,045  1,378  2,69 
Garuti  1,471  1,358  1,593  2,78 
Granberg  2,066  1,935  2,206  2,79 
Guner  1,834  1,569  2,144  2,65 
Haller  1,321  1,118  1,561  2,63 
Tsuda  2,517  1,964  3,225  2,43 
Varner  1,795  0,842  3,826  1,13 
Abu Ghazzeh  1,215  0,538  2,745  1,03 
Briley  1,855  1,396  2,465  2,33 
Cacciatore  1,239  0,877  1,752  2,15 
DeSilva  1,306  0,245  6,957  0,34 
Granberg  3,937  2,933  5,284  2,30 
Grigoriou  2,946  2,430  3,572  2,57 
Gu  1,307  0,956  1,787  2,25 
Gupta  1,846  0,783  4,350  0,96 
H?nggi  4,000  2,472  6,473  1,76 
Ivanov  2,273  1,691  3,054  2,30 
Karlsson  2,649  1,936  3,627  2,24 
Loverro  5,957  3,648  9,729  1,73 
Malinova  1,963  1,591  2,421  2,53 
Merz  1,697  1,287  2,236  2,35 
Nasri  2,740  1,833  4,096  1,98 
Nasri  2,400  1,711  3,367  2,17 
Pertl  1,293  1,115  1,499  2,67 
Suchocki  1,120  1,027  1,222  2,77 
Taviani  1,802  0,983  3,304  1,44 
Weber  1,618  1,374  1,904  2,64 
Wolman  2,481  1,556  3,956  1,80 
Moreles  2,312  1,845  2,896  2,49 
Rudigoz  2,981  1,638  5,426  1,46 
Todorova  1,667  0,729  3,808  1,01 
Gruboeck  7,036  3,689  13,422  1,35 
Chan  2,543  1,779  3,635  2,12 
Degenhardt  2,516  1,856  3,411  2,27 
Dijkhuizen  1,859  1,389  2,489  2,31 
Brolmann  2,017  1,487  2,736  2,27 
Ceccini  3,267  2,655  4,021  2,54 
Masearetti  2,059  1,096  3,866  1,38 
Mortakis  2,213  1,602  3,058  2,22 
Schramm  1,241  0,899  1,714  2,22 
Smith  1,938  1,252  3,001  1,88 
Osmers  1,964  1,699  2,271  2,68 
SeelbachG?bel  1,680  1,455  1,940  2,68 
Altuncu et al.  29,167  4,089  208,02  0,25 


(REM) pooled LR+  2,087  1,881  2,315 
Heterogeneity chisquared = 506,06 (d.f.= 47) p = 0,000
Inconsistency (Isquare) = 90,7%
No. studies = 48.
Filter OFF
Add 1/2 only zero cell studies
Results of Spearman rank correlation of sensitivity against (1 ? specificity) to assess the threshold effect in all test accuracy studies included in systematic review of ultrasound for prediction of endometrial cancer.
Var.  Coeff.  Std. Error  T  pvalue 
A  2.412  0.292  8.266  0.0000 
b(1)  0.187  0.101  1.857  0.0697 
Spearman correlation coefficient: 0,394 pvalue = 0,006 (Logit(TPR) vs Logit(FPR)
Moses' model (D = a + bS)
Unweighted regression
Tausquared estimate = 0,3540
(Convergence is achieved after 2 iterations)
Restricted Maximum Likelihood estimation (REML)
No. studies = 48
Filter OFF
Results of metaregression analysis for predicting the presence or absence of endometrial carcinoma with variables: use or nonuse of hormone replacement therapy (HRT); technique of ultrasound measurement (single or double layer); and population enrolment (consecutive or other).
MetaRegression(Inverse Variance weights) (1)  
Var.  Coeff.  pvalue  RDOR  [95%CI] 
Cte.  0,857  0,1571     
S  0,263  0,0208     
Layers  0,709  0,0610  2,03  (0,97;4,27) 
Consecutive  0,206  0,7398  1,23  (0,35;4,26) 
HRT  0,324  0,4152  1,38  (0,63;3,06) 
MetaRegression(Inverse Variance weights) (2)  
Var.  Coeff.  pvalue  RDOR  [95%CI] 


Cte.  0,849  0,1565     
S  0,253  0,0194     
Layers  0,739  0,0424  2,09  (1,03;4,27) 
HRT  0,320  0,4152  1,38  (0,63;3,02) 
MetaRegression(Inverse Variance weights) (3)  
Var.  Coeff.  pvalue  RDOR  [95%CI] 


Cte.  0,959  0,0999     
S  0,258  0,0166     
Layers  0,712  0,0482  2,04  (1,01;4,13) 
Article Categories:

Previous Document: Energy expenditure during overfeeding.
Next Document: Prevalence of soil transmitted nematodes on Nukufetau, a remote Pacific island in Tuvalu.