Wk 8 Assign Epid

profileNursebeauty
MetaAnalysis.pdf

HIPPOKRATIA 2010, 14 (Suppl 1) 2�

REVIEW ARTICLE

PASCHOS KAHIPPOKRATIA 2010, 14 (Suppl 1): 2�-��

Meta-analysis in medical research Haidich AB Department of Hygiene and Epidemiology, Aristotle University of Thessaloniki School of Medicine, Thessaloniki, Greece

Abstract The objectives of this paper are to provide an introduction to meta-analysis and to discuss the rationale for this type of research and other general considerations. Methods used to produce a rigorous meta-analysis are highlighted and some aspects of presentation and interpretation of meta-analysis are discussed. Meta-analysis is a quantitative, formal, epidemiological study design used to systematically assess previous research studies to derive conclusions about that body of research. Outcomes from a meta-analysis may include a more precise estimate of the effect of treatment or risk factor for disease, or other outcomes, than any individual study contributing to the pooled analysis. The examination of variability or heterogeneity in study results is also a critical outcome. The ben- efits of meta-analysis include a consolidated and quantitative review of a large, and often complex, sometimes apparently conflicting, body of literature. The specification of the outcome and hypotheses that are tested is critical to the conduct of meta-analyses, as is a sensitive literature search. A failure to identify the majority of existing studies can lead to er- roneous conclusions; however, there are methods of examining data to identify the potential for studies to be missing; for example, by the use of funnel plots. Rigorously conducted meta-analyses are useful tools in evidence-based medicine. The need to integrate findings from many studies ensures that meta-analytic research is desirable and the large body of research now generated makes the conduct of this research feasible. Hippokratia 2010; 14 (Suppl 1): 29-37

Key words: meta-analysis, systematic review, randomized clinical trial, bias, quality, evidence-based medicine

Corresponding author: Anna-Bettina Haidich, Department of Hygiene and Epidemiology Aristotle University of Thessaloniki, School of Medicine, 54124 Thessaloniki, Greece, Tel: +302310-999143, Fax: +302310-999701, e-mail:[email protected]

Important medical questions are typically studied more than once, often by different research teams in dif- ferent locations. In many instances, the results of these multiple small studies of an issue are diverse and con- flicting, which makes the clinical decision-making dif- ficult. The need to arrive at decisions affecting clinical practise fostered the momentum toward “evidence-based medicine”1-2. Evidence-based medicine may be defined as the systematic, quantitative, preferentially experimen- tal approach to obtaining and using medical information. Therefore, meta-analysis, a statistical procedure that in- tegrates the results of several independent studies, plays a central role in evidence-based medicine. In fact, in the hierarchy of evidence (Figure 1), where clinical evidence is ranked according to the strength of the freedom from various biases that beset medical research, meta-analy- ses are in the top. In contrast, animal research, laboratory studies, case series and case reports have little clinical value as proof, hence being in the bottom.

Meta-analysis did not begin to appear regularly in the medical literature until the late 1970s but since then a plethora of meta-analyses have emerged and the growth is exponential over time (Figure 2)3. Moreover, it has been shown that meta-analyses are the most frequently cited form of clinical research4. The merits and perils of the somewhat mysterious procedure of meta-analysis, however, continue to be debated in the medical com-

munity5-8. The objectives of this paper are to introduce meta-analysis and to discuss the rationale for this type of research and other general considerations.

Meta-Analysis and Systematic Review Glass first defined meta-analysis in the social science

literature as “The statistical analysis of a large collection of analysis results from individual studies for the purpose of integrating the findings”9. Meta-analysis is a quanti- tative, formal, epidemiological study design used to sys-

Figure 1: Hierarchy of evidence.

�0

tematically assess the results of previous research to de- rive conclusions about that body of research. Typically, but not necessarily, the study is based on randomized, controlled clinical trials. Outcomes from a meta-analysis may include a more precise estimate of the effect of treat- ment or risk factor for disease, or other outcomes, than any individual study contributing to the pooled analysis. Identifying sources of variation in responses; that is, ex- amining heterogeneity of a group of studies, and general- izability of responses can lead to more effective treatments or modifications of management. Examination of hetero- geneity is perhaps the most important task in meta-analy- sis. The Cochrane collaboration has been a long-standing,

rigorous, and innovative leader in developing methods in the field10. Major contributions include the development of protocols that provide structure for literature search methods, and new and extended analytic and diagnostic methods for evaluating the output of meta-analyses. Use of the methods outlined in the handbook should provide a consistent approach to the conduct of meta-analysis.

Moreover, a useful guide to improve reporting of system- atic reviews and meta-analyses is the PRISMA (Preferred Reporting Items for Systematic reviews and Meta-analy- ses) statement that replaced the QUOROM (QUality Of Reporting of Meta-analyses) statement11-13.

Meta-analyses are a subset of systematic review. A systematic review attempts to collate empirical evidence that fits prespecified eligibility criteria to answer a specif- ic research question. The key characteristics of a system- atic review are a clearly stated set of objectives with pre- defined eligibility criteria for studies; an explicit, repro- ducible methodology; a systematic search that attempts to identify all studies that meet the eligibility criteria; an assessment of the validity of the findings of the included studies (e.g., through the assessment of risk of bias); and a systematic presentation and synthesis of the attributes and findings from the studies used. Systematic methods

are used to minimize bias, thus providing more reliable findings from which conclusions can be drawn and de- cisions made than traditional review methods14,15. Sys- tematic reviews need not contain a meta-analysis—there are times when it is not appropriate or possible; however, many systematic reviews contain meta-analyses16.

The inclusion of observational medical studies in meta-analyses led to considerable debate over the valid- ity of meta-analytical approaches, as there was necessar- ily a concern that the observational studies were likely to be subject to unidentified sources of confounding and risk modification17. Pooling such findings may not lead to more certain outcomes. Moreover, an empirical study showed that in meta-analyses were both randomized and non-randomized was included, nonrandomized studies tended to show larger treatment effects18.

Meta-analyses are conducted to assess the strength of evidence present on a disease and treatment. One aim is to determine whether an effect exists; another aim is to determine whether the effect is positive or negative and, ideally, to obtain a single summary estimate of the effect. The results of a meta-analysis can improve precision of estimates of effect, answer questions not posed by the in- dividual studies, settle controversies arising from appar- ently conflicting studies, and generate new hypotheses. In particular, the examination of heterogeneity is vital to the development of new hypotheses.

Individual or Aggregated Data The majority of meta-analyses are based on a series

of studies to produce a point estimate of an effect and measures of the precision of that estimate. However, methods have been developed for the meta-analyses to be conducted on data obtained from original trials19,20. This approach may be considered the “gold standard” in meta- analysis because it offers advantages over analyses using aggregated data, including a greater ability to validate the quality of data and to conduct appropriate statistical anal- ysis. Further, it is easier to explore differences in effect across subgroups within the study population than with aggregated data. The use of standardized individual-level information may help to avoid the problems encountered in meta-analyses of prognostic factors21,22. It is the best way to obtain a more global picture of the natural his- tory and predictors of risk for major outcomes, such as in scleroderma23-26.This approach relies on cooperation between researchers who conducted the relevant studies. Researchers who are aware of the potential to contribute or conduct these studies will provide and obtain addition- al benefits by careful maintenance of original databases and making these available for future studies.

Literature Search A sound meta-analysis is characterized by a thor-

ough and disciplined literature search. A clear definition of hypotheses to be investigated provides the framework for such an investigation. According to the PRISMA statement, an explicit statement of questions being ad-

Figure 2: Cumulative number of publications about meta- analysis over time, until 17 December 2009 (results from Medline search using text “meta-analysis”).

HAIDICH AB

HIPPOKRATIA 2010, 14 (Suppl 1) �1

dressed with reference to participants, interventions, com- parisons, outcomes and study design (PICOS) should be provided11,12. It is important to obtain all relevant studies, because loss of studies can lead to bias in the study. Typi- cally, published papers and abstracts are identified by a computerized literature search of electronic databases that can include PubMed (www.ncbi.nlm.nih.gov./en- trez/query.fcgi), ScienceDirect (www.sciencedirect.com), Scirus (www.scirus.com/srsapp ), ISI Web of Knowledge (http://www.isiwebofknowledge.com), Google Scholar (http://scholar.google.com) and CENTRAL (Cochrane Central Register of Controlled Trials, http://www.mrw. interscience.wiley.com/cochrane/cochrane_clcentral_ar- ticles_fs.htm). PRISMA statement recommends that a full electronic search strategy for at least one major database to be presented12. Database searches should be augmented with hand searches of library resources for relevant pa- pers, books, abstracts, and conference proceedings. Cross- checking of references, citations in review papers, and communication with scientists who have been working in the relevant field are important methods used to provide a comprehensive search. Communication with pharmaceu- tical companies manufacturing and distributing test prod- ucts can be appropriate for studies examining the use of pharmaceutical interventions.

It is not feasible to find absolutely every relevant study on a subject. Some or even many studies may not be published, and those that are might not be indexed in computer-searchable databases. Useful sources for un- published trials are the clinical trials registers, such as the National Library of Medicine’s ClinicalTrials.gov Web- site. The reviews should attempt to be sensitive; that is, find as many studies as possible, to minimize bias and be efficient. It may be appropriate to frame a hypothesis that considers the time over which a study is conducted or to target a particular subpopulation. The decision whether to include unpublished studies is difficult. Although lan- guage of publication can provide a difficulty, it is impor- tant to overcome this difficulty, provided that the popula- tions studied are relevant to the hypothesis being tested.

Inclusion or Exclusion Criteria and Potential for Bias Studies are chosen for meta-analysis based on inclu-

sion criteria. If there is more than one hypothesis to be tested, separate selection criteria should be defined for each hypothesis. Inclusion criteria are ideally defined at the stage of initial development of the study protocol. The rationale for the criteria for study selection used should be clearly stated.

One important potential source of bias in meta-analy- sis is the loss of trials and subjects. Ideally, all random- ized subjects in all studies satisfy all of the trial selection criteria, comply with all the trial procedures, and provide complete data. Under these conditions, an “intention-to- treat” analysis is straightforward to implement; that is, statistical analysis is conducted on all subjects that are enrolled in a study rather than those that complete all stages of study considered desirable. Some empirical

studies had shown that certain methodological character- istics, such as poor concealment of treatment allocation or no blinding in studies exaggerate treatment effects27. Therefore, it is important to critically appraise the quality of studies in order to assess the risk of bias.

The study design, including details of the method of randomization of subjects to treatment groups, criteria for eligibility in the study, blinding, method of assess- ing the outcome, and handling of protocol deviations are important features defining study quality. When studies are excluded from a meta-analysis, reasons for exclusion should be provided for each excluded study. Usually, more than one assessor decides independently which studies to include or exclude, together with a well-defined checklist and a procedure that is followed when the assessors dis- agree. Two people familiar with the study topic perform the quality assessment for each study, independently. This is followed by a consensus meeting to discuss the studies excluded or included. Practically, the blinding of reviewers from details of a study such as authorship and journal source is difficult.

Before assessing study quality, a quality assessment protocol and data forms should be developed. The goal of this process is to reduce the risk of bias in the estimate of effect. Quality scores that summarize multiple compo- nents into a single number exist but are misleading and unhelpful28. Rather, investigators should use individual components of quality assessment and describe trials that do not meet the specified quality standards and probably assess the effect on the overall results by excluding them, as part of the sensitivity analyses.

Further, not all studies are completed, because of pro- tocol failure, treatment failure, or other factors. Nonethe- less, missing subjects and studies can provide important evidence. It is desirable to obtain data from all relevant randomized trials, so that the most appropriate analysis can be undertaken. Previous studies have discussed the significance of missing trials to the interpretation of in- tervention studies in medicine29,30. Journal editors and reviewers need to be aware of the existing bias toward publishing positive findings and ensure that papers that publish negative or even failed trials be published, as long as these meet the quality guidelines for publication.

There are occasions when authors of the selected pa- pers have chosen different outcome criteria for their main analysis. In practice, it may be necessary to revise the inclusion criteria for a meta-analysis after reviewing all of the studies found through the search strategy. Varia- tion in studies reflects the type of study design used, type and application of experimental and control therapies, whether or not the study was published, and, if published, subjected to peer review, and the definition used for the outcome of interest. There are no standardized criteria for inclusion of studies in meta-analysis. Universal criteria are not appropriate, however, because meta-analysis can be applied to a broad spectrum of topics. Published data in journal papers should also be cross-checked with con- ference papers to avoid repetition in presented data.

�2

Clearly, unpublished studies are not found by search- ing the literature. It is possible that published studies are systemically different from unpublished studies; for ex- ample, positive trial findings may be more likely to be published. Therefore, a meta-analysis based on literature search results alone may lead to publication bias.

Efforts to minimize this potential bias include work- ing from the references in published studies, searching computerized databases of unpublished material, and in- vestigating other sources of information including con- ference proceedings, graduate dissertations and clinical trial registers.

Statistical analysis The most common measures of effect used for dichot-

omous data are the risk ratio (also called relative risk) and the odds ratio. The dominant method used for continuous data are standardized mean difference (SMD) estimation. Methods used in meta-analysis for post hoc analysis of findings are relatively specific to meta-analysis and in- clude heterogeneity analysis, sensitivity analysis, and evaluation of publication bias.

All methods used should allow for the weighting of studies. The concept of weighting reflects the value of the evidence of any particular study. Usually, studies are weighted according to the inverse of their variance31. It is important to recognize that smaller studies, therefore, usually contribute less to the estimates of overall effect. However, well-conducted studies with tight control of measurement variation and sources of confounding con- tribute more to estimates of overall effect than a study of identical size less well conducted.

One of the foremost decisions to be made when conducting a meta-analysis is whether to use a fixed-ef- fects or a random-effects model. A fixed-effects model is based on the assumption that the sole source of variation in observed outcomes is that occurring within the study; that is, the effect expected from each study is the same. Consequently, it is assumed that the models are homoge- neous; there are no differences in the underlying study population, no differences in subject selection criteria, and treatments are applied the same way32. Fixed-effect methods used for dichotomous data include most often the Mantel-Haenzel method33 and the Peto method 34(only for odds ratios).

Random-effects models have an underlying assump- tion that a distribution of effects exists, resulting in het- erogeneity among study results, known as τ2. Conse- quently, as software has improved, random-effects mod- els that require greater computing power have become more frequently conducted. This is desirable because the strong assumption that the effect of interest is the same in all studies is frequently untenable. Moreover, the fixed effects model is not appropriate when statistical het- erogeneity (τ2) is present in the results of studies in the meta-analysis. In the random-effects model, studies are weighted with the inverse of their variance and the het- erogeneity parameter. Therefore, it is usually a more con-

servative approach with wider confidence intervals than the fixed-effects model where the studies are weighted only with the inverse of their variance. The most com- monly used random-effects method is the DerSimonian and Laird method35. Furthermore, it is suggested that comparing the fixed-effects and random-effect models developed as this process can yield insights to the data36.

Heterogeneity Arguably, the greatest benefit of conducting meta-

analysis is to examine sources of heterogeneity, if pres- ent, among studies. If heterogeneity is present, the sum- mary measure must be interpreted with caution 37. When heterogeneity is present, one should question whether and how to generalize the results. Understanding sources of heterogeneity will lead to more effective targeting of prevention and treatment strategies and will result in new research topics being identified. Part of the strategy in conducting a meta-analysis is to identify factors that may be significant determinants of subpopulation analysis or covariates that may be appropriate to explore in all stud- ies.

To understand the nature of variability in studies, it is important to distinguish between different sources of het- erogeneity. Variability in the participants, interventions, and outcomes studied has been described as clinical di- versity, and variability in study design and risk of bias has been described as methodological diversity10. Variability in the intervention effects being evaluated among the dif- ferent studies is known as statistical heterogeneity and is a consequence of clinical or methodological diversity, or both, among the studies. Statistical heterogeneity mani- fests itself in the observed intervention effects varying by more than the differences expected among studies that would be attributable to random error alone. Usually, in the literature, statistical heterogeneity is simply referred to as heterogeneity.

Clinical variation will cause heterogeneity if the inter- vention effect is modified by the factors that vary across studies; most obviously, the specific interventions or par- ticipant characteristics that are often reflected in different levels of risk in the control group when the outcome is dichotomous. In other words, the true intervention effect will differ for different studies. Differences between stud- ies in terms of methods used, such as use of blinding or differences between studies in the definition or measure- ment of outcomes, may lead to differences in observed effects. Significant statistical heterogeneity arising from differences in methods used or differences in outcome as- sessments suggests that the studies are not all estimating the same effect, but does not necessarily suggest that the true intervention effect varies. In particular, heterogene- ity associated solely with methodological diversity indi- cates that studies suffer from different degrees of bias. Empirical evidence suggests that some aspects of design can affect the result of clinical trials, although this may not always be the case.

The scope of a meta-analysis will largely determine

HAIDICH AB

HIPPOKRATIA 2010, 14 (Suppl 1) ��

the extent to which studies included in a review are di- verse. Meta-analysis should be conducted when a group of studies is sufficiently homogeneous in terms of sub- jects involved, interventions, and outcomes to provide a meaningful summary. However, it is often appropriate to take a broader perspective in a meta-analysis than in a single clinical trial. Combining studies that differ sub- stantially in design and other factors can yield a mean- ingless summary result, but the evaluation of reasons for the heterogeneity among studies can be very insightful. It may be argued that these studies are of intrinsic interest on their own, even though it is not appropriate to produce a single summary estimate of effect.

Variation among k trials is usually assessed using Cochran’s Q statistic, a chi-squared (χ2) test of heteroge- neity with k-1 degrees of freedom. This test has relatively poor power to detect heterogeneity among small numbers of trials; consequently, an α-level of 0.10 is used to test hypotheses38,39.

Heterogeneity of results among trials is better quanti- fied using the inconsistency index I 2, which describes the percentage of total variation across studies40. Uncertainty intervals for I 2 (dependent on Q and k) are calculated us- ing the method described by Higgins and Thompson41. Negative values of I 2 are put equal to zero, consequently I 2 lies between 0 and 100%. A value >75% may be con- sidered substantial heterogeneity41. This statistic is less influenced by the number of trials compared with other methods used to estimate the heterogeneity and provides a logical and readily interpretable metric but it still can be unstable when only a few studies are combined42.

Given that there are several potential sources of het- erogeneity in the data, several steps should be considered in the investigation of the causes. Although random-ef- fects models are appropriate, it may be still very desirable to examine the data to identify sources of heterogeneity and to take steps to produce models that have a lower lev- el of heterogeneity, if appropriate. Further, if the studies examined are highly heterogeneous, it may be not appro- priate to present an overall summary estimate, even when random effects models are used. As Petiti notes43, statis- tical analysis alone will not make contradictory studies agree; critically, however, one should use common sense in decision-making. Despite heterogeneity in responses, if all studies had a positive point direction and the pooled confidence interval did not include zero, it would not be logical to conclude that there was not a positive effect, provided that sufficient studies and subject numbers were present. The appropriateness of the point estimate of the effect is much more in question.

Some of the ways to investigate the reasons for het- erogeneity; are subgroup analysis and meta-regression. The subgroup analysis approach, a variation on those described above, groups categories of subjects (e.g., by age, sex) to compare effect sizes. The meta-regression approach uses regression analysis to determine the influ- ence of selected variables (the independent variables) on the effect size (the dependent variable). In a meta-regres-

sion, studies are regarded as if they were individual pa- tients, but their effects are properly weighted to account for their different variances44.

Sensitivity analyses have also been used to examine the effects of studies identified as being aberrant concern- ing conduct or result, or being highly influential in the analysis. Recently, another method has been proposed that reduces the weight of studies that are outliers in meta-analyses45. All of these methods for examining het- erogeneity have merit, and the variety of methods avail- able reflects the importance of this activity.

Presentation of results A useful graph, presented in the PRISMA statement11,

is the four-phase flow diagram (Figure 3).

This flow-diagram depicts the flow of information through the different phases of a systematic review or meta-analysis. It maps out the number of records identi- fied, included and excluded, and the reasons for exclu- sions. The results of meta-analyses are often presented in a forest plot, where each study is shown with its ef- fect size and the corresponding 95% confidence interval (Figure 4).

The pooled effect and 95% confidence interval is shown in the bottom in the same line with “Overall”. In the right panel of Figure 4, the cumulative meta-analysis is graphically displayed, where data are entered succes- sively, typically in the order of their chronological ap- pearance46,47. Such cumulative meta-analysis can retro- spectively identify the point in time when a treatment

Υear of publication

Figure 3: PRISMA 2009 Flow Diagram (From Moher D, Liberati A, Tetzlaff J, Altman DG; PRISMA Group. Preferred reporting items for systematic reviews and meta-analyses: the PRISMA statement. J Clin Epidemiol 2009;62:1006-12, For more information, visit www.prisma-statement.org).

�4

effect first reached conventional levels of significance. Cumulative meta-analysis is a compelling way to exam- ine trends in the evolution of the summary-effect size, and to assess the impact of a specific study on the overall conclusions46. The figure shows that many studies were performed long after cumulative meta-analysis would have shown a significant beneficial effect of antibiotic prophylaxis in colon surgery.

Biases in meta-analysis Although the intent of a meta-analysis is to find and

assess all studies meeting the inclusion criteria, it is not always possible to obtain these. A critical concern is the papers that may have been missed. There is good reason to be concerned about this potential loss because stud- ies with significant, positive results (positive studies)

are more likely to be published and, in the case of interventions with a commercial value, to be promoted, than studies with non-significant or “negative” results (negative stud- ies). Studies that produce a positive result, especially large studies, are more likely to have been published and, conversely, there has been a re- luctance to publish small studies that have non-significant results. Further, publication bias is not solely the responsibility of editorial policy as there is reluctance among research- ers to publish results that were either uninteresting or are not random- ized48. There are, however, problems with simply including all studies that have failed to meet peer-review standards. All methods of retrospec- tively dealing with bias in studies are imperfect.

It is important to examine the re- sults of each meta-analysis for evi- dence of publication bias. An estima- tion of likely size of the publication bias in the review and an approach to dealing with the bias is inherent to the conduct of many meta-analy- ses. Several methods have been de- veloped to provide an assessment of publication bias; the most common- ly used is the funnel plot. The funnel plot provides a graphical evaluation of the potential for bias and was de- veloped by Light and Pillemer49 and discussed in detail by Egger and col- leagues50,51. A funnel plot is a scat- terplot of treatment effect against a measure of study size. If publication bias is not present, the plot is ex- pected to have a symmetric inverted

funnel shape, as shown in Figure 5A. In a study in which there is no publication bias, larger

studies (i.e., have lower standard error) tend to cluster closely to the point estimate. As studies become less pre- cise, such as in smaller trials (i.e., have a higher standard error), the results of the studies can be expected to be more variable and are scattered to both sides of the more precise larger studies. Figure 5A shows that the smaller, less precise studies are, indeed, scattered to both sides of the point estimate of effect and that these seem to be sym- metrical, as an inverted funnel-plot, showing no evidence of publication bias. In contrast to Figure 5A, Figure 5B shows evidence of publication bias. There is evidence of the possibility that studies using smaller numbers of sub- jects and showing an decrease in effect size (lower odds ratio) were not published.

Figure 4: Forest plots of the meta-analysis addressing the use of antibiotic prophy- laxis compared with no treatment in colon surgery. The outcome is would infection and 32 studies were included in the meta-analysis. Risk ratio <1 favors use of pro- phylactic antibiotics, whereas risk ratio > 1 suggests that no treatment is better. Left panel, Studies displayed chronologically by year of publication, n represents study size. Each study is represented by a filled circle (denoting its risk ratio estimate) and the horizontal line denotes the corresponding 95% confidence interval. Studies that intersect the vertical line of unity (RR=1), indicate no difference between the antibi- otic group and the control group. Pooled results from all studies are shown at bottom with the random-effect model. Right panel,Cumulative meta-analysis of same stud- ies with random-effects model, where the summary risk ratio is re-estimated each time a study is added over time. It reveals that antibiotic prophylaxis efficacy could have been identified as early as 1971 after 5 studies involving about 300 patients (n in this panel represents cumulative number of patients from included studies). (From Ioannidis JP, Lau J. State of the evidence: current status and prospects of meta- analysis in infectious diseases. Clin Infect Dis 1999;29:1178–85).

HAIDICH AB

HIPPOKRATIA 2010, 14 (Suppl 1) ��

Asymmetry of funnel plots is not solely attributable to publication bias, but may also result from clinical het- erogeneity among studies. Sources of clinical heterogene- ity include differences in control or exposure of subjects to confounders or effect modifiers, or methodological heterogeneity between studies; for example, a failure to conceal treatment allocation. There are several statistical tests for detecting funnel plot asymmetry; for example, Egger’s linear regression test50, and Begg’s rank correla- tion test52 but these do not have considerable power and are rarely used. However, the funnel plot is not without problems. If high precision studies really are different than low precision studies with respect to effect size (e.g., due different populations examined) a funnel plot may give a wrong impression of publication bias53. The ap- pearance of the funnel plot plot can change quite dramati- cally depending on the scale on the y-axis - whether it is the inverse square error or the trial size54.

Other types of biases in meta-analysis include the time lag bias, selective reporting bias and the language bias. The time lag bias arises from the published stud- ies, when those with striking results are published earlier than those with non-significant findings55. Moreover, it has been shown that positive studies with high early ac-

crual of patients are published sooner than negative trials with low early accrual56. However, missing studies, either due to publication bias or time-lag bias may increasingly be identified from trials registries.

The selective reporting bias exists when published articles have incomplete or inadequate reporting. Empiri- cal studies have shown that this bias is widespread and of considerable importance when published studies were compared with their study protocols29,30. Furthermore, re- cent evidence suggests that selective reporting might be an issue in safety outcomes and the reporting of harms in clinical trials is still suboptimal57. Therefore, it might not be possible to use quantitative objective evidence for harms in performing meta-analyses and making thera- peutic decisions.

Excluding clinical trials reported in languages other

than English from meta-analyses may introduce the lan- guage bias and reduce the precision of combined estimates of treatment effects. Trials with statistically significant results have been shown to be published in English58. In contrast, a later more extensive investigation showed that trials published in languages other than English tend to be of lower quality and produce more favourable treatment effects than trials published in English and concluded that excluding non-English language trials has generally only modest effects on summary treatment effect estimates but

the effect is difficult to predict for individual meta-analy- ses59.

Evolution of meta-analyses The classical meta-analysis compares two treatments

while network meta-analysis (or multiple treatment meta- analysis) can provide estimates of treatment efficacy of multiple treatment regimens, even when direct compari- sons are unavailable by indirect comparisons60. An ex- ample of a network analysis would be the following. An initial trial compares drug A to drug B. A different trial studying the same patient population compares drug B to drug C. Assume that drug A is found to be superior to drug B in the first trial. Assume drug B is found to be equivalent to drug C in a second trial. Network analysis then, allows one to potentially say statistically that drug A is also superior to drug C for this particular patient popu- lation. (Since drug A is better than drug B, and drug B is equivalent to drug C, then drug A is also better to drug C even though it was not directly tested against drug C.)

Meta-analysis can also be used to summarize the per- formance of diagnostic and prognostic tests. However, studies that evaluate the accuracy of tests have a unique design requiring different criteria to appropriately assess the quality of studies and the potential for bias. Addition- ally, each study reports a pair of related summary statis- tics (for example, sensitivity and specificity) rather than a single statistic (such as a risk ratio) and hence requires different statistical methods to pool the results of the stud- ies61. Various techniques to summarize results from diag- nostic and prognostic test results have been proposed62-64. Furthermore, there are many methodologies for advanced

Figure 5: A) Symmetrical funnel plot. B) Asymmetrical fun- nel plot, the small negative studies in the bottom left corner is missing.

��

meta-analysis that have been developed to address specif- ic concerns, such as multivariate meta-analysis65–67, and special types of meta-analysis in genetics68 but will not be discussed here.

Meta-analysis is no longer a novelty in medicine. Nu- merous meta-analyses have been conducted for the same medical topic by different researchers. Recently, there is a trend to combine the results of different meta-analyses, known as a meta-epidemiological study, to assess the risk of bias79,70.

Conclusions The traditional basis of medical practice has been

changed by the use of randomized, blinded, multicenter clinical trials and meta-analysis, leading to the widely used term “evidence-based medicine”. Leaders in initiating this change have been the Cochrane Collaboration who have produced guidelines for conducting systematic reviews and meta-analyses10 and recently the PRISMA statement, a helpful resource to improve reporting of systematic re- views and meta-analyses has been released11. Moreover, standards by which to conduct and report meta-analyses of observational studies have been published to improve the quality of reporting71.

Meta-analysis of randomized clinical trials is not an infallible tool, however, and several examples exist of meta-analyses which were later contradicted by single large randomized controlled trials, and of meta-analyses addressing the same issue which have reached opposite conclusions72. A recent example, was the controversy be- tween a meta-analysis of 42 studies73 and the subsequent publication of the large-scale trial (RECORD trial) that did not support the cardiovascular risk of rosiglitazone74. However, the reason for this controversy was explained by the numerous methodological flaws found both in the meta-analysis and the large clinical trial75.

No single study, whether meta-analytic or not, will provide the definitive understanding of responses to treat- ment, diagnostic tests, or risk factors influencing disease. Despite this limitation, meta-analytic approaches have demonstrable benefits in addressing the limitations of study size, can include diverse populations, provide the opportunity to evaluate new hypotheses, and are more valuable than any single study contributing to the analy- sis. The conduct of the studies is critical to the value of a meta-analysis and the methods used need to be as rigor- ous as any other study conducted.

References 1. Guyatt GH, Sackett DL, Sinclair JC, Hayward R, Cook DJ,

Cook RJ. User’s guide to the medical literature. IX. A method for grading health care recommendations. Evidence-Based Med- icine Working Group. JAMA. 1995; 274: 1800–1804.

2. Sackett DL, Rosenberg WM, Gray JA, Haynes RB, Richardson WS. Evidence based medicine: what it is and what it isn’t. BMJ. 1996: 312: 71–72.

3. Chalmers TC, Matta RJ, Smith H Jr, Kunzler AM. Evidence fa- voring the use of anticoagulants in the hospital phase of acute myocardial infarction. N Engl J Med. 1977; 297: 1091–1096.

4. Patsopoulos NA, Analatos AA, Ioannidis JP. Relative citation

impact of various study designs in the health sciences. JAMA. 2005; 293: 2362-2366.

5. Hennekens CH, DeMets D, Bairey-Merz CN, Borzak S, Borer J. Doing more good than harm: the need for a cease fire. Am J Med. 2009; 122: 315-316.

6. Naylor CD. Meta-analysis and the meta-epidemiology of clini- cal research. BMJ. 1997; 315: 617-619.

7. Bailar JC. The promise and problems of meta-analysis [edito- rial]. N Engl J Med. 1997; 337: 559-561.

8. Meta-analysis under scrutiny [editorial]. Lancet 1997; 350: 675.

9. Glass GV. Primary, secondary and meta-analysis of research. Educational Researcher. 1976; 5: 3–8.

10. Higgins JPT, Green S (editors). Cochrane Handbook for System- atic Reviews of Interventions Version 5.0.2 [updated September 2009]. The Cochrane Collaboration, 2009. Available from www. cochrane-handbook.org.

11. Moher D, Liberati A, Tetzlaff J, Altman DG; PRISMA Group. Preferred reporting items for systematic reviews and meta-anal- yses: the PRISMA statement. J Clin Epidemiol 2009;62:1006- 1012.

12. Liberati A, Altman DG, Tetzlaff J, Mulrow C, Götzsche PC, Io- annidis JP, et al. The PRISMA statement for reporting systematic reviews and meta-analyses of studies that evaluate health care interventions: explanation and elaboration. J Clin Epidemiol. 2009;62:e1-34.

13. Moher D, Cook DJ, Eastwood S, Olkin I, Rennie D, Stroup DF. Improving the quality of reports of metaanalyses of randomised controlled trials: the QUOROM statement. Quality of Reporting of Meta-analyses. Lancet 1999; 354: 1896–1900.

14. Antman, EM, Lau J, Kupelnick B, Mosteller F, Chalmers TC. A comparison of results of meta-analyses of randomized control trials and recommendations of clinical experts: Treatments for myocardial infarction. JAMA. 1992; 268: 240–248.

15. Oxman, AD, Guyatt GH. The science of reviewing research. Ann NY Acad Sci. 1993; 703: 125–133.

16. Lau J, Ioannidis JP, Schmid CH. Quantitative synthesis in sys- tematic reviews. Ann Intern Med. 1997; 127: 820–826.

17. Greenland S. Invited commentary: A critical look at some popu- lar meta-analytic methods. Am J Epidemiol. 1994; 140: 290– 296.

18. Ioannidis JPA, Haidich A-B, Pappa M, Pantazis N, Kokori SI, Tektonidou MG, et al. Comparison of evidence of treatment ef- fects in randomized and non-randomized studies. JAMA. 2001; 286: 821-830.

19. Simmonds MC, Higgins JPT, Stewart LA, Tierney JA, Clarke MJ, Thompson SG. Meta-analysis of individual patient data from randomized trials: A review of methods used in practice. Clin Trials. 2005; 2: 209–217.

20. Stewart LA, Clarke MJ. Practical methodology of meta-analy- ses (overviews) using updated individual patient data. Cochrane working group. Stat Med. 1995; 14: 2057–2079.

21. Altman DG. Systematic reviews of evaluations of prognostic variables. BMJ. 2001; 323: 224 –228.

22. Simon R, Altman DG: Statistical aspects of prognostic factor studies in oncology. Br J Cancer.1994; 69: 979 –985.

23. Ioannidis JPA, Vlachoyiannopoulos PG, Haidich AB, Medsger TA Jr., Lucas M, Michet CJ, et al. Mortality in systemic sclero- sis: an international meta-analysis of individual patient data. Am J Med. 2005; 118: 2-10.

24. Oxman AD, Clarke MJ, Stewart LA. From science to practice. Meta-analyses using individual patient data are needed. JAMA. 1995; 274: 845- 846.

25. Stewart LA, Tierney JF: To IPD or not to IPD? Advantages and disadvantages of systematic reviews using individual patient data. Eval Health Prof. 2002; 25: 76 –97.

26. Trikalinos TA, Ioannidis JPA: Predictive modeling and hetero- geneity of baseline risk in meta-analysis of individual patient data. J Clin Epidemiol. 2001; 54: 245–252.

HAIDICH AB

HIPPOKRATIA 2010, 14 (Suppl 1) ��

27. Pildal J, Hróbjartsson A, Jorgensen KJ, Hilden J, Altman DG, Götzsche PC. Impact of allocation concealment on conclusions drawn from meta-analyses of randomized trials. Int J Epidemiol. 2007; 36: 847-857.

28. Juni P, Altman DG, Egger M. Systematic reviews in health care: Assessing the quality of controlled clinical trials. BMJ. 2001; 323: 42–46.

29. Chan AW, Hróbjartsson A , Haahr MT, Götzsche PC, Altman DG. Empirical evidence for selective reporting of outcomes in randomized trials: comparison of protocols to published articles. JAMA. 2004; 291: 2457–2465.

30. Dwan K, Altman DG, Arnaiz JA, Bloom J, Chan A, Cronin E, et al. Systematic review of the empirical evidence of study publica- tion bias and outcome reporting bias. PLoS One. 2008; 3:e3081.

31. Egger M, Smith GD, Phillips AN. Meta-analysis: principles and procedures. BMJ. 1997; 315: 1533-1537.

32. Stangl DK, Berry DA. Meta-analysis in medicine and health policy. Marcel Dekker, New York, NY. 2000.

33. Mantel N, Haenszel W. Statistical aspects of the analysis of data from retrospective studies of disease. J Natl Cancer Inst. 1959; 22: 719–748.

34. Yusuf S, Peto R, Lewis J, Collins R, Sleight P. Beta blockade during and after myocardial infarction: an overview of the ran- domized trials. Prog Cardiovasc Dis. 1985; 27: 335–371.

35. DerSimonian R, Laird N. Meta-analysis in clinical trials. Con- trol Clin Trials. 1986; 7: 177–188.

36. Whitehead A. Meta-analysis of controlled clinical trials. JohnWiley & Sons Ltd. Chichester, UK. 2002.

37. Greenland S. Quantitative methods in the review of epidemio- logic literature. Epidemiol Rev. 1987; 9: 1–30.

38. Cochran W. The combination of estimates from different experi- ments. Biometrics. 1954; 10: 101–129.

39. Hardy RJ, Thompson SG. Detecting and describing heterogene- ity in meta-analysis. Stat Med.1998; 17: 841–856.

40. Higgins JP, Thompsom SG, Deeks JJ, Altman DG. Measuring inconsistency in meta-analysis. BMJ. 2003; 327: 557–560.

41. Higgins JP, Thompson SG. Quantifying heterogeneity in a meta- analysis. Stat Med. 2002; 21:1539–1558.

42. Huedo-Medina TB, Sanchez-Meca J, Marin-Martinez F, Botella J. Assessing heterogeneity in meta-analysis: Q statistic or I2 in- dex? Psychol Methods. 2006; 11: 193–206.

43. Petiti DB. Meta-analysis, decision analysis and cost-effective- ness analysis. Oxford University Press, New York, NY. 1994.

44. Lau J, Ioannidis JP, Schmid CH. Quantitative synthesis in sys- tematic reviews. Ann Intern Med. 1997; 127: 820–826.

45. Baker R, Jackson D. A new approach to outliers in meta-analy- sis. Health Care Manage Sci. 2008; 11:121–131.

46. Lau J, Schmid CH, Chalmers TC. Cumulative meta-analysis of clinical trials builds evidence for exemplary medical care. J Clin Epidemiol. 1995; 48: 45–57.

47. Ioannidis JP, Lau J. State of the evidence: current status and prospects of meta-analysis in infectious diseases. Clin Infect Dis. 1999; 29: 1178–1185.

48. Dickersin K, Berlin JA. Meta-analysis: State of the science. Epi- demiol Rev. 1992; 14: 154–176.

49. Light J, Pillemer DB. Summing up: The Science of Review- ing Research. Harvard University Press, Cambridge, Massachu- setts,1984.

50. Egger M, Smith GD, Schneider M, Minder C. Bias in meta- analysis detected by a simple, graphical test. BMJ. 1997; 315: 629–624.

51. Sterne JA, Egger M. Funnel plots for detecting bias in meta- analysis: guidelines on choice of axis. J Clin Epidemiol. 2001; 54: 1046–1045.

52. Begg CB, Mazumdar M. Operating characteristics of a rank cor- relation test for publication bias. Biometrics. 1994; 50: 1088- 1101.

53. Lau J, Ioannidis JP, Terrin N, Schmid CH, Olkin I. The case of the misleading funnel plot. BMJ. 2006; 333: 597–590.

54. Tang JL, Liu JL. Misleading funnel plot for detection of bias in meta-analysis. J Clin Epidemiol. 2000; 53: 477–484.

55. Ioannidis JP. Effect of the statistical significance of results on the time to completion and publication of randomized efficacy trials. JAMA. 1998; 279: 281-286.

56. Haidich AB, Ioannidis JP. Effect of early patient enrollment on the time to completion and publication of randomized controlled trials. Am J Epidemiol. 2001; 154: 873–880.

57. Haidich AB, Charis B, Dardavessis T, Tirodimos I, Arvanitidou M. The quality of safety reporting in trials is still suboptimal: Survey of major general medical journals. J Clin Epidemiol. 2010; (in press).

58. Egger M, Zellweger-Zähner T, Schneider M, Junker C, Lengeler C, Antes G. Language bias in randomised controlled trials pub- lished in English and German. Lancet. 1997; 350: 326–329.

59. Jόni P, Holenstein F, Sterne J, Bartlett C, Egger M. Direction and impact of language bias in meta-analyses of controlled tri- als: empirical study. Int J Epidemiol. 2002; 31:115–123.

60. Lumley T. Network meta-analysis for indirect treatment com- parisons. Stat Med. 2002; 21: 2313–2324.

61. Deeks JJ. Systematic reviews in health care: Systematic reviews of evaluations of diagnostic and screening tests. BMJ. 2001; 323: 157–162.

62. Littenberg B, Moses LE. Estimating diagnostic accuracy from multiple conflicting reports: a new meta-analytic method. Med Decis Making. 1993; 13: 313–321.

63. Reitsma JB, Glas AS, Rutjes AW, Scholten RJ, Bossuyt PM, Zwinderman AH. Bivariate analysis of sensitivity and specificity produces informative summary measures in diagnostic reviews. J Clin Epidemiol. 2005; 58: 982-990.

64. Rutter CM, Gatsonis CA. A hierarchical regression approach to meta-analysis of diagnostic test accuracy evaluations. Stat Med. 2001; 20: 2865–2884.

65. Arends LR. Multivariate meta-analysis: modelling the heteroge- neity. Rotterdam: Erasmus University; 2006.

66. Riley RD, Abrams KR, Sutton AJ, Lambert PC, Thompson JR. Bivariate random-effects meta-analysis and the estimation of be- tween-study correlation. BMC Med Res Methodol. 2007; 7: 3.

67. Trikalinos TA, Olkin I. A method for the meta-analysis of mutu- ally exclusive binary outcomes. Stat Med. 2008; 27: 4279–4300.

68. Trikalinos TA, Salanti G, Zintzaras E, Ioannidis JP. Meta-analy- sis methods. Adv Genet. 2008; 60: 311–334.

69. Nόesch E, Trelle S, Reichenbach S, Rutjes AW, Bόrgi E, Scherer M, et al. The effects of excluding patients from the analysis in randomized controlled trials: meta-epidemiological study. BMJ. 2009; 339: b3244.

70. Wood L, Egger M, Gluud LL, Schulz KF, Jόni P, Altman DG, et al. Empirical evidence of bias in treatment effect estimates in controlled trials with different interventions and outcomes: meta-epidemiological study. BMJ. 2008; 336: 601-605.

71. Stroup DF, Berlin JA, Morton SC, Olkin I, Williamson GD, Rennie D, et al. Meta-analysis of observational studies in epi- demiology: a proposal for reporting. Meta-analysis of Observa- tional Studies in Epidemiology (MOOSE) group. JAMA. 2000; 283:2008–2012.

72. LeLorier J, Grégoire G, Benhaddad A, Lapierre J, Derderian F. Discrepancies between meta-analyses and subsequent large ran- domized, controlled trials. N Engl J Med. 1997; 337: 536–542.

73. Nissen SE, Wolski K. Effect of rosiglitazone on the risk of myo- cardial infarction and death from cardiovascular causes. N Engl J Med. 2007; 356: 2457–2471.

74. Home PD, Pocock SJ, Beck-Nielsen H, Curtis PS, Gomis R, Hanefeld M, et al; RECORD study team. Rosiglitazone Evalu- ated for Cardiovascular Outcomes in Oral Agent Combination Therapy for Type 2 Diabetes (RECORD). Lancet. 2009; 373: 2125-2135.

75. Hlatky MA, Bravata DM. Review: rosiglitazone increases risk of MI but does not differ from other drugs for CV death in type 2 diabetes. Evid Based Med. 2007; 12:169-170.