Top#2
©Copyright by Croatian Society of Medical Biochemistry and Laboratory Medicine. This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc-nd/3.0/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
Abstract
Statistics are the primary tools for assessing relationships and evaluating study questions. Unfortunately, these tools are often misused, either inad- vertently because of ignorance or lack of planning, or conspicuously to achieve a specified result. Data abuses include the incorrect application of statistical tests, lack of transparency and disclosure about decisions that are made, incomplete or incorrect multivariate model building, or exclusion of outliers. Individually, each of these actions may completely invalidate a study, and often studies are victim to more than one offense. Increasingly there are tools and guidance for researchers to look to, including the development of an analysis plan and a series of study specific checklists, in order to prevent or mitigate these offenses. Key words: a priori; analytical plan; statistical methods; disclosure; transparency; biostatistics
Received: November 30, 2014 Accepted: January 03, 2015
The misuse and abuse of statistics in biomedical research
Matthew S. Thiese*, Zachary C. Arnold, Skyler D. Walker
Rocky Mountain Center for Occupational and Environmental Health, University of Utah, Salt Lake City, Utah, USA
*Corresponding author: [email protected]
http://dx.doi.org/10.11613/BM.2015.001 Biochemia Medica 2015;25(1):5–11
5
Lessons in biostatistics
Introduction
Utility of biomedical research is a product of ap- propriate study design, high quality measures, proper selection and application of statistical methods, and correct interpretations of analytical results. Biostatistics are a set of tools that are used to evaluate relationships between results in bio- medical research. They are essential for furthering scientific knowledge and understanding. Unfortu- nately, statistics can be appropriately used, mis- used and abused, either through concept or appli- cation. These concerns have been voiced in the lit- erature since the 1980s, and are still cogent con- cerns today (1,2). Concerns include poor planning, communication, or understanding of the concep- tual framework that the statistical tool is being used to evaluate. Inappropriate study design is of- ten the first actions in research, and has been dis- cussed in the literature (3-5). Problems with data collection and analyses follow directly from study design and are often poorly described in the litera- ture. Recently, additional guidance has been pro-
posed regarding reporting of study methods and results to improve quality and reduce misconduct within biomedical research (6-8).
Computers and statistical software packages have increased the complexity with which data can be analyzed and, consequently, the use of statistics in medical research has also increased. Unfortunate- ly, though the types of errors may have changed, the frequency of statistical misuse has not (9,10). These errors are primarily due to inadequate knowledge and researchers not seeking support from statisticians (9). There is no consensus of opti- mal methods for biomedical research among bio- statisticians, either currently or in the past. There are many differences of opinion in methodological approaches, as exemplified by Frequentist and Bayesianist statistical methodology, or statistical estimation methods proposed by Fischer, Ney- man, and Wald among others (11-15). The differ- ences of opinion are so longstanding entrenched that most statistical packages simultaneously pre-
Biochemia Medica 2015;25(1):5–11 http://dx.doi.org/10.11613/BM.2015.001
6
Thiese MS. et al. Misuse and abuse of statistics
sent multiple estimation results and allow for se- lection of different methods depending on the type of data and relative strength of each method.
While most misuses of statistics are inadvertent and arise from a lack of knowledge or planning, others may be deliberate decisions in order to achieve a desired statistical result. A recent sys- tematic review and meta-analysis investigating fabrication and falsification of research found that 33.7% of those surveyed admitted to questionable research practices, including modifying results to improve the outcome, questionable interpretation of data, withholding methodological or analytical details, dropping observations or data points from analyses because of a “gut feeling that they were inaccurate” and deceptive or misleading report of design, data or results (3). While it is difficult to dis- cern the differences between the two, the end re- sult is often the same, erroneous relationships and flawed conclusions that are printed and relied upon by others in the field. This report will discuss common misuses and abuses of biostatistics from an epidemiological perspective and provide some guidance on methods to reduce the likelihood for these wrongdoings.
Errors in statistical design
Each statistical test requires certain assumptions to be met and types of data (categorical, continu- ous, etc.) in order to produce valid results. If these assumptions are not appropriately considered dur- ing selection of statistical tests, meaningful errors and misinterpretation of results are possible. At best errors of this nature may be a slight limitation, and at worst may completely invalidate results and their associated conclusions (16). In worst case scenari- os, the research study itself may be entirely com- promised. It is possible that errors in the applica- tion of biostatistics may occur at any or all stages of a study. Furthermore, a single statistical error can be adequate to invalidate any study results (17). Any re- search investigation can be appropriately planned and performed, however, if incorrect analytical ap- proach is applied the repercussions may be as grave as if the investigation was fundamentally flawed in either design or execution (17).
Errors in the description and presentation of data
Discussions of statistical assumptions are com- monly absent from many research articles (18,19). One study reported that nearly 90% of all the pub- lished articles evaluated lacked any discussion of statistical assumptions (19). More concerning is that many articles fail to report which statistical tests were utilized during data analysis (20). Only stating that tests were used “where appropriate” is also grossly inadequate, yet commonly done (18,21). Statistical tests are precisely designed for specific types of data, and with the vast array of tests now available, thorough consideration must be given to the assumptions, which guide their se- lection.
The prevalence of statistical misuse can be ex- plained by the widespread absence of basic statis- tical knowledge among the medical community (22,23). In a cross-sectional study of faculty and students from colleges of medicine, Gore reported the 53.87% found statistics to be very difficult, 52.9% could not correctly define the meaning of P value, 36.45% ill-defined standard deviation, and 50.97% failed to correctly calculate sample size.
Appropriate treatment of outliers
Outliers are observations beyond what is expect- ed, which may be identified by some statistical variation (e.g. 3 standard deviations above or be- low the mean) or simple face validity (e.g. body mass index of 65.0) or consensus based upon on clinical reasons. Traditionally, outliers were exclud- ed from analyses because they were thought to be unduly influencing the statistical model, particu- larly in studies with small sample size (24). While this may be true in some instances, researchers may consciously or unconsciously exclude valid data that don’t fit a pre-defined data pattern or hypothesis, therefore committing an error. This may be a simple error that will have minimal im- pact on results, or it can be a fatal error, which will completely invalidate results. Arguments for iden- tification and omission of outliers are common, however there is little consensus on the appropri-
http://dx.doi.org/10.11613/BM.2015.001 Biochemia Medica 2015;25(1):5–11
7
Thiese MS. et al. Misuse and abuse of statistics
ate treatment of an outlier. The most comprehen- sive approach is to analyze data with the extreme observations included and run a second set of analyses excluding these data. Disclosure and complete presentation of both sets of analyses will allow the readers to arrive at their own conclusions regarding relationships between exposures and outcomes. Unfortunately, due to multiple reasons ranging from malfeasance to word count limita- tions, these analyses are often not performed or presented. Caution should be taken when exclud- ing any data point, and ideally decisions about what should be excluded should be made prior to data collection during the design of the study.
Data transformation and testing for normality
Data may be skewed in biomedical research, re- quiring different statistical tests. Assessment for the magnitude of skewness through testing of normality is not uniformly performed, and rarely reported. Normality can be assessed graphically or statistically. If data are not normally distributed, ei- ther non-parametric analytical techniques should be employed or data need to be transformed to a normal distribution. Mathematically, data transfor- mation is relatively simple, however interpretation of results can be difficult.
Parametric and non-parametric tests
There are numerous types of statistical misuse. Misapplication of nonparametric and parametric tests, failure to apply corrections, and disregard for statistical independence are just a few (25). Over the years, some have attempted to quantify the amount of statistical errors present in published research articles. Four articles have each reported that approximately 50% of articles in medical and dental research contain one or more statistical er- rors (26-28). It is likely that these percentages are underestimates because many research publica- tions omit or conceal data, rendering post-exami- nation impossible (26).
Similarly, many statistical tests have various ver- sions and applications. Like the tests themselves,
the selection of each version must be in accord- ance with the required assumptions (18). For ex- ample, student’s t-test is used to compare the means for two sets of continuous sample data. If the data are paired, meaning each observation in one sample has a corresponding observation in the other, then a paired t-test is used. For inde- pendent data, there are different forms of the t- test depending upon the variance of the samples. In cases of equal variance, using a two-sample t- test is appropriate. For unequal variance, a modi- fied two-sample test is required. When more than two samples are compared, ANOVA should be uti- lized. For both t-tests and ANOVA, multiple com- parisons may necessitate adjustment through the use of corrections. There is little agreement on when or how to adjust for multiple comparisons (29).
In an examination of the American Journal of Physi- ology, Williams et al. discovered that greater than half of all the articles employed unpaired or paired t-tests (19). Of those articles, approximately 17% failed to correctly utilize the t-test for multiple comparisons by modifying the test with either the Bonferroni or some other correction method (19). In the same study, the authors also reported that articles which used the ANOVA test did not specify whether one-way or two-way designs were select- ed (19). Likewise, Glantz found, while inspecting two journals that approximately half of the articles that used statistics employed the t-test in situations that required a test for multiple comparisons (2).
In some cases, these errors have led to incorrect conclusions (26). More commonly, the conclusions have not been supported by the statistical results. In one publication, 72% or f articles lacked statisti- cal validation for their conclusions (25).
Transparency - disclosure and a-priori vs. post-hoc analytical decisions
Research transparency is an increasingly impor- tant topic in biomedical research. The decisions that are made, as well as when those decisions are made can play a strong role in the interpretation of study results. An ideal study has one where all potential outcomes are explored prior to data col-
Biochemia Medica 2015;25(1):5–11 http://dx.doi.org/10.11613/BM.2015.001
8
Thiese MS. et al. Misuse and abuse of statistics
lection, including common elements such as how data are collected, a detailed statistical analysis plan, the alpha level for statistical significance and what tests of association are going to be per- formed. There are many advantages to be gained having a thorough approach to the design and analysis of the study and documenting the deci- sions that were made. However, there are unfore- seen situations that arise that often require prompt and proper decisions. These can be innocuous such as failure of a data collection tool, to blatant selection bias to achieve a desired outcome. These decisions can easily, and often are, not mentioned in article manuscripts.
Hawthorne effects are potentially found in re- search studies that conduct observational or inter- ventional study designs with human participants. It is the theory that study participants act differ- ently when they know they are being watched or are aware of their participation in a research study. This change in behavior can commonly be found in audits for companies, it is not uncommon to see an increase in productivity when employees are made aware of an audit. Operant conditioning can also be blamed on Hawthorne effects, leading to results that stray from true behavior or statistical results in studies. Although it is difficult to elimi- nate any deceptive results deriving from Haw- thorne effects, a pre-planned approached can help maintain true and strong statistically signifi- cant results (30).
Some of the misuse is because of the nature of re- search dissemination. There is a publication bias, where statistically significant results are more like- ly to be published (31). Publication is important for many reasons, including obtaining grants or other funding and achieving tenure in an academic insti- tution. This external pressure to find statistically significant results from research may bias some scientists to select a statistical method that is more likely to yield statistically significant results. Addi- tionally, the inclusion or exclusion of outliers, or even fabrication of data, may be justified in some scientists mind. There is a proposal for a registry of unpublished social science data that has statisti- cally insignificant results (32).
The decision to analyze an exposure-outcome re- lationship should ideally be made prior to data collection, i.e. a priori. When analytical decisions are made a priori, the data collection process is more efficient and researchers are much less likely to find spurious relationships. A priori analyses are needed for hypothesis testing, and are generally considered the stronger category of analytical de- cisions. Post-hoc or after the fact analyses can be useful in exploring relationships and generating hypotheses. Often post-hoc analyses are not fo- cused and include multiple analyses to investigate potential relationships without full consideration for the suspected causal pathway. These can be “fishing” for results where all potential relation- ships are analyzed. The hazard arises when re- searchers perform post-hoc analyses and report re- sults without disclosing that they are post-hoc findings. Based on the alpha level of 0.05, it is likely that by random chance 1 in 20 relationships will be statistically significant but not clinically meaning- ful. Proper disclosure of how many analyses were performed post-hoc, the decision process for how those analyses were selected for evaluation, and both the statistically significant and insignificant results is warranted.
Epidemiological vs . biostatistical model building
Multivariate regression is often used to control for confounding and assess for effect modification (33). Often when assessing the relationship be- tween an exposure and outcome there are many potential confounding variables to control for through statistical adjustment in a multivariate model (34). The selection of variables to include in a multivariate model is often more art than sci- ence, with little agreement on the selection pro- cess, which is often compounded by the complex- ity of the adjusting variables and theoretical rela- tionships (34). Purely statistical approaches to model building, including forward and backward stepwise building may result in different “final” main effects models, both in relation to variables included and relationships identified (35,36). Reli- ance on a pre-determined set of rules regarding
http://dx.doi.org/10.11613/BM.2015.001 Biochemia Medica 2015;25(1):5–11
9
Thiese MS. et al. Misuse and abuse of statistics
stoppage of the model building process can im- prove this process, and have been proposed since the 1970s (37).
Directed acyclic graphs (DAGs) have been utilized to both mitigate bias and control for confounding factors (38). DAGs hold strong potential for proper model selection, and may be a viable option for proper covariate selection and model creation (39). Although there is no consensus on which method of model building is most appropriate, certain consistencies remain regardless of the model building method used. Proper planning pri- or to data collection and well before analyses helps to ensure that variables are appropriately collected and analyzed.
Variables to consider as potential confounders
Clinically meaningful relationships identified from past studies.
• Biologically plausible factors based on the pur- ported causal pathway between the exposure and outcome.
• Other factors that the researcher may suspect would confound the exposure-outcome rela- tionship.
• After identifying a comprehensive list of varia- bles that may be effect modifiers or confound- ers, additional analytical elements need to be considered and decided upon.
Decisions to make prior to data collection
• P-value criteria for potential inclusion in multi- variate model.
• Assessment for colinearity of variables and de- termination of treatment if colinearity is identi- fied.
• P-value for inclusion in final model. • P-value for inclusion in effect modification (if
assessing for effect modification). Relatively few peer-reviewed articles contain any description of the number of variables collected, criteria for potential inclusion in a multivariate model, type of multivariate model building meth- od used, how many potential variables were in-
cluded in the model, and how many different as- sessments were performed.
Interpretation of results
Many statistical packages allow for a multitude of analyses and results, however proper interpreta- tion is key to translation from research to practice. Understanding the implications of committing ei- ther a type I or type II error are key. Type I error is the false rejection of the null when the null is true. Conversely, type II error is the false acceptance of the null hypothesis when the null hypothesis is false. Setting alpha levels prior to analyses are im- portant; however, there are many elements that can influence the P-value, including random error, bias and confounding. A P-value of 0.051 com- pared to an alpha level of 0.05 does not mean that there is no association, moreover it means that this study was not able to detect a statistically signifi- cant result. Many researchers would argue that there may in fact be a relationship but the study was not able to detect it. Additionally, committing a type II error can most often be influenced by bias and lack of sufficient statistical power. Complete understanding the implications of potentially committing either of these errors, as well as meth- ods to minimize the likelihood of committing these errors should be achieved prior to begin- ning a study.
How to combat misuse and abuse of statistics
There is increasing interest in improvement of sta- tistical methods for epidemiological studies. These improvements include consideration and imple- mentation of more rigorous epidemiological and statistical methods, improved transparency and disclosure regarding statistical methods, appropri- ate interpretation of statistical results and exclu- sion of data must be explained.
There are two initiatives aimed at biomedical re- searchers to improve the design, execution and in- terpretation of biomedical research. One is termed “Statistical Analyses and Methods in the Published Literature”, or commonly the “SAMPL Guidelines”,
Biochemia Medica 2015;25(1):5–11 http://dx.doi.org/10.11613/BM.2015.001
10
Thiese MS. et al. Misuse and abuse of statistics
and provides detailed guidelines to reporting of statistical methods and analyses by analysis type (40). While relatively new, the SAMPL Guidelines are a valuable resource when designing a study or writing study results. Another initiative is “Strength- ening Analytical Thinking for Observational Studies” (STRATOS) which aims to provide guidance in the design, execution and interpretation of observa- tional studies (4). Additional resources, including checklists and guidelines have been presented for specific study design types (STROBE, STARD, CON- SORT, etc.).
Textbooks and biostatistical journals, including Bi- ometrika, Statistical Methods in Medical Research, Statistics in Medicine, and Journal of the American Statistical Association, can provide up to date re- sources for application of statistical analytical plans, interpretation of results, and improvement of statistical methods. Additionally, there are many statistical societies that hold annual meetings that can provide additional instruction, guidance, and insight.
Furthermore, researchers should strive to stay in- formed regarding the development and applica- tion of statistical tests. Statistical tools including
splines, multiple imputation, and ordinal regres- sion analyses are becoming increasingly accepted and applied within biomedical research. As new methods are evaluated and accepted in research, there will be an increasing potential for abuse and misuse of these methods.
Perhaps most importantly, researchers should in- vest adequate time in developing the theoretical construct, whether that is through a DAG or sim- ple listing of exposure measures, outcome meas- ures, and confounders.
Conclusion
There has been, and will likely continue to be mis- use and abuse of statistical tools. Through proper planning, application, and disclosure, combined with guidance and tools, hopefully researchers will continue to design, execute and interpret cutting edge biomedical research to further our knowl- edge and improve health outcomes.
Potential conflict of interest
None declared.
References 1. Bland JM, Altman DG. Misleading statistics: errors in textbo-
oks, software and manuals. Int J Epidemiol 1988;17:245-7. http://dx.doi.org/10.1093/ije/17.2.245.
2. Glantz SA. Biostatistics: how to detect, correct and prevent errors in the medical literature. Circulation 1980;61:1-7. http://dx.doi.org/10.1161/01.CIR.61.1.1.
3. Fanelli D. How many scientists fabricate and falsify resear- ch? A systematic review and meta-analysis of survey data. PloS one 2009;4:e5738. http://dx.doi.org/10.1371/journal. pone.0005738.
4. Sauerbrei W, Abrahamowicz M, Altman DG, Cessie S, Car- penter J. STRengthening Analytical Thinking for Observa- tional Studies: the STRATOS initiative. Statistics in medicine 2014. http://dx.doi.org/10.1002/sim.6265.
5. Thiese MS. Observational and interventional study design types; an overview. Biochem Med 2014;24:199-210. http:// dx.doi.org/10.11613/BM.2014.022.
6. Moher D, Schulz KF, Altman DG. The CONSORT statement: revised recommendations for improving the quality of re- ports of parallel group randomized trials. BMC Med Res Methodol 2001;1:2. http://dx.doi.org/10.1186/1471-2288- 1-2.
7. Bossuyt PM, Reitsma JB, Bruns DE, Gatsonis CA, Glasziou PP, Irwig LM, et al. The STARD statement for reporting studies of diagnostic accuracy: explanation and elabo- ration. Ann Intern Med 2003;138:W1-12. http://dx.doi. org/10.7326/0003-4819-138-1-200301070-00010.
8. Von Elm E, Altman DG, Egger M, Pocock SJ, Gøtzsche PC, Vandenbroucke JP. The Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) sta- tement: guidelines for reporting observational studi- es. Prev Med 2007;45:247-51. http://dx.doi.org/10.1016/j. ypmed.2007.08.012.
9. Ercan I. Misusage of Statistics In Medical Research. Eur J Gen Med 2007;4:128-34.
10. Felson DT, Cupples LA, Meenan RF. Misuse of statistical met- hods in Arthritis and Rheumatism. 1982 versus 1967-68. Ar- thritis Rheum 1984;27:1018-22. http://dx.doi.org/10.1002/ art.1780270908.
11. Bayarri MJ, Berger JO. The interplay of Bayesian and frequ- entist analysis. Statistical Science 2004:58-80.
12. Efron B. Bayesians, frequentists, and scientists. J Am Stat Assoc 2005;100:1-5. http://dx.doi.org/10.1198/01621450 5000000033.
http://dx.doi.org/10.11613/BM.2015.001 Biochemia Medica 2015;25(1):5–11
11
Thiese MS. et al. Misuse and abuse of statistics
13. Fisher R. Statistical methods and scientific induction. J R Stat Soc Series B Stat Methodol 1955:69-78.
14. Wald A. Estimation of a parameter when the number of un- known parameters increases indefinitely with the number of observations. Ann Math Stat 1948:220-7. http://dx.doi. org/10.1214/aoms/1177730246.
15. Lenhard J. Models and statistical inference: The contro- versy between Fisher and Neyman–Pearson. Brit J Phil Sci 2006;57:69-91. http://dx.doi.org/10.1093/bjps/axi152.
16. Jamart J. Statistical tests in medical research. Acta Oncol 1992;31:723-7. http://dx.doi.org/10.3109/028418692090 83860.
17. Altman DG. Statistics and ethics in medical research. Mi- suse of statistics is unethical. BMJ 1980;281:1182-4. http:// dx.doi.org/10.1136/bmj.281.6249.1182.
18. Strasak AM, Zaman Q, Pfeiffer KP, Gobel G, Ulmer H. Sta- tistical errors in medical research--a review of common pitfalls. Swiss Med Wkly 2007;137:44-9.
19. Williams JL, Hathaway CA, Kloster KL, Layne BH. Low power, type II errors, and other statistical problems in re- cent cardiovascular research. Am J Physiol 1997;273:H487- 93.
20. Feinstein AR. Clinical biostatistics. XXV. A survey of the sta- tistical procedures in general medical journals. Clin Phar- macol Ther 1974;15:97-107.
21. Welch GE, 2nd, Gabbe SG. Statistics usage in the American Journal of Obstetrics and Gynecology: has anything chan- ged? Am J Obstet Gynecol 2002;186:584-6. http://dx.doi. org/10.1067/mob.2002.122144.
22. Gore A, Kadam Y, Chavan P, Dhumale G. Application of bi- ostatistics in research by teaching faculty and final-year postgraduate students in colleges of modern medicine: A cross-sectional study. Int J Appl Basic Med Res 2012;2:11-6. http://dx.doi.org/10.4103/2229-516X.96792.
23. Wulff HR, Andersen B, Brandenhoff P, Guttler F. What do doctors know about statistics? Stat Med 1987;6:3-10. http://dx.doi.org/10.1002/sim.4780060103.
24. Hawkins DM. Identification of outliers: Springer; 1980. http://dx.doi.org/10.1007/978-94-015-3994-4.
25. Schor S, Karten I. Statistical evaluation of medical jour- nal manuscripts. JAMA 1966;195:1123-8. http://dx.doi. org/10.1001/jama.1966.03100130097026.
26. Gore SM, Jones IG, Rytter EC. Misuse of statistical met- hods: critical assessment of articles in BMJ from January to March 1976. BMJ 1977;1:85-7. http://dx.doi.org/10.1136/ bmj.1.6053.85.
27. Kim JS, Kim DK, Hong SJ. Assessment of errors and misu- sed statistics in dental research. Int Dent J 2011;61:163-7. http://dx.doi.org/10.1111/j.1875-595X.2011.00037.x.
28. White SJ. Statistical errors in papers in the British Journal of Psychiatry. Br J Psychiatry 1979;135:336-42. http://dx.doi. org/10.1192/bjp.135.4.336.
29. Perneger TV. What’s wrong with Bonferroni adjus- tments. BMJ 1998;316:1236-8. http://dx.doi.org/10.1136/ bmj.316.7139.1236.
30. Holden JD. Hawthorne effects and research into professi- onal practice. J Eval Clin Pract 2001;7:65-70. http://dx.doi. org/10.1046/j.1365-2753.2001.00280.x.
31. Dwan K, Gamble C, Williamson PR, Kirkham JJ. Systema- tic review of the empirical evidence of study publicati- on bias and outcome reporting bias—an updated review. PloS one 2013;8:e66844. http://dx.doi.org/10.1371/journal. pone.0066844.
32. Franco A, Malhotra N, Simonovits G. Publication Bias in the Social Sciences: Unlocking the File Drawer. Lancet 1991;337:867-72.
33. Hair JF. Multivariate data analysis. 2009. 34. Sauerbrei W, Royston P, Binder H. Selection of important
variables and determination of functional form for conti- nuous predictors in multivariable model building. Stat Med 2007;26:5512-28. http://dx.doi.org/10.1002/sim.3148.
35. Derksen S, Keselman H. Backward, forward and stepwi- se automated subset selection algorithms: Frequency of obtaining authentic and noise variables. Br J Math Stat Psychol 1992;45:265-82. http://dx.doi.org/10.1111/j. 2044- 8317.1992.tb00992.x.
36. Wiegand RE. Performance of using multiple stepwise algo- rithms for variable selection. Stat Med 2010;29:1647-59.
37. Bendel RB, Afifi AA. Comparison of stopping rules in forward “stepwise” regression. J Am Stat Assoc 1977;72:46- 53.
38. Shrier I, Platt RW. Reducing bias through directed acyclic graphs. BMC Med Res Methodol 2008;8:70. http://dx.doi. org/10.1186/1471-2288-8-70.
39. Weng H-Y, Hsueh Y-H, Messam LLM, Hertz-Picciotto I. Methods of covariate selection: directed acyclic graphs and the change-in-estimate procedure. Am J Epidemiol 2009:kwp035. http://dx.doi.org/10.1093/aje/kwp035.
40. Lang TA, Altman DG. Basic Statistical Reporting for Articles Published in Biomedical Journals: The “Statistical Analyses and Methods in the Published Literature” or The SAMPL Gu- idelines”. Science Editors’ Handbook, European Association of Science Editors. 2013.
Copyright of Biochemia Medica is the property of Biochemia Medica and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use.