|
|
||||||||
Guest Access | Sign In via User Name/Password |
|||||||||
* From the Department of Medicine (Drs. Moss and Wellman), Division of Pulmonary and Critical Care Medicine, and the Rollins School of Public Health (Mr. Cotsonis), Emory Unversity, Atlanta, GA.
Correspondence to: Marc Moss, MD, Division of Pulmonary and Critical Care, Grady Memorial Hospital, 69 Jesse Hill Junior Dr, Suite 2C007, Atlanta, GA 30335; e-mail: marc_moss{at}emory.org
| Abstract |
|---|
|
|
|---|
Design: We examined all of the published manuscripts for 12 potential limitations in the reporting of important statistical methodologies over a 6-month period from July 1, 2000, until December 31, 2000.
Results: Of the 81 articles that included multivariable logistic regression analyses, only 65% (53 analyses) properly reported the coding classification of pertinent independent variables that were included in the final model. An odds ratio and confidence interval were reported for the independent variables included in the final model for 79% (64 analyses) and 74% (60 analyses), respectively. Only 12% (10 articles) referenced whether interaction terms or effect modifications were examined, 1% (1 article) reported testing for collinearity, and only 16% (13 articles) included a goodness-of-fit analysis of the logistic model. The type of statistical package was reported in 69% (56 articles). Finally, approximately 39% of the articles (22 of 57) may have overfit the logistic regression model, leading to potentially unreliable regression coefficients and odds ratios.
Conclusions: Our results indicate that the reporting of multivariable logistic regression analyses in the pulmonary and critical care literature is often incomplete, therefore making it difficult for the reader to accurately interpret the manuscript. We recommend the implementation of adequate guidelines that will lead to overall improvements in the reporting and possibly to the conducting of multivariable analyses in the pulmonary medicine and critical care medicine literature.
Key Words: critical care medicine logistic models pulmonary medicine study design
| Introduction |
|---|
|
|
|---|
In 1993, Concato and colleagues1 reported a review of 60 randomly selected articles from the Lancet and the New England Journal of Medicine that utilized multivariable statistical methods. In 73% of the articles, they identified violations and omissions of methodological guidelines that would make the reported results potentially inaccurate, misleading, or difficult to interpret. Based on their observations, the authors concluded that there was a need for improvement in the reporting and perhaps in the conducting of multivariable analysis in the general medical literature.
It has been almost a decade since this important observational study was published. The methodological and statistical review policies of many major medical journals have improved over the last decade.6 However, deficiencies in the reporting of multivariable regression models still exist in some subspecialty journals, including the obstetrics and gynecology literature.4 It is presently unknown whether similar deficits remain in the pulmonary medicine and critical care medicine literature. Therefore, we used preexisting guidelines for assessing multivariable models and investigated the quality of reporting multivariable logistic regression analyses in five major pulmonary and critical care journals.
| Materials and Methods |
|---|
|
|
|---|
Various guidelines have been recommended that can improve the appropriate execution, interpretation, and reporting of multivariable logistic regression methods.1 5 7 Using some of the criteria established by Concato et al,1 we examined published manuscripts for 12 potential limitations in their reporting of multivariable logistic regression analysis, including the following: (1) inclusion of odds ratio and confidence intervals for variables in the final model; (2) listing the type of statistical software package that was used to perform the analysis; (3) classification or coding of the independent variables included in the final model; (4) testing for interaction terms; (5) collinearity of independent variables; (6) goodness of fit; (7) potential overfitting of the model; (8) conformity of linear gradients; and (9) explanation of how the explanatory variables that were used in the model were initially selected. The 10th potential limitation was reserved only for those articles that used pair-matched case-control data. We determined whether the authors reported that a conditional logistic regression model was used. Finally, if a logistic regression analysis was used to create a predictive model, we determined (11) whether a receiver operating characteristic (ROC) curve was reported for the final model and (12) whether a separate validation set was used.
Study Criteria
The following criteria were used to categorize each article in a systematized manner for each of the potential limitations in reporting style:
65 years.1
Articles were considered to properly report the coding of variables if the method of coding for all of the variables that remained in the final statistical model could easily be determined (such as gender being a dichotomous variable) or were referenced anywhere in the article. If the odds ratio could not be calculated accurately because the coding of all of the variables in the final model was not designated or able to be determined, the article was considered to have improperly reported the coding of variables. All of the articles were distributed and examined by two of the three authors to determine the accuracy of logistic regression analysis reporting. If there was disagreement between the initial reviews, then the third person, who had not previously examined the article, rendered the final decision.
| Results |
|---|
|
|
|---|
The overall results are summarized in Table 1 . Odds ratios for the independent variables included in the final multivariable logistic model were reported in 79% (64 articles). Confidence intervals for these odds ratios were included in 74% of the articles (60). Only 22 of the articles (27%) specified whether the multivariable regression analysis was performed in a forward or backward manner. Of these 22 articles, 12 used a backward regression analysis and 10 used a forward process. The type of statistical package was reported in 69% (56 articles). The most commonly used programs were SPSS (SPSS; Chicago, IL) [39 articles; 39%], SAS (SAS Institute; Cary, NC) [18 articles; 32%], and STATA (STATA; College Station, TX) [9 articles; 16%].
|
Overfitting of the regression model could be assessed in 57 of the manuscripts. Using the criteria of maintaining < 10 outcome events per each explanatory (independent) variable, 39% of the articles (22 articles) included final logistic regression models that were suspicious for overfitting.1 The criterion for nonconformity to a linear gradient did not apply to 42 of the articles in which the analyses used only binary independent variables. For the remaining 39 articles, only 6 (15%) included any indication of assessing for nonconformity to a linear gradient in the text of the manuscript. Finally, the reason that the specific explanatory variables were chosen for the multivariable logistic regression analysis was reported in only 23% of the 81 articles (19 articles). The most common reason for the inclusion of a specific variable was a significant association in a univariate analysis (95%; 18 of 19 articles). One manuscript (5%) reported that the selection of explanatory variables was due to reasons of biological plausibility.
The majority of articles (94%; 76 of 81 articles) used multivariable logistic regression modeling in a descriptive manner in order to determine the effect of an individual variable on a specific outcome while adjusting for differences in other factors. The remaining articles (6%; 5 articles) were primarily concerned with creating a predictive modeling strategy to estimate the likelihood of an outcome for a given patient from a set of observations particular to that patient. Goodness-of-fit tests and ROC curves were reported in 80% (four of five articles). However, a validation set was included in only three of the articles (60%). Five other articles were pair-matched case-control studies. Four of the five articles properly indicated that a conditional multivariable logistic regression analysis had been performed.
| Discussion |
|---|
|
|
|---|
For example, the apparent effect of an independent variable will depend on the corresponding units of measure and the coding of that variable.1 If the values of the regression coefficients are reported without identifying the units of coding for the independent variable, then readers will be unable to interpret the actual magnitude of the risk estimates. The importance of testing for interaction terms is evident in a 1999 trial9 examining the effect of body position on the incidence of nosocomial pneumonia in mechanically ventilated patients. If interaction was not considered in the logistic regression model, then the regression coefficient for the independent variable (eg, the body position of an intubated patient that had been coded as semi-recumbent or horizontal) represents the impact of this variable on the outcome event (such as the development of ventilator-associated pneumonia) for all levels of another variable (eg, the use of enteral nutrition coded as present or absent). If body position and enteral nutrition have a significant interaction, the impact of body position depends on the presence or absence of enteral feedings.9 Without attention to interaction, the coefficient for body position will report a misleading quantitative estimate of the impact of the body position of an intubated patient on the development of ventilator-associated pneumonia.1 Finally, goodness-of-fit indexes evaluate how effectively the calculated model fits the actual data for estimating the outcome variable,1 and, therefore, the validity of all of the results and conclusions strongly depends on assessment of the adequacy of the regression model.10
Most clinicians and researchers depend on journals, via the editorial and peer review processes, to ensure that statistical methods in published articles are being used and interpreted appropriately.6 Goodman and colleagues6 conducted a cross-sectional survey of medical journals to determine the general policies of the statistical review process. Approximately one third of the 114 journals that responded to the survey require statistical review for all accepted manuscripts. The statistical review policies differed between journals according to the size of their circulation. Eighty-two percent of journals with a circulation of > 25,000 maintained a statistical consultant on staff compared to only 31% for those journals with a circulation of < 4,100. In addition, the managing editors of these sampled journals estimated that a formal statistical review resulted in an important change in approximately 50% of the manuscripts. Goodman and colleagues6 further evaluated the peer review process by examining the quality of manuscripts before and after editing at the Annals of Internal Medicine.11 The quality of multivariable reporting was rated as one of the most deficient factors at the time of submission. However, after the manuscript was edited and revised, the clarity in the reporting of multivariable analyses was markedly improved.
Given the large number of papers submitted for publication and the limitations on journal space, editors must make important decisions concerning what type of information about regression models should be included, excluded, or made available on request. Campillo12
suggested that clear-cut publication criteria for generalized regression models should be made available for contributors to medical journals. Lang and Secic13
have recommended that an article with a multivariable logistic regression analysis should include a table with the coefficient (ß), SE, Wald
2 test value, p value, odds ratio, and 95% confidence interval for each independent variable. In addition, the coding of each independent variable and a statement concerning whether the model was validated should be included. Finally, they recommended that the important issues of interaction and collinearity should be addressed in these manuscripts (Table 2
). One of the major pulmonary and critical care journals has changed its "Instructions for Contributors" in order to limit the length of each article and to utilize the online reporting of the medical literature.14
Authors will now be required to limit the "Methods" section to 500 words; however, an extended account of methods, when appropriate, will be included online in the journals Web repository. Therefore, an additional venue for reporting multivariable statistical analyses in the future may be online, thereby optimally utilizing a Web repository site.
|
Accurate and understandable results are required to properly communicate medical research.1 Several deficiencies in the reporting of multivariable logistic regression analyses in the pulmonary and critical care medicine literature were identified. We recommend that the editorial boards of these pulmonary and critical care journals implement sufficient guidelines for the proper statistical reporting of multivariate analyses that will enable the reader to accurately interpret an articles results (Table 2) . In addition, it may be reasonable for the editorial boards to consider increasing the frequency of involving statistical consultants to review manuscripts to a similar degree as journals use copy editors to correct the grammar and format.
It has been nearly a decade since Cancato et al1 first identified problems in the reporting of multivariable statistical analyses in the medical literature. Presently, the pulmonary and critical care medicine literature has not fully addressed these issues or implemented acceptable guidelines or criteria for the reporting of multivariable analyses. Hopefully, this article will promote improvements in the reporting of multivariable analyses and will allow readers to more accurately interpret study results in future articles.
| Footnotes |
|---|
Received for publication May 4, 2001. Accepted for publication June 18, 2002.
| References |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
A. X. Freire, L. Bridges, G. E. Umpierrez, D. Kuhl, and A. E. Kitabchi Admission Hyperglycemia and Other Risk Factors as Predictors of Hospital Mortality in a Medical ICU Population Chest, November 1, 2005; 128(5): 3109 - 3116. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. N. Wijeysundera, W. S. Beattie, G. Djaiani, V. Rao, M. A. Borger, K. Karkouti, and R. J. Cusimano Off-Pump Coronary Artery Surgery for Reducing Mortality and Morbidity: Meta-Analysis of Randomized and Observational Studies J. Am. Coll. Cardiol., September 6, 2005; 46(5): 872 - 882. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Joffe, J. Barr, and M. K. Gould Pitfalls in the Interpretation of Multivariable Models in the Critical Care Literature Chest, January 1, 2005; 127(1): 411 - 412. [Full Text] [PDF] |
||||
![]() |
A. V. Hernandez, Y. Vergouwe, E. W. Steyerberg, and M. Moss Reporting of Predictive Logistic Models Should Be Based on Evidence-Based Guidelines Chest, November 1, 2003; 124(5): 2034 - 2035. [Full Text] [PDF] |
||||
![]() |
S. D. Barnett Increasing Statistical Accuracy Chest, August 1, 2003; 124(2): 771 - 771. [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |