Chest Email Content Delivery
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     

Guest Access | Sign In via User Name/Password
This Article
Right arrow Full Text (PDF) Free
Right arrow Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when eLetters are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Article Archive
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via ISI Web of Science (5)
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Wood, K. E.
Right arrow Articles by Grounds, R. M.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Wood, K. E.
Right arrow Articles by Grounds, R. M.
(Chest. 1999;115:614-616.)
© 1999 American College of Chest Physicians

Critical Care Outcomes in the United Kingdom

Sobering Wake-up Call or Stability of the Lamppost?

Kenneth E. Wood, DO, FCCP; Douglas B. Coursin, MD, FCCP (Madison, WI) and R. Michael Grounds, MD*(London, UK ).

Dr. Wood is the Director of the Trauma and Life Support Center and the Assistant Professor of Internal Medicine; Dr. Coursin is the Associate Director of the Trauma and Life Support Center and the Professor of Anesthesiology and Internal Medicine, University of Wisconsin Hospitals and Clinics; and Dr. Grounds is the Consultant Anaesthetist/Intensivist at St. Georges Hospital.

Correspondence to: Douglas B. Coursin, MD, Professor of Anesthesiology and Internal Medicine, B6/319 UW CSC, Madison, WI 53792-3272; e-mail: dcoursin@facstaff.wisc.edu

It has been suggested that physicians should use statistics "as the drunken man uses the lamppost—for support rather than illumination."1 In this issue of CHEST (see page 802), Pappachan and colleagues report on their attempt to perform a validation study of APACHE III (acute physiology and chronic health evaluation) in 17 general adult ICUs in the South of England (hospital size range 300 to 800 beds). The significantly high standard mortality ratio (SMR = observed/predicted mortality ratio) in the UK hospitals raises the questions of whether a sobering wake-up call to a health-care system has been sounded or whether the statistical foundation upon which the lamppost is situated needs to be stabilized and reevaluated. Although the use of SMR as a measure of ICU performance is debatable,2 ,3 ,4 ,5 the significant excess in mortality beyond the severity stratified predictions warrants close examination and explanation.

As with other external validation studies of general ICU scoring systems,6 ,7 ,8 APACHE III showed good discrimination (ability to assign higher probability of mortality to patients who die), but poor calibration (correspondence between the estimated probability and the actual mortality) indicating poor fit. Accordingly, the APACHE III risk predictions were consistently lower than the actual mortality. This excess in observed to predicted mortality was evident in virtually all risk groups and was particularly striking in those patients in the low risk groups. According to APACHE III, the admission of this group to US hospitals would have resulted in 25% less mortality, the equivalent of a 747 jet crashing in southern England annually. If APACHE III is as robust as its developers and entrepreneurial marketing experts would lead us to believe, one could only conclude that the care to the population in southern England is substandard to that in the US. With candid honesty, the authors readily acknowledge this point and thoroughly review the recognized differences and perceived shortcomings in their system. These include the following: less resource allocation; previous failure to recognize critical care as a specialty; fewer ICU directors and dedicated training programs; the logistics of refusal/denial of admissions to the ICU; early ICU discharge with increased readmission rates; and the high requirements for interhospital transfer of critically ill patients (7.7% in the UK vs 2.3% in the US). Therefore, it is quite possible that the standard of care in major ICUs in the US is superior to the UK. Alternatively, the black box of APACHE III,9 which forms the foundation upon which the lamppost is built, may be unstable or inaccurate when applied cross-culturally.

Selection bias and case mix variability, lead time bias, and methodologic problems are criticisms of APACHE III that potentially render it unreliable for ICU performance comparison.2 ,10 ,11 Selection bias and case mix variability refer to the potential differences in the original database and the population in question. Insofar as APACHE III is newer, expensive, and not frequently reported; a review and comparison of reported issues with APACHE II is also instructive. The APACHE II database was not sufficiently broad enough to allow for accurate predictions in specific disease groups.12 ,13 ,14 ,15 ,16 Despite the expanded database in APACHE III, it cannot predict mortality within a specific disease group. Multiple disease groups (424) are combined into the 78 disease categories for which predictive equations exist. Therefore, one can compare only severity, not outcome among patients within the same disease group.

The published APACHE III database is derived from 17,440 patients at 40 US hospitals; 14 were tertiary care centers, and 26 were randomly chosen hospitals with over 200 beds, half of which had medical school affiliations. Of the 12,793 patients evaluated in the present UK study, 94% were from district general hospitals and only 6% were from one teaching center; the size of which was not reported. The case mix in the United Kingdom was also significantly different: older men; greater comorbidity; increased incidence of transfer into the ICU from the hospital wards; fewer patients directly admitted from the ED; and fewer elective, but more emergency surgical patients. The relative weights of these factors in the predictive linear regression APACHE III equations are for all practical purposes in the proprietary domain of the for-profit APACHE Medical Systems company, and consequently, true investigative comparison is virtually impossible. Lead time bias and its analogue, pre-ICU treatment bias, can significantly contribute to the underestimation of mortality as was evident in APACHE II.17 ,18 ,19 ,20 ,21 APACHE III allegedly corrects for this shortcoming by accounting for the patients' pre-ICU location, although the actual statistical weight remains unpublished and unknown. As a greater number of UK patients originated from the ward, one can only speculate that APACHE II shortcomings persist. Lastly, errors in diagnostic labeling and data collection remain problematic. APACHE III requires a single diagnosis in each patient (424 diseases placed in 78 disease categories), and disease labeling in the United Kingdom may be different than the United States. This could significantly impact upon the mortality prediction. Accurate and reliable data collection is crucial to the success of any predictive system. In APACHE II, there was a high degree of interobserver reliability, utilizing trained and dedicated data collectors.22 In this era of cost containment, few hospitals can afford such a luxury. When data collection is accomplished by registered nurses and residents, the error rate for the principal diagnostic category was between 9 and 18% and there was a clinically significant lack of agreement once predicted mortality exceeded 20%.23 As APACHE III retains this essential data collection plus five new physiologic variables, one can only wonder if the error is further magnified, particularly, in this study where no attempts to correct "illogical, extreme or unlikely values" were performed. In our unit (University of Wisconsin multidisciplinary 24-bed unit), due to increasing cost containment strategies, 30% of patients do not have all of the reference laboratory values (eg, albumin and bilirubin) required for the accurate generation of an APACHE III score.

Despite the preceding criticisms, previous applications of the US derived APACHE database to international populations have shown reasonable correlation,24 ,25 ,26 ,27 although results from the United Kingdom have been mixed.8 ,28 ,29 An increased SMR in Brazil30 and Tunisia6 was ascribed to lack of technology and senior physicians and nurses, respectively. As such, it is crucial to reconcile the data reported in this issue of CHEST by Pappachan and coworkers. A desirable characteristic of risk adjusted morbidity predictors is that they be "open to inspection and testing,"31 so prior to offering a sobering indictment of a health-care system, it would be prudent to inspect the black box foundation upon which the statistical lamppost is secured. To do less would only provide validation that entrepreneurial investigation is supplanting scientific investigation in critical care medicine.

References

  1. . Anonymous. (1986) TPN and APACHE. Lancet 1,1478[CrossRef][ISI][Medline]
  2. Boyd, O, Grounds, RM (1993) Physiologic scoring systems and the audit. Lancet 341,1573-1574[CrossRef][ISI][Medline]
  3. Boyd, O, Grounds, RM (1994) Can standardized morality ratio be used to compare quality of intensive care performance [letter]? Crit Care Med 22,1706-1709[ISI][Medline]
  4. Sherck, JP, Shatney, CH (1996) ICU Scoring Systems do not allow predictions of patient outcomes or comparison of ICU performance. Crit Care Clin 12,515-523[CrossRef][ISI][Medline]
  5. Lemeshow, S, LeGall, J-R (1994) Modeling the severity of illness of ICU patients: a systems update. JAMA 272,1049-1055[Abstract]
  6. Nouira, S, Belghitti, M, Elastrous, S, et al (1998) Predictive value of severity scoring systems: a comparison of four models in Tunisian adult intensive care units. Crit Care Med 26,852-859[CrossRef][ISI][Medline]
  7. Bastos, PG, Sun, X, Wagner, DP, et al (1996) Application of the APACHE III prognostic system in Brazilian intensive care units: a prospective multicenter study. Intensive Care Med 22,564-570[ISI][Medline]
  8. Beck, DH, Taylor, BL, Millar, B, et al (1997) Prediction of outcome from intensive care: a prospective cohort study comparing acute physiology and chronic health evaluations II and III prognostic systems in a United Kingdom intensive care unit. Crit Care Med 25,9-15[CrossRef][ISI][Medline]
  9. Teres, D (1994) Comment on "The case for using objective scoring systems to predict intensive care outcome." Crit Care Clin 10,91-92
  10. Teres, D, Lemeshow, S (1994) Why severity models should be used with caution. Crit Care Clin 10,93-110[ISI][Medline]
  11. Cowen, JS, Kelly, MA (1994) Errors and bias in using predictive scoring systems. Crit Care Clin 10,53-78[ISI][Medline]
  12. Cerra, FB, Negro, F, Abrams, J (1990) APACHE III score does not predict multiple organ failure or mortality in postoperative surgical patients. Arch Surg 125,519-522[Abstract]
  13. Marsh, HM, Krishan, I, Naesseus, JM, et al (1990) Assessment of prediction of mortality by using the APACHE II scoring system in intensive care units. Mayo Clin Proc 65,1549-1557[ISI][Medline]
  14. Civetta, JM, Hudson-Civetta, JA, Nelson, LD (1990) Evaluation of APACHE II for cost containment and quality assurance. Ann Surg 212,266-276[ISI][Medline]
  15. Brown, MC, Crede, WB (1995) Predictive ability of acute physiology and chronic health evaluation II scoring when applied to human immunodeficiency virus-positive patients. Crit Care Med 23,848-853[CrossRef][ISI][Medline]
  16. McAnena, OJ, Moore, FA, Moore, EE, et al (1992) An evaluation of the APACHE II scoring system for patients with acute trauma. J Trauma 33,504-507[ISI][Medline]
  17. Dragsted, L, Jorgensen, J, Jensen, NM, et al (1989) Interhospital comparisons of patients' outcomes from intensive care: importance of lead-time bias. Crit Care Med 17,418-422[ISI][Medline]
  18. Escore, JJ, Kelly, MA (1990) Admission source to medical intensive care unit predicts death independently of APACHE II score. JAMA 264,2389-2394[Abstract]
  19. Tunnel, RD, Millar, BW, Smith, GB (1998) The effect of lead-time bias on severity of illness scoring, mortality prediction and standardized mortality ratio in intensive care: a plot study. Anaesthesia 53,566-576
  20. Rapoport, J, Teres, D, Lemeshow, S, et al (1990) Timing of intensive care unit admission in relation to ICU outcome. Crit Care Med 18,1231-1235[ISI][Medline]
  21. Goldhill, DR, Withington, PS (1996) Mortality predicted by APACHE II. The effects of changes in physiological values and post-ICU hospital mortality. Anaesthesia 51,719-723[ISI][Medline]
  22. Knaus, WA, Draper, EA, Wagner, DP, et al (1985) APACHE II: a severity of disease classification system. Crit Care Med 13,818-829[ISI][Medline]
  23. Holt, AW, Burry, LK, Bersten, AD, et al (1992) Prospective evaluation of residents and nurses as severity score data collectors. Crit Care Med 20,1688-1691[ISI][Medline]
  24. Knaus, WA, LeGall, JR, Wagner, DP, et al (1982) A comparison of intensive care in the USA and France. Lancet 2,642-646[CrossRef][ISI][Medline]
  25. Zimmerman, JE, Knaus, WA, Judson, JA, et al (1988) Patient selection for intensive care: a comparison of New Zealand and US hospitals. Crit Care Med 16,318-326[ISI][Medline]
  26. Wong, DT, Crofts, SL, Gomiez, M, et al (1995) Evaluation of predictive ability of APACHE II system and hospital outcome in Canadian intensive care unit patients. Crit Care Med 23,1177-1783[CrossRef][ISI][Medline]
  27. Castella, X, Artigas, A, Bion, J, et al (1995) A comparison of severity of illness scoring systems for intensive care unit patients: results of a multicenter study. Crit Care Med 23,1327-1335[CrossRef][ISI][Medline]
  28. Rowan, KM, Kerr, JH, Major, E, et al (1994) Intensive care society's acute physiology and chronic health evaluation (APACHE II) study in Britain and Ireland: a prospective multicenter cohort study comparing two methods for predicting outcome for adult intensive care patients. Crit Care Med 22,1392-1401[ISI][Medline]
  29. Goldhill, DR, Withington, PS (1996) The effect of casemix adjustment on mortality as predicted by APACHE II. Intensive Care Med 22,415-419[CrossRef][ISI][Medline]
  30. Bastos, PG, Knaus, WA, Zimmerman, JE (1996) The importance of technology for achieving superior outcomes from intensive care. Intensive Care Med 22,664-669[ISI][Medline]
  31. Selker, HP (1993) Systems for comparing actual and predicted mortality rates: characteristics to promote cooperation in improving hospital care. Ann Intern Med 118,820-822[Free Full Text]




This Article
Right arrow Full Text (PDF) Free
Right arrow Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when eLetters are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Article Archive
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via ISI Web of Science (5)
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Wood, K. E.
Right arrow Articles by Grounds, R. M.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Wood, K. E.
Right arrow Articles by Grounds, R. M.


HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS