|
|
||||||||
Guest Access | Sign In via User Name/Password |
|||||||||
* From the Department of Intensive Care Medicine (Drs. Pappachan and Smith), Queen Alexandra Hospital, Portsmouth, UK; Critical Audit, Ltd. (Mr. Millar), St. George's Hospital Medical School, London, UK; and Department of Intensive Care Medicine (Dr. Bennett), St. George's Hospital, London, UK.
| Abstract |
|---|
|
|
|---|
Design: Prospective, noninterventional, cohort study.
Setting: Seventeen general adult ICUs in a discrete area of southwest England.
Patients: 12,793 patients admitted between April 1, 1993 and December 31, 1995.
Measurements: Sociodemographic and severity-of-illness data were collected for all patients admitted to the study units. Formal goodness-of-fit tests were applied and observed mortality was compared with that predicted by using the APACHE III system.
Results: For the group of ICUs as a whole, the risk-adjusted standardized mortality ratio (SMR) was 1.23 (95% confidence intervals, 1.121.25). For 11 out of 17 ICUs, the SMR was significantly greater than unity (p < 0.05). Calibration, as tested by Hosmer-Lemeshow statistics, was poor (H2 = 312.54; C2 = 332.85; df = 8; p < 0.01); however, model discrimination was good with a total correct classification rate of 82.9% and an area under the receiver operating characteristic curve of 0.89.
Conclusions: The excess mortality observed after case-mix adjustment using the APACHE III system in this study may be the result of either poor intensive care performance as compared with the United States or a failure of the APACHE III equation to fit the UK data.
Key Words: APACHE III ICU outcome scoring systems severity of illness standardized mortality ratio
| Introduction |
|---|
|
|
|---|
| Materials and Methods |
|---|
|
|
|---|
Patients were excluded from the study if they were younger than 16 years; had a diagnosis of primary burn injuries; died within 4 h of ICU admission; or had been admitted to the ICU because either a separate coronary care or surgical recovery unit did not exist or was not available in the hospital concerned (ie, they did not require intensive care). For the purposes of calculating ICU and hospital mortality ratios, only data from the first admission to ICU in any one hospital admission were considered.
Using the definitions of the APACHE III prognostic system,7 where appropriate, the following data were collected for each patient: sociodemographic factors (age and sex); preexisting comorbidity; reason for admission (medical, elective surgical, or emergency surgical); diagnostic category; and severity of illness (using acute physiologic data). Outcome data (alive or dead on discharge from ICU and hospital) were also recorded. Hospital mortality was measured as the proportion of deaths during hospitalization and therefore included ICU deaths.
Personnel were identified at each unit to be responsible for data collection. Strict written guidelines were provided regarding data definition to minimize inaccuracies in data collection. Standard documentation and training were provided to all ICU personnel. Within each ICU, the collection and storage of the minimum data set was undertaken using a stand-alone computer (Macintosh LC 475; Apple Computers; Cupertino, CA) and specially designed software (Ward Watcher; Critical Audit, Ltd; London, UK). On a quarterly basis, the stored data set of a random 20% of patients in each ICU was cross-checked against the patient's medical and nursing records and other standard documentation. All data were checked for illogical, extreme, or unlikely values but no attempt was made to alter original data. Approval from local Ethics Committees for participation in the study was not required since data used for the study had already been collected for clinical purposes.
For each of 12,793 patients admitted to the study, APACHE III scores
and estimates of hospital mortality were calculated using the APACHE
III prognostic system in the manner described by Knaus et
al.7
Individual patient APACHE III estimates of hospital
mortality were summed to give an average predicted risk of mortality
for all patients and for given groups of ICU admissions (ie,
by age, APACHE III score, surgical status, and system diagnostic
category). Mortality ratios, obtained by dividing the observed number
of deaths for each group by the predicted number, were used to compare
actual with estimated outcome. In addition, 95% confidence intervals
(CIs) were calculated for each mortality ratio and continuity-corrected
2 tests were used to analyze the difference
between the observed and expected number of deaths.
The statistical analysis of the overall goodness-of-fit was undertaken using several techniques designed to test both discrimination (the ability of the model to discriminate between survivors and nonsurvivors) and calibration (the accuracy of risk predictions). Discrimination was tested using classification matrices and receiver operating characteristic (ROC) curves; 95% confidence intervals were computed for the total correct classification rate (TCCR), ie, the sum of the true-positive and true-negative rates derived from the classification tables. Calibration was tested using the Hosmer-Lemeshow H2 and C2 tests10 ,11 ; the H2 statistic divides the population into equal deciles of risk and the C2 statistic uses bands of risk with approximately equal numbers of patients. In addition, a calibration curve was constructed by plotting the predicted death rates stratified by 10% intervals of mortality (x-axis) against the observed mortality rates (y-axis).
| Results |
|---|
|
|
|---|
2 = 23.6; p < 0.01), older than 45 years
of age (
2 = 22.0; p < 0.01), have greater
APACHE III comorbidity (
2 = 29.3;
p < 0.01), and were more frequently transferred to an ICU from
hospital wards and other hospitals
(
2 = 310.2 and
2 = 246.3, respectively; p < 0.01). More
American patients were admitted from an emergency room or other ICU
(
2 = 349.0 and
2
= 116.8, respectively; p < 0.01). In the UK database, there were
significantly fewer elective surgical patients
(
2 = 34.0; p < 0.01) and significantly
more emergency surgical patients (
2 = 217.3;
p < 0.01) than in the US database. In the South of England, fewer
ICU admissions were the result of trauma
(
2 = 75.7; p < 0.01), congestive heart
failure (
2 = 70.6; p < 0.01), or drug
overdose (
2 = 8.0; p < 0.01). We were
unable to compare mean age, mean APACHE III scores, or mean length of
ICU stay because insufficient data were available from the original
APACHE III database.12
|
|
|
|
|
|
|
|
|
| Discussion |
|---|
|
|
|---|
If it is assumed that the APACHE III system is accurate and is sufficiently robust to control for international variations in ICU admission practice, it must be accepted that the performance of 64% (11 of 17) of ICUs in the South of England is poor compared with ICUs in the United States. Indeed, this finding is particularly worrying, as it suggests that the admission of identical groups of patients to units in the United States and the South of England would result in 25% greater hospital mortality in the latter group. Such an excess might be explained by the well-recognized differences between the two countries in the structure and organization of intensive care, availability of technology, and the training of medical staff. For instance, in the United States, resource allocation for intensive care is considerably greater,14 ,15 ,16 intensive care is a recognized specialty,17 ICUs are more likely to have a full-time clinical director, and dedicated critical care training programs have existed for some time.18 Resource shortages in the United Kingdom often lead to refusal of ICU admission,19 ,20 delay in ICU admission,20 a high requirement for interhospital transfers,21 ,22 early ICU discharge,23 and increased ICU re-admission rates,24 all of which have the potential to worsen patient outcome. Many patients are referred late for intensive care25 ; this is reflected in our study by the greater number of ICU patients who were admitted from general wards, rather than from the emergency department as often happens in the United States. Results similar to ours have been reported by Bastos et al,8 who recorded an SMR of 1.67 (range, 1.01 to 2.31) for 10 Brazilian ICUs using APACHE III. Subsequently, the performance of individual units was attributed to the availability of ICU technology.26
A far more likely explanation of the disparity between actual and predicted outcome for our group of patients may be that the APACHE III equation does not fit the South of England data. This hypothesis is supported by the poor calibration and uniformity of fit seen in our study. Although there is good discrimination, some would argue that this has nothing to do with model fit and only with the ordering of the predicted chance of the patients dying. In such circumstances, the performance of UK and US ICUs could be similar, but masked by the inability of the APACHE III system to control for international differences in case-mix factors, ICU admission practices, or lead-time bias.27 Indeed, considerable differences in case-mix factors have been demonstrated between our data and that of the APACHE III database.7 ,12 The APACHE III equation was formulated from a population that was more likely to be female, younger, less sick, with less comorbidity and a greater chance of being admitted directly from the emergency room. Each of these factors might "bias" the APACHE III equation to outcome such that it does not function accurately when applied to a population with a different composition. Similar disparity between the case-mix of test and reference databases were also reported by Bastos et al,8 and they too discovered that the APACHE III showed good discrimination but poor calibration and uniformity of fit.
To our knowledge, there have only been two other large, published multicenter analyses of case-mix-adjusted outcome undertaken in the United Kingdom, both using the APACHE II system.28 ,29 One, undertaken by the UK Intensive Care Society, demonstrated that the goodness of fit of the APACHE II equation to data from 26 ICUs was good.28 However, the Intensive Care Society found poor uniformity of fit when patients were grouped by age, diagnosis, or APACHE II score.28 Nevertheless, the overall SMR was 1.02 (95% CI, 0.981.06) and this has been taken to imply that the performance of these 26 ICUs was as good as similar units in the United States. In contrast, the work of Goldhill and Withington29 demonstrated a significantly higher SMR (1.11; 95% CI, 1.061.16) for the group as a whole and for certain subroups of patients. They also demonstrated "... substantial differences in case mix... " between their data and the original APACHE II study.29 This pattern of apparent excess mortality and a disparity of case-mix factors between study and reference populations mirrors the results of both our study and that from Brazil.8 The fact that the APACHE III equation does not fit our study data might thus be explained by differences in case mix between the two countries.
Other reasons why there may be a poor fit of the APACHE III equation to the South of England data might include systematic differences in medical definitions or diagnostic labeling, in the effectiveness of therapy, and in data collection (automated data collection systems record measurements more frequently than manual chart systems and are more common in the United States), or a failure of the APACHE III system to weight certain comorbidities sufficiently for their impact on UK mortality. In addition, the effect of lead-time bias may also have a greater impact in the United Kingdom, where Tunnell et al27 have demonstrated significant increases in APACHE III severity-of-illness scores and estimated risk of hospital mortality, if data collected before definitive intervention outside the ICU were included. This might suggest that the factor within APACHE III that corrects for lead-time bias may not necessarily adjust across all patient subgroups. Finally, the validity of using the SMR as an indicator of ICU performance has been questioned previously.27 ,30 ,31 It must be accepted that, although the SMR has been proposed as a measure of ICU performance, there is no alternative independent standard by which SMRs themselves have been or can be compared or validated. Additionally, the SMR reflects the ratio of observed to predicted hospital mortality and looks at the overall system of patient management, not just ICU activity.
In summary, the excess mortality observed in 17 ICUs in the South of England after adjustment for case mix using APACHE III is almost certainly a reflection of a failure of the APACHE III equation to fit the UK data. However, the lack of a universally recognized and verified marker of ICU performance makes it impossible to exclude poor UK intensive care performance as the cause of this excess. Two potential methods of determining whether clinical underperformance is responsible exist. The first involves a change in the method of delivery of health care in the South of England with subsequent reassessment of outcome using the APACHE III system. The second would be to match patients in our study with those of the APACHE III reference database on the basis of case-mix profile and risk predictions and then compare outcome in these groups.
| Acknowledgements |
|---|
| Footnotes |
|---|
Correspondence to: G.B. Smith, FRCA, Department of Intensive Care Medicine, Queen Alexandra Hospital, Portsmouth PO6 3LY, UK
Abbreviations: A&E = accident and emergency; APACHE = acute physiology, age, chronic health evaluation; CI = confidence interval; ROC = receiver operating characteristic; SMR = standardized mortality ratio; TCCR = total correct classification rate
Received for publication March 31, 1998. Accepted for publication October 21, 1998.
| References |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
A. Garland Improving the ICU: Part 1 Chest, June 1, 2005; 127(6): 2151 - 2164. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Stefanec Diuretics in Critically Ill Patients With Acute Renal Failure JAMA, March 19, 2003; 289(11): 1380 - 1380. [Full Text] [PDF] |
||||
![]() |
C. A. Sirio, K. Tajimi, N. Taenaka, Y. Ujike, K. Okamoto, and H. Katsuya A Cross-Cultural Comparison of Critical Care Delivery : Japan and the United States Chest, February 1, 2002; 121(2): 539 - 548. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. A. Cook Performance of APACHE III Models in an Australian ICU Chest, December 1, 2000; 118(6): 1732 - 1738. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. M. Poses, D. K. McClish, W. R. Smith, E. C. Huber, F. L. W. Clemo, B. P. Schmitt, D. Alexander, E. M. Racht, and C. C. Colenda III Results of Report Cards for Patients with Congestive Heart Failure Depend on the Method Used To Adjust for Severity Ann Intern Med, July 4, 2000; 133(1): 10 - 20. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |