|
|
||||||||
Guest Access | Sign In via User Name/Password |
|||||||||
* From the Department of Medicine, Division of Pulmonary and Critical Care Medicine (Dr. Carson), University of North Carolina, Chapel Hill, NC; and the Health Outcomes Research Group, Department of Epidemiology and Biostatistics and Department of Medicine (Dr. Bach), Memorial Sloan-Kettering Cancer Center, New York, NY.
Correspondence to: Shannon S. Carson, MD, Division of Pulmonary and Critical Care Medicine, University of North Carolina, 420 Burnette Womack Bldg, CB# 7020, Chapel Hill, NC 27599; e-mail: scarson{at}med.unc.edu
| Abstract |
|---|
|
|
|---|
Design: Retrospective inception cohort study.
Setting: A single, urban, long-term, acute-care hospital in Chicago.
Patients: One hundred eighty-two patients transferred from 37 acute-care hospital ICUs.
Measurements and results: We assessed four indexes: the acute physiology and chronic health evaluation II, the simplified acute physiology score II, the mortality prediction model II, and the logistic organ dysfunction system using variables measured on admission to the long-term acute-care hospital ICU. We found that none of these indexes distinguished well between the patients who lived and the patients who died (area under ROC [receiver operating characteristics] curve < 0.70 for all), nor did they assign correct probabilities of death to individual patients (Hosmer-Lemeshow goodness-of-fit statistics, p < 0.01 for all).
Conclusions: Investigators and clinicians should use caution in using severity-of-illness measures developed for acutely ill patients to describe critically ill patients admitted to long-term care units. As clinical practice and research focus more on these latter patients, development of adequately performing severity-of-illness measures appropriate to this patient population will be needed.
Key Words: chronically critically ill epidemiology (methods) hospital mortality intensive care long-term care outcome assessment (health care) prolonged critical illness severity-of-illness index
| Introduction |
|---|
|
|
|---|
Several investigators have used severity-of-illness measures developed in the acute-care hospital ICU setting to describe, for research purposes, illness severity in patients with prolonged critical illness.6 7 8 9 10 Others have described their use of these measures in triage and hospital admission decisions.11 12 While it is a fact that many patients admitted to LTAC hospitals are critically ill and require intensive care services, existing acute-care ICU scoring systems were not developed in the LTAC ICU setting and, therefore, may not be appropriate to this population. Before the use of acute-care ICU scoring systems in this patient population becomes more common, a detailed analysis of the performance of acute-care ICU severity-of-illness measures in critically ill patients admitted to LTAC hospitals is warranted.
Severity-of-illness measures widely used in acute- care hospital ICUs include the APACHE (acute physiology and chronic health evaluation) scores,13 the simplified acute physiology score (SAPS),14 the mortality prediction model (MPM),15 and the logistic organ dysfunction system (LODS).16 These measures combine physiologic variables and sometimes measures of chronic health problems to calculate a score that corresponds to relative illness severity. In each case, the score can be transformed into a probability of death through the use of a logistic regression equation. In this study, we assessed the ability of APACHE II, SAPS II, MPM II, and LODS to predict mortality rates in a group of critically ill patients admitted to an LTAC hospital ICU.
| Materials and Methods |
|---|
|
|
|---|
Study Patients
Study patients include all patients transferred to the LTAC
hospital between August 1, 1995 and July 31, 1996, from ICUs,
respiratory care units, or step-down units at 37 acute-care hospitals.
Only the initial hospital admission was considered for analysis.
Patients who received mechanical ventilation long-term before their
index hospitalizations were excluded.
Patient Data
Acute physiology variables measured during the first 24 h
of LTAC admission were obtained by review of medical records. Records
from the transferring facility were reviewed for comorbidities, the
primary diagnosis leading to initial ICU admission, and the diagnosis
responsible for continued ICU admission. Patients were considered to
have survived to discharge from the LTAC if they were discharged to any
other facility or to home. Any patient who died at the LTAC was
considered a death, regardless of the level of care he or she was
receiving at the time of death. All patients were observed until one of
these events occurred.
Severity-of-Illness Calculations
The techniques and scoring algorithms for each of these scores
have been published previously.13
14
15
16
Approaches and
adaptations specific to this patient population are mentioned below:
APACHE II: All patients received five chronic health points because of their ventilator dependence or other organ system dysfunction that was characteristic of the patient population. The diagnosis leading to continued ICU admission, as determined by the investigators, was used as the input for the diagnostic category weight.
MPM II: Complete data were not available on prior GI bleeding. All patients were considered to be physician-recommended ICU admissions, as none had undergone surgery within 1 week of admission. For the cardiovascular support score, the requirement of vasoactive drug therapy was recorded, but specific dosages of vasopressor drugs were not available.
LODS: With the exception of serum urea measurement within 24 h of ICU admission, data were complete for calculation of this score.
SAPS II: Data were complete for calculation of this score.
Analytic Methods
Severity-of-illness indexes were evaluated for two separate
performance characteristics: discrimination and
calibration.17
Discrimination refers to the
ability of the index to correctly rank a group of patients in order of
the likelihood that each individual patient dies. To assess each
instruments ability to discriminate, we examined three types of
outcomes. For the binary outcome (dead or alive at hospital discharge),
we examined the mean difference in scores between survivors and
nonsurvivors using the Students t test and the empiric
area under the receiver operating characteristic (ROC) curve (AUC)
generated from a logistic regression model.18
AUC values
> 0.5 reflect discriminative power, with higher values reflecting
better discrimination. For the time-dependent outcome death, we
examined the Harrells concordance probability (or C
statistic).19
20
Similar to ROC analysis, the Harrell
statistic is a measure of the likelihood that, of two randomly chosen
patients, the one with the lower score will live longer than the one
with the higher score. As such, better scores on the concordance
probability reflect the ability of the index to predict survival time.
For this latter analysis, patients were censored at the time of
hospital discharge.
Calibration refers to the ability of the index to accurately determine the probability of death. We calculated an individual patients probability of death as proposed by the developers of each index. We evaluated both general calibration and specific calibration of each index. To determine the general level of calibration of each index, we compared the calculated probability of death for the cohort to the observed death rate. We then determined the patient-specific calibration of each index by calculating the Hosmer-Lemeshow goodness of fit statistic for eight equally sized groups and 8 degrees of freedom (df), ordered based on their predicted probability of death.18 Each analysis was performed on all available data. All p values are two-sided. Analyses were performed with STATA 6.0 software (Statacorp; College Station, TX).
The project was reviewed and approved by the Institutional Review Board of the University of Chicago, Chicago, IL.
| Results |
|---|
|
|
|---|
|
|
Concordance Probability
In Table 2
, we present the values for the Harrells concordance
probability for each statistic. Similar to ROC analysis, a value of 0.5
would denote that the scale provided no discrimination. The values in
the table reflect a slight improvement over chance but in no case
suggest a high degree of discriminatory power.
Calibration of Severity-of-Illness Measures
Overall, 42% of the patients at the LTAC died. Generally, only
the APACHE II scoring system was calibrated correctly, producing an
estimated mortality rate of 42% as well. All other indexes, which
relied more heavily on acute physiology and less on chronic health
impairments, underpredicted mortality. SAPS II predicted a 23%
mortality rate, MPM II predicted a 21% mortality rate, and LODS
predicted a 17% mortality rate.
Because the APACHE II system was calibrated correctly to the cohort overall, we display the results of the Hosmer-Lemeshow test on our cohort for this severity-of-illness measure (Table 3 ). The Hosmer-Lemeshow goodness-of-fit test provides a further assessment of calibration, in that it assesses the extent to which each scale predicts an individual patients risk of death. In this test, the cohort is broken into a number of groups based on the predicted probability of death for each individual. For the example in Table 3 , the entire cohort is broken up into eight groups. Group 1 has 24 patients in it, and, based on their APACHE II scores, this group has a predicted probability of death of 15.6%. In other words, based on the APACHE II scores of these 24 patients, we would expect 0.156 x 24 patients (or 3.74 patients) to die. We actually observed five deaths in this group. Group 2 contains the patients with the next highest probabilities of death, averaging 23.6%. For the 22 patients in this group, we anticipated 0.236 x 22, or 5.19 deaths. We observed nine deaths. At each level of probability, the predicted and the actual mortalities are compared. Greater deviations between predicted deaths and actual deaths increase the C statistic, where higher values reflect poorer calibration. In the case of the APACHE II, the patient-specific calibration was poor (C = 20.5, df = 8, p = 0.0086).
|
| Discussion |
|---|
|
|
|---|
We also evaluated the calibration of these indexes and found that the indexes were poorly calibrated to the patient population. Three of the four indexes underpredicted mortality in our cohort, whereas APACHE II predicted overall mortality correctly but was poor at discriminating between individual patients in terms of risk of death. Whereas all four of the indexes are based on measures of acute physiologic derangements, APACHE II takes into account chronic organ dysfunction and diagnosis leading to ICU admission. If outcomes for chronically critically ill patients are more dependent on underlying disease, long-term organ dysfunction, and physical functioning than on physiologic derangements measured during the first day of hospital admission, this result would make sense. It is also possible that the similarity between observed mortality rates and mortality rates predicted by APACHE II was a chance occurrence. Therefore, the findings of this study should be confirmed in other groups of LTAC patients.
We must emphasize that, at present, none of the indexes, including APACHE II, are calibrated well for individual patients. When patients were stratified based on risk of death, the differences in the number of observed vs predicted deaths were large, as evidenced by a large Hosmer-Lemeshow test statistic (Table 3) . Therefore, we can conclude that none of these scoring systems were well calibrated to individuals in this LTAC hospital, despite the fact that one of the systems was well calibrated to the group overall. There are other examples in the literature where severity-of-illness indexes performed very well when applied to large groups of patients, but mortality rate predictions for individual patients were less reliable.21 22
This study has several limitations. Data were obtained by review of medical records from the LTAC and referring hospitals; therefore, data for all variables in the MPM II model were not complete. Additionally, this study was performed at a single institution, so the data may not be applicable to all LTACs or long-term ventilator units.
Severity-of-illness scoring systems such as the ones in this study were designed and validated in acutely ill patients using variables gathered during the first 24 h of admission to acute-care hospital ICUs. The patients, facilities, and approaches to care are different in LTAC hospitals, so it is not surprising that these indexes do not perform at the same level in LTAC patients as they do in the populations for whom they were intended. These instruments should not be used for clinical decision making in this patient population, and their use for description of research cohorts should be interpreted with caution.
To optimally perform the tasks expected of severity-of-illness measures, new scales that are developed and validated in a large population of patients with prolonged critical illnesses are necessary. Development of a severity-of-illness measure for chronically critically ill patients may require a novel approach. Survival to discharge may not be an appropriate outcome, as patients whose conditions clinically worsen can be transferred to acute-care hospitals causing the most severely ill patients to be classified as hospital survivors. Other outcomes, such as long-term survival, return to independence, or other qualitative measures may be more appropriate. In a model that we described, but which has not yet been validated, poor functional status prior to acute illness and advanced age were stronger predictors of 1-year mortality in patients requiring prolonged treatment with mechanical ventilation at an LTAC than were any measures of organ dysfunction.7
| Footnotes |
|---|
This study was supported by an unrestricted grant from the Transitional Hospitals Corporation and by the Robert Wood Johnson Foundation.
Received for publication August 21, 2000. Accepted for publication February 21, 2001.
| References |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
N. R. MacIntyre, S. K. Epstein, S. Carson, D. Scheinhorn, K. Christopher, and S. Muldoon Management of Patients Requiring Prolonged Mechanical Ventilation: Report of a NAMDRC Consensus Conference Chest, December 1, 2005; 128(6): 3937 - 3954. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |