|
|
||||||||
Guest Access | Sign In via User Name/Password |
|||||||||
* From the Department of Medicine (Drs. Mahon, Paterson, and Wood) and the Department of Epidemiology and Biostatistics (Dr. Donner), University of Western Ontario; and Department of Medicine (Drs. Laupacis, Hodder, and McKim), University of Ottawa. Canada.
| Abstract |
|---|
|
|
|---|
Design: Randomized study of n of 1 trials vs standard practice.
Setting: Outpatient departments in two tertiary care centers.
Patients: Sixty-eight patients with irreversible CAL who were symptomatic despite the use of inhaled bronchodilators, and who were unsure whether theophylline was helping them following open treatment, were randomized into n of 1 trials (N = 34) or standard practice.
Interventions: The n of 1 trials (single-patient, randomized, double-blind, multiple crossover comparisons of the effect on dyspnea of theophylline vs a placebo) followed published guidelines. Standard practice patients stopped taking theophylline but resumed it if their dyspnea worsened. If their dyspnea then improved, theophylline was continued. In both groups, a decision about continuing or stopping the use of theophylline was made within 3 months of randomization.
Measurements and results: The primary outcomes (the chronic respiratory disease questionnaire [CRQ] and 6-min walk) were measured at baseline, 6 months, and 12 months by personnel blinded to treatment group allocation. No between-group differences (n of 1 minus standard practice) were seen in within-group changes over time (1 year minus baseline) in the CRQ Physical Function score (point estimate on the difference, 2.8; 95% confidence limits [CLs], 8.2, 2.5), CRQ Emotional Function score (point estimate on the difference, 0.5; 95% CLs, 4.7, 5.7), or 6-min walk (point estimate on the difference, 8 m; 95% CLs, 26, 44 m). No differences between groups were seen in the secondary outcome of the proportion of patients taking theophylline at 6 and 12 months. In 7 of 34 n of 1 trial patients (21%), dyspnea improved during theophylline treatment compared with placebo treatment.
Conclusions: Using n of 1 trials to guide theophylline therapy in patients with irreversible CAL did not improve their quality of life or exercise capacity, or reduce drug usage, over 1 year compared to standard practice. Under the objective conditions of an n of 1 trial, 21% of patients with CAL responded to theophylline. There remains a rationale for considering theophylline in patients with irreversible CAL who remain symptomatic despite the use of inhaled bronchodilators, but the use of n of 1 trials to guide this decision did not yield clinically important advantages over standard practice.
Key Words: irreversible chronic airflow limitations n of 1 trials randomized trial theophylline
| Introduction |
|---|
|
|
|---|
Open therapeutic trials in single patients carry inherent biases. These include the placebo effect, the tendency for physicians and patients to want and to expect the therapy to work, and regression to the mean.19 ,20 These biases probably favor false conclusions that a treatment is effective and can, therefore, be expected to lead to the unnecessary use of a therapy in some patients.19 ,20 ,21 As well, many chronic diseases including irreversible CAL show fluctuations in severity that cannot be explained clearly by extraneous events. This variability in "natural history" may result in false conclusions that a treatment is not effective in some patients if it is started simultaneously with a period of decline. The n of 1 trial (randomized, double-blind, multiple crossover comparisons of active and placebo treatments in a single patient) is a potential way to limit such biases.19 ,20 ,21
We hypothesized that the use of n of 1 trials to guide theophylline therapy in a group of patients with CAL would lead to a better outcome than that a group in which theophylline use was guided by open therapeutic trials.21 This improvement would result from more reliable identification of those patients who do, and do not, benefit from the drug. Randomization, blinding, and multiple crossovers would limit false conclusions that theophylline is effective by controlling for the placebo effect, patient and physician expectations, and regression to the mean; multiple crossovers would also limit false conclusions that theophylline is not effective by controlling for temporal variation in the underlying course of the patients' CAL. To the extent that the biases of open trials appear weighted toward false conclusions of efficacy, we also predicted that n of 1 trials would result in less overall theophylline use than standard practice.
In a previous randomized study of theophylline use for irreversible CAL guided by n of 1 trials vs standard practice, we found that n of 1 trials significantly reduced theophylline use over 6 months.21 However, improved quality of life and exercise capacity among n of 1 trial patients was not seen, possibly because of the small sample size.21 We report a larger study with follow-up to 1 year that tested two a priori hypotheses. Our primary hypothesis was that patients with irreversible CAL who were given theophylline guided by n of 1 trials would have better quality of life and exercise capacity than patients treated according to standard practice. Our secondary hypothesis was that n of 1 trials would result in less theophylline use than standard practice. We also report the results of a post hoc analysis in which patients from this and our previous study were combined.
| Materials and Methods |
|---|
|
|
|---|
15% and
200 mL after the use of inhaled salbutamol. Patients were recruited
from four outpatient clinics in two tertiary care Canadian centers. Two
strata were defined before randomization. "Prior-theophylline-use"
patients were taking theophylline at the time of recruitment and were
using inhaled bronchodilators (ipratropium bromide, 40 µg qid,
salbutamol, 200 µg bid, or another inhaled
ß2-agonist in adequate dosage).
"No-prior-theophylline-use" patients were also using inhaled
bronchodilators but were not taking theophylline at the time of
recruitment. Before being entered into the study, these patients were
prescribed open-label theophylline (Theo-Dur; Key Pharmaceuticals;
Kenilworth, NJ), to be used for at least 2 weeks, and the dose
was adjusted as required to obtain theophylline blood levels between 30
and 110 µmol/L without side effects. Patients were excluded before randomization if (1) they could not tolerate theophylline at any dose, or (2) they answered "yes" to the question, "Are you certain that theophylline is helping you?" while having theophylline blood levels between 30 and 110 µmol/L. These exclusions arose from the prediction that n of 1 trials are of little use in patients who demonstrate obvious beneficial effects or adverse effects during an open run-in with the treatment.23 In both cases, n of 1 trials are likely to simply confirm these responses and are, therefore, a waste of time and effort.23
| Interventions |
|---|
|
|
|---|
|
Immediately on stopping the n of 1 trial, the code was opened and the mean symptom score for each 10-day treatment period was determined. If the patient had received at least one treatment pair, the point estimate on the difference in mean symptom score within treatment pairs (theophylline minus placebo) was determined; if two or more treatment pairs were received, the 90% confidence limits (CLs) on this estimate were also determined.23 If diary scores were not available for at least one treatment pair, a recommendation to stop or continue theophylline was made according to clinical judgment and patient preference. For all other n of 1 trials, the recommendation about theophylline therapy was guided by a previously described algorithm that considered the point estimate on the mean difference in symptom score and, where available, its 90% CLs (Fig 2 ).21 We accepted 90% CLs, rather than the conventional 95% CLs, in deciding about a treatment effect within the n of 1 trials as a compromise between the inherent low power of n of 1 trials having four or less treatment pairs and the need to avoid a type I (false-positive) statistical error.20 ,21 Assumptions implicit in the algorithm were that the minimal important difference for n of 1 trials using a 7-point Likert scale was a mean change of 0.5/symptom24 and that within-patient serial correlation in mean responses could be ignored.25
|
Patients in both groups returned to their primary care physicians once the initial 3-month phase of the study was completed. Changes to their treatment including changes regarding theophylline were left to the discretion of the primary care physician.
| Outcome Measures and Sample Size |
|---|
|
|
|---|
The recommendation within the first 3 months to continue or stop theophylline usage for all patients was recorded, as was the proportion of patients taking theophylline at 6 and 12 months. Patient and physician confidence in the treatment decisions were independently assessed by the question, "How confident are you that theophylline should be stopped?" or "... continued?," with responses ranging from 1 (totally comfortable) to 7 (extremely uncomfortable). Cointerventions (inhaled agents [beclamethasone diproprionate, budesonide, flunisolide, ß2-agonists, and ipratropium bromide], prednisone, and home oxygen) in the week before the 6- and 12-month assessments were also recorded.
A sample of 80 patients was chosen based on a two-tailed
-error of
5%, 80% power, a 20% loss to follow-up, and the use of the CRQ
Physical and Emotional function scores and the 6-min walk as primary
outcomes. Minimal important differences for within-group changes over 1
year were assumed to be 4.5 for the Physical Function
score,29
5.5 for the Emotional Function
score,29
and 30 m for the 6-min
walk.9
,27
Estimates for within-group variance over time
for these measures were derived from our previous trial.21
An a priori sample size estimate for the differences in drug
usage between the two groups was not done because it was a secondary
hypothesis.
| Analysis |
|---|
|
|
|---|
| Assignment and Masking |
|---|
|
|
|---|
| Results |
|---|
|
|
|---|
|
|
|
|
Comparison of n of 1 Trial and Standard Practice Patients According
to Duration of Theophylline Use at Entry (Secondary Analysis)
Tables 2
and 3
also show the CRQ scores, 6-min walk results, and
results of theophylline therapy over time among n of 1 trial patients
and standard practice patients according to the prerandomization
strata. An apparent interaction was present between previous
theophylline use and theophylline treatment groups in the CRQ Emotional
Function scores at 6 months (p = 0.034) and 12 months (p = 0.039).
Comparisons between n of 1 trial patients and standard practice
patients within the subgroups at 6 and 12 months were not significant,
with the exception of an increase in the Emotional Function score among
n of 1 trial patients ( + 5.6) compared with a decrease among standard
practice patients (2.2) at 6 months (p = 0.013, unpaired
t test) within the no-prior-theophylline-use group.
The mean baseline theophylline level was higher in the prior-theophylline-use group than in the no-prior-theophylline-use groups (64 µmol/L vs 49 µmol/L, respectively; p = 0.002). A greater number of prior-theophylline-use patients than no-prior-theophylline-use patients were taking theophylline at 12 months (22/35 patients [63%] vs 9/26 patients [35%], respectively; p = 0.02). Among no-prior-theophylline-use patients, no differences were seen in the proportion of n of 1 trial patients and standard practice patients taking theophylline over time. Among prior-theophylline-use patients, more standard practice patients than n of 1 trial patients were taking theophylline at 3 months (26% more) and 6 months (21% more). However, these differences were not statistically significant (3 months, p = 0.18; 6 months, p = 0.19; Fisher's exact test).
Conduct and Results of n of 1 Trials
Four n of 1 trial patients stopped their n of 1 trials
before or on completion of the first treatment period and refused to
continue. Of these, three patients in the prior-theophylline-use group
chose to remain on theophylline openly (two patients had received
theophylline only and one patient had received placebo only), and one
patient in the no-prior-theophylline-use group had received
theophylline only and chose not to continue the drug. Of the remaining
30 patients, 4 provided diary scores for one pair of theophylline and
placebo treatments from which a point estimate on the difference in
dyspnea score could be determined. The other 26 patients completed at
least two treatment pairs (8 patients, two pairs; 6 patients, three
pairs; 12 patients, four pairs) from which CLs on the point estimate
could also be determined. Early crossovers to alternate treatments
within pairs occurred in six patients, and four patients were treated
for a respiratory infection during the n of 1 trials but did not
crossover or end the treatment period early.
Seven of 34 patients (21%) undergoing n of 1 trials demonstrated a potentially useful improvement in dyspnea while on theophylline (Table 4 ). In each case, the point estimate on the improvement in mean daily dyspnea score, as measured by the 7-point Likert scale during theophylline therapy, exceeded zero (range, 0.3 to 3.1), and the 90% CLs on the estimates were either removed from zero (five patients) or marginally overlapped zero (two patients). Seven of 18 prior-theophylline-use patients (39%) and 1 of 13 no-prior-theophylline-use patients (8%) (p = 0.10; Fisher's exact test) chose either to resume theophylline following a negative n of 1 trial, or to abandon their n of 1 trial in favor of remaining on theophylline, even though they had not received at least one course of both theophylline and placebo treatments.
|
| Discussion |
|---|
|
|
|---|
We doubt that biased assessment of quality of life and exercise capacity explains our negative result. Personnel ascertaining these measures were masked to treatment- group allocation. We also suspect that the isolated finding of a higher CRQ Emotional Function score at 6 months among n of 1 trial patients within the No Prior Theophylline Use subgroup reflected chance. Thus, other reasons for our negative finding, beyond the main explanation that n of 1 trials are not effective in this setting, should be considered.
First, the question of inadequate statistical power can be addressed by comparing the observed 95% CLs for the between-group differences (n of 1 trial minus standard patients) in within-group changes over time (12 months minus baseline) in the primary outcomes to their corresponding minimal important differences. Given the respective CLs and minimal important differences for the CRQ Physical Function score (8.2, 2.5; 4.5),29 the CRQ Emotional Function score (4.7, 5.7; 5.5), 29 and the 6-min walk (26, 42 m; 30 m),9 ,27 it remains possible, but unlikely, that clinically important effects in favor of n of 1 trials were missed. A second possible reason for our negative result is that theophylline is not effective for irreversible CAL through 1 year. This question has not been answered in existing randomized, placebo-controlled trials because no studies have assessed theophylline's effect beyond 2 months.4 ,5 ,6 ,7 ,8 ,9 ,10 ,11 ,12 ,13 ,14 ,15 ,16 ,17 If the drug's impact on patient well-being is not maintained in the longer term, then our study could not have been expected to show a difference.
A third possible reason for the negative result was that the n of 1 trials were conducted suboptimally. Nearly one half (16/34) of the n of 1 trials were based on two or less pairs of active and placebo treatments, including four patients in whom only one course of either active or placebo treatment was received. Because the experimental unit in an n of 1 trial is a treatment period, the power to detect differences between two therapies depends directly on the number of treatment periods. Thus, n of 1 trials having very few treatment periods are unlikely to detect weak treatment effects. This tendency toward low power is accentuated by the recommendation, to which we adhered, of excluding patients who were sure that theophylline was helpful following an open trial.23 However, our rate of stopping one half of the n of 1 trials before completing at least three treatment pairs is consistent with other studies,20 ,31 and it suggests that the failure of patients to complete multiple treatment crossovers is an inherent limitation of n of 1 trials. Our decision to exclude patients who were sure that theophylline was helpful following open therapy should not have invalidated conclusions regarding the effects of n of 1 trials or standard practice on group outcomes because of differing baseline perceptions about the drug's value. This conclusion is based on our randomizing patients to n of 1 trials or standard practice only after confirming that they were uncertain about the effects of theophylline. However, this approach does limit the generalizability of our findings. Specifically, we did not assess the impact of initiating theophylline by either n of 1 trials or standard practice, and without an open run-in, in patients with CAL.
A final possible reason for the negative result may reflect our decision to return patients to their own physician following the first 3 months of the study. In so doing , we adopted an effectiveness ("real world") rather than efficacy ("ideal world") strategy to assess the impact of n of 1 trials. In seven patients, nonstudy physicians subsequently chose to resume theophylline despite conclusions during the n of 1 trials or standard practice approach that it was ineffective. We cannot determine the impact of these reversals on the comparisons of quality of life and exercise capacity between the n of 1 trial group and the standard practice group beyond noting that (1) there was not a large difference between the two groups in the number of patients who subsequently resumed theophylline (five and two patients, respectively), and (2) excluding these patients did not yield different results from the primary, intent-to-treat analysis. It is, therefore, less likely that these switches materially affected our overall findings. However, investigators comparing outcomes between patients randomized to n of 1 trials or standard practice in future studies should consider limiting subsequent treatment changes (by directly controlling patient therapy beyond the initial phase) or allowing such reversals to occur but maintaining study power to detect differences after excluding patients who switch (by randomizing a larger number) .
Our secondary hypothesis that n of 1 trials would lead to less theophylline use than standard practice was also not confirmed in the present study. It was based on the premise that, overall, the biases of single-patient open trials favor false conclusions that the treatment is effective, and our previous randomized study of 26 patients in which 47% (95% CLs, 14%, 80%) fewer n of 1 trial patients than standard practice patients were taking theophylline at 6 months without differences in CRQ scores and the 6-min walk.21 Patients in our previous study had been on theophylline at the time of recruitment for at least a year, making them more comparable to prior-theophylline-use patients in the present study. Both patient groups were probably "theophylline survivors," being less sensitive to the drug's side effects and having a stronger perception that it was relieving symptoms compared to no-prior-theophylline-use patients. This inference is supported by our observations of prior-theophylline-use patients versus no-prior-theophylline-use patients of higher theophylline levels at entry, a greater proportion of patients taking theophylline at 1 year, and a larger proportion of patients resuming theophylline following a negative n of 1 trial or prematurely abandoning the n of 1 trial in favor of continuing the drug.
Despite the greater tendency among prior-theophylline-use patients to tolerate and remain on theophylline, we found that and fewer n of 1 trial patients than standard practice patients within this stratum were taking theophylline at 3 months (26%) and 6 months (21%), without clinically important changes in CRQ scores and exercise capacity. While the reduction in theophylline use and the lack of changes in quality of life and exercise capacity did not reach statistical significance, given the risks of theophylline1 and the large number of patients currently on the drug, these findings would be of potential clinical and economic importance if confirmed in a larger sample. We explored this possibility by combining prior-theophylline-use patients in the present trial with patients from our previous trial. We found a statistically and clinically significant reduction in theophylline use (32%) at 6 months among n of 1 patients vs standard practice patients, without significant reductions in the CRQ Physical and Emotional Function scores and the 6-min walk. This finding supports the presence of an overall bias toward false conclusions of efficacy when stopping theophylline openly in patients with CAL who have been taking it for some time, and it provides justification to use n of 1 trials in this setting. However, this conclusion must be qualified because it is based on a post hoc analysis for which confirmation is needed.
Our study neither strengthens nor diminishes the case for considering a trial of theophylline in some patients with irreversible CAL. In particular, our overall negative result does not mean that theophylline should be abandoned because it is ineffective in all such patients. This is because 7 of 34 patients (21%) undergoing n of 1 trials demonstrated a potentially useful response to the drug on the basis of their daily dyspnea scores (Table 4 ). In each case, the improvement in dyspnea score on theophylline exceeded zero, and the 90% CLs on the estimates made it unlikely, though not impossible, that chance explained the difference. Therefore, given the objectivity of n of 1 trials, the improvement in dyspnea during theophylline therapy in these patients seems genuine. Furthermore, these results corroborate those of previous parallel-group, placebo-controlled trials in which theophylline improved the quality of life and/or exercise capacity.7 ,9 ,10 ,14 ,15 Thus, there remains a rationale for a trial of theophylline in selected patients with irreversible CAL who remain symptomatic despite inhaled bronchodilators.
In conclusion, guiding theophylline therapy for patients with irreversible CAL by n of 1 trials did not improve their quality of life or exercise capacity, or reduce overall drug usage through 1 year compared to standard practice. Although some patients with CAL who underwent n of 1 trials of theophylline noted an improvement in dyspnea, we did not show that routine use of n of 1 trials to establish the value of theophylline in this setting yielded important advantages compared to the standard practice of open, before-after trials. In a post hoc analysis requiring confirmation, using n of 1 trials in patients with CAL who were already taking theophylline reduced drug usage over 6 months compared to standard practice without changes in their quality of life or exercise capacity. Further randomized studies of n of 1 trials vs standard practice for this and other problems are needed before widespread adoption of n of 1 trials can be recommended.32
| Acknowledgements |
|---|
| Footnotes |
|---|
Correspondence to: Dr. Jeffrey Mahon, Rm 6OF11, University Hospital, PO Box 5339, 339 Windermere Rd, London, ON N6A 5A5, Canada; e-mail: Jl.Mahon@lhsc.on.ca
Abbreviations:CAL = irreversible chronic airflow limitations; CL = confidence limit; CRQ = chronic respiratory disease questionnaire
Received for publication December 22, 1997. Accepted for publication July 16, 1998.
| References |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
G. E. Vist, K. B. Hagen, P J Devereaux, D. Bryant, D. T. Kristoffersen, and A. D. Oxman Systematic review to determine whether participation in a trial influences outcome BMJ, May 21, 2005; 330(7501): 1175. [Abstract] [Full Text] [PDF] |
||||
![]() |
J.R. Curtis and D.L. Patrick The assessment of health status among patients with COPD Eur. Respir. J., June 1, 2003; 21(41_suppl): 36S - 45s. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. A. PAUWELS, A. S. BUIST, P. M. A. CALVERLEY, C. R. JENKINS, and S. S. HURD Global Strategy for the Diagnosis, Management, and Prevention of Chronic Obstructive Pulmonary Disease . NHLBI/WHO Global Initiative for Chronic Obstructive Lung Disease (GOLD) Workshop Summary Am. J. Respir. Crit. Care Med., April 1, 2001; 163(5): 1256 - 1276. [Full Text] |
||||
![]() |
D. A. Mahler How Should Health-Related Quality of Life Be Assessed in Patients With COPD? Chest, February 1, 2000; 117(2_suppl): 54S - 57S. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE |