|
|
||||||||
Guest Access | Sign In via User Name/Password |
|||||||||
* From the Mexican National Institute of Respiratory Diseases (Drs. Pérez-Padilla and Regalado-Pineda), the National Institute of Public Health (Drs. Mendoza and Rojas), and the Environmental Health Directorate of the Secretariat of Health (Drs. Torres, Borja-Aburto, and Olaiz), Mexico City, Mexico.
Other members of the EMPECE study group are given in the Appendix.
Correspondence to: Rogelio Pérez-Padilla, MD, Instituto Nacional de Enfermedades Respiratorias, Tlalpan 4502, México DF, México; e-mail: perezpad{at}servidor.unam.mx
| Abstract |
|---|
|
|
|---|
Subjects and methods: Three thousand three hundred forty-seven participants from the third through sixth grades of elementary school were recruited to perform biannual spirometry, yielding a maximum of seven evaluations and a total of 15,563 tests. Standard recommendations of the American Thoracic Society (ATS) were followed, using dry rolling-seal volume spirometers.
Results: During their first spirometric test, > 95% of the subjects fulfilled each of the quality criteria proposed by ATS for adults, though not all of them did so simultaneously. For example, only 72.4% obtained three acceptable maneuvers, reproducibility for FEV1 and FVC to < 200 mL, and a small back-extrapolated volume that increased to 92.3% by the second test. Between phase 1 and phase 7 of the study, spirometry quality increased significantly, as a result of subject and technician training. Intratest and intertest (with a 6-month difference) spirometric variability was less in boys than in girls. Intratest variability was also lower in younger and taller subjects. Technicians contributed significantly to intratest and intertest variability, the latter decreasing if the same technician performed both evaluations.
Conclusion: Children > 7 years old can fulfill ATS criteria of quality after the first spirometric evaluation. To maintain quality of spirometric tests in longitudinal studies of children, a strict control is required, especially of technician performance.
Key Words: American Thoracic Society criteria children European Respiratory Society criteria quality control spirometry
| Introduction |
|---|
|
|
|---|
| Materials and Methods |
|---|
|
|
|---|
Methodology for Spirometric Examination
Tests were done with six identical dry rolling-seal horizontal volume spirometers (SensorMedics Model 922; SensorMedics; Yorba Linda, CA). These spirometers fulfill the quality criteria recommended by the American Thoracic Society (ATS).1
2
Routine calibration was performed with a 3-L syringe (SensorMedics), and quality control checks were performed with a flow-volume syringe (Flow-Volume Calibrator; Jones Medical Instruments; Oak Brook, IL).
Before each study, height and weight without shoes were measured. Spirometer temperature was measured by a digital wall thermometer and included in the data along with the mean barometric pressure of Mexico City (583 mm Hg).
Technicians were trained specifically for this project, using the guidelines of the Spirometry Course of the National Institute of Occupational Safety and Health. Four full-time field supervisors participated, two of them in charge of spirometry quality control, along with 12 technicians. Standard procedures suggested by ATS1 were followed, with additions coming from a similar study3 that involved students of the same age.
To avoid inhaling from spirometers, only the expiratory part of forced expiratory maneuvers was registered. A maximum of nine forced expiratory maneuvers were performed to complete three acceptable readings according to ATS criteria; each was identified automatically by a nil error code. That is, each maneuver identified with a nil error code has a back-extrapolated volume (Vbe)
5% of FVC or
100 mL, an expiratory time of
6 s, or a zero volume change (< 50 mL) for the last 2 s of expiration. In addition, reproducibility for FVC and FEV1 within 5% was identified automatically.
The three best maneuvers were saved by the computer. In general, < 5 maneuvers (done while standing) were required to obtain three acceptable readings. If the child had a tendency to inspire during the FVC maneuver and continue expiration later, the maneuver was repeated using nasal clips.
At the end of each study, technicians reviewed a quality control screen on the monitor to certify the correct selection of best maneuvers and values by the machine. Spirometric technique was supervised at least once a week by the quality control staff. To analyze only reliable data, at the end of a season all spirometric tests carried out were reviewed once again.
The quality of spirometric tests was assessed by several criteria in addition to the automatic evaluation done by the software. One was the number of acceptable maneuvers according to ATS,1 ranging from 0 to 3, the highest kept by the spirometric software. Another indicator of quality was reproducibility. FEV1 and FVC were considered reproducible according to ATS criteria when the best two trials differed by not more than 200 mL. In addition, intratest reproducibility was quantified by the difference between the two best measurements of FEV1, FVC, forced mid-expiratory flow rate (FEF2575), and peak expiratory flow rate (PEFR) in original units and expressed as percentages (percentage difference between the two highest FEV1 values [dFEV1%], percentage difference between the two highest FVC values [dFVC%], percentage difference between the two highest FEF2575 values [dFEF2575%], and percentage difference between the two highest PEFR values [dPEFR%]) and the coefficient of variation (COV)4 of the same variables.
Quality criteria were also calculated as described by Enright et al5 in the Lung Health Study. These criteria include, in addition to dFEV1% and dFVC%, the reproducibility of PEFR (absolute difference between the two highest PEFR values [dPEFR]), which is much more difficult to achieve. In addition, performance for flows and volume is estimated separately. The original classification was used for flow quality, but for volume it was modified slightly because spirometers did not provide the expiratory time. For the purposes of this study, volume "A" grade was assigned if the average time to reach vital capacity (FET100%VC) in the three best maneuvers was > 6 s. FET100%VC is shorter than expiratory time if vital capacity can be reached before expiratory effort ends; in other words, when a volume plateau is present, a common event in children.
As indicators of intrasubject, intertest reproducibility (6-month difference), the Pearson correlation coefficient,6 the intraclass correlation coefficient, and the COV4 were all used. Also calculated was the percentage of subjects who fulfilled quality criteria for a good beginning of test, as proposed by ATS1 (Vbe <150 mL or 5% FVC) and the European Respiratory Society (ERS)7 [Vbe <100 mL or 5% FVC]. The end-of-test criteria was not assessed independently because the end-of-test volume and forced expiratory time are not provided by the software.
Spirometric variability was analyzed with longitudinal models (generalized estimation equation) using as the dependent variable one indicator of reproducibility (the COV or the percentage difference between the best two maneuvers), and as independent variables, the phase of the study (indicator variables from 1 to 7), spirometry order (indicator variables from 1 to 7), height, age, gender, and technician. Similar models were designed to assess intertest (intrasubject with 6-month difference) variability adjusting by study phase, age, gender, height, and technician. Calculations were done with the help of Stata Statistical Software (Stata Corporation; College Station, TX).
| Results |
|---|
|
|
|---|
|
|
|
|
Adjusting for the rest of the variables, including size, age, and gender, a 6% increase of FEV1 in the seventh evaluation was observed, which could be attributed to training; however, one technician obtained on average values 8% higher and another 6% lower than the rest, adjusting by the rest of the variables. Spirometric reproducibility of FEV1, FVC, FEF2575, PEFR, and other instantaneous flows such as maximal expiratory flow when 50%, 25%, 75% of the FVC remains to be expired, increased in the study after the first evaluation. Variability adjusted by study phase (generalized estimation equation model) was lower in boys than in girls for the main measurements: FEV1, FVC, and PEFR. Similar results were obtained using the flow and volume scores of Enright et al5 as the dependent variables. In longitudinal models adjusted by age, height, phase of study, and technician, the group with the least intratest variability was that made up of tall young boys. In longitudinal multivariate models, the intratest variability of FEV1 decreased with study phase (time), but additional association with spirometry order was not present. However, study phase and spirometry order contributed significantly to intratest variability of FVC and instantaneous flows, decreasing with time and number of spirometric tests done, likely due to the training of children and technicians.
Technicians contributed substantially to intratest and intertest variability. When consecutive spirometric tests were done by the same technician, variability decreased significantly. Those technicians who performed better, obtained higher spirometric values despite adjusting simultaneously by height, weight, gender, phase of study, and order of spirometry in a longitudinal model. Compared with the technician who performed more studies, the mean estimated difference of the rest of technicians was + 15.9 mL for FEV1 and + 34.6 mL for FVC, with 75% of the values between - 15 mL and + 63 mL for FEV1 and - 4 to + 82 for FVC. Twelve technicians did 71% of all tests, and their performance did not differ significantly from the rest.
Intertest variability was assessed in 10,097 spirometric tests done in 2,746 subjects with repeated tests with a 6-month difference. In this group, the mean number of tests was 3.7. Table 5 shows the 6-month intertest (intrasubject) reproducibility, estimated by using the Pearson correlation coefficient, the intraclass correlation coefficient, and the COV for all tests done. The intraclass correlation coefficient for FEV1 and FVC was > 0.8, and the COV was approximately 5% and changed little, excluding outliers. Variability increased between the fifth and sixth evaluations, but then decreased again (Table 5) . Reproducibility was poorer for PEFR but also improved considerably during the study.
|
The FEV1 was on average 103.4% (SD, 113.1) of that predicted by the third National Health and Nutrition Evaluation Survey for Mexican Americans,10 and 100.6% (SD, 12.0) of that predicted for Mexicans;11 whereas similar numbers for FVC were 104.6% (SD, 13.5) and 99.9% (SD, 11.7).
| Discussion |
|---|
|
|
|---|
It is also important to assess reproducibility periodically to ascertain if quality was maintained, a difficult issue in longitudinal studies; however, children and their lung function grow between two consecutive tests (6 months) and therefore spirometric measurements differ even with good quality control. Despite lung growth, the correlation between two consecutive tests will be higher and the COV lower in better-performed tests. Intertest reproducibility was maintained during the study, except for a transient drop during the fifth phase that coincided with an episode of high air pollution. During this episode, some of the technicians participated in an epidemiologic surveillance team that returned to the evaluation of students at a later date. Due to the time constraints of evaluation, it is likely that a group of students was evaluated quickly in order to finish before the summer break.
Children were able to fulfill the ATS criteria of acceptability used with adults, including the Vbe, end-of-test criteria, and an expiration lasting for at least 6 s. Almost 91% of the tests evaluated had three acceptable maneuvers, and only 1.4% obtained no acceptable maneuvers. ATS requirements for adults can be achieved in children with few modifications as suggested by Enright et al.8 It was noticeable that 93.9% of the population was able to reproduce FEV1 and FVC to < 150 mL or 5%, slightly better than previously reported.8 Approximately 95% of the children studied fulfilled each of the standards proposed by Enright et al8 for children, but not all of them together. For example, 92.3% of the tests in the subjects have an extrapolated volume < 5% of FVC, 94.9% have an absolute difference between the two highest FVC values (dFVC) < 200 mL and 5%, 94.9% have an absolute difference between the two highest FEV1 values (dFEV1) < 200 mL and 5%, and 94.0% have a dPEFR < 1,000 mL/s and 5%. However, only 79.5% of the subjects fulfilled all those four criteria simultaneously. Neither the forced expiratory time, PEFR time, nor the end-of-test volume could be evaluated quantitatively because they are not provided in the database of the spirometers, but it is likely that the percentage of children complying simultaneously with all proposed quality criteria will be low. It is clear that specific quality criteria for spirometry in children must be agreed on.12
The impact of technicians on the pulmonary function in longitudinal multivariate models was important. In this study, two technicians with extreme performance produced estimated differences in FEV1 of 200 mL (approximately 12%), adjusted for the rest of the variables, a magnitude that is clinically significant, although three fourths of the technicians differ < 70 mL on average. Technician training and permanent evaluation is therefore essential; in addition, its contribution should be taken into account during analysis.
The quality of spirometry increased from the second phase of the present project, due to learning by both children and technicians, as well as a better interaction between both. Ideally, the best level of performance in technicians and children should be attained in a pilot phase, before obtaining data, but in this study that was unfeasible. Another priority in prolonged studies is keeping the interest of people who are taking part: as enthusiasm tends to wean, so does spirometric quality. This goal was also achieved, despite a minor decrease in spirometric quality during the fifth phase that was solved afterwards.
Clinical spirometry in children also requires quality assurance. Although the specific levels of reproducibility achievable in routine pediatric care have to be determined, the issues discussed in the present document are relevant to clinical practice including the importance of technician training, evaluation and supervision.
| Appendix |
|---|
|
|
|---|
| Acknowledgements |
|---|
| Footnotes |
|---|
This work was presented at the American Thoracic Society Meetings held in May 2001.
Supported by CONACYT (National Council of Science and Technology of Mexico), the Ministry of Health, the National Institutes of Respiratory Diseases and Public Health, the CONSERVA Foundation and the following pharmaceutical laboratories: Ciba, Glaxo, Promeco, Eli Lilly, and Smith, Kline, and Beecham.
Received for publication May 15, 2002. Accepted for publication October 11, 2002.
| References |
|---|
|
|
|---|
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |