Skip to main content

Main menu

  • Home
  • Current issue
  • ERJ Early View
  • Past issues
  • ERS Guidelines
  • Authors/reviewers
    • Instructions for authors
    • Submit a manuscript
    • Open access
    • COVID-19 submission information
    • Peer reviewer login
  • Alerts
  • Subscriptions
  • ERS Publications
    • European Respiratory Journal
    • ERJ Open Research
    • European Respiratory Review
    • Breathe
    • ERS Books
    • ERS publications home

User menu

  • Log in
  • Subscribe
  • Contact Us
  • My Cart
  • Log out

Search

  • Advanced search
  • ERS Publications
    • European Respiratory Journal
    • ERJ Open Research
    • European Respiratory Review
    • Breathe
    • ERS Books
    • ERS publications home

Login

European Respiratory Society

Advanced Search

  • Home
  • Current issue
  • ERJ Early View
  • Past issues
  • ERS Guidelines
  • Authors/reviewers
    • Instructions for authors
    • Submit a manuscript
    • Open access
    • COVID-19 submission information
    • Peer reviewer login
  • Alerts
  • Subscriptions

Use of forced vital capacity and forced expiratory volume in 1 second quality criteria for determining a valid test

John L. Hankinson, Bill Eschenbacher, Mary Townsend, Janet Stocks, Philip H. Quanjer
European Respiratory Journal 2015 45: 1283-1292; DOI: 10.1183/09031936.00116814
John L. Hankinson
1Hankinson Consulting, Inc., Athens, GA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: john@hankconsulting.com
Bill Eschenbacher
2Cincinnati VA Medical Center, Cincinnati, OH, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Mary Townsend
3MC Townsend Associates, Pittsburgh, PA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Janet Stocks
4Respiratory, Critical Care and Anaesthesia Section (Portex Unit), UCL Institute of Child Health, London, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Philip H. Quanjer
5Dept of Pulmonary Diseases, Erasmus University Medical Centre, Rotterdam, The Netherlands
6Dept of Paediatrics, Division of Respiratory Medicine, Erasmus University Medical Centre – Sophia Children's Hospital, Rotterdam, The Netherlands
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • Article
  • Figures & Data
  • Info & Metrics
  • PDF
Loading

Abstract

The 2005 American Thoracic Society (ATS)/European Respiratory Society (ERS) spirometry guidelines define valid tests as having three acceptable blows and a repeatable forced vital capacity (FVC) and forced expiratory volume in 1 s (FEV1). The aim of this study was to determine how reviewer and computer-determined ATS/ERS quality could affect population reference values for FVC and FEV1.

Spirometry results from 7777 normal subjects aged 8–80 years (NHANES (National Health and Nutrition Examination Survey) III) were assigned quality grades A to F for FVC and FEV1 by a computer and one reviewer (reviewer 1). Results from a subgroup of 1466 Caucasian adults (aged 19–80 years) were reviewed by two additional reviewers. Mean deviations from NHANES III predicted for FVC and FEV1 were examined by quality grade (A to F).

Reviewer 1 rejected (D and F grade) 5.2% of the 7777 test sessions and the computer rejected ∼16%, primarily due to end-of-test (EOT) failures. Within the subgroup, the computer rejected 11.5% of the results and the three reviewers rejected 3.7–5.9%. Average FEV1 and FVC were minimally influenced by grades A to C allocated by reviewer 1.

Quality assessment of individual blows including EOT assessments should primarily be used as an aid to good quality during testing rather than for subsequently disregarding data. Reconsideration of EOT criteria and its application, and improved grading standards and training in over-reading are required. Present EOT criteria results in the exclusion of too many subjects while having minimal impact on predicted values.

Abstract

ATS/ERS end of test criteria for spirometry exclude too many subjects while having minimal impact on predicted values http://ow.ly/DQRCD

Introduction

The first step in deciding if a subject's spirometry results should be included in a study is by assessing the quality and validity of their test session. The 2005 American Thoracic Society (ATS)/European Respiratory Society (ERS) statement [1] defines a valid test during test performance as one that has three acceptable blows and a repeatable forced vital capacity (FVC) and forced expiratory volume in 1 s (FEV1). The ATS/ERS state that “It is desirable to use a computer-based system that provides feedback to the technician. […] The repeatability criteria are used to determine when more than three acceptable FVC manoeuvres are needed; these criteria are not to be used to exclude results from reports or to exclude subjects from a study” [1]. However, it is unclear when quality is insufficient for acceptance of results into research studies. There are no specific recommendations as to how strictly these criteria should be applied during test inclusion decisions, or the extent to which a reviewer should over-ride strict or computerised decisions regarding test validity. Which tests (subjects) are included in a study may impact study results, particularly in widely used reference values studies. Thus, amendments to quality criteria during the past two decades could potentially impact on spirometry reference standards. The NHANES (National Health and Nutrition Examination Survey) III spirometry data were collected with the aim of each subject trying to exhale for at least 6 s, with a minimum of five blows and a repeatable FVC and FEV1 within 5% or 100 mL. By contrast, the 1987 ATS criteria defined an end-of-test (EOT) plateau as a “<40 mL increment over 2 s” [2]. This was manually determined using an expanded EOT display using no change in volume for 1–2 s and/or a good expiratory effort. The ATS EOT criteria were subsequently amended, first to “no change in volume over 1 s”, based on the spirometer's limit of detection volume of ≤0.030 L [3], and then to “no change in volume (<0.025 L) for ≥1 s, and the subject has tried to exhale for >6 s in subjects aged >10 years” [1]. The extent to which available spirometry software has implemented EOT and other quality assessment algorithms is varied and unclear.

Therefore, the aims of the present study were as follows. 1) To assess what proportion of spirometry test results from healthy nonsmokers in the NHANES III study [4] would be included in a study if quality control decisions had been based on computerised application of the 1994 ATS [3] and 2005 ATS/ERS [1] EOT definitions. 2) To assess the extent to which different grading systems might affect published reference data. 3) To compare computer-generated quality grades for FEV1 and FVC with those from a reviewer, as well as comparing quality grades between three reviewers.

Since test quality is an integral part of clinical test interpretation and therefore differs considerably from its application when used in the research context, we limited the scope of our investigations to healthy research subjects.

Study design

Spirometry curves and results from the NHANES III reference value study [2] were re-analysed to determine to what extent observed values for FVC and FEV1 are affected by quality control using published grading systems [5–7]; modified here to accommodate ATS and ATS/ERS recommendations (table 1) when applied by a computer and reviewers.

View this table:
  • View inline
  • View popup
TABLE 1

Computer algorithms for assigning quality criteria grades for forced vital capacity (FVC)

Since “normal” subjects were used to derive the reference equations for NHANES III and there was one test per subject, the impact of alterations in quality control criteria and relative grades can be measured by any deviation from the original predicted values. Consequently, predicted values [2] for the entire dataset were compared across computer-determined quality grades, using ≤25 mL EOT criterion, with those from one reviewer (reviewer 1).

In a separate analysis, a subset of Caucasian adults was reviewed by two additional experienced reviewers to assess the agreement between reviewers. Apart from being told that “D” and “F” grades would result in a test (subject) being excluded from a study, no specific instructions or training were provided prior to these reviews.

Methods

Test results from 7777 healthy, nonsmoking subjects (58% male, 41% Caucasian, 31% aged <18 years) from the NHANES III study [4] were assigned quality grades (A through F) for FVC and FEV1, separately as previously described [5–7], using both computer algorithms (table 1) and grades allocated by reviewer 1. Data from 348 normal subjects previously excluded from the NHANES reference dataset [4], due to unreliable results as judged by two original reviewers, were included in this re-analysis. Data from all 7777 subjects (table E1) were reviewed and graded both by computer and reviewer 1 according to the different quality criteria. Test results from the subset of all healthy Caucasian adults (n=1466) were reviewed by two additional reviewers. Predicted or lower limit of normal values were not provided to the reviewers.

When assigning test session quality grades, the computer visual display of all the subject's blows were available to the independent reviewers (i.e. all spirograms and flow–volume curves, including any that did not meet quality criteria), together with: 1) the corresponding values for FVC, FEV1, FEV1/FVC and peak expiratory flow; 2) their variability (i.e. how much individual values differed from the largest (best) values); and 3) a series of green or red bars for each of these outcomes to indicate whether or not acceptability criteria had been met according to the computer, and whether the largest FVC and/or FEV1 were repeatable. Criteria for acceptability included: no artifacts, no abrupt termination, no glottis closure or cough within the first second of the test, no early termination, no leaks and no large back extrapolated volume, as well as a maximal continuous effort. The computer algorithm was set to determine if the EOT criterion was met, i.e. a volume change of <25 mL for ≥1 s, and whether forced expiratory time (FET) was ≥6 s. The computer grades and acceptability criteria, as shown in table 1, were provided as a guide but could be modified if judged appropriate by each reviewer following inspection of each blow, e.g. inspection of the tail of the curve appeared satisfactory even if it did not meet the computerised criteria of <25 mL in 1 s or FET ≥6 s . In contrast to the computer, reviewers used visual inspection to judge a satisfactory EOT rather than basing this decision on a specific value for FET. Using all the information above, FVC and FEV1 were scored separately; FVC and FEV1 values could be used from blows where EOT criteria had not been met.

Analysis

FVC and FEV1 were expressed as the difference (mL) between results observed in this study and the NHANES predicted values (ΔFVC and ΔFEV1) [4]. The mean (95% confidence intervals) for ΔFVC and ΔFEV1 were then calculated by quality grade. Mean differences should equal zero, with higher or lower values suggesting a bias according to quality grades. As we were assessing the influence of quality control on reference values obtained with the whole dataset [4], a subject's test session was included in the analysis even if not all EOT criteria were met. Therefore, any differences from predicted were not due to exclusion of blows where the FVC did not meet EOT criteria as such blows could still be used when deriving the subject's best FVC and FEV1.

While the ATS/ERS standardisation document states that manoeuvres that do not meet EOT criteria should not be used to satisfy the requirements of three acceptable blows [1], this requirement was ignored for the purposes of this study so that its potential effect on results could be explored.

Statistical analyses were performed using SigmaPlot (Version 12.5; Systat Software, Inc., San Jose, CA, USA). ANOVA and linear regression were used to test for within-reviewer trends between ΔFVC and ΔFEV1 and quality control grades. A p-value <0.05 was considered significant.

Results

Review of entire dataset

FVC

As expected, the rejection rate based on FVC quality alone increased with increasing quality and EOT requirements (table 2). When re-analysing the entire dataset, reviewer 1 rejected (D or F grade) 5.2% of the FVC results (4.5% for males, 6.5% for females), compared to 16% for the computer when using <25 mL EOT criterion (fig. 1). Requiring a quality grade of “A” would exclude >25% of tests, based on either reviewer or computer results (fig. 1). While the computer rejected a far higher percentage of tests (16% being graded F primarily due to failure to achieve an appropriate EOT, based on the 25 mL plateau criterion for any of the blows) than reviewer 1, the reviewer used additional information from inspecting the shape and consistency of curves, allocating fewer grade A but more grades B and C than the computer. For the test shown in figure 2, the computer assigned an FVC grade of “F” primarily due to less than two acceptable blows due to the technical lack of a plateau in the volume–time curve, while reviewer 1 assigned grade “B” since the FVC would only have increased by ∼70 mL had exhalation continued for 15 s [4].

FIGURE 1
  • Download figure
  • Open in new tab
  • Download powerpoint
FIGURE 1

Distribution of forced vital capacity quality grades by reviewer 1 and the computer. The computer rejected a far higher percentage of tests (16% being graded F due to failure to achieve an appropriate end-of-test, based on the 25 mL plateau criteria for at least two blows) than reviewer 1. The reviewer used additional information from inspecting the shape and consistency of the curves, allocating fewer grade A but more grades B and C than the computer.

FIGURE 2
  • Download figure
  • Open in new tab
  • Download powerpoint
FIGURE 2

Sample curve showing end-of-test (EOT) or plateau failure as determined by the computer. FVC: forced vital capacity; FEV1: forced expiratory volume in 1 s; FET: forced expiratory time.

View this table:
  • View inline
  • View popup
TABLE 2

Computer rejection rates based on application of original quality control criteria used in the NHANES III study and application of the 2005 ATS/ERS end-of-test (EOT) criteria#

There were no differences in reviewer rejection rates between ethnic groups (5.5%, 5.6% and 5.9% rejected by reviewer 1 for Caucasian, Mexican–American and African–American subjects, respectively).

FVC differed significantly from predicted for grades A, C, D and F for reviewer 1 (fig. 3). The results were minimally, but statistically (ANOVA p=0.03), different for grades A–C with the mean (95% confidence interval) ΔFVC across these grades being 13.5 (3.1–23.9) mL. There was only a slight trend for FVC decreasing between grades A–C, with grade C providing results that were essentially equivalent to grades A and B. The ΔFVC from predicted for reviewer-allocated grades D and F (−309 and −518 mL, respectively) were much larger than those associated with computerised grading (−78 and −158 mL, respectively; ANOVA p<0.001) (fig. 3). Similarly, while FVC did decrease with poor computer-determined grades, ΔFVC differed significantly from zero only for grades A and F.

FIGURE 3
  • Download figure
  • Open in new tab
  • Download powerpoint
FIGURE 3

Differences between predicted and observed forced vital capacity (FVC) according to quality control grades allocated by reviewer 1 and the computer when applied to the entire dataset. The quality control grades are presented in table 1. n=7777. *: p<0.05.

The total FET for all 7777 subjects (fig. 4a) was >6 s for virtually all subjects because this had been the EOT testing goal during test performance. However, in 54% of subjects aged ≥19 years and 74% of subjects aged <19 years, the expiratory time actually required to reach a plateau defined as a volume change of <25 mL over 1 s was <6 s. (table 2). The <25 mL EOT plateau requirement resulted in FVC differences from predicted that increased with age to ∼85 mL in older subjects (fig. 4b) because the criterion terminates the manoeuvre earlier.

FIGURE 4
  • Download figure
  • Open in new tab
  • Download powerpoint
FIGURE 4

a) Total expiratory time was consistently longer at any age than the time required to reach an end-expiratory plateau of 25 mL. Note that in subjects aged <37 years the average time to reach a 25 mL plateau is <6 s. b) Applying a 25 mL over 1 s criteria for the plateau leads to an age-related underestimation of the maximum forced vital capacity (FVC).

FEV1

The FEV1 grades were higher than FVC since they were not affected by EOT issues and relatively few poor (D or F) FEV1 manoeuvres were identified by either reviewer 1 or the computer (3.1% and 3.9%, respectively) (fig. 5). There was a large decrease in FEV1 from grade A to F with ΔFEV1 being significantly below zero for both grade D and F results allocated by reviewer 1 and for computer-determined grade F. Reviewer 1 had more grades B and C than the computer (21.5–6.45%), primarily because, even if back extrapolated volume did not exceed 150 mL or 5% of FVC, lower grading occurred if the peak flow was suboptimal, with the exception of one study [8]. Suboptimal effort has been shown to affect flows and hence the FEV1 [9–12].

FIGURE 5
  • Download figure
  • Open in new tab
  • Download powerpoint
FIGURE 5

a) Differences between measured and predicted forced expiratory volume in 1 s (FEV1) for the entire dataset, according to quality control grades allocated by reviewer 1 and the computer. The numbers above the bars represent the number of subjects. Reviewer 1 had fewer grade A and more grades B and C than the computer. b) Distribution of FEV1 quality grades by reviewer 1 and the computer. *: p<0.05.

Subset analysis

Among the 1466 Caucasian adults that comprised the subset, the computer rejected two to three times as many subjects as all reviewers (table 3). Figure 6a shows the differences between the observed and predicted FVC results by quality grade. Only the ΔFVC results for D and F grades from reviewer 1 and the F grades from reviewer 2 were statistically different from zero. However, the trend of positive difference associated with grade A to more negative differences with grade F found in figure 3 is repeated for all reviewers and the computer.

FIGURE 6
  • Download figure
  • Open in new tab
  • Download powerpoint
FIGURE 6

a) The difference between measured and predicted forced vital capacity (FVC) in 1466 Caucasian adults varied between reviewers and the computer, and increases the lower the quality grade. b) The computer rejected several measurements (grades D and F) which had little impact on predicted FVC, whereas reviewers were more discriminative, rejecting a low percentage of measurements with a considerable impact on FVC (a and b). See also figure E1for similar data relating to analysis of forced expiratory volume in 1 s in this subset. *: p<0.05.

View this table:
  • View inline
  • View popup
TABLE 3

Relative rates of rejection (D or F grade) and mean differences# in rejected values according to reviewers and computerised assessment for healthy subjects and Caucasian adults

Reviewers 1 and 2 found a significantly lower FEV1 for grade F (mean (95% CI) −243.6 (−428.0–  −59.2) mL and −263.1 (−416.3– −109.9) mL, respectively). An association between ΔFVC and ΔFEV1 with decreasing quality grade was found by reviewer 1 (all subjects: ANOVA p<0.001 and p=0.04; Caucasian adults: ANOVA p=0.04 and p=0.03, for ΔFVC and ΔFEV1, respectively). This trend was also observed for ΔFVC when using the 25 mL EOT criterion (p=0.03), but not for reviewer 3.

Differences between reviewers

Differences in rejection rates between reviewers were greater for FVC than FEV1 grades (3.7–5.9% and 1.9–2.1%, respectively) (table 3); thus, reflecting the more complex EOT judgments needed for FVC. Mean values for rejected FVC test results were significantly lower than predicted for all reviewers and the computer. FEV1 rejected test rates were similar for all reviewers and the computer (fig. E1). Only reviewer 1 and the computer had values for rejected tests that were significantly lower than predicted.

Discussion

The main finding in this study was the proportion of tests regarded as being of unsatisfactory quality when assessed by strict application of the ATS/ERS guidelines using computer algorithms, corroborating findings in children [13]. Including data rejected by the computer had no significant effect on overall results, whereas the reviewers were more discriminative. The large number of tests that would be rejected, particularly in younger subjects, due to failure to complete a 6-s exhalation but which could be included if a test was terminated as soon as an expiratory plateau had been obtained (table 2, fig. 4a), strongly suggests that the 2005 ATS/ERS EOT recommendations [1] need to be reconsidered. Our results also emphasise that quality assessment regarding the acceptability of individual blows including EOT assessments should be primarily used as an aid to good quality during testing rather than an excuse to subsequently disregard data. Of particular note is that use of the >25 mL EOT criterion resulted in an FVC error (fig. 4b), increasing with age up to a maximum error of 100 mL. Although these differences may not be clinically significant, such differences do indicate EOT quality may influence FVC results and are age dependent, even when using the current ATS/ERS recommended 25 mL EOT criterion.

There are several lessons to be learned from this study. The NHANES III study produced high-quality data because it adhered to a strict protocol [14], which included rigorous training of technicians and refresher sessions, well-defined test conditions, and care and maintenance of equipment, as well as online feedback of the quality of each manoeuvre and test. The maximum expiratory time limit was 20 s and a minimum of five blows were obtained. Yet a computer algorithm that applied the 2005 ATS/ERS recommendations (table 2) identified 18.2% of measurements as being of insufficient quality. A particularly important finding of this study was the large proportion of subjects, particularly younger subjects, who did not require 6 s to reach the 2005 ATS/ERS defined plateau volume of 25 mL (table 2). The fact that 53.8% of healthy subjects aged ≥19 years and 74.3% of subjects aged <19 years would have their FVC results rejected for lack of exhaling for 6 s strongly suggests that the 2005 ATS/ERS EOT criteria should be reconsidered and that computerised quality control should not be relied upon in isolation. Clearly, the 2005 ATS/ERS quality goals, while providing an appropriate incentive during test performance, should not be used to determine whether a subject's results are included in a study. Several other studies underpin this suggestion [15–17], with Müller-Brandes et al. [13] concluding that a fixed cut-off for FET should be abandoned.

Enright et al. [15] found that ∼80% of their subjects met the 2005 ATS/ERS spirometry goals of three acceptable blows and a repeatable test to within 150 mL (grade A or B) [1]. The authors used an EOT requirement of 40 mL in the last second instead of the ATS/ERS recommended 25 mL, closer to the EOT used by reviewer 1 who rejected ∼5.2% of the NHANES III tests due to FVC failures. Since the 50th and 75th percentiles for the EOT volume in the study by Enright et al. [15] were 23 mL and 38 mL, respectively, somewhere between 25% and 50% of their subjects would not have met the ATS/ERS requirements for an EOT volume of 25 mL; similar to our results using a computer determined EOT. In another study of patients aged 20–89 years, 33% of test results needed to be excluded, mainly because the 6 s criterion could not be satisfied [16], whereas among children aged 8–11 years only 13.3% could exhale for 6 s [17].

While aiming for a 3 s exhalation in children aged <10 years, and 6 s beyond that age serves as a good incentive during test performance, many people simply cannot expel any gas from their lungs for that long and FET should not be used as a criterion for disqualifying a manoeuvre. Signal drift and errors in the spirometry software may also lead to inaccurate recording of expiratory duration and EOT [18]. If a volume plateau is not reached during an exhalation extended to up to 15 s, despite a good effort (fig. 2), it seems unjustified to qualify the results as unsatisfactory. This situation is most likely to arise in subjects with airways obstruction, where the largest FVC from a 15-s manoeuvre with good effort will still provide clinically useful information. Measurements of lung function may be used for various purposes including clinical diagnosis or management, as outcome measures in research studies or for deriving predicted values, these do not necessarily require the same accuracy and precision. FVC and FEV1 quality grades should allow the use of different grading limits according to underlying purpose, or allow investigation of the effects of eliminating subjects with low-quality control grades from a study. We found that grades D and F were associated with lower test results whether allocated by reviewers or the computer, with these differences being highly significant for some reviewers.

Inter-reviewer differences

Differences in grading between reviewers were generally small (fig. 6b), with all reviewers showing decreasing test results with progressively poorer quality control grades. Reviewer 1 routinely reviews or grades tests for research studies where the concern is to prevent poor-quality tests from influencing the results and to provide quality feedback to technicians. Thus, reviewer 1 may be biased towards obtaining the most accurate expected value rather than determining whether the test results are “clinically” useful. Reviewer 2 runs spirometry training courses, conducts reviews for occupational clients and provides feedback to technicians. Therefore, reviewer 2 is also concerned with high levels of accuracy as job placement and costly unnecessary referrals may result from falsely low values. By contrast, reviewer 3 conducts reviews for clinical interpretation where the objective is detection or classification of disease and where false-negative results are more of a concern. Thus, for reviewer 3, the tendency is to be more concerned with test repeatability and particularly EOT issues, which can significantly impact the FEV1/FVC due to early termination resulting in a false negative classification of patients with the obstructive lung disease pattern.

One potential limitation of this study is that results are based on NHANES III data which were collected using a volume-type spirometer, whereas flow-type spirometers are more frequently used nowadays. However, since the errors associated with incorrect registration of zero flow that may occur in flow-type spirometers do not occur when using volume-type spirometers, the use of a volume-type spirometer may in fact be an advantage when investigating EOT quality issues.

Due to time constraints, the subset used for comparisons between reviewers was limited to Caucasian adults. However, as reviewer 1 did not find differences in quality between ethnic groups, this is unlikely to have affected the results.

Differences between reviewers might have been decreased slightly had an initial formal training period been undertaken to ensure internal consistency. We did not study the extent to which within-reviewer variability contributed to overall variability. Regardless, the differences between the reviewers were small and probably reflect the experience of our reviewers as discussed above. The largest differences were between reviewers and the computer, with the computer rejecting at least twice the number of tests.

Conclusion

The results from this study emphasise that quality assessment regarding the acceptability of individual blows including EOT assessments should be primarily used as an aid to good quality during testing rather than a reason to subsequently disregard data. Although important when used as a testing goal, application of the 2005 ATS/ERS EOT criteria will reject many tests judged acceptable by experienced reviewers [1]; specifically, the need for a fixed minimum FET needs to be reconsidered. In adults, provided that at least two acceptable blows are obtained wherein values of FVC and FEV1 are within 200 mL, slight deviations in quality grading are unlikely to significantly impact study results or clinical interpretation. Although differences in quality grading between experienced reviewers were minimal, the background of the reviewer or the context of the review may influence the grading system in detecting poor quality tests. Computer quality assessments appear to reject far more tests than human reviewers, primarily due to the application of EOT requirements. Based on our results, strict computer-generated quality grading will probably have a minimal effect on predicted values, but will result in far fewer subjects being included in a study. Similarly, as application of quality criteria may not always improve clinical interpretations, visual inspection [13] will always be required to grade slightly lower quality tests and needs to be standardised as far as possible.

Acknowledgements

We thank M.R. Miller and the European Respiratory Society anonymous reviewers for their critical and constructive comments, which were instrumental in improving the presentation of our findings and clarifying the scope of our research.

Footnotes

  • This article has supplementary material available from erj.ersjournals.com

  • Conflict of interest: Disclosures can be found alongside the online version of this article at erj.ersjournals.com

  • Received June 26, 2014.
  • Accepted October 23, 2014.
  • Copyright ©ERS 2015

ERJ Open articles are open access and distributed under the terms of the Creative Commons Attribution Non-Commercial Licence 4.0.

References

  1. ↵
    1. Miller MR,
    2. Hankinson J,
    3. Brusasco V, et al.
    Standardisation of spirometry. Eur Respir J 2005; 26: 319–338.
    OpenUrlAbstract/FREE Full Text
  2. ↵
    Standardization of spirometry – 1987 update. Statement of the American Thoracic Society. Am Rev Respir Dis 1987; 136: 1285–1298.
    OpenUrlCrossRefPubMedWeb of Science
  3. ↵
    Standardization of spirometry, 1994 update. American Thoracic Society. Am J Respir Crit Care Med 1995; 152: 1107–1136.
    OpenUrlCrossRefPubMedWeb of Science
  4. ↵
    1. Hankinson JL,
    2. Odencrantz JR,
    3. Fedan KB
    . Spirometric reference values from a sample of the general U.S. population. Am J Respir Crit Care Med 1999; 159: 179–187.
    OpenUrlCrossRefPubMedWeb of Science
  5. ↵
    1. Enright PL,
    2. Johnson LR,
    3. Connett JE, et al.
    Spirometry in the Lung Health Study. 1. Methods and quality control. Am Rev Respir Dis 1991; 143: 1215–1223.
    OpenUrlCrossRefPubMedWeb of Science
    1. Enright PL
    . How to make sure your spirometry tests are of good quality. Respir Care 2003; 48: 773–776.
    OpenUrlAbstract/FREE Full Text
  6. ↵
    1. Ferguson GT,
    2. Enright PL,
    3. Buist AS, et al.
    Office spirometry for lung health assessment in adults: a consensus statement from the National Lung Health Education Program. Chest 2000; 117: 1146–1161.
    OpenUrlCrossRefPubMedWeb of Science
  7. ↵
    1. Hegewald MJ,
    2. Lefor MJ,
    3. Jensen RL, et al.
    Peak expiratory flow is not a quality indicator for spirometry: peak expiratory flow variability and FEV1 are poorly correlated in an elderly population. Chest 2007; 131: 1494–1499.
    OpenUrlCrossRefPubMedWeb of Science
  8. ↵
    1. Ingram RH,
    2. Schilder DP
    . Effect of thoracic gas compression on the flow–volume curve of the forced vital capacity. Am Rev Respir Dis 1966; 94: 56–63.
    OpenUrlPubMedWeb of Science
    1. Sadoul P.
    Mesure de la capacité vitale et des débits maximaux. In: Denolin H, Sadoul P, Orie NGM, eds. L'exploration fonctionnelle pulmonaire, 2me partie. Paris, Flammarion, 1971.
    1. Van de Woestijne KP,
    2. Afschrift M
    . Airway dynamics during forced expiration in patients with chronic obstructive lung disease. In: Orie NGM, Van der Lende R, eds. Bronchitis III. Assen, Royal Van Gorcum, 1970; pp. 195–206.
  9. ↵
    1. Krowka MJ,
    2. Enright PL,
    3. Rodarte JR, et al.
    Effect of effort on measurement of forced expiratory volume in one second. Am Rev Respir Dis 1987; 136: 829–833.
    OpenUrlCrossRefPubMedWeb of Science
  10. ↵
    1. Müller-Brandes C,
    2. Krämer U,
    3. Gappa M, et al.
    LUNOKID: can numerical American Thoracic Society/European Respiratory Society quality criteria replace visual inspection of spirometry? Eur Respir J 2014; 43: 1347–1356.
    OpenUrlAbstract/FREE Full Text
  11. ↵
    Centers for Disease Control and Prevention. Third National Health and Nutrition Examination Survey III. Spirometry Procedure Manual. Rockville, Westat Inc., 1988.
  12. ↵
    1. Enright PL,
    2. Skloot GS,
    3. Cox-Ganser JM, et al.
    Quality of spirometry performed by 13,599 participants in the World Trade Center Worker and Volunteer Medical Screening Program. Respir Care 2010; 55: 303–309.
    OpenUrlAbstract/FREE Full Text
  13. ↵
    1. Swanney MP,
    2. Jensen RL,
    3. Crichton DA, et al.
    FEV6 is an acceptable surrogate for FVC in the spirometric diagnosis of airway obstruction and restriction. Am J Respir Crit Care Med 2000; 162: 917–919.
    OpenUrlCrossRefPubMedWeb of Science
  14. ↵
    1. Arets HG,
    2. Brackel HJ,
    3. van der Ent CK
    . Forced expiratory manoeuvres in children: do they meet ATS and ERS criteria for spirometry? Eur Respir J 2001; 18: 655–660.
    OpenUrlAbstract/FREE Full Text
  15. ↵
    1. Aurora P,
    2. Stocks J,
    3. Oliver C, et al.
    Quality control for spirometry in preschool children with and without lung disease. Am J Respir Crit Care Med 2004; 169: 1152–1159.
    OpenUrlCrossRefPubMedWeb of Science
View Abstract
PreviousNext
Back to top
View this article with LENS
Vol 45 Issue 5 Table of Contents
European Respiratory Journal: 45 (5)
  • Table of Contents
  • Table of Contents (PDF)
  • About the Cover
  • Index by author
Email

Thank you for your interest in spreading the word on European Respiratory Society .

NOTE: We only request your email address so that the person you are recommending the page to knows that you wanted them to see it, and that it is not junk mail. We do not capture any email address.

Enter multiple addresses on separate lines or separate them with commas.
Use of forced vital capacity and forced expiratory volume in 1 second quality criteria for determining a valid test
(Your Name) has sent you a message from European Respiratory Society
(Your Name) thought you would like to see the European Respiratory Society web site.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Print
Citation Tools
Use of forced vital capacity and forced expiratory volume in 1 second quality criteria for determining a valid test
John L. Hankinson, Bill Eschenbacher, Mary Townsend, Janet Stocks, Philip H. Quanjer
European Respiratory Journal May 2015, 45 (5) 1283-1292; DOI: 10.1183/09031936.00116814

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero

Share
Use of forced vital capacity and forced expiratory volume in 1 second quality criteria for determining a valid test
John L. Hankinson, Bill Eschenbacher, Mary Townsend, Janet Stocks, Philip H. Quanjer
European Respiratory Journal May 2015, 45 (5) 1283-1292; DOI: 10.1183/09031936.00116814
Reddit logo Technorati logo Twitter logo Connotea logo Facebook logo Mendeley logo
Full Text (PDF)

Jump To

  • Article
    • Abstract
    • Abstract
    • Introduction
    • Methods
    • Results
    • Discussion
    • Acknowledgements
    • Footnotes
    • References
  • Figures & Data
  • Info & Metrics
  • PDF

Subjects

  • Lung structure and function
  • Tweet Widget
  • Facebook Like
  • Google Plus One

More in this TOC Section

Original Articles

  • Ambulatory management of secondary spontaneous pneumothorax
  • Systematic assessment of respiratory health in illness susceptible athletes
  • Identifying early PAH biomarkers in systemic sclerosis
Show more Original Articles

Lung function

  • Reference equations for spirometry function tests in South Asia
  • GLI 2017 reference equations and interpretation of TLCO
  • Reference equations for CO and NO diffusing capacity
Show more Lung function

Related Articles

Navigate

  • Home
  • Current issue
  • Archive

About the ERJ

  • Journal information
  • Editorial board
  • Press
  • Permissions and reprints
  • Advertising

The European Respiratory Society

  • Society home
  • myERS
  • Privacy policy
  • Accessibility

ERS publications

  • European Respiratory Journal
  • ERJ Open Research
  • European Respiratory Review
  • Breathe
  • ERS books online
  • ERS Bookshop

Help

  • Feedback

For authors

  • Instructions for authors
  • Publication ethics and malpractice
  • Submit a manuscript

For readers

  • Alerts
  • Subjects
  • Podcasts
  • RSS

Subscriptions

  • Accessing the ERS publications

Contact us

European Respiratory Society
442 Glossop Road
Sheffield S10 2PX
United Kingdom
Tel: +44 114 2672860
Email: journals@ersnet.org

ISSN

Print ISSN:  0903-1936
Online ISSN: 1399-3003

Copyright © 2023 by the European Respiratory Society