Influence of secular trends and sample size on reference equations for lung function tests

P.H. Quanjer; J. Stocks; T.J. Cole; G.L. Hall; S. Stanojevic

doi:10.1183/09031936.00110010

Abstract

The aim of our study was to determine the contribution of secular trends and sample size to lung function reference equations, and establish the number of local subjects required to validate published reference values.

30 spirometry datasets collected between 1978 and 2009 provided data on healthy, white subjects: 19,291 males and 23,741 females aged 2.5–95 yrs. The best fit for forced expiratory volume in 1 s (FEV₁₎, forced vital capacity (FVC) and FEV₁/FVC as functions of age, height and sex were derived from the entire dataset using GAMLSS. Mean z-scores were calculated for individual datasets to determine inter-centre differences. This was repeated by subdividing one large dataset (3,683 males and 4,759 females) into 36 smaller subsets (comprising 18–227 individuals) to preclude differences due to population/technique.

No secular trends were observed and differences between datasets comprising >1,000 subjects were small (maximum difference in FEV₁ and FVC from overall mean: 0.30– -0.22 z-scores). Subdividing one large dataset into smaller subsets reproduced the above sample size-related differences and revealed that at least 150 males and 150 females would be necessary to validate reference values to avoid spurious differences due to sampling error.

Use of local controls to validate reference equations will rarely be practical due to the numbers required. Reference equations derived from large or collated datasets are recommended.

Pulmonary function test, in particular spirometry, play an important role in diagnosing obstructive lung disease 1–6, assessing the severity of lung disease, monitoring treatment of patients with respiratory disorders, and allocating patients to treatment groups in drug intervention studies. Since spirometric lung function depends on body size, age, sex and ethnic group, reference equations derived from healthy individuals are imperative for interpreting results. While there is ample choice of published equations [7], there are significant differences between reference equations which have implications for the interpretation of results 8–14. Furthermore, the observed differences between equations raise important questions about the causes of these discrepancies, how to select the most appropriate equations and whether or not “local healthy controls” should be used to validate the selected reference.

It has been proposed 10 that the observed differences between prediction equations are due to technical and procedural differences (including quality control), true (biological) differences between populations, including secular trends, and chance differences between populations due to sampling (“sampling error”). Smaller samples have greater uncertainty about the true mean and thus increase the chance that they will differ from other datasets simply due to sampling error. Since many prediction equations are based on limited sample sizes, to date, it has been difficult to separate the effects of sampling error from those due to biological or technical differences.

Given the practical limitations of measuring large samples of healthy individuals to derive population-specific reference equations, especially those derived from across the entire life span, it has been proposed that available datasets should be collated to produce globally applicable reference equations 13, 15. Earlier attempts to achieve this goal relied on constructing summary equations derived from published equations for specific age groups 16–18. Although such summary equations have been widely used and served a useful role, their limitations soon became apparent 19. The potential advantages of collating original data and storing such data within an international database supported by the major international respiratory societies was proposed in 1995, but has yet to be implemented 13. The recent emergence of flexible and sophisticated statistical methods such as GAMLSS to model lung function data [20–22] provides a powerful tool to underpin such initiatives, and overcomes many of the previous difficulties, including those relating to the rapid age-related changes in childhood and adolescence when trying to derive “all-age” equations 13, 23–26. The benefits of collating data include a larger sample size over a wider age range which will be more generalisable. This may, however, come at a cost of increased variability from outlying datasets due to the biological or technical differences mentioned previously, which may in turn inflate the lower limit of normal (LLN). Furthermore, if secular trends in pulmonary function arise from a trend in physical development 27, 28 and/or in technical improvements, equations based on collated datasets could be internally biased 29–34.

Although many users remain unaware of which prediction equations are being used to interpret their data, it is generally recommended to apply reference equations that have been derived from a similar population to that being tested, using comparable instruments, methods and quality standards 8, 10, 35. It has also been suggested that results from a group of local healthy subjects should be used to validate the selected reference equations. However, in practice these recommendations are difficult to apply, since evidence regarding the number of healthy subjects actually required is limited and inconsistent, with estimates ranging from 20 to100 8–10, 36.

The objectives of our study were to: 1) explore the differences between centres in a collated dataset, including those due to secular trends, and how these affect predicted values of spirometric lung function; 2) investigate the influence of sample size per se on the interpretation of lung function results; and 3) to estimate the minimum number of subjects required for local validation of reference equations.

MATERIAL AND METHODS

Within the framework of the Global Lungs Initiative (see [37] for further information), 64 centres from 28 countries across five continents have now shared spirometric data on 149,759 healthy, lifelong nonsmokers aged 2.5–95 yrs. The overall aim of this group is to derive prediction equations for different ethnic groups that are valid worldwide. The analysis presented in this paper is confined to white subjects, as they currently represent the majority of available data. 30 centres provided data on 19,291 white males and 23,741 white females aged 2.5–95 yrs. The distribution by age is shown in the supplementary data (fig. E1). Sample size in each of the individual datasets varied between 20 to 4,759 female subjects and 17 to 3,683 male subjects; 12 datasets comprised <100 females and 10 had <100 males. Data were collected between 1978 and 2009. Although published data from the Health Survey for England 1995–1996 (HSE) study was limited to individuals aged ≥16 yrs 38, for the purposes of this analysis these were complemented with data from subjects aged 7–16 yrs olds that had been collected in the same study using identical protocol and techniques.

All datasets complied with international recommendations with regard to equipment, methods, procedures, data selection and quality control, valid at the time of data collection. All persons, groups or organisations sharing data with the Lung Function in Growth and Aging initiative have specified that data were collected with appropriate ethical approval and permission has been given to publish results from the collated dataset.

GAMLSS [20–22] was used to derive the best fitting function for forced expiratory volume in 1 s (FEV₁), forced vital capacity (FVC) and FEV₁/FVC as a function of age and height in males and females. The statistical methods used were as described by Cole et al. 23, details of which are provided in the supplementary data. Once the equations had been derived from the entire dataset, they were used to calculate the average standardised residuals (z-scores) for each centre. Ideally, the residuals from each centre should have a mean of 0 and an sd of 1. To investigate the influence of secular trends, we displayed the residuals (z-scores) as a function of the year of measurement; in studies conducted over several years the mid-point of the study was used.

In order to explore the influence of sample size per se on predicted values, without any confounding effects due to biological or technical differences, and to estimate the minimal number of local controls that would be needed to validate published reference equations, we repeated this exercise after dividing data from the HSE study into smaller subsets. A random selection was performed (without replacement) so that 36 subgroups of males (n = 3,683; subgroup size 38 to 225) and 46 subgroups of females (n = 4,759; subgroup size 18 to 227) were formed. Using all the data in the HSE study, GAMLSS was again used to derive the best fitting function for FEV₁, FVC and FEV₁/FVC as a function of age and height in females and males. Subsequently, the average standardised residuals (z-scores) were obtained for each of the 82 individual HSE subsets.

RESULTS

Difference in mean z-score between centres

Differences between the largest datasets were small and clinically minimal (fig. 1). The mean of the z-scores in the largest datasets was virtually zero. Maximum differences from the overall mean for FEV₁, FVC and FEV₁/FVC in datasets with >1,000 subjects ranged 0.30– -0.22 z-score (i.e. were within ∼3% of that predicted from the collated dataset). For datasets with n>150 but n<1,000 the corresponding range for FEV₁ and FVC was 0.4– -0.3 z-scores, with slightly larger differences for FEV₁/FVC. The smaller the number of subjects in a dataset, the larger the offset from zero (fig. 1). Details of the numbers of subjects in each study and the centre specific z-scores are given in the supplementary data (table E1).

Figure 1–

Distribution of mean z-scores (standardised residuals) for a) forced expiratory volume in 1 s (FEV₁) and b) forced vital capacity (FVC) in 30 datasets as a function of sample size. z-scores for each centre were calculated using the equations derived from the entire collated dataset by using GAMLSS [21] to regress FEV₁ and FVC on age and height separately for males and females. ○: females; •: males.

Secular trends

There was no significant correlation between year of measurement and either sample size or the residuals for FEV₁, FVC and FEV₁/FVC (p = 0.833 for FEV₁ and p = 0.447 for FVC) (fig. 2). Height differed between datasets from that in the HSE study by up to 2%; these differences were unrelated to the mean z-score for that centre.

Figure 2–

Distribution of mean z-scores for a) forced expiratory volume in 1 s (FEV₁₎ and b) forced vital capacity (FVC) in 30 datasets as a function of the year of measurement. No secular trends were evident in data collected from white subjects over the past 30 yrs. ○: females; •: males.

Differences in scatter between centres

The scatter (sd of the z-scores) in each centre was also related to sample size (fig. E2 in supplementary data). Thus, while the sd was fairly close to the ideal of 1 in datasets comprising >1,000 subjects, there was increasing variability for sample sizes <1,000, the extremes being 0.60 and 1.29. The average sd of the z-scores from all the centres was slightly less than 1 (0.97 in females, 0.95 in males for FEV₁; 0.96 in females, 0.94 in males for FVC). The LLN in collated data is determined by the largest datasets; these have a slightly lower LLN for FEV₁ and FVC than in small datasets. This effect was more pronounced for the FEV₁/FVC ratio (0.90 in both females and males). Further details are given in the supplementary data.

The effect of sample size on predicted values

To investigate whether including the smaller datasets biased the predicted values and their scatter, we derived prediction equations that were limited to datasets where n>100 for each sex and compared these to the values derived when data from all 30 centres had been included. This had minimal effect on either the predicted values (maximum difference in FEV₁ or FVC 17 mL) or the scatter (maximum difference in sd up to 2 mL).

What size of sample is needed to validate equations?

To confirm whether differences observed between centres (fig. 1) were primarily due to sample size, we examined the differences within subsets of varying sizes from the HSE study. As can be seen from figure 3, exactly the same pattern was observed as when examining differences in residuals between centres. Thus, there was minimal offset from the predicted value derived from the entire HSE dataset in the largest subsets (n>150), with increasing scatter in the residuals with decreasing group size (fig. 3).This is in keeping with results from other studies 39–41. As these data derive from the same population and were collected by the same staff, using the same methods and quality control, differences can only be explained by sampling error; the smaller the sample, the more the offset and variability from the expected mean.

Figure 3–

Distribution of mean z-scores (standardised residuals) for a) forced expiratory volume in 1 s (FEV₁₎ and b) forced vital capacity (FVC) in 3,683 males (•) and 4,759 females (○) as a function of sample size in subsets of the Health Survey for England population 38.

The sd of the z-scores was also calculated for each of the 82 subsets of data from the HSE study. The sd for FEV₁, FVC and FEV₁/FVC ranged from 0.80 to 1.21 in females and from 0.72 to 1.25 in males for the various subsets, the average being 0.99–1.00. Once again, the pattern was similar to that observed from the 30 centres, i.e. the sd decreased as sample size increased, with close agreement once the sample size was >150. This is in keeping with results from other studies 39–41. These findings suggest that at least 300 local healthy controls (150 males and 150 females) would be needed to validate published reference equations with any degree of certainty, since with smaller sample sizes differences of up to 0.5 z-scores may occur purely by chance.

DISCUSSION

This study provides evidence with respect to several important issues related to the interpretation of spirometry data using reference equations. First, differences between large datasets were clinically insignificant, and no evidence was found of secular trends in data collected from white subjects over the past 30 yrs. Secondly, reference equations derived from collated datasets are a reasonable approach to interpret lung function results across different centres. Thirdly, differences between datasets based on a relatively homogeneous population (in this case white subjects of European origin) are largely explained by sampling variability. Fourthly, validation of published reference values for spirometry using local controls is unlikely to be practical due to the numbers required to avoid sampling error. Finally, prediction equations derived from <150 individuals for each sex are unlikely to be reliable. This is likely to hold true for any ethnic group.

Despite being collected over a period of 30 yrs (1978–2009), there was no evidence of a secular trend in pulmonary function (fig. 2). This strongly suggests that differences between centres can only be attributed to a minor extent to biological and technical differences. Secular trends in pulmonary function have been described previously 28–34. They may be due to a cohort effect, i.e. that persons born for example 50 yrs ago had a different lung development during growth than those born 30 yrs ago. Infectious diseases, nutrition, smoking during pregnancy, exposure to environmental factors, different age of maturation, etc. might all leave their marks so that the lung function of a 50-yr-old person born in 1930 would be different from that in a 50-yr-old person born in 1960. The fact that we did not find evidence for a secular trend in spirometric indices over the past 25–30 yrs may be due to the fact that the present study is based on data derived from societies in Europe, North America, Australia and New Zealand with comparable and relatively stable socio-economic conditions. Alternatively, since secular changes in height with increasing affluence tend to reflect changes in leg length rather than trunk height 27, it is possible that subtle improvements in lung function over time may be masked when lung function is based on standing height. Since data in this study complied with different standards issued by ECCS, ECCS/European Respiratory Society (ERS), American Thoracic Society (ATS) and ATS/ERS [17, 35, 42–44], secular trends might spuriously arise from the application of more rigorous standards and differences in instruments used. For example, Hankinson et al. 45 found that applying the 1994 ATS standards 43 rather than the 1987 standards 42 made a difference in the level of FEV₁ and FVC equivalent to z-scores of ∼0.14 and ∼0.12, respectively. Our results suggest that any trend due to implementation of more rigorous standards is obscured by small differences between populations. This may not hold true for measurements of lung volumes and transfer factor for the lung of carbon monoxide.

A major concern when working with collated data is that differences in inclusion criteria or in methodological and other issues will adversely affect the accuracy and precision of predicted values, i.e. the average and the LLN. Our findings suggest that biological differences between populations were small, that standards of administering pulmonary function tests and quality control were relatively uniform, and that differences between centres seem to be mainly due to sampling error. By creating subgroups of different sizes from the large HSE study, we controlled for secular trends, biological differences and technical differences in administering pulmonary function tests, as well as in quality control. This analysis confirmed that the offset from predicted values is related to sampling error, such that smaller sample sizes have greater offset from the predicted mean. This provides further evidence to support the collation of data in order to derive more robust prediction equations.

When combining data from 30 sources around the world, we would expect greater offset and variability than in the sub-samples of datasets derived from a large population (HSE). Instead, there was remarkable similarity, with only the smallest datasets differing more from the overall mean rather than the sub-samples from the HSE population (fig. 3). However, the findings differed with respect to the residual scatter which was, on average, smallest in the smallest datasets (fig. E2), indicating that populations in the largest datasets were less homogeneous. Because the results of regression analysis are dominated by the largest datasets, which reflect a representative population sample, the LLN derived from the collated data will be slightly lower than those in datasets comprising <1,000 individuals (fig. E2). As the largest datasets arose from random population samples, the predicted values and their LLNs are representative of such populations. Our results also indicate that while inclusion of the smaller datasets does not lead to biased results, predicted values and LLNs derived from small local datasets of healthy controls are unlikely to be representative of a general population. This does not diminish the value of recruiting prospective controls in research studies.

In the past, equations based on collated data were generated using published prediction equations 16–18, 35. This leaves much to be desired as the prediction equations were simple in structure and, therefore, did not allow appropriate modelling of the complex relationship of pulmonary function with age and height from derived data. This also holds for determining the LLN. A clear advantage of deriving prediction equations from collated data using modern statistical techniques and the actual results from each individual is that the age-related changes in the level of pulmonary function, as well as the LLN, can be modelled properly 15, 23, 25, 26.

Our findings have practical implications in that laboratories seeking to validate reference equations for spirometry need to collect data on at least 300 healthy subjects. Even then, results from a local reference population are still likely to differ from predicted values by up to 0.2 z-scores. Since the between-subject variability (coefficient of variation) in this study varied from ∼11% in adolescents to 17% in elderly subjects and those aged <6 yrs, a difference of 0.2 z-scores equates to 2–3.5% of predicted, depending on the age of the local reference group. Thus, while validation of reference equations is potentially important, it appears to be largely impractical. Given the cost and effort required to form such large local reference populations, it is probably more practical and acceptable for laboratories to adhere closely to international standards for pulmonary function testing and adopt reference equations derived from large or collated studies.

Conclusions

The collation of spirometric data to produce reference equations is a robust approach to interpret lung function results. The potential and minimal inflation of the limits of normality when using a collated dataset is balanced against a greater bias in predicted values when smaller datasets are used. Differences between centres are unlikely to be due to secular trends, technical or population differences, but rather sampling variability due to small sample sizes. The validation of external reference values by lung function laboratories is quite costly and impractical. In practice it is advisable for laboratories which adhere closely to international standards for pulmonary function testing to adopt reference equations derived from large studies. This study provides evidence in support of the collation of spirometry data to create worldwide all-age reference equations.

Acknowledgments

The persons and centres contributing data to this manuscript as part of the Global Lungs Initiative were as follows. C.S. Beardsmore (Dept of Infection, Immunity and Inflammation, Division of Child Health, University of Leicester, Leicester, UK); B. Brunekreef (Institute for Risk Assessment Sciences, Universiteit Utrecht, Utrecht, the Netherlands); H. Eigen (Section of Pulmonology and Intensive Care, James Whitcomb Riley Hospital for Children, Indiana University School of Medicine, Indianapolis, IN, USA); E. Falaschetti (Dept of Epidemiology and Public Health, UCL, London, UK); B. Fallon (Respiratory Laboratory, Nepean Hospital, Penrith, Australia); M.W. Gerbase (Division of Pulmonary Medicine, University Hospitals of Geneva, Geneva, Switzerland); C.J. Gore (Physiology Dept, Australian Institute of Sport, Belconnen, Australia); G. Hall (Respiratory Medicine, Princess Margaret Hospital for Children, Perth, Australia); J.L. Hankinson (Hankinson Consulting, Valdosta, GA, USA); A.J. Henderson (University of Bristol, Bristol, UK); M. Gappa and the LUNOKID study group (Children’s Hospital and Research Institute, Marienhospital Wesel, Germany); G.S. Kerby and the Lung Function Measures in Preschool Children with Cystic Fibrosis study group (University of Colorado Denver School of Medicine Pulmonary Medicine, The Children's Hospital, Aurora, CO, USA); J. Kühr (Klinik für Kinder- und Jugendmedizin, Städtisches Klinikum Karlsruhe, Karlsruhe, Germany); S. Kuster (Lungenliga Zürich, Zürich, Switzerland); A. Langhammer (HUNT Research Centre, NTNU, Verdal, Norway); S. Lum (Portex Respiratory Unit, UCL, Institute of Child Health, London, UK); W. Nystad (Division of Epidemiology, Norwegian Institute of Public Health, Oslo, Norway); P. Piccioni (SC Pneumologia CPA ASL Torino 2, Torino, Italy); F. Pistelli (Pulmonary and Respiratory Pathophysiology Unit, Cardiothoracic Dept, University Hospital of Pisa and Pulmonary Environmental Epidemiology Unit, CNR Institute of Clinical Physiology, Pisa, Italy); P.H. Quanjer (Dept of Pulmonary Diseases and Pediatrics, Erasmus Medical Centre – Sophia Children’s Hospital, Erasmus University, Rotterdam, the Netherlands); M. Rosenthal (Royal Brompton Hospital, London, UK); S. Stanojevic (Portex Respiratory Unit, UCL, Institute of Child Health, London, UK); J.B. Soriano (Program of Epidemiology and Clinical Research, CIMERA, Recinte Hospital Joan March, Illes Balears, Spain); W. Tomalak (Dept of Physiopathology of Respiratory System, National Institute for TBC and Lung Disease, Rabka Branch, Poland); S.W. Turner (Dept of Child Health, University of Aberdeen, Aberdeen, UK); D. Vilozni (Pediatric Pulmonary Units of The Edmond and Lili Safra Children's Hospital, Sheba Medical Center Ramat-Gan and Meyer Children's Hospital of Haifa, Rambam Medical Center, Haifa, Israel); H. Vlachos (Dept of Pediatrics, Division of Respiratory Medicine, University of Sherbrooke, Quebec, Canada); S. West (Respiratory Function Laboratory, Westmead Hospital, Australia); and D. Zagami (Lung Function Laboratory, Gold Coast Hospital, Southport, QLD, Australia).

The following individuals/centres share datasets (occasionally more than one) with the Global Lungs Initiative, centres that formed the Asthma UK initiative are marked with an asterisk. O. Al-Rawas (Oman); M. Badier (France*); C. Beardsmore (UK*); H. Ben Saad (Tunisia); B. Brunekreef (the Netherlands); P. Burney (UK); W. Dejsomritrutai (Thailand); H. Eigen (USA*); B. Fallon (Australia); A.M. Fulambarker (USA); M. Gappa (Germany*); M. Gerbase (Switzerland); M. Golshan (Iran); C. Gore (Australia); G. Hall (Australia*); J.L. Hankinson (USA*); J. Henderson (UK); M.S.M. Ip (China); S. Karrasch (Germany); G. Kerby (USA*); J. Kühr (Germany); S. Kuster (Switzerland); A. Langhammer (Norway); S. Lum (UK*); A.L. Miller (USA); W. Nystad (Norway*); Y.M. Oh (South Korea); W-H. Pan (Taiwan); R. Perez-Padilla (Mexico); P. Piccioni (Italy*); F. Pistelli (Italy); Prasad KVV (India); P.H. Quanjer (the Netherlands); M. Rosenthal (UK*); S. Stanojevic (UK*); J.B. Soriano (Spain); F. Thomas (France); W. Tomalak (Poland*); Y. Trabelsi (Tunisia); S.W. Turner (UK*); D. Vilozni (Israel*); H. Vlachos (Canada*); R. Warshaw (USA); S. West (Australia); D. Zagami (Australia); and J.P. Zheng (China*).

We would like to thank M. Bottai and C. Schindler for undertaking an independent statistical review of this paper.

Footnotes

This article has supplementary material available from www.erj.ersjournals.com
Statement of Interest
None declared.

Received July 13, 2010.
Accepted August 12, 2010.

©ERS 2011

REFERENCES

↵
1. Pellegrino R,
2. Viegi G,
3. Brusasco V,
4. et al
. Interpretative strategies for lung function tests. Eur Respir J 2005; 26: 948–968.
OpenUrl FREE Full Text
1. Global Initiative for Chronic Obstructive Lung Disease
. Global Strategy for the Diagnosis, Management and Prevention of Chronic Obstructive Lung Disease. 2009. www.goldcopd.com/Guidelineitem.asp?l1=2&l2=1&intId=2003 Date last accessed: January 4, 2010.
1. Celli BR,
2. MacNee W
. Standards for the diagnosis and treatment of patients with COPD: a summary of the ATS/ERS position paper. Eur Respir J 2004; 23: 932–946.
OpenUrl FREE Full Text
BTS guidelines for the management of chronic obstructive pulmonary disease. The COPD Guidelines Group of the Standards of Care Committee of the BTS. Thorax 1997; 52: Suppl. 5, S1–S28.
OpenUrl PubMed
↵
1. National Institute for Health and Clinical Excellence
. Clinical Guideline 12. Chronic obstructive pulmonary disease, 2004. http://guidance.nice.org.uk/CG12 Date last updated: August 31, 2010.
↵
1. Global Initiative for Asthma
. Global Strategy for Asthma Management and Prevention. 2008. www.ginasthma.com/Guidelineitem.asp??l1=2&l2=1&intId=1561 Date last accessed: July 10, 2010.
Spirexpert. Become an expert in spirometry www.spirxpert.com/GOLD.html.
↵
Lung function testing: selection of reference values and interpretative strategies. American Thoracic Society. Am Rev Respir Dis 1991; 144: 1202–1218.
OpenUrl CrossRef PubMed Web of Science
1. Beydon N,
2. Davis SD,
3. Lombardi E,
4. et al
. An official American Thoracic Society /European Respiratory Society statement: pulmonary function testing in preschool children. Am J Respir Crit Care Med 2007; 175: 1304–1345.
OpenUrl CrossRef PubMed Web of Science
1. Pellegrino R,
2. Viegi G,
3. Brusasco V,
4. et al
. Interpretative strategies for lung function tests. Eur Respir J 2005; 26: 948–968.
OpenUrl FREE Full Text
↵
1. Swanney MP,
2. Ruppel G,
3. Enright PL,
4. et al
. Using the lower limit of normal for the FEV₁/FVC ratio reduces the misclassification of airway obstruction. Thorax 2008; 63: 1046–1051.
OpenUrl Abstract/FREE Full Text
↵
1. Miller MR,
2. Quanjer PH,
3. Swanney MP,
4. et al
. Interpreting lung function data using 80 percent of predicted and fixed thresholds misclassifies over 20% of patients. Chest 2011; 139: 52–59.
OpenUrl CrossRef PubMed Web of Science
↵
1. Quanjer PH,
2. Borsboom GJJM,
3. Brunekreef B,
4. et al
. Spirometric reference values for white European children and adolescents: Polgar revisited. Pediat Pulm 1995; 19: 135–142.
OpenUrl CrossRef
↵
1. Quanjer PH,
2. Borsboom GJJM,
3. Kivastik J,
4. et al
. Cross-sectional and longitudinal spirometry in children and adolescents. Interpretative strategies. Am J Respir Crit Care Med 2008; 178: 1262–1270.
OpenUrl CrossRef PubMed Web of Science
↵
1. Stanojevic S,
2. Wade A,
3. Stocks J,
4. et al
. Reference ranges for spirometry across all ages. A new approach. Am J Respir Crit Care Med 2008; 177: 253–260.
OpenUrl CrossRef PubMed Web of Science
↵
1. Polgar G,
2. Promadhat V
. Pulmonary Function Testing in Children: Techniques and Standards. Philadelphia, WB Saunders, 1971.
Quanjer PhH. Standardized lung function testing. Report Working Party Standardization of Lung Function Tests, European Community for Steel and Coal. Bull Eur Physiopathol Respir 1983; 19: Suppl. 5, 45–51.
OpenUrl
↵
1. Stocks J,
2. Quanjer PH
. Reference values for residual volume, functional residual capacity and total lung capacity. Eur Respir J 1995; 8: 492–506.
OpenUrl CrossRef PubMed Web of Science
1. Baur X,
2. Isringhausen-Bley S,
3. Degens P
. Comparison of lung function reference values. Int Arch Occup Environ Health 1999; 72: 69–83.
OpenUrl CrossRef PubMed Web of Science
↵
1. Rigby RA,
2. Stasinopoulos DM
. Generalized additive models for location, scale and shape. Applied Stats 2005; 54: 507–544.
OpenUrl
↵
GAMLSS: Generalized Additive Models for Location Scale and Shape. http://gamlss.org/.
↵
The R Project for Statistical Computing http://www.R-project.org.
↵
1. Cole TJ,
2. Stanojevic S,
3. Stocks J,
4. et al
. Age- and size-related reference ranges: a case study of spirometry through childhood and adulthood. Statist Med 2009; 28: 880–898.
OpenUrl CrossRef
↵
1. Stanojevic S,
2. Wade A,
3. Stocks J
. Reference values for lung function: past, present and future. Eur Respir J 2010; 36: 12–19.
OpenUrl Abstract/FREE Full Text
1. Stanojevic S,
2. Wade A,
3. Cole TJ,
4. et al
. Spirometry centile charts for young Caucasian children: The Asthma UK Collaborative Initiative. Am J Respir Crit Care Med 2009; 180: 547–552.
OpenUrl CrossRef PubMed Web of Science
1. Quanjer PH,
2. Stanojevic S,
3. Stocks J,
4. et al
. Changes in the FEV₁/FVC ratio during childhood and adolescence: an intercontinental study. Eur Respir J 2010; 36: 1391–1399.
OpenUrl Abstract/FREE Full Text
1. Tanner JM,
2. Hayashi T,
3. Preece MA,
4. et al
. Increase in length of leg relative to trunk in Japanese children and adults from 1957 to 1977: comparison with British and with Japanese Americans. Ann Hum Biol 1982; 9: 411–423.
OpenUrl CrossRef PubMed Web of Science
1. Ip MS,
2. Karlberg EM,
3. Karlberg JP,
4. et al
. Lung function reference values in Chinese children and adolescents in Hong Kong: I. Spirometric values and comparison with other populations. Am J Respir Crit Care Med 2000; 162: 424–429.
OpenUrl PubMed Web of Science
↵
1. Glindmeyer HW,
2. Diem JE,
3. Jones RN,
4. et al
. Non-comparability of longitudinally and cross-sectionally determined annual change in spirometry. Am Rev Respir Dis 1982; 125: 544–548.
OpenUrl PubMed Web of Science
↵
1. Burrows B,
2. Lebowitz MD,
3. Casmilli AE,
4. et al
. Longitudinal changes in forced expiratory volume in one second in adults. Methodologic considerations and findings in healthy nonsmokers. Am Rev Respir Dis 1986; 133: 974–980.
OpenUrl PubMed Web of Science
↵
1. Ware JH,
2. Dockery DW,
3. Louis TA,
4. et al
. Longitudinal and cross-sectional estimates of pulmonary function decline in never-smoking adults. Am J Epidemiol 1990; 132: 685–700.
OpenUrl Abstract/FREE Full Text
↵
1. Van Pelt W,
2. Borsboom GJJM,
3. Rijcken B,
4. et al
. Discrepancies between longitudinal and cross-sectional change in ventilatory function in 12 years of follow-up. Am J Respir Crit Care Med 1994; 149: 1218–1226.
OpenUrl PubMed Web of Science
↵
1. Xu X,
2. Laird N,
3. Dockery DW,
4. et al
. Age, period, and cohort effects on pulmonary function in a 24-year longitudinal study. Am J Epidemiol 1995; 141: 554–566.
OpenUrl Abstract/FREE Full Text
1. Ip MS,
2. Karlberg EM,
3. Chan KN,
4. et al
. Lung function reference values in Chinese children and adolescents in Hong Kong: II. Prediction equations for plethysmographic lung volumes. Am J Respir Crit Care Med 2000; 162: 430–435.
OpenUrl PubMed Web of Science
↵
1. Quanjer PhH.,
2. Tammeling GJ,
3. Cotes JE,
4. et al
. Lung volumes and forced ventilatory flows. Report Working Party Standardization of Lung Function Tests, European Community for Steel and Coal. Eur Respir J 1993; 6: Suppl. 16, 5–40.
OpenUrl FREE Full Text
↵
1. Jensen RL,
2. Crapo RO,
3. Flint AK,
4. et al
. Problems in selecting representative reference values for spirometry. Am J Respir Crit Care Med 2002; 165: A200.
OpenUrl
↵
Lung Functionin Growth and Aging: aunited worldwide approach. www.lungfunction.org. Date last updated: October 13, 2010.
1. Falaschetti E,
2. Laiho J,
3. Primatesta P,
4. et al
. Prediction equations for normal and low lung function from the Health Survey for England. Eur Respir J 2004; 23: 456–463.
OpenUrl Abstract/FREE Full Text
↵
1. Steyerberg EW,
2. Bleeker SE,
3. Moll HA,
4. et al
. Internal and external validation of predictive models: a simulation study of bias and precision in small samples. J Clin Epidemiol 2003; 56: 441–477.
OpenUrl CrossRef PubMed Web of Science
1. Peek N,
2. Arts D,
3. Bosman R,
4. et al
. External validation of prognostic models for critically ill patients required substantial sample sizes. J Clin Epidemiol 2007; 60: 491–501.
OpenUrl PubMed Web of Science
1. Knofczynski GT,
2. Mundfrom D
. Sample sizes when using multiple linear regression for prediction. Educational and Psychological Measurement 2008; 68: 431–442.
OpenUrl Abstract/FREE Full Text
Standardization of spirometry: 1987 update. Official Statement of the American Thoracic Sociey. Am Rev Respir Dis 1987; 136: 1285–1298.
OpenUrl CrossRef PubMed Web of Science
Standardization of spirometry: 1994 Update. American Thoracic Society. Am J Respir Crit Care Med 1995; 152: 1107–1136.
OpenUrl CrossRef PubMed Web of Science
1. Miller MR,
2. Hankinson J,
3. Brusasco V,
4. et al
. Standardization of spirometry. Eur Respir J 2005; 26: 319–338.
OpenUrl Abstract/FREE Full Text
1. Hankinson JL,
2. Odencrantz JR,
3. Fedan KB
. Spirometric reference values from a sample of the general US population. Am J Respir Crit Care Med 1995; 152: 179–187.
OpenUrl CrossRef PubMed Web of Science

View this article with LENS

Vol 37 Issue 3 Table of Contents

Citation Tools

Full Text (PDF)

Subjects

Lung structure and function

Original Article

Show more Original Article

Lung Function

Show more Lung Function

[1] ↵
Pellegrino R,
Viegi G,
Brusasco V,
et al
. Interpretative strategies for lung function tests. Eur Respir J 2005; 26: 948–968.
OpenUrl FREE Full Text

[2] Pellegrino R,

[3] Viegi G,

[4] Brusasco V,

[5] et al

[6] Global Initiative for Chronic Obstructive Lung Disease
. Global Strategy for the Diagnosis, Management and Prevention of Chronic Obstructive Lung Disease. 2009. www.goldcopd.com/Guidelineitem.asp?l1=2&l2=1&intId=2003 Date last accessed: January 4, 2010.

[7] Global Initiative for Chronic Obstructive Lung Disease

[8] Celli BR,
MacNee W
. Standards for the diagnosis and treatment of patients with COPD: a summary of the ATS/ERS position paper. Eur Respir J 2004; 23: 932–946.
OpenUrl FREE Full Text

[9] Celli BR,

[10] MacNee W

[11] BTS guidelines for the management of chronic obstructive pulmonary disease. The COPD Guidelines Group of the Standards of Care Committee of the BTS. Thorax 1997; 52: Suppl. 5, S1–S28.
OpenUrl PubMed

[12] ↵
National Institute for Health and Clinical Excellence
. Clinical Guideline 12. Chronic obstructive pulmonary disease, 2004. http://guidance.nice.org.uk/CG12 Date last updated: August 31, 2010.

[13] National Institute for Health and Clinical Excellence

[14] ↵
Global Initiative for Asthma
. Global Strategy for Asthma Management and Prevention. 2008. www.ginasthma.com/Guidelineitem.asp??l1=2&l2=1&intId=1561 Date last accessed: July 10, 2010.

[15] Global Initiative for Asthma

[16] Spirexpert. Become an expert in spirometry www.spirxpert.com/GOLD.html.

[17] ↵
Lung function testing: selection of reference values and interpretative strategies. American Thoracic Society. Am Rev Respir Dis 1991; 144: 1202–1218.
OpenUrl CrossRef PubMed Web of Science

[18] Beydon N,
Davis SD,
Lombardi E,
et al
. An official American Thoracic Society /European Respiratory Society statement: pulmonary function testing in preschool children. Am J Respir Crit Care Med 2007; 175: 1304–1345.
OpenUrl CrossRef PubMed Web of Science

[19] Beydon N,

[20] Davis SD,

[21] Lombardi E,

[22] et al

[23] Pellegrino R,
Viegi G,
Brusasco V,
et al
. Interpretative strategies for lung function tests. Eur Respir J 2005; 26: 948–968.
OpenUrl FREE Full Text

[24] Pellegrino R,

[25] Viegi G,

[26] Brusasco V,

[27] et al

[28] ↵
Swanney MP,
Ruppel G,
Enright PL,
et al
. Using the lower limit of normal for the FEV₁/FVC ratio reduces the misclassification of airway obstruction. Thorax 2008; 63: 1046–1051.
OpenUrl Abstract/FREE Full Text

[29] Swanney MP,

[30] Ruppel G,

[31] Enright PL,

[32] et al

[33] ↵
Miller MR,
Quanjer PH,
Swanney MP,
et al
. Interpreting lung function data using 80 percent of predicted and fixed thresholds misclassifies over 20% of patients. Chest 2011; 139: 52–59.
OpenUrl CrossRef PubMed Web of Science

[34] Miller MR,

[35] Quanjer PH,

[36] Swanney MP,

[37] et al

[38] ↵
Quanjer PH,
Borsboom GJJM,
Brunekreef B,
et al
. Spirometric reference values for white European children and adolescents: Polgar revisited. Pediat Pulm 1995; 19: 135–142.
OpenUrl CrossRef

[39] Quanjer PH,

[40] Borsboom GJJM,

[41] Brunekreef B,

[42] et al

[43] ↵
Quanjer PH,
Borsboom GJJM,
Kivastik J,
et al
. Cross-sectional and longitudinal spirometry in children and adolescents. Interpretative strategies. Am J Respir Crit Care Med 2008; 178: 1262–1270.
OpenUrl CrossRef PubMed Web of Science

[44] Quanjer PH,

[45] Borsboom GJJM,

[46] Kivastik J,

[47] et al

[48] ↵
Stanojevic S,
Wade A,
Stocks J,
et al
. Reference ranges for spirometry across all ages. A new approach. Am J Respir Crit Care Med 2008; 177: 253–260.
OpenUrl CrossRef PubMed Web of Science

[49] Stanojevic S,

[50] Wade A,

[51] Stocks J,

[52] et al

[53] ↵
Polgar G,
Promadhat V
. Pulmonary Function Testing in Children: Techniques and Standards. Philadelphia, WB Saunders, 1971.

[54] Polgar G,

[55] Promadhat V

[56] Quanjer PhH. Standardized lung function testing. Report Working Party Standardization of Lung Function Tests, European Community for Steel and Coal. Bull Eur Physiopathol Respir 1983; 19: Suppl. 5, 45–51.
OpenUrl

[57] ↵
Stocks J,
Quanjer PH
. Reference values for residual volume, functional residual capacity and total lung capacity. Eur Respir J 1995; 8: 492–506.
OpenUrl CrossRef PubMed Web of Science

[58] Stocks J,

[59] Quanjer PH

[60] Baur X,
Isringhausen-Bley S,
Degens P
. Comparison of lung function reference values. Int Arch Occup Environ Health 1999; 72: 69–83.
OpenUrl CrossRef PubMed Web of Science

[61] Baur X,

[62] Isringhausen-Bley S,

[63] Degens P

[64] ↵
Rigby RA,
Stasinopoulos DM
. Generalized additive models for location, scale and shape. Applied Stats 2005; 54: 507–544.
OpenUrl

[65] Rigby RA,

[66] Stasinopoulos DM

[67] ↵
GAMLSS: Generalized Additive Models for Location Scale and Shape. http://gamlss.org/.

[68] ↵
The R Project for Statistical Computing http://www.R-project.org.

[69] ↵
Cole TJ,
Stanojevic S,
Stocks J,
et al
. Age- and size-related reference ranges: a case study of spirometry through childhood and adulthood. Statist Med 2009; 28: 880–898.
OpenUrl CrossRef

[70] Cole TJ,

[71] Stanojevic S,

[72] Stocks J,

[73] et al

[74] ↵
Stanojevic S,
Wade A,
Stocks J
. Reference values for lung function: past, present and future. Eur Respir J 2010; 36: 12–19.
OpenUrl Abstract/FREE Full Text

[75] Stanojevic S,

[76] Wade A,

[77] Stocks J

[78] Stanojevic S,
Wade A,
Cole TJ,
et al
. Spirometry centile charts for young Caucasian children: The Asthma UK Collaborative Initiative. Am J Respir Crit Care Med 2009; 180: 547–552.
OpenUrl CrossRef PubMed Web of Science

[79] Stanojevic S,

[80] Wade A,

[81] Cole TJ,

[82] et al

[83] Quanjer PH,
Stanojevic S,
Stocks J,
et al
. Changes in the FEV₁/FVC ratio during childhood and adolescence: an intercontinental study. Eur Respir J 2010; 36: 1391–1399.
OpenUrl Abstract/FREE Full Text

[84] Quanjer PH,

[85] Stanojevic S,

[86] Stocks J,

[87] et al

[88] Tanner JM,
Hayashi T,
Preece MA,
et al
. Increase in length of leg relative to trunk in Japanese children and adults from 1957 to 1977: comparison with British and with Japanese Americans. Ann Hum Biol 1982; 9: 411–423.
OpenUrl CrossRef PubMed Web of Science

[89] Tanner JM,

[90] Hayashi T,

[91] Preece MA,

[92] et al

[93] Ip MS,
Karlberg EM,
Karlberg JP,
et al
. Lung function reference values in Chinese children and adolescents in Hong Kong: I. Spirometric values and comparison with other populations. Am J Respir Crit Care Med 2000; 162: 424–429.
OpenUrl PubMed Web of Science

[94] Ip MS,

[95] Karlberg EM,

[96] Karlberg JP,

[97] et al

[98] ↵
Glindmeyer HW,
Diem JE,
Jones RN,
et al
. Non-comparability of longitudinally and cross-sectionally determined annual change in spirometry. Am Rev Respir Dis 1982; 125: 544–548.
OpenUrl PubMed Web of Science

[99] Glindmeyer HW,

[100] Diem JE,

[101] Jones RN,

[102] et al

[103] ↵
Burrows B,
Lebowitz MD,
Casmilli AE,
et al
. Longitudinal changes in forced expiratory volume in one second in adults. Methodologic considerations and findings in healthy nonsmokers. Am Rev Respir Dis 1986; 133: 974–980.
OpenUrl PubMed Web of Science

[104] Burrows B,

[105] Lebowitz MD,

[106] Casmilli AE,

[107] et al

[108] ↵
Ware JH,
Dockery DW,
Louis TA,
et al
. Longitudinal and cross-sectional estimates of pulmonary function decline in never-smoking adults. Am J Epidemiol 1990; 132: 685–700.
OpenUrl Abstract/FREE Full Text

[109] Ware JH,

[110] Dockery DW,

[111] Louis TA,

[112] et al

[113] ↵
Van Pelt W,
Borsboom GJJM,
Rijcken B,
et al
. Discrepancies between longitudinal and cross-sectional change in ventilatory function in 12 years of follow-up. Am J Respir Crit Care Med 1994; 149: 1218–1226.
OpenUrl PubMed Web of Science

[114] Van Pelt W,

[115] Borsboom GJJM,

[116] Rijcken B,

[117] et al

[118] ↵
Xu X,
Laird N,
Dockery DW,
et al
. Age, period, and cohort effects on pulmonary function in a 24-year longitudinal study. Am J Epidemiol 1995; 141: 554–566.
OpenUrl Abstract/FREE Full Text

[119] Xu X,

[120] Laird N,

[121] Dockery DW,

[122] et al

[123] Ip MS,
Karlberg EM,
Chan KN,
et al
. Lung function reference values in Chinese children and adolescents in Hong Kong: II. Prediction equations for plethysmographic lung volumes. Am J Respir Crit Care Med 2000; 162: 430–435.
OpenUrl PubMed Web of Science

[124] Ip MS,

[125] Karlberg EM,

[126] Chan KN,

[127] et al

[128] ↵
Quanjer PhH.,
Tammeling GJ,
Cotes JE,
et al
. Lung volumes and forced ventilatory flows. Report Working Party Standardization of Lung Function Tests, European Community for Steel and Coal. Eur Respir J 1993; 6: Suppl. 16, 5–40.
OpenUrl FREE Full Text

[129] Quanjer PhH.,

[130] Tammeling GJ,

[131] Cotes JE,

[132] et al

[133] ↵
Jensen RL,
Crapo RO,
Flint AK,
et al
. Problems in selecting representative reference values for spirometry. Am J Respir Crit Care Med 2002; 165: A200.
OpenUrl

[134] Jensen RL,

[135] Crapo RO,

[136] Flint AK,

[137] et al

[138] ↵
Lung Functionin Growth and Aging: aunited worldwide approach. www.lungfunction.org. Date last updated: October 13, 2010.

[139] Falaschetti E,
Laiho J,
Primatesta P,
et al
. Prediction equations for normal and low lung function from the Health Survey for England. Eur Respir J 2004; 23: 456–463.
OpenUrl Abstract/FREE Full Text

[140] Falaschetti E,

[141] Laiho J,

[142] Primatesta P,

[143] et al

[144] ↵
Steyerberg EW,
Bleeker SE,
Moll HA,
et al
. Internal and external validation of predictive models: a simulation study of bias and precision in small samples. J Clin Epidemiol 2003; 56: 441–477.
OpenUrl CrossRef PubMed Web of Science

[145] Steyerberg EW,

[146] Bleeker SE,

[147] Moll HA,

[148] et al

[149] Peek N,
Arts D,
Bosman R,
et al
. External validation of prognostic models for critically ill patients required substantial sample sizes. J Clin Epidemiol 2007; 60: 491–501.
OpenUrl PubMed Web of Science

[150] Peek N,

[151] Arts D,

[152] Bosman R,

[153] et al

[154] Knofczynski GT,
Mundfrom D
. Sample sizes when using multiple linear regression for prediction. Educational and Psychological Measurement 2008; 68: 431–442.
OpenUrl Abstract/FREE Full Text

[155] Knofczynski GT,

[156] Mundfrom D

[157] Standardization of spirometry: 1987 update. Official Statement of the American Thoracic Sociey. Am Rev Respir Dis 1987; 136: 1285–1298.
OpenUrl CrossRef PubMed Web of Science

[158] Standardization of spirometry: 1994 Update. American Thoracic Society. Am J Respir Crit Care Med 1995; 152: 1107–1136.
OpenUrl CrossRef PubMed Web of Science

[159] Miller MR,
Hankinson J,
Brusasco V,
et al
. Standardization of spirometry. Eur Respir J 2005; 26: 319–338.
OpenUrl Abstract/FREE Full Text

[160] Miller MR,

[161] Hankinson J,

[162] Brusasco V,

[163] et al

[164] Hankinson JL,
Odencrantz JR,
Fedan KB
. Spirometric reference values from a sample of the general US population. Am J Respir Crit Care Med 1995; 152: 179–187.
OpenUrl CrossRef PubMed Web of Science

[165] Hankinson JL,

[166] Odencrantz JR,

[167] Fedan KB

Main menu

User menu

Search

Influence of secular trends and sample size on reference equations for lung function tests

Abstract

MATERIAL AND METHODS

RESULTS

Difference in mean z-score between centres

Secular trends

Differences in scatter between centres

The effect of sample size on predicted values

What size of sample is needed to validate equations?

DISCUSSION

Conclusions

Acknowledgments

Footnotes

REFERENCES

Citation Manager Formats

Subjects

More in this TOC Section

Original Article

Lung Function

Related Articles

Contact us

Main menu

User menu

Search

Influence of secular trends and sample size on reference equations for lung function tests

Abstract

MATERIAL AND METHODS

RESULTS

Difference in mean z-score between centres

Secular trends

Differences in scatter between centres

The effect of sample size on predicted values

What size of sample is needed to validate equations?

DISCUSSION

Conclusions

Acknowledgments

Footnotes

REFERENCES

Citation Manager Formats

Jump To

Subjects

More in this TOC Section

Original Article

Lung Function

Related Articles

Contact us