Abstract
High-resolution computed tomography (HRCT) may be useful to monitor lung disease in children with common variable immunodeficiency disorder (CVID). We evaluated interobserver agreement and correlation with pulmonary function tests (PFTs) for automated quantification and visual scoring of air trapping and airway wall thickening on HRCT in paediatric CVID patients.
In a cohort of 51 children with CVID, HRCT was analysed visually and automated for presence of air trapping and airway wall thickening. PFTs were expressed as % predicted. Disease duration, physician-diagnosed pneumonias and antibiotic prophylaxis were recorded.
Interobserver agreement for automated airway wall thickening was good with an intra-class correlation coefficient of 0.88, compared with 0.51 for visual scoring. Presence of air trapping on HRCT correlated significantly with PFTs and disease duration, but was not associated with previous pneumonias. Airway wall thickening did not correlate significantly with PFTs or disease duration and was not associated with previous pneumonias or prophylactic antibiotic use.
In children with CVID disorders, HRCT air trapping measurements are significantly correlated with PFTs and disease duration. Quantitative air trapping is a feasible and promising technique for small airway disease quantification that may be applied to monitor (silent) disease progression in CVID.
Common variable immunodeficiency (CVID) is a heterogeneous immunodeficiency syndrome characterised by B-cell dysfunction, resulting in hypogammaglobulinaemia. Because of the low levels of immunoglobulins (Ig), CVID patients are prone to developing recurrent lower respiratory tract infections [1, 2]. It has been reported that 90% of CVID patients suffer from one or more episodes of lower respiratory tract infections throughout their life [3], and structural pulmonary high-resolution computed tomography (HRCT) abnormalities are reported in up to 93% of patients [1, 2]. For the prevention of irreversible airway damage, CVID patients are treated with Ig replacement therapy and antibiotics as a prophylaxis or when needed for respiratory infections. Despite adequate treatment, progression of pulmonary disease occurs in a considerable number of patients and, recently, an overall mortality rate of 25% was reported 15 yrs after diagnosis [4, 5]. Because pulmonary complications are commonly seen and therapeutic options are available, it is important to monitor the presence and progression of lung disease in CVID patients [2]; this can be done clinically with laboratory tests, cultures, pulmonary function tests (PFTs) and imaging techniques. HRCT can demonstrate disease in asymptomatic CVID patients. This is important, as disease can progress without any notable episode of pneumonia, described as “silent progression” [6, 7]. Recently, a CVID-related lung disease HRCT score was reported, which aimed to quantify the structural disease component [2]. However, visual scores are time-consuming and associated with interobserver variation. The interobserver variation was substantial, especially for visual airway wall thickening scores (intra-class correlation coefficient 0.51). Validated software is available to quantify airway wall dimensions [8, 9] and software initially developed for emphysema quantification [10–12] could be used to quantify end-expiratory air trapping.
The aims of the current study were to evaluate the relationship of visual scores and quantitative measures of HRCT-diagnosed air trapping and airway wall thickening with PFTs in a cohort of children with CVID(-like) disease, and to determine whether observer agreement for airway wall measurements can be improved by using automated quantitative techniques.
MATERIAL AND METHODS
Study population
Children with clinically important humoral immunodeficiencies underwent structured follow-up at our outpatient paediatric clinic (University Medical Center Utrecht and Wilhelmina Children’s Hospital, Utrecht, the Netherlands). Follow-up included pulmonary evaluation with HRCT and PFTs. We evaluated 54 children with CVID(-like) disease who were treated with Ig replacement therapy and showed significant clinical improvement after initiation of therapy. Three patients were excluded from the study because the raw HRCT data were not stored properly in one and software failed in two, thus the study population consisted of 51 children. These 51 HRCT scans were performed between June 2008 and June 2009 and were all obtained in clinically stable patients, otherwise the scan was postponed. 33 (65%) children were diagnosed with CVID according to the European Society for Immunodeficiencies (ESID)/Pan-American Group for Immunodeficiency (PAGID) criteria [13], while 18 (35%) did not meet all of the ESID criteria for CVID. These 18 patients were defined as CVID-like (symptomatic selective antibody deficiency in combination with IgA deficiency and/or IgG subclass deficiency), but showed clinical and immunological phenotypes indistinguishable from CVID patients. For antibiotic prophylaxis, co-trimoxazole was used in each protocol. This retrospective investigation was approved by the ethical review board and informed consent was waived.
HRCT scanning protocol
Chest HRCT scanning was performed, conforming to a dedicated low-dose volumetric protocol, on a 16-detector-row computed tomography (CT) scanner (Brilliance-16; Philips, Cleveland, OH, USA). Scans were obtained during both inspiration and expiration using a breath-hold instruction. Inspiratory scans were acquired in a caudocranial direction with a collimation of 16×0.75 mm, pitch 0.9, rotation time 0.5 s, 90 kVp and milli-amperage per second depending on body weight (range 16–60 mA·s−1). Expiratory scans were acquired in a caudocranial direction with a collimation of 16×0.75 mm, pitch 1.2, rotation time 0.4 s, 90 kVp and 11 mA·s−1. The expiratory scan was obtained at end expiration.
HRCT measurements
In two patients the expiration HRCT scan was obtained at insufficient exhalation (visual judgement); therefore, air trapping scores and measurements were reported for 49 HRCT scans. The visual scores for airway wall thickening and air trapping have been described previously [2]. Two blinded independent observers scored the extent of air trapping on a scale from 0 (absent) to 5 (>80% of the volume of a lobe affected), the extent of airway wall thickening on a scale from 0 (absent) to 3 (>67% of the volume of a lobe affected), and the average and maximum severity of severe airway wall thickening per lobe. Airway wall thickening severity was defined as mild (airway wall thickness to accompanying pulmonary artery ratio 0.33/0.5), moderate (airway wall thickness to artery ratio 0.5/1) and severe (airway wall thickness to artery ratio >1). Abnormalities were scored for each of the six lobes including the lingula. The range for air trapping was 0 (absent in all lobes) to 30 (>80% involvement of all lobes). For airway wall thickening for each lobe, the most severe score (maximum 3) was added to the average score (maximum 3), and the sum was multiplied by the extent score (maximum 3). This led to a range of 0 (no airway wall thickening) to 108 (all airways severely thickened). The HRCT scores were expressed on a scale of 0–100%. The presence of ground-glass opacities was recorded.
Validated custom software was used for the automated measurements (EmphylxJ; University of British Columbia, Vancouver, BC, Canada) [8, 9, 11, 12]. Airway wall area (WA) and lumen area (LA) dimensions were quantified for the right upper lobe apical bronchus (RB1) and the area of the right lung at this level was measured (fig. 1). The airway quantification of RB1 was performed by two independent observers in order to calculate interobserver agreement. Wall area was expressed as a percentage of total airway area: 100×WA/(WA+LA) = WA%. As WA% is not constant with age, it was corrected for age by using the formula: WA%age = WA% + (18 − age×0.72). This formula was obtained from airway measurements published previously [14]. WA was also normalised to body size by dividing WA by body surface area (using the formula of Mosteller [15], WABSA) and by right lung area (WAlung area).
Example of automated airway measurement technique of right upper lobe apical bronchus (RB1) on an inspiratory high-resolution computed tomography slice. a) Right lung area at RB1 is identified. b, c) Enlarged area of the area highlighted by the box in a. The airway wall is then automatically defined and can be adjusted manually if required (c).
For airway wall quantification, all perpendicular airways at every tenth CT slice were measured. We obtained perimeter of the inner airway lumen (Pi) and WA. √WA was plotted on the vertical axis against Pi, resulting in a linear relationship. The linear regression formula was obtained for each patient and, by using this formula, the WA at a Pi of 10 mm was calculated (WAPi10).
For air trapping quantification, the lung parenchyma was segmented at every tenth expiratory HRCT slice and the total number of voxels and voxels < -850 Hounsfield units (HU) were quantified (fig. 2). This cut-off value has been described previously [16]. The number of voxels below the cut-off was expressed as a percentage of the total number of voxels: ATHU-850 = 100×number of voxels < -850 HU/total number of voxels. The HU at the 15th percentile was also calculated (AT15).
Demonstration of the air trapping quantification technique on an expiratory high-resolution computed tomography (HRCT) slice. a) The original expiratory HRCT slice demonstrates differences in density reflecting air trapping. b) The voxels with a density below a certain cut-off value, in this example < -850 Hounsfield units, are highlighted (grey shaded areas) and quantified.
Pulmonary function tests
Two (4%) children were unable to perform spirometry and four (8%) could not perform plethysmography. Spirometric measurements and body plethysmographic measurements were obtained according to the American Thoracic Society criteria [17, 18]. Spirometric measurements included forced vital capacity (FVC), forced expiratory volume in 1 s (FEV1) and maximum mid-expiratory flow at 25–75% of FVC (MMEF). Body plethysmography included residual volume (RV) and total lung capacity (TLC). Measurements were expressed as percentage of predicted values. Reference formulas from a Dutch cohort of healthy children were used [19]. Two ratios were calculated and expressed as a percentage: RV/TLC and FEV1/FVC.
Statistical analysis
Interobserver agreement for RB1 WA%, airway wall thickening score and air trapping score [2] was evaluated using scatter plots with a line of identity, and intra-class correlation coefficients were calculated. This correlation took account the distance to the line of identity. An intra-class correlation >0.8 represented good agreement. The relationship between PFTs, disease duration and HRCT parameters was visualised using scatter plots and Spearman rank correlations were calculated. Unpaired sample t-tests were used to determine whether HRCT measurements differed between patients who experienced physician-diagnosed pneumonia and those who had not, or those who used antibiotic prophylaxis. Analyses were performed using SPSS 15.0 (SPSS Inc., Chicago, IL, USA). Statistical significance was set at p<0.05. Data are reported as mean±sd and (range), unless otherwise stated.
RESULTS
Patient characteristics and pulmonary function
The mean±sd (range) age of the 51 children (38 males and 13 females) at the time of HRCT was 13.1±3.5 (6–18) yrs. Mean disease duration was 6.5±3.5 (0.1–13) yrs. 27 (53%) out of the 51 children were diagnosed with one or more pneumonia before the HRCT was performed. A total of 22 (43%) children were using regular antibiotic prophylaxis prior to the HRCT. On average, children with CVID had a preserved lung function with an FEV1 of 96.3±12% (66.6–129.1%). Further details are presented in table 1.
HRCT scores and measurements
The interobserver agreement for the automated RB1 WA% measurement was good with an intra-class correlation coefficient of 0.88. The interobserver agreement for the visual scores has been previously reported and was 0.51 for airway wall thickening and 0.78 for air trapping [2]. WA%age was 72.5±7.4% (54.9–92.4%). WAPi10 was 4.3±0.6 (3.5–6.0) mm. Ground-glass opacities were present in seven (14%) out of the 49 children (table 1).
Correlation between HRCT, PFTs, disease duration and antibiotic use
Visual air trapping score correlated significantly with FVC, FEV1, MMEF and FEV1/FVC%, but not with RV/TLC% and disease duration (tables 2 and 3). ATHU-850 correlated significantly with MMEF and, when patients with ground-glass attenuation on HRCT (n=7, 14%) were excluded, this correlation became stronger and also significant for AT15 (table 2). Disease duration was also significantly associated with quantitative air trapping measures (table 3; fig. 3).
Scatter plots illustrating significant correlations between visual scores for air trapping and a) maximum mid-expiratory flow at 25–75% of forced vital capacity (MMEF) and b) forced expiratory volume in 1 s (FEV1). c) Quantitative measurements of air trapping for cut-off value of -850 HU in patients with no ground-glass attenuation on high-resolution computed tomography and MMEF. Correlation coefficients are presented in table 2. % pred: % predicted.
For airway wall thickening, neither visual scoring nor automated quantitative measures were associated with decreased pulmonary function (table 2) or longer disease duration (table 3). Of note, the significant correlations of FEV1, MMEF and RV/TLC% with quantitative airway measurements were in the opposite direction, i.e. a worse lung function was associated with a lower wall thickness. HRCT measurements did not differ between patients who experienced physician-diagnosed pneumonia and those who had not, or those who used antibiotic prophylaxis (table 4).
DISCUSSION
Airway disease is an important complication of CVID and HRCT and PFTs are gaining important complementary roles in monitoring it. We showed that in children with CVID disorders, HRCT air trapping measurements are significantly correlated with PFTs and disease duration, and that interobserver agreement for HRCT airway wall thickening assessment can be improved by using automated quantitative techniques compared to visual scores. Thus, automated quantitative HRCT, especially for air trapping, may prove useful for monitoring disease progression in CVID patients.
For air trapping, we demonstrated that visual, as well as quantitative, HRCT measurements correlate significantly with PFTs and disease duration. For visual scoring of air trapping, previous studies have also found a significant correlation with PFTs in CVID patients [20]. These data and our data indicate that HRCT-diagnosed air trapping is related to relevant airway obstruction in these patients. The advantage of our study is the use of quantitative techniques of air trapping that have not previously been employed in CVID. Using this technique, we demonstrated that the presence of air trapping correlates the strongest with MMEF, a functional measure of small airways disease. A previous study by Thickett et al. [21] suggests that airway disease in CVID, as in other diseases such as cystic fibrosis, may start in the small airways. It is likely, that although we have no pathological verification, quantitative air trapping is a sign of small airway disease that occurs in childhood and increases with longer duration of the disease. The correlations with PFTs are stronger for visual air trapping scores than for quantitative measures. Several reasons could account for this. First, the presence of ground-glass opacity apparently had an effect on the quantitative measurements. It may be that more sophisticated air trapping measures that combine inspiratory and expiratory CT, such as the relative volume change, can be used to overcome this problem [22]. Secondly, differences in depth of expiration introduces noise; it is possible that human observers detect an air trapping pattern at a variety of expiration levels while computerised analysis is more dependent on a stable expiration between patients. For that reason, spirometer gating or more sophisticated quantification techniques might be advantageous [23, 24]. However, quantitative air trapping measurements correlated more strongly with disease duration than the visual scores. Based on these results, we can neither advocate visual nor quantitative HRCT air trapping measurements for disease assessment in CVID, although the quantitative measurements have the advantage of being fully automated.
For airway wall thickening, we demonstrated that neither visual nor quantitative airway wall measurements correlated with pulmonary function or disease duration, although this does not necessarily exclude the possibility of monitoring disease progression. Quantitative airway wall measurements had not previously been employed in CVID. We tried several strategies to “normalise” the airway dimensions by age, body size and lung size for RB1 and we calculated WA at a lumen perimeter of 10 mm for all perpendicular airways as a normalisation. Nevertheless, we were unable to demonstrate a significant correlation with lung function or disease duration and, therefore, cannot recommend a specific automated measurement or calculation of airway wall thickening. We also could not explain this lack of correlation by use of prophylactic co-trimoxazole in some patients. Other studies have reported a significant correlation between airway wall thickening and pulmonary function in CVID [25], and it is generally accepted that recurrent pulmonary infections and inflammation typically cause bronchial wall thickening. An explanation for the discrepancy with our results might be that we performed HRCT in clinically stable patients, while in most other CVID studies, HRCT scans were performed for a clinical indication. It may also be that in children with CVID, mainly peripheral airway walls are involved. With routine CT technology, these airways cannot be directly visualised and for visible airways the quantification is most accurate in airways with a diameter >2 mm as airway wall dimensions are overestimated in smaller visible airways with most software. It may well be that the airway walls of the larger airways that we could quantify in our study are spared in stable CVID children. We have no clear explanation for the significant correlations of FEV1, MMEF and RV/TLC% with quantitative airway wall measurements in the unexpected direction, and we were unable to identify any CVID-specific pathological studies of airway abnormalities that could clarify this result. In conclusion, although large airway wall thickening appears to be an acute disease marker in CVID, it appears that it is not a good disease marker in stable children with CVID, where airway wall disease is probably limited to the small airways; although, it cannot be excluded that longitudinal studies of airway wall thickening will be able to detect disease progression in large airway walls of stable CVID children.
We evaluated the reproducibility of HRCT measurements of air trapping and airway wall thickening. It has previously been described [2] that air trapping and airway wall thickening are common findings on visually judged HRCT scans of CVID patients, although, especially for airway wall thickening, interobserver variation was considerable. Herein, we showed that quantitative, automated measurements of airway wall thickness substantially improve interobserver agreement. We also demonstrated that fully automated quantification of air trapping (without interobserver variation) is possible and can be achieved by using a simple cut-off at -850 HU or by assessing the HU value at the 15th percentile on the expiratory HRCT. This gain in interobserver agreement by using quantitative techniques is promising for the long-term monitoring of airway abnormalities in patients with CVID. It has been shown that quantitative airway wall measurements in other diseases, such as cystic fibrosis, can detect disease progression [26], and it is plausible that longitudinal studies could prove HRCT to be able to detect (silent) disease progression in CVID patients as well, and eventually may improve clinical management in these patients.
In the present study, we did not quantify bronchiectasis, which would be another interesting disease measure in these children, as bronchiectasis already occur in childhood [2]. For quantification of bronchiectasis the lumen area must, however, be normalised by using the area of the accompanying pulmonary artery (ratio of airway lumen area to pulmonary artery area). To our knowledge, no commercially available software exists for pulmonary artery area measurements. Another possibility for bronchiectasis quantification is the measurement of airway lumen tapering, but for such measurements, three-dimensional software is required.
Our study has some limitations. First, given our cross-sectional study design, we cannot determine whether HRCT can detect (silent) disease progression in patients with stable PFTs and absence of symptoms. As longitudinal studies are required to assess this relationship, we plan to re-evaluate our cohort after 2 yrs. Secondly, although inadequate expiration may affect HRCT measurements, we did not control the level of inspiration and expiration by spirometry-gated HRCT, for example. Nevertheless, since our aim was for a clinically applicable test and spirometer-gated scanning is not widely available, its absence strengthens the generalisability of our results. Thirdly, no pathological correlate was available for our HRCT measurements and, to our knowledge, CVID-specific pathology studies of airway abnormalities have not been published to date. Fourthly, we could not find a significant difference in HRCT measurements between children who had and those who had not experienced a physician-diagnosed pneumonia. This may be due to the retrospective evaluation of these pneumonias from the patient files, leading to pneumonias that were not recorded in our study for several children. Finally, our study lacks control HRCT scans of “healthy” children and, therefore, we are unable to determine a specific cut-off level for pathology. However, at this stage, our choice for quantitative or visual HRCT analysis for future research would be based on interobserver agreement and correlation with lung function.
In conclusion, we report that automated quantitative HRCT measurements improve interobserver agreement for airway wall measurements compared with visual scores in a cohort of paediatric CVID patients. Whether wall measurements in larger airways will prove useful in stable CVID children is questionable as these measurements are not significantly associated with PFTs, disease duration or number of pneumonias. Conversely, our study cannot exclude that quantitative airway wall measurements can detect disease progression or acute disease changes. Our quantitative air trapping data suggest that, in stable children with CVID, mainly the walls of small airways are involved and quantitative air trapping is a feasible and promising technique for small airways disease quantification, which may be applied to monitor (silent) disease progression in CVID as early detection of complications is of crucial importance to prevent long-term morbidity, especially in paediatric patients.
Footnotes
Statement of Interest
A statement of interest for A.A.J.M. van de Ven can be found at www.erj.ersjournals.com/site/misc/statements.xhtml
- Received November 7, 2010.
- Accepted May 25, 2011.
- ©ERS 2011