Copyright ©ERS Journals Ltd 2006 Airway dimensions measured from micro-computed tomography and high-resolution computed tomography1 The Woolcock Institute of Medical Research, and 2 The Cooperative Research Centre for Asthma, and 3 University of Sydney, and 4 The Key Centre for Microscopy and Microanalysis, University of Sydney, Sydney, and 5 Dept of Radiology, Royal Prince Alfred Hospital, Camperdown, and 6 Dept of Respiratory Medicine, Royal North Shore Hospital, St. Leonards, Australia. CORRESPONDENCE: G. G. King, The Woolcock Institute of Medical Research, University of Sydney, Sydney 2006, Australia. Fax: 61 299066391. E-mail: ggk{at}woolcock.org.au Keywords: Computer-assisted image processing, imaging, phantoms, validation studies
Received: February 3, 2005
Volume averaging results in both over- and underestimation of airway dimensions when they are measured by high-resolution computed tomography (HRCT). The current authors calibrated computerised measurements of airway dimensions from HRCT against a novel three-dimensional micro-computed tomography (CT) standard, which has a 50-fold greater resolution, as well as against traditional morphometry. Inflation-fixed porcine lung cubes were scanned by HRCT and micro-CT. A total of 59 lumen area (Ai), 30 wall area (Aaw) and 11 lumen volume (Vi) measurements were made. Ai was measured from the cut surface of 11 airways by morphometry. Airways in scanned images were matched using branching points. After calibration, the errors of Ai, Aaw and Vi HRCT measurements were determined. The current authors found a systematic, size-dependent underestimation of Ai and overestimation of Aaw from HRCT measurements. This was used to calibrate an HRCT measurement algorithm. The 95% limits of agreement of subsequent measurements were ±3.2 mm2 for Ai, ±4.3 mm2 for Aaw, and ±11.2 mm3 for Vi with no systematic error. Morphometric measurements agreed with micro-CT (±2.5 mm2) without systematic error. In conclusion, micro-computed tomography image data from inflation-fixed airways can be used as calibration standards for three-dimensional lumen volume measurements from high-resolution computed tomography, while morphometry is acceptable for two-dimensional measurements. The image dataset could be used to validate other developmental three-dimensional segmentation algorithms. Due to multidetector technology, volumetric high-resolution computed tomography (HRCT) image data are routinely acquired, and have been used for quantitative image analysis of airways and lung tissue in many studies 19. Direct measurements of airway structures can be made from HRCT image data, making it useful for studying asthma, cystic fibrosis and chronic obstructive pulmonary disease. Airway dimensions have been measured from HRCT image data using customised computer-analysis algorithms that have been validated using airway phantoms of known dimensions constructed from nonanimal materials, such as plexiglass and sweet potato 4, 8, 1012. In these studies, planimetric measurements were made of a cut surface of the phantom. It was therefore reasonably assumed that the dimensions would be completely uniform along the length of the airway phantom and that orientation in the z-axis of the scanning plane could be accurately controlled.
Validation of airway measurement algorithms advanced further with the use of explanted animal lungs as calibration standards. Porcine lungs wet fixed with formalin 13 have been used and the current authors have previously used formalin inflation-fixed porcine lungs 8 for validation. The use of liquid-fixed resected human lung tissue for validation has also recently been described 13. Explanted lung preparations are more suitable than artificial phantoms for validation because they closely approximate the range of tissue densities and dimensions, and the complex structure of lung tissues in vivo. This includes features such as mucosal folding and irregularities in airways, which are not present in artificial phantoms. This difference compared with artificial material phantoms is, in turn, associated with different estimates of measurement errors for HRCT 8. HRCT measurements are affected by volume averaging, particularly in the scanner's z-axis. This is due to the complexity of the structures, range of tissue densities, inability to control airway orientation in the z-axis during scanning, and the fact that the z-axis is the longest dimension of the voxel (fig. 1
Micro-computed tomography (CT) is currently used for studying the 3D structure of a wide range of small objects in microscopic detail with image data consisting of cubic voxels as small as 2x2x2 µm, but its use in the lung has not been widely explored. Micro-CT resolution is almost two orders of magnitude greater than HRCT, in which voxel size is typically 0.5x0.5x1 mm. The resolution of micro-CT should allow airway measurements of similar or greater accuracy to morphometry, but in three dimensions, which would allow validation of 3D segmentation algorithms used to measure airway dimensions in vivo (fig. 2
To validate micro-CT as the calibration standard for an HRCT airway segmentation algorithm, micro-CT diameter measurements of plastic and wooden cylindrical rods of various sizes were compared with calliper measurements. Ai measurements from a single micro-CT image of the cut surface of inflation-fixed porcine lung were compared with the current gold standard, tissue morphometry 8, 14. Micro-CT measurements of Ai and Aaw were used to calibrate a segmentation algorithm for unbiased Ai and Aaw measurements from HRCT and the precision (95% limits of agreement) of Ai, Aaw and Vi measurements of the calibrated algorithm. Finally, the degree of agreement between calibrated HRCT measurements of Ai from the cut surface of the lung and morphometric measurements was examined (fig. 3
Lung preparation and fixation Lungs were fixed in inflation with formalin 8 using modification of a method described by Weibel and Vidone 15. The lungs were obtained from a local butcher and were fixed 1224 h after being removed from the animal. Formalin steam was passed into the lung and negative pressure was applied to the exterior of the lung to fix the lung in inflation at an approximate inflation pressure of 25 cmH2O (fig. 4 2 h, depending on the size of the specimen.
After fixation and cooling to room temperature, the lung was scanned by HRCT and airways of interest were identified. The lung was then cut into 2-cm cubes in a plane approximately perpendicular to the z-axis of the airways of interest. The cut surfaces of the lung cubes were photographed. Finally, the 2-cm lung cubes were wrapped in plastic wrap and coated in paraffin for protection (Paraplast; Tyco Healthcare, Mansfield, MA, USA).
Digital imaging of the lung
Image analysis Ai and Aaw measurements were made directly from the HRCT image data (DICOM format) using an in-house, custom-written software program. The HRCT lung images were displayed on-screen and the airway of interest was identified manually. The centroid of the airway was automatically identified, and 20 radial spikes were created running out from it. The inner and outer edges of the airway wall were defined as the point along the radial spikes at which the greatest rate of change in density occurred, i.e. the transition from air to soft tissue. Ai and Aaw were measured from micro-CT bitmap images using the same method as for morphometry, tracing the inner and outer edges of the airway wall using the ImageJ software. The lumen and wall areas were calculated as the product of the pixel size (19x19 µm2) and the number of voxels within the traced border. A single operator (J.R. Dame Carroll) traced each airway border twice, and the average of the two measurements was used for analysis. For comparisons of micro-CT and morphometry, all airways that were visible in both the micro-CT FOV and the digital photographs of the cut surface of the lung specimens were measured. This resulted in a range of airway sizes comparable to those measured by the current authors in a previous HRCT study of asthmatic and normal airways 16. For comparisons between HRCT and micro-CT, airways were matched using airway branch points as anatomical markers. Since the HRCT slice thickness of 1.25 mm was >60 times thicker than the micro-CT slice thickness of 19 µm, every 30th micro-CT image was used to give an effective slice thickness of 570 µm. To calibrate Ai measurements from HRCT, the average of three consecutive Ai and Aaw measurements from micro-CT images which covered a thickness of 1.14 mm, was compared with each corresponding HRCT slice of thickness 1.25 mm. Micro-CT and HRCT measurements of Vi were calculated as the sum of all lumen voxels in the airway. Voxel dimensions for micro-CT were 19x19x570 µm, since only every 30th image was used (30 slicesx19µm = 570µm). The voxel dimensions for HRCT were 0.35x0.35x1.25 mm, since slice thickness was 1.25 mm.
Statistical analysis
Intra-observer variation
Agreement between imaging methods
Calibration of HRCT measurements
Calliper measurements The measurements made by micro-CT were validated against 72 calliper measurements of cylindrical rods, made of artificial materials, with diameters between 0.8 mm and 9.8 mm. The intra-operator repeatability of the calliper measurements of the rods was ±0.21 mm and was independent of diameter. Consequently, the percentage error was greater for smaller rods. The mean difference between micro-CT and calliper measurements of diameter was 0.03±0.03 mm (p = 0.14) and the 95% limits of agreement were ±0.285 mm. Thus, there was no systematic error in micro-CT measurements.
Morphometry versus micro-CT of airways
Micro-CT versus HRCT of airways A total of 59 Ai measurements from micro-CT and HRCT were compared. The mean Ai from micro-CT was 24.1±5.7 mm2 (idealised diameter 5.1±0.5 mm) and ranged 1.780.9 mm2. There was a systematic and size-dependent underestimation of Ai when measured using HRCT. This was linear in nature (fig. 6a
HRCT Ai = 0.7224xmicro-CT Ai1.7241(1)
After this equation, or calibration factor, was applied to all HRCT Ai measurements, the 95% limits of agreement of Ai measurement were ±3.2 mm2 (fig. 6c
A total of 30 airway wall area measurements from micro-CT and HRCT were compared (mean Ai 29.4±9.6 mm2, idealised diameter 6.5±1.0 mm). Airways with Aaw <10 mm2 could not be segmented for wall area reliably with the current authors segmentation algorithm, and were therefore excluded from the analysis. The excluded airways had a corresponding Ai <5 mm2, and an idealised diameter of <2.5 mm. The overestimation of Aaw when measured using HRCT was size dependent and linear (fig. 7a
HRCT Aaw = 0.8872xmicro-CT Aaw+8.8611 (2)
After calibration of the HRCT Aaw measurements, the 95% limits of agreement of Aaw measurements were ±4.3 mm2 (fig. 7c
Airway volume was measured using the calibrated segmentation algorithm for all airways that had at least three consecutive HRCT images in which segmentation could be done. Airway lumen volume was calculated as the product of voxel volume (0.35x0.35x1.25 mm) and the number of voxels within the lumen from all contiguous slices along the airway volume. Volume measurements from HRCT images from 11 airways using the calibrated measurements agreed closely with micro-CT Vi measurements with no systematic error or size dependence (fig. 8a
In this study, 2D and 3D measurements of the airway lumen were obtained from micro-CT images of explanted lung tissue to serve as calibration standards. An inflation fixation was used that kept the tissue at a similar density to what it would have been in vivo, before morphometry, HRCT scanning and micro-CT scanning of the same tissue was undertaken. By using micro-CT, the previous limitation of having only 2D calibration standard measurements (i.e. lumen area) of explanted airways was overcome. The current authors Ai segmentation method for HRCT data resulted in significant underestimation when compared to the calibration standard measurements. The measurement error was predictable, being size dependent in a linear fashion. The current authors segmentation algorithm for HRCT was also found to result in a linear, size-dependent overestimation of Aaw. The calibration of the computerised segmentation algorithm resulted in Ai and Aaw measurements from HRCT that were free of systematic errors and allowed determination of the overall precision of the current authors measurement tool. The close agreement and lack of systematic error between micro-CT and morphometric measurements of Ai suggest that micro-CT measurements do not suffer any significant volume averaging. Validation of the current authors ray-casting segmentation for HRCT against micro-CT produced accurate Vi measurements with relatively small and unbiased errors. There are several possible sources of errors in airway measurements from HRCT image data. These include volume averaging (which can be exacerbated by more acute angles of airway orientation relative to the scanning plane); the scanners point spread functions and reconstruction algorithms; and the lumen segmentation methods 19. It is possible to measure the angles of orientation of airways from volumetric HRCT data to correct for volume averaging 20, potentially reducing variation in HRCT measurements. The present authors were unable to do this with the current iteration of their analysis algorithm, but it needs to be addressed in future studies. It is likely that these sources of error may differ between manufacturers and models of HRCT machines. Therefore, it may be useful to examine the variations in CT image data arising from differences in clinical CT scanners, as well as those arising from differences in reconstruction algorithms. The magnitude of the variation is not known, nor is it known whether it is clinically significant, although it is common for serial chest CT scans for monitoring disease to be carried out using different scanners. The manual outlining of airway lumen borders on micro-CT image data might have been an additional source of error, but the present authors think this is likely to be very small. This is because the very small errors in measurements of diameter obtained from the cylindrical rods are an order of magnitude smaller than the errors introduced through HRCT image acquisition and analysis. Measurements of Ai, Aaw and Vi made from micro-CT were unaffected by window settings because of the great spatial resolution and the great contrast at the lumen interface. It is interesting that the 95% limits of agreement between micro-CT and surface morphometry (±2.5 mm2) were similar to the agreement between micro-CT and HRCT (±3.2 mm2). The disagreement between micro-CT and surface morphometry may be due to the reconstruction plane of micro-CT being slightly out of alignment with the cut surface of the lung which was photographed. The implication is that similar errors are likely to have occurred when comparing micro-CT with HRCT and this also suggests that the comparison method could be improved in the future by using a 3D registration process to remove this potential source of error. It should also be noted that in some cases, the airways tended to taper towards the surface of the lung cubes, with the airway reaching its largest diameter toward the centre of the lung cube. These airways were not included in the comparison between micro-CT and morphometry, because they were not visible on the cut surface of the lung.
Volume averaging is the factor most likely to account for the bulk of the errors in Ai measurements. The effects of volume averaging on apparent Ai are likely to be greatest in the z-axis in HRCT, because the z-dimension of the HRCT voxel (1.25 mm) is almost four times as long as the x- and y-dimensions (0.35 mm). This is relevant because airways are usually not parallel to the z-axis in human HRCT studies, nor were they in the current study. As a result, the effects of volume averaging and, hence, the magnitude of the underestimation of lumen area are greater as airway size decreases and the angle of orientation increases (fig. 1
The current authors found that the lower limit of airway size for which lumen measurements could be made with accuracy, in terms of percentage errors, was The current authors used one airway segmentation method for HRCT, although there are many airway analysis algorithms, all of which are based on slightly different segmentation criteria 1, 38, 21. In the present study, the point at which the density change over two pixels was maximal along the radial spike was used, while others have used the more common "full width at half-max" method. The former method is likely to result in a greater underestimation of the lumen, but it was chosen for the current study because it appeared to segment the lumen reliably even in small airways. Another possible factor is the number of radial spikes used in the segmentation. The ideal number may vary depending on the edge-detection method and is worthy of further study. Such systematic errors relating to a particular methodology of finding the airway lumen edge can be corrected by comparison with calibration standards generated by studies, such as the present one. The precision of different segmentation methods can then be meaningfully compared. The data set of HRCT airway images could be used in the future to compare, and possibly standardise, measurements made using the different segmentation algorithms. Given the increasing likelihood that computerised segmentation will be used routinely in clinical practice to measure airway dimensions, documentation of the differences in Ai and Aaw measurements associated with different segmentation methods needs to be done using calibration standards in follow-up studies. The formalin steam fixation method did not significantly alter lung density, which makes this validation method likely to be more accurate and relevant to in vivo lung imaging than validation with artificial material phantoms. Lung densities pre- and post-fixation were 0.34 g·mL-1 and 0.38 g·mL-1, respectively. The tissue density of the formalin-fixed pig lung was compared with human in vivo lung as measured by HRCT and found to be similar. The lung cubes were very delicate and needed to be handled extremely gently as they were moist and not completely stiff. A very sharp and thin knife was used to slice the lung, but a slight distortion of the parenchyma at the site of slicing was noticed in the lung cubes. Since all of the analysis was performed after slicing, the effect of the small distortions at the sliced edge was constant across imaging techniques and not considered an important issue. Wet fixation (a method commonly used for excised lungs, in which formalin solution is pumped into the lung or lobe) was considered unsuitable for this validation because it results in fluid-filled alveoli, thereby altering mean lung density, as well as introducing additional fluid into the airway lining. Dry fixation was similarly considered to be unsuitable because it causes desiccation of the lung. Therefore, the calibration standard described in the present study gives a more "real life" measure of the artefacts in airway appearance and measurement that would occur in HRCT scans of human subjects. Another strength of the present study is that the micro-CT, morphometry and HRCT were performed after fixation, which enabled the current authors to determine measurement errors that were associated with image analysis, since there could be no errors due to fixation artefacts. HRCT provides the unique opportunity to monitor the response of airways to treatment or other stimuli by making direct measurements on airways in vivo over time. It is essential that airway measurements are accurate and that the accuracy of the measurements is independent of airway size, since an airway that is reduced in size owing to moderate or severe increases in smooth muscle tone, inflammation or wall thickening, may experience large changes in lumen size if the inflammation and bronchoconstriction resolve. If the accuracy of the measurements is largely size dependent, then the magnitude of the change in airway dimensions will be masked. It is widely accepted that HRCT measurements of lumen area tend to underestimate actual lumen area in a size-dependent fashion such that the absolute error increases as airway size increases, and HRCT measurements of airway wall tend to overestimate actual airway wall dimensions in the same size-dependent fashion 8. Therefore, if a patient was scanned during an acute exacerbation of asthma while the airways were inflamed and constricted, then scanned again after recovery and treatment with anti-inflammatory medication, the errors associated with the two scans of the same airway would differ greatly. This process also works in reverse, such that the underestimation of airway lumen size before bronchoconstriction will be larger than the underestimation error associated with the same airway measured after constriction, so the absolute change in lumen area would itself be underestimated. In the present study, a software standard, not a hardware standard, has been developed. A data set of micro-CT and HRCT images has been created. This can be provided to other researchers to be analysed using their own segmentation software. This is not a hardware comparison tool for CT scanners. In order to make this method applicable to other scanners, a better, longer-lasting method for preserving the lung tissue samples after they are fixed needs to be developed so that the lung cubes can be sent to different research sites to be scanned by different scanners. The paraffin wax used to seal the cubes in the present study is air permeable, and the current authors found that after a number of weeks the tissue had shrunk noticeably when the cubes were scanned again. In summary, the current authors have used micro-computed tomography as the standard to calibrate and then measure the accuracy of a computerised segmentation algorithm for measuring lumen area, wall area and lumen volume using high-resolution computed tomography. The importance of this work is in providing a three-dimensional standard for calibration, which allows more accurate in vivo measurements to be made of airway dimensions from using high-resolution computed tomography. This will allow clinicians and researchers to assess airway pathophysiology more effectively. The micro-computed tomography method is more technically complex and only available in large academic centres, but clearly its value is as a three-dimensional calibration standard which allows comparison of different models of CT scanner, reconstruction methods and segmentation algorithms. The current authors segmentation algorithm, which utilised a radial-spike method, was shown to underestimate lumen area and overestimate wall area in a size-dependent manner, which is in keeping with results of previous studies by the current authors and others. Having an unbiased three-dimensional method will allow researchers to obtain important data on airway lengths, lumen areas and wall areas that are useful, for instance, in modelling studies and in longitudinal treatment studies. The close agreement between morphometry and micro-computed tomography measurements suggests that the two methods are comparable for use as two-dimensional standards. The ultimate goal is that a standard calibration method can be used between research groups to address differences in airway measurements due to different hardware, scanning methodology and reconstruction, and segmentation algorithms.
This article has been cited by other articles:
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||