Abstract
The aim of the present study was to apply hyperpolarised (HP) 3He magnetic resonance imaging (MRI) to identify patients with chronic obstructive pulmonary disease (COPD) and α1-antitrypsin deficiency (α1-ATD) from healthy volunteers and compare HP 3He MRI findings with high-resolution computed tomography (HRCT) in a multicentre study. Quantitative measurements of HP 3He MRI (apparent diffusion coefficient (ADC)) and HRCT (mean lung density (MLD)) were correlated with pulmonary function tests.
A prospective three centre study enrolled 122 subjects with COPD (either acquired or genetic) and age-matched never-smokers. All diagnostic studies were completed in 94 subjects (52 with COPD; 13 with α1-ATD; 29 healthy subjects; 63 males; and 31 females; median age 62 yrs). The consensus assessment of radiologists, blinded for other test results, estimated nonventilated lung volume (HP 3He MRI) and percentage diseased lung (HRCT). Quantitative evaluation of all data for each centre consisted of ADC (HP 3He MRI) and MLD measurements (HRCT), and correlation with forced expiratory volume in 1 s (FEV1)/forced vital capacity (FVC) indicating airway obstruction, and the diffusing capacity of the lung for carbon monoxide (DL,CO) indicating alveolar destruction.
Using lung function tests as a reference, regional analysis of HP 3He MRI and HRCT correctly categorised normal volunteers in 100% and 97%, COPD in 42% and 69% and α1-ATD in 69% and 85% of cases, respectively. Direct comparison of HP 3He MRI and CT revealed 23% of subjects with moderate/severe structural abnormalities had only mild ventilation defects. In comparison with lung function tests, ADC was more effective in separating COPD patients from healthy subjects than MLD (p<0.001 versus 0.038). ADC measurements showed better correlation with DL,CO than MLD (r = 0.59 versus 0.29).
Hyperpolarised 3He MRI correctly categorised patients with COPD and normal volunteers. It offers additional functional information, without the use of ionising radiation whereas HRCT gives better morphological information. We showed the feasibility of a multicentre study using different magnetic resonance systems.
- Apparent diffusion coefficient
- chronic obstructive lung disease
- emphysema
- high-resolution computed tomography
- hyperpolarised helium-3 magnetic resonance imaging
Chronic obstructive lung disease (COPD) is characterised by irreversible airflow obstruction 1. The prevalence of COPD is increasing, largely as a result of the combined effects of smoking on an ageing population, and this realisation has only become more urgent over the past decade 2, 3. Furthermore, the prevalence of this disease is affecting younger people, leading to early disability, high healthcare costs and loss of economic contributions 3, 4. The impact of COPD on society as a whole is therefore highly significant 4 and it currently is the costliest disease within the western world, costing the UK National Health Service £2,576 million (\#8364;3.7 million) in 2000 5.
Treatment options are relatively limited. The foremost drive has been to reduce smoking, and prevention of the disease is an important step forward. However, the effects of such behavioural changes take >25 yrs to become visible; we are currently faced with the smoking history of patients, who are largely over the age of 45 yrs 3, 5. Bronchodilatory and anti-inflammatory inhaled drugs are of limited value in lungs that are already severely damaged or destroyed. Lung volume reduction surgery has been shown to improve the quality of life of patients but long-term survival effects have been disappointing and significant mortality and morbidity has been reported 6, 7. In the National Emphysema Treatment Trial, it was shown that selection of patients was possible using relatively simple high-resolution computed tomography (HRCT) based methods to demonstrate upper lobe predominance of disease 8. Newer, less invasive methods of emphysema treatment via bronchoscopy are being tested 9, 10.
In order for COPD patients to receive better treatment, both early identification and the linking of imaging with lung function are generally considered important. It is clear that these two diagnostic pathways offer different, complementary results. In clinical practice, most physicians rely on lung function tests, even though these are global measures and probably not sensitive enough to detect minor changes that may occur due to the disease process or in response to (novel) therapies. HRCT has been used to better define and quantify both the distribution and extent of emphysema 8, 11–15. Indeed, quantification of emphysema has been shown to be quite accurate and reproducible, allowing use in the assessment of disease extent and possibly treatment response 13–15. However, there are concerns over the use of ionising radiation, particularly if scanning is to be performed at a younger age for early detection purposes or where longitudinal follow-up studies are required in chronic disease states, and therefore there is a potential application for magnetic resonance (MR)-based techniques.
Proton magnetic resonance imaging (MRI) has (limited) capabilities in imaging COPD but novel techniques are being tested that have additional contrast capabilities and these have shown promising results 16–20. Hyperpolarised (HP) 3He MRI has been developed over the past decade and has increasingly been introduced into clinical studies to assess the diagnostic performance in emphysema 21–25. Although various techniques are under development, focus thus far has been on demonstration of qualitative or semi-quantitative ventilation distribution using a single breath-hold of a mixture of HP 3He and nitrogen gas and on assessment of “alveolar size” by measuring gas diffusivity (apparent diffusion coefficient (ADC)). In particular, the ADC measurements have shown to be sensitive for detection of micro-structural changes related to the most distal airways 26–29, including gravity effects 30, lung development 31 and ageing 32. Most studies, however, have been relatively small and multi-centre studies have been rare 29. Furthermore, the availability of HP 3He gas has hampered wider evaluation of the technology and introduction of this technique into the clinical arena.
This multi-centre study aimed to assess the diagnostic capabilities of HP 3He MRI in healthy volunteers and patients with acquired or genetic forms of COPD. A secondary aim was to compare the potentially useful quantitative parameters of HP 3He MRI (ADC) with HRCT (mean lung density (MLD)) and correlation of these parameters with results from pulmonary function tests (PFT; including forced expiratory volume in 1 s (FEV1)/forced vital capacity (FVC) and diffusing capacity of the lung for carbon monoxide (DL,CO)). Finally, this study aimed to demonstrate the feasibility of performing multicentre studies with distant sites and a central 3He gas production facility.
MATERIALS AND METHODS
Study design
The study was performed in three centres, using a central gas production facility at the Institut für Physik (Mainz, Germany). Highly polarised 3He gas was produced, shipped to the imaging site by road and/or air, and used for imaging at polarisation levels ranging from 40% to 70% 33, 34. This facility has obtained a European Union manufacturing license and regulatory approval for distribution of 3He gas for human imaging since the completion of this study. Individual approval was obtained from local Institutional Review Boards, and all patients gave informed consent prior to enrolment into the study.
All patient data were recorded at a central data processing centre. This allowed management of case−control subjects, according to age and sex distribution of target subjects with normal controls. All diagnostic tests were required to be performed within 1 week to reduce the potential influence of COPD exacerbations or other respiratory illness.
Study population
This prospective case−control study aimed to recruit normal volunteers and subjects with COPD with an additional special subgroup of subjects with α1-antitrypsin deficiency (α1-ATD). COPD was defined according to the guidelines of the European Respiratory Society (ERS) as subjects with chronic chest symptoms with FEV1/FVC <70% predicted, FEV1 <80% and reversibility of <12% and <200 mL, and asthmatics were specifically excluded from this study based on history or reversibility as above 34. Subjects were restricted by age, only allowing participants who were >50 yrs. Smoking history did not serve as an inclusion criterion for patients. Normal volunteers were never-smokers (defined as fewer than 5 pack-yrs and stopped ≥20 yrs prior to recruitment into the study), >50 yrs with FEV1 >80% predicted and normal HRCT. Normal volunteers were recruited to match the age and sex of the patient group. Patients who underwent chest CT for other reasons and met the criteria for normal volunteers were also eligible for this group. Exclusion criteria were: pregnancy or breast feeding, myocardial infarction or stroke within the past 6 months, a history of pulmonary infection within the prior 6 weeks and contraindication to MRI (pacemaker, ferromagnetic implants, etc.) as well as reversibility of airflow obstruction.
PFTs
PFTs were performed using standard spirometry and body plethysmography according to the recommendations of the ERS 35. Patients underwent flow−volume reversibility testing 15 min after inhalation of 1.0 mg terbutaline. An increase in FEV1 by >12% and >200 mL was considered significant reversibility of airflow obstruction and resulted in exclusion from the study. An extensive series of measurement values was obtained, but we focused on the following main parameters: FEV1/FVC as an index of airway obstruction, and DL,CO as an index of alveolar destruction. The latter was available in 83 of these 94 subjects.
COPD was categorised as follows. 1) No COPD as normal spirometry with or without chronic chest symptoms; 2) mild COPD as FEV1/FVC <70%, FEV1 >80% pred with or without chronic chest symptoms (representing Global Initiative for Chronic Obstructive Lung Disease (GOLD) stage I); 3) moderate COPD as FEV1/FVC <70%, FEV1 between 30% and 80% pred with or without chest symptoms (representing GOLD stages II and III); 4) severe COPD as FEV1/FVC <70%, FEV1 <30% pred or presence of respiratory failure or clinical signs of right heart failure (representing GOLD stage IV) 1.
MRI
Ventilation distribution was evaluated using HP 3He gas density distribution MRI using three 1.5 T MRI systems (two Siemens Magnetom Vision (Erlangton, Germany) and one Philips Eclipse (Cleveland, OH, USA)) in the three study centres (Copenhagen (Denmark), Mainz (Germany) and Sheffield (UK), respectively) tuned to the 3He Larmor frequency of 48 MHz using specifically designed radio frequency coils (two Fraunhofer Gesellschaft (St Ingbert, Germany) and one Medical Advances (Milwaukee, WI, USA)). Patients were scanned in the supine position at breath-hold following the administration of 200–300 mL HP 3He gas through a face mask under control of a microprocessor controlled delivery device developed at one of the participating institutions 36. The 3He bolus was followed by additional volume of room air to circumvent anoxia and to reach full inspiration. The following sequences for morphologic 3-He imaging were used on the Philips Eclipse 1.5T MRI system (Sheffield): 18 s breath-hold sequence (field of view (FOV) 42 cm, 128×128 matrix, time to echo (TE)/repetition time (TR)/flip angle of 2.5 ms/7 ms/9°, 19 coronal slices with a slice thickness of 10 mm). The Copenhagen and Mainz sites used Siemens Vision 1.5T systems: 12.5 s breath-hold (FOV 34×34 cm, 81×128 raw data matrix, TE/TR/flip angle of 4.2 ms/11 ms/<10°, 14 coronal slices with a slice thickness of 10 mm).
Diffusion imaging was performed to evaluate small airspace morphology using the following sequence (with some adaptation according to MR systems): a 12 s breath-hold sequence (FOV 47×47 cm, raw data matrix 64×128, TE/TR/flip angle 10.7 ms/16.1 ms/3°, three transverse slices with a slice thickness of 20 mm at the level of the carina, 3 cm above and 5 cm below). The diffusion gradient has the following parameters: τ = 300 μs, δ = Δ = 2,300 μs, and Gdiff = 12 mT·m−1, resulting in a b-value of b = 3.89 s·cm−2. The diffusion gradient was switched on along the x-, y- and z-axis to obtain ADC values in all three directions. From the images, a map of the mean ADC in the axial plane was calculated as described previously 28. A signal-to-noise ratio threshold of >5 was accepted for inclusion into the study.
HRCT
For CT, a high-resolution algorithm was applied and 1-mm axial slices at 10-mm intervals were reconstructed during full inspiration and full expiration. In normal volunteers, only three slices were acquired (3 cm above the carina, at the carina and 5 cm below the carina), while the entire chest was imaged in COPD and α1-ATD groups.
Image analysis
All data were de-identified and sent to a central image database at one of the centres, where entry criteria were verified. These images were presented to a panel of at least three chest radiologists from the three involved sites during four reading sessions. The radiologists were blinded to clinical details and evaluation was by consensus. The cases were read digitally in repetitive series of five randomly selected HP 3He MRI investigations followed by five randomly selected HRCT studies at preset window setting (W/L 1000/-800).
For both HP 3He MRI and HRCT the severity of ventilation impairment was visually estimated as a percentage of nonventilated (HP 3He MRI) or diseased lung (HRCT), respectively. For HP 3He MRI, nonventilated lung was estimated as the difference between the ventilated lung, which is clearly demonstrated by HP 3He MRI, and the whole volume which is estimated from the presumed shape of the lungs within the thorax based on anatomical knowledge by the radiologists. For HRCT, diseased lung was defined as areas showing visually determined airway wall thickening and/or emphysematous destruction compared with the whole lung volume with an estimation of affected lung segments. The HRCT disease extent was estimated based on the available slices. A real volumetric assessment was not possible due to restrictions of the imaging protocol. A semi-quantitative score was adopted for this process, ranging from normal to severe, which was applied for HP 3He MRI and HRCT: 1) normal <10%; 2) mild 11–30%; 3) moderate 31–60%; and 4) severe >60%.
In addition, quantitative evaluations were performed. For HP 3He MRI, the mean ADC of the whole lung was calculated from the diffusion imaging, which was successfully obtained from 84 subjects 28. At HRCT, MLD was determined from 92 subjects, using previously described methods 37.
Statistical analysis
Visual analysis
The confirmatory analysis of the trial intended to assess the agreement between the parallel 3He MRI and the CT findings based on respective binary classifications into “none or mild COPD” versus “moderate or severe COPD” findings. The descriptive agreement analysis therefore derived the relative frequencies of concordant and discordant parallel classifications; the significance evaluation of the latter was based on a McNemar test to evaluate the order of discordant findings. Furthermore Cohen's κ coefficient (with asymptotic 95% CI) was estimated to quantify the parallel classifications' agreement 38.
An extensive exploratory analysis then contrasted the underlying four stage visual 3He MRI and CT evaluations in terms of relative frequencies; in addition the latter were contrasted with the patients' status as indicated by lung function assessments.
Quantitative analysis
A second stage of this exploratory analysis concentrated on the respective continuous parameterisation of lung function assessments, 3He MRI and CT. Medians and quartiles for ADC and MLD were used for the description of these continuous endpoints; stratification for patients' health status and binary ratings were also based on medians and quartiles, graphical evaluations were based on nonparametric box whisker plots, accordingly.
Exploratory significance tests were summarized in terms of p-values, where p<0.05 was declared as an indication of locally statistical significance. The comparison of patient samples along continuous endpoints was based on global Kruskal−Wallis tests and pair-wise two sample Wilcoxon tests. The comparison of samples along categorical endpoints was based on exact Fisher tests.
The pair-wise correlation between continuous endpoints was estimated by means of partial Spearman correlations controlling for patients' health status.
All numerical and graphical analyses were performed using SPSS software (release 12.0 for Windows; SPSS, Chicago, IL, USA).
Sample size considerations
The overall investigation was intended to evaluate a fraction of 30% discordant rating patterns between the binary 3He MRI and CT visual evaluations by means of a two-sided McNemar tests at the significance level 5%. To achieve a minimum power of 80% under the assumption of a rate difference of ≥15% in positive findings between MRI and CT, a minimum sample size of 98 patients must be analysed.
RESULTS
A total of 122 patients were enrolled into the study. Six patients were excluded as they did not comply with inclusion criteria on review. Three patients withdrew their consent during the course of the study. MRI was not completed in 15 participants. This was due to patient-related factors in 10 cases (five had claustrophobia, two were obese and three were in an unstable condition/dyspnoea) and technical failure in five cases. In three MRI exams, the obtained signal-to-noise ratio was considered insufficient to allow for evaluation (inadequate study). In one case, CT was only available as a hardcopy, which neither allowed for sufficient image quality nor allowed for consistent image windowing. Finally, the data of 94 subjects were eligible for further evaluation with full analysis of MRI, CT and PFT. The study population consisted of 52 patients with COPD (two (4%) nonsmokers), 13 with α1-ATD (12 (92%) nonsmokers), and 29 healthy subjects (all nonsmokers). There were 63 males and 31 females, with a median age of 62 yrs (range 50–79 yrs). In 95% of subjects, the imaging procedures were performed on the same day, while PFT testing was performed within one day in nearly all subjects.
No severe adverse reactions were observed and no medical interventions were required in any of the subjects studies. During the MRI investigation, pulse oximetry measurements did not show any critical decrease.
Demographics of the study cohort, PFT results and their distribution across the three study centres are summarised in tables 1⇓ and 2⇓. The tables also provides a comparison of the data between the recruited cohort of 116 subjects and the sample actually eligible for further evaluation (n = 94). As tables 1⇓ and 2⇓ illustrate, the necessary exclusion of subjects was not found to introduce a selection bias in the analysis cohort. Furthermore, the cofactors' distributions among the three study sites centres did not differ significantly or relevantly. However, as an effect of easier access to HP 3He gas, the radiology centre closest to the gas production centre recruited a slightly larger number of patients and healthy subjects than the other centres, as the logistics and organisation of imaging was slightly more flexible.
Sociodemographic characteristics and lung function categorisation of 116 subjects, who all met the inclusion criteria, across study sites
Sociodemographic characteristics and lung function of the cohort of 94 subjects, who were eligible for further evaluation, across study sites
COPD categorisation according to GOLD criteria revealed that 28 of the 29 healthy nonsmokers had normal lung function, whereas by our definition one subject had mild COPD. Of the 52 COPD patients, three (6%) had mild, 39 (75%) moderate and 10 (19%) severe COPD, and of the 13 α1-ATD patients one (8%) had mild, 10 (77%) moderate and two (15%) severe COPD.
Visual analysis
From HP 3He MRI, 20 (69%) of the healthy nonsmokers had normal ventilation distribution, and nine (31%) had mild ventilation defects. Of the COPD and α1-ATD patients, this distribution was one (2%) and 0 (0%) normal, 29 (55%) and four (31%) mild, 18 (35%) and eight (61%) moderate and four (8%) and one (8%) severe ventilation impairment, respectively.
From HRCT, 20 (69%) of the healthy nonsmokers had normal, eight (28%) mild and one (3%) moderate pathological findings. For the COPD group, two (4%), 14 (27%), 23 (44%) and 13 (25%) had normal, mild, moderate and severe HRCT abnormalities, while for α1-ATD patients these figures were 0 (0%), two (15%), six (46%) and five (39%), respectively.
For further evaluation, subjects were categorised in two groups as having normal or GOLD category 1 versus clearly impaired results (GOLD categories 2, 3 or 4) for PFT and normal or minimal versus significant abnormalities on HP 3He MRI and HRCT. Table 3⇓ demonstrates the resulting fraction of subjects with clearly pathological findings in the respective diagnostic tests. PFTs were used to define the pulmonary function status and classification of patients, and are therefore the reference method. The direct comparison of HP 3-He MRI with HRCT demonstrated that both were capable of identifying healthy subjects very well (fig. 1⇓), but performed poorly in recognising patients as having pathological results when PFTs were used as the reference. At the same time HRCT appeared slightly better than HP 3He MRI in its ability to classify normal and diseased patients (figs 1⇓ and 2⇓). To elucidate the role of both modalities a direct comparison was performed. Significantly more (McNemar p = 0.002) HP 3He MRI (67%) studies were categorised as “none or mild” disease based on the amount of ventilatory abnormalities with corresponding CT findings categorised as none or mild in 44% and moderate or severe in 23%; conversely both tests demonstrated moderate or severe disease in 28% and CT was relatively normal in 5% of patients with moderate or severe 3He ventilation defects. Accordingly the overall agreement analysis between CT and MRI findings resulted in a κ estimate of 0.43 (95% CI 0.38–0.48) indicating moderate agreement between the binary ratings. A further example of direct comparison between 3He MRI and CT is demonstrated in figure 3⇓ and 4⇓.
62-yr-old healthy male subject with normal lung function (forced expiratory volume in 1 s 114%). a) Normal hyperpolarised 3He magnetic resonance imaging (9% nonventilated lung) and b) normal high-resolution computed tomography (8% diseased lung).
69-yr-old healthy male subject with normal lung function (forced expiratory volume in 1 s 88%). a) Hyperpolarised 3He magnetic resonance imaging grade mild (20% nonventilated lung) and b) high-resolution computed tomography grade mild (15% diseased lung).
62-yr-old female patient with moderate α1-antitrypsin deficiency (forced expiratory volume in 1 s 43%). a) Hyperpolarised 3He magnetic resonance imaging grade mild (25% nonventilated lung) and b) high-resolution computed tomography grade severe (65% diseased lung). This disparity suggests that ventilation is still taking place in extensive morphologically destroyed lung.
65-yr-old male patient with chronic obstructive pulmonary disease grade severe (forced expiratory volume in 1 s 23%). a) Hyperpolarised 3He magnetic resonance imaging grade moderate (45% nonventilated lung) and b) high-resolution computed tomography grade moderate (50% diseased lung).
Categorisation of the subjects with clear pathological findings in the three diagnostic tests' groups as percentage of the chronic obstructive pulmonary disease (COPD), α1-antitrypsin deficiency (α1-ATD) and healthy cohorts
Tables 4⇓ and 5⇓ demonstrate the correlation between CT and MRI as an overall assessment in subjects with COPD (table 4⇓) and in normal subjects (table 5⇓). Clearly, there is a shift towards normal as expected, but also for both categories the actual agreement between the two imaging modalities is 18 (35%) out of 52 in the COPD group and 19 (66%) out of 29 in the healthy group. Upon closer inspection, 8 (15%) out of 52 and 1 (3%) out of 29 subjects have a disagreement by greater than one category, respectively.
Cross correlation of computed tomography(CT) and hyperpolarised (HP) 3He magnetic resonance imaging (MRI) for the 52 subjects with chronic obstructive pulmonary disease demonstrating imperfect agreement between the two imaging modalities
Cross correlation of computed tomography(CT) and hyperpolarised (HP) 3He magnetic resonance imaging (MRI) for the 29 healthy subjects demonstrating imperfect agreement between the two imaging modalities
Quantitative analysis
The second stage of the evaluation was based on quantitative readouts of both imaging modalities as figure 5⇓ illustrates the ADC measurement from HP 3He MRI, and MLD from HRCT. ADC measurements provided a statistically significant differentiation (Wilcoxon p<0.001) in healthy subjects (median 0.17 cm2·s−1) versus COPD patients (median 0.28 cm2·s−1) and α1-ATD patients (median 0.29 cm2·s−1). There was no significant difference between COPD and α1-ATD patients. However, the MLD distributions were similar in healthy subjects (median MLD -855 HU) and COPD patients (median MLD -865 HU; Wilcoxon p = 0.038) from a clinical perspective (fig. 5b⇓); conversely α1-ATD patients showed lower values (median MLD -892 HU; Wilcoxon p<0.001 versus healthy and COPD subjects). After adjustment for PFT status, partial Spearman correlation coefficients demonstrated moderate correlations between FEV1/FVC as an indicator of airway obstruction with both ADC and MLD (r = 0.50 and 0.52, respectively), and between DL,CO as an indicator of alveolar destruction with ADC and MLD (r = 0.59 and 0.29, respectively).
Box whisker plots for the distributions of a) the apparent diffusion coefficient (ADC) and b) the mean lung density stratified for the study subjects' health status. Horizontal lines indicate medians and quartiles, vertical lines indicate maximum and minimum observations. COPD: chronic obstructive pulmonary disease; α1-ATD: α1-antitrypsin deficiency. •: statistical outliers.
DISCUSSION
The results of this multicentre study, the largest of its kind to date, showed the feasibility of performing this rather complex study in a multinational, multicentre fashion using a central production facility. This obviously is an important step forward towards the application of 3He MRI.
The study demonstrates that there is a significant difference in information obtained from PFT and HP 3He MRI compared with HRCT. In particular, it appears that ADC measurements using HP 3He MRI are more closely correlated to functional information (DL,CO) obtained through routine PFTs and CO diffusion capacity assessments. In addition, there are significant differences between ventilation distribution images and HRCT images. These observations could be explained by the fundamental differences between HRCT and HP 3He MRI imaging. The latter relies on gas inflow into the lungs for visualisation of the airways and the size of the (distal) pulmonary airspaces 39 and therefore is closer in nature to the breathing manoeuvres used in PFTs. As the visual assessment of both HP 3He MRI and CT was performed using categorical scores, and not continuous variables, comparison is by definition a little more difficult. However, quantitative assessment using ADC and MLD, can be evaluated in greater detail because they are continuous variables.
Although not affecting the direct comparison between HRCT and MRI, one needs to also consider that these imaging modalities allow for regional information. However, as a comparison with PFTs were necessary (as the reference method), this regional information was lost as an averaging process was needed to allow for statistical comparison. Healthy lung parenchyma compensates (at least in part) for lack of parenchymal function in diseased lung states, usually by compensatory hyperinflation and/or redirection of perfusion to enhance ventilation−perfusion matching and gas exchange mechanism. Regional functional HP 3He MRI is able to distinguish some of the compensatory mechanisms, demonstrating redistribution of ventilation in the presence of ventilation defects. Thus, the dichotomy that is forced onto these tests by comparison with PFTs and biostatistical sample size calculation, as well as the arbitrary classification of level of disease based on the visual estimates, are likely resulting in under and over reporting of abnormalities in the subgroups that were studied using imaging methods. Thus, when combining the differences between the imaging and PFTs, one may argue that PFTs are (by their very nature) a sub-standard reference method in the assessment of novel imaging techniques of the lung.
This study aimed to compare two imaging modalities in a cohort of patients, who were grouped and defined according to PFTs. A potential problem is that the imaging tests may be more sensitive to (regional) changes in the lung than the global results obtained by PFT; this discrepancy obviously means that a comparison will be skewed from the outset. Nevertheless, there is significant correlation for both methods, thus allowing for both tests to be of use in the assessment and distribution analysis of emphysema. This information may become relevant for the planning and guidance of regional emphysema treatment, such as interventional bronchoscopic valve placement and other new management strategies 10. Furthermore, new pharmacological treatment trials may benefit from regional functional tools, as these appear to be more sensitive and may therefore reduce the sample size as outcome may be based on these surrogate measures.
There is some validity to separate normal/mild COPD from moderate/severe COPD as an outcome parameter. First, the GOLD criteria have been shown to be reproducible in multiple centres, and they have been widely adapted. Secondly, treatment will very likely be different, with medical therapy predominantly of use in earlier disease, while surgical or other interventional treatment will be sought in more severe emphysema. Therefore, any imaging test should be capable of identifying those that require medical treatment versus those that need surgical or interventional treatment. However, one needs to realise that the chosen analysis with global values was required in order to compare HP 3He MRI and HRCT with the accepted reference standard (PFTs). This limitation may not be as critical once these tests are applied for clinical practice or for assessment of treatment response, where they can serve as independent (and regional) outcome measures. In addition, the trade-off between sensitivity versus specificity is largely driven by the potential for treatment in those patients that are diagnosed early. Thus, as long as there is no effective early treatment available, this is a decision that cannot currently be made.
The correlation between HRCT and ventilation images was probably influenced by the way in which the visual assessment was designed. It has become clear that visual assessment can be less accurate than computer-based assessment, both for MRI 40 and for CT 13–15. The study protocol did not intend for such an analysis to take place. In retrospect, the additional performance of a proton MRI sequence prior to HP 3He MR imaging would have enabled the performance of a later proposed method 40. This method uses a proton MRI sequence as a mask for the subsequent subtraction of signal from HP 3He MRI, and would have been able to demonstrate signal-based ventilated lung volumes bases on absolute pixel values 40. In addition, with the advancement of CT scanner technology and the subsequent changes in CT protocols, it is now possible to allow full volumetric lung assessment with histogram analysis and determination of extent of emphysema based on various density cut-off values 15. A recent study in an emphysema-induced canine model compared volumetric CT with 3 mm slice thickness and 3He diffusion MRI and demonstrated very good correlation between the two modalities 41. A further study in asthmatic patients also demonstrated good correlation between MDCT findings of airway morphology and expiratory air trapping and the presence of ventilation defects 42. This discrepancy of our relative lack of agreement compared with this literature may well be related to the fact that we used traditional HRCT methods (with axial 1-mm slices and 1-cm gaps), thus reducing the ability to obtain better comparative images.
A second issue is that we were forced to compare axial CT slices with coronal MRI slices, again due to the traditional HRCT methodology of our study. It was feasible to obtain axial reconstructions of MRI (as these were volumetrically acquired images), but this comparison was not perfect. Nevertheless, by trying to score pathological findings based on segmental involvement, a reasonable correlation would have been expected if the two tests were demonstrating the same pathology.
The apparent diffusion coefficient appears to be most closely related to functional parameters, in particular DL,CO. This is not unexpected, as it is a measurement of microstructure of the lung, which is totally dependent on ventilation taking place to the regions under review. As a result, mean ADC will probably underestimate the amount of destroyed lung tissue, as inflow obstruction in COPD will prevent ADC measurement, and some of the most destroyed emphysematous lung areas will therefore be excluded from the analysis. In spite of this limitation, the ADC was more closely related to PFTs than HRCT. Ongoing analysis is taking place to assess regional differences between the two imaging methods. One should, however, note that the ADC depends on the level of inspiration which was not spirometrically controlled in the present study. A further point of note is that the measured 3He ADC values are dependent upon pulse sequence timing of the gradients and their associated diffusion weighted b-values 43. In this study relatively strong b-values b = 3.89 s·cm−2 were used to achieve a strong diffusion weighting and, thus, a high sensitivity for regional differences in the ADC values. This approach, however, results in a complete loss of MRI signal in the trachea and grossly enlarged alveolar spaces. In the ADC pulse sequence, diffusion gradients were applied subsequently on all axes. An earlier study which evaluated a subset of the volunteer data presented here has shown that in normal patients no differences are observed between the ADC values obtained in the three directions 29. Moreover, it has been shown that no differences between the ADC values are observed if an identical pulse sequence is used on the same MRI scanner model at two different study centres.
In the majority of other studies published in COPD with 3He MRI 27, 32 a lower b-value of b = 1.6 s·cm−2 on a single axis was used. Care should be taken when making a direct comparison of our ADC results with these smaller studies as the higher b-values used here may give a lower ADC 44. In future studies some international consensus on standardisation of b-value for ADC measurement would be worthwhile. Nevertheless for inter patient comparison in this multicentre study the b-values used and ADC values obtained are consistent between sites.
Although this study underwent an a priori statistical sample size calculation to achieve sufficient power of the overall evaluation, it must be emphasised, that some of the above results are due to exploratory analyses and therefore may suffer from loss in statistical power. During the planning phase of the trial, the investigation was not designed to thoroughly stratify results for the patients' disease status (smokers, COPD or α1-ATD), but should rather derive an “overall” agreement statement between CT and 3He-MRI based diagnostic ratings. As a consequence, the overall trial's sample size was designed for an overall agreement analysis at an 80% statistical power level without any multiplicity consideration. However, during initial descriptive analyses, severe sub-sample heterogeneity according to the patients' disease status was observed. A stratification of all clinically relevant results was therefore found necessary and was provided throughout the evaluation. In fact, the resulting sub-sample evaluations will suffer from a loss in statistical power, which cannot be corrected for by formal multiplicity correction, but could have only been taken into account for by interim sample size corrections. As these findings were due to descriptive, in some sense exploratory evaluations, the authors decided to display the numerical data in a corresponding manner. For example, tables 2⇑ and 3⇑ provide only stratified point estimates instead of formally multiplicity adjusted confidence intervals, which again would have afforded larger sample sizes in the first place. Should a study be aimed at a four-class comparison of these diagnostic tests, a study population of >200 subjects would be necessary and this was too much to achieve in the setting of this multicentre study using this new technology.
Finally, although not formally assessed in the present study, a possible issue that needs further evaluation is the repeatability of 3He MRI, not only in normal subjects, but also in those with asthma and COPD. At least two recent studies have directly assessed this issue. One study in asthmatic patients demonstrated that ventilation defects in asthma patients tended to be quite persistent in spite of optimal treatment, even though a decrease in the extent of the ventilation defects was observed in most patients 45. A further study in patients with COPD demonstrated excellent same-day and 1-week follow-up repeated measurements for ADC measurements, whereas measurements aimed at ventilation distribution were less reproducible 46.
In conclusion, the current study findings suggest that HRCT is capable of demonstrating the morphological (airways) characteristics of COPD (such as bronchial wall thickening and bronchiectasis). However, HP 3He MRI appears slightly more closely related to functional measures, and complements HRCT with its capability to quantify ventilation within the lung regions, as well as apply diffusion measurements to allow for micro-structural changes on a spatial scale smaller than the resolution of HRCT. The techniques thus offer quite different, complementary information, which each may be useful for treatment planning in different contexts. The exact roles of these two imaging techniques need to be established using management studies.
Support statement
Funding for the present study was provided by the European Union, Framework 5, Polarized Helium Imaging of the Lung project (IST-2000-31559).
Statement of interest
Statements of interest for E.J.R. van Beek and H-U. Kauczor can be found at www.erj.ersjournals.com/misc/statements.dtl
Acknowledgments
The following people participated in the conduct of this study. E. Otten, W. Heil and F. Filbir (Johannes Gutenberg University, Institut für Physik) and F. Knitz and N. Weiler (Dept of Anesthesiology, Mainz, Germany); L. Hanson, K. Markenroth, J. Mortensen, A. Dirksen, A. Kiil Berthelsen, E. Frausing and J. Vestbo (Danish Research Centre for Magnetic Resonance, Copenhagen University Hospital, Hvidovre, Denmark); G. Mills (Depts of Anesthesiology), Rod Lawson (Respiratory Medicine) S. Fleming, S. Fichele, N. Woodhouse, A. Swift, M. Paley and P. Griffiths (Academic Radiology, University of Sheffield, Sheffield, UK)
- Received September 9, 2008.
- Accepted June 11, 2009.
- © ERS Journals Ltd