Abstract
Introduction Pulmonary sarcoidosis is a rare heterogeneous lung disease of unknown aetiology, with limited treatment options. Phenotyping relies on clinical testing including visual scoring of chest radiographs. Objective radiomic measures from high-resolution computed tomography (HRCT) may provide additional information to assess disease status. As the first radiomics analysis in sarcoidosis, we investigate the potential of radiomic measures as biomarkers for sarcoidosis, by assessing 1) differences in HRCT between sarcoidosis subjects and healthy controls, 2) associations between radiomic measures and spirometry, and 3) trends between Scadding stages.
Methods Radiomic features were computed on HRCT in three anatomical planes. Linear regression compared global radiomic features between sarcoidosis subjects (n=73) and healthy controls (n=78), and identified associations with spirometry. Spatial differences in associations across the lung were investigated using functional data analysis. A subanalysis compared radiomic features between Scadding stages.
Results Global radiomic measures differed significantly between sarcoidosis subjects and controls (p<0.001 for skewness, kurtosis, fractal dimension and Geary's C), with differences in spatial radiomics most apparent in superior and lateral regions. In sarcoidosis subjects, there were significant associations between radiomic measures and spirometry, with a large association found between Geary's C and forced vital capacity (FVC) (p=0.008). Global radiomic measures differed significantly between Scadding stages (p<0.032), albeit nonlinearly, with stage IV having more extreme radiomic values. Radiomics explained 71.1% of the variability in FVC compared with 51.4% by Scadding staging alone.
Conclusions Radiomic HRCT measures objectively differentiate disease abnormalities, associate with lung function and identify trends in Scadding stage, showing promise as quantitative biomarkers for pulmonary sarcoidosis.
Abstract
Radiomic measures identify pulmonary parenchymal abnormalities in sarcoidosis and are highly associated with lung function, suggesting that radiomics could enhance visual reads and result in improved patient profiling, disease staging and monitoring. http://bit.ly/2HMLaKm
Introduction
Pulmonary sarcoidosis is a rare heterogeneous disease of unknown aetiology characterised by the formation of granulomas in the lungs, as well as other organs. Compromised lung function is common with pulmonary involvement [1], which can limit daily activities, and frequently indicates the need for therapy to reduce or reverse abnormalities. Spontaneous remission has been reported to occur in up to two-thirds of patients in certain populations [2], yet mortality rates due to respiratory failure are rising in the USA [3].
Assessment of chest radiography, in addition to lung function, is used to clinically monitor sarcoidosis, with chest radiographs showing abnormalities in 90% of cases [4]. The Scadding system of staging, based on chest radiographs, is a common visual classification for sarcoidosis, dividing the radiographic manifestations into five stages: stage 0 (no radiograph abnormalities), stage I (bilateral hilar lymphadenopathy (BHL)), stage II (pulmonary infiltration with BHL), stage III (pulmonary infiltration without BHL) and stage IV (pulmonary fibrosis with volume loss). Despite the numerical nomenclature, there is no sequential ordering of Scadding stages and subjects may demonstrate abnormalities in multiple stages at various times. Additional chest radiographic patterns may be observed that are not well characterised by Scadding stage, such as small nodular opacities along the bronchovascular bundle, focal consolidation, consolidation of small nodular opacities (conglomerate masses), ground glass and fibrosis, among others [2].
Recently, high-resolution computed tomography (HRCT) scans have been used to characterise pulmonary sarcoidosis manifestations [5–7]. Despite limited standard visual assessment tools for HRCT sarcoidosis manifestations [8–10], HRCT findings are still characterised visually, which is time consuming, dependent on the expertise of the reader and subject to poor inter-rater reliability [11]. Quantitative assessment of HRCT in sarcoidosis may provide a more rapid, objective and sensitive quantification of the various abnormalities that present in sarcoidosis.
Radiomics, an emerging field in which large numbers of quantitative imaging features are computed from medical images, has proved useful for developing quantitative biomarkers in emphysema [12–15], interstitial lung disease [16–18] and lung cancer [19–23]. To the best of our knowledge, radiomic measures have yet to be explored in sarcoidosis. Given the usefulness of radiomics in other diseases, we hypothesise that radiomic features will also prove useful as potential imaging biomarkers for sarcoidosis.
Here, we perform an early-stage investigation of radiomic measures as potential biomarkers in sarcoidosis, where “early stage” references the biomarker study pipeline suggested by Pepe et al. [24]. Our goal is to investigate whether specific radiomic features on lung HRCT differ between sarcoidosis subjects and healthy controls (figure 1 and table 1). In particular, we compute global and spatially varying radiomic features, and assess how these measures differentiate sarcoidosis subjects from healthy controls and between Scadding stages. Lastly, we relate radiomics and lung function to show the potential of radiomic measures as biomarkers for sarcoidosis.
Methods
Study populations and data acquisition
Supplementary section E1 includes a detailed description of the study populations and imaging acquisition. In brief, the sarcoidosis population (n=79) was recruited at National Jewish Health (Denver, CO, USA) as part of the National Heart, Lung, and Blood Institute-funded Genomic Research in Alpha-1 Antitrypsin Deficiency and Sarcoidosis (GRADS) study [25]. A nonsmoking, healthy control population (n=108) was obtained from the COPDGene study [26]. Subjects in GRADS were 18–85 years old with a confirmed diagnosis of sarcoidosis via biopsy or manifestations consistent with acute sarcoidosis (Löfgren's syndrome) [25]. Subjects in the COPDGene study were 45–80 years old with no history of lung disease and normal post-bronchodilator (BD) spirometry [27]. In both GRADS and COPDGene, pulmonary function testing was performed (including pre-BD forced expiratory volume in 1 s (FEV1) and forced vital capacity (FVC)) and a chest HRCT ordered according to the same imaging protocol [26], except for a difference in tube current average (GRADS: 180–330 mA varied by body mass index (BMI); COPDGene: 400 mA). According to Mackin et al. [28], we do not believe the difference in tube current will affect our results. 73 of the GRADS subjects had HRCT. Only COPDGene subjects scanned on machines from the same manufacturer (Siemens) as the GRADS subjects were analysed in this study (n=78). All subjects provided signed informed consent through either the GRADS or COPDGene studies.
Radiomic measures
Image data pre-processing and radiomics are described in supplementary sections E2 and E3. In brief, lung segmentation and radiomic calculations were performed using the lungct R package (https://github.com/ryansar/lungct). Five radiomic measures (skewness, kurtosis, fractal dimension [29], Moran's I [30] and Geary's C [31]) were pre-selected for evaluation in our study based on their statistical properties and success in previous studies for idiopathic pulmonary fibrosis and lung nodules [20, 32, 33]. The five radiomic measures were computed on every slice for every subject in each of the two lungs and three anatomical planes. To summarise the distribution of the radiomic features obtained for each subject, lung and plane, the median value was computed. Detailed descriptions of each measure along with directional clinical hypotheses for each measure can be found in figure 1 and table 1.
Statistical analysis
A detailed statistical analysis can be found in supplementary section E4. Descriptive statistics (e.g. mean and standard deviation for continuous variables; frequencies for categorical variables) were used to summarise demographic, radiomic and spirometry data (table 2). Linear regression models were used to test for differences in global radiomic measures between sarcoidosis subjects and controls, adjusted for age, sex and BMI (table 3, and supplementary tables E1 and E2). To evaluate whether disease status (i.e. sarcoidosis versus control) alters associations between lung function and global radiomic measures, linear regressions were fitted with an interaction between disease group and a global radiomic measure (figure 2, and supplementary tables E3 and E4). In secondary analyses, functional regressions using penalised smoothing splines [34] were fitted to determine whether there was a spatially varying association between subjects with and without sarcoidosis and a radiomic measure, i.e. whether certain regions of the lung were more different between sarcoidosis subjects and controls than other regions (figures 3 and 4, and supplementary figures E1 and E2). The analyses were repeated in sarcoidosis subjects to investigate potential trends in radiomic and lung function measures across Scadding stages. Results were considered significant at p<0.05.
Results
For brevity, the results presented here are for the left lung axial orientation, unless otherwise noted. In general, similar patterns were seen for the right lung, and coronal and sagittal anatomical planes (see supplementary material for more information).
Differences in nonradiomic characteristics between sarcoidosis subjects and controls, and across Scadding stages in sarcoidosis
Table 2 shows the characteristics of the subject population in this study. Compared with controls (n=78), sarcoidosis subjects (n=73) were younger (mean±sd 54.1±8.6 versus 64.5±8.3 years; p<0.001), with a higher BMI (mean±sd 30.1±7.0 versus 28.0±4.8 kg·m−2; p=0.029), larger population of White (84.9% versus 94.1%; p=0.002) and a meaningfully, but not significantly, higher percentage of males (49.3% versus 35.9%; p=0.133). Adjusted for sex, race, ethnicity, age and height, sarcoidosis subjects had significantly lower pre-BD FEV1 and FVC (p<0.001 for both) compared with controls, but a similar FEV1/FVC (p=0.465).
Subject characteristics were similar across the Scadding stages (p>0.05) (table 2), with differences observed in BMI and FEV1/FVC (p=0.040 and p<0.001, respectively). Stage IV had the lowest FEV1/FVC, while others had FEV1/FVC within a normal range.
Differences in global radiomic measures were observed between sarcoidosis subjects and controls
The mean±se of the global skewness was positive, but lower in sarcoidosis subjects compared with controls (3.17±0.09 versus 3.64±0.06; p<0.001), as was the global kurtosis (12.7±0.7 versus 16.7±0.6; p<0.001), global fractal dimension (2.404±0.004 versus 2.429±0.003; p<0.001) and global Geary's C (0.212±0.002 versus 0.226±0.002); the global Moran's I (mean±se 0.702±0.005 versus 0.691±0.004) was higher in sarcoidosis subjects (p<0.001) (table 3, and supplementary tables E1 and E2).
Associations were apparent between global radiomic measures and lung function, and differed between sarcoidosis subjects and controls
Kurtosis
The relationship between kurtosis and pre-BD FEV1 (figure 2a) differed between sarcoidosis subjects and controls (p=0.032); in sarcoidosis subjects there was a positive association between kurtosis and FEV1 (β±se 0.053±0.012 L; p<0.001); however, in controls there was no significant association between kurtosis and FEV1 (p=0.189). The relationship between kurtosis and FVC and FEV1/FVC was positive (figure 2b and c), and did not differ between sarcoidosis subjects and controls (p>0.087).
Geary's C
The association between Geary's C and pre-BD FEV1 (figure 2d), pre-BD FVC (figure 2e) and FEV1/FVC (figure 2f) differed between sarcoidosis subjects and controls (p<0.001, p=0.002 and p<0.001, respectively); in sarcoidosis subjects there were significant positive associations between Geary's C and FEV1, FVC and FEV1/FVC (FEV1: β±se 15.7±3.6 L; p<0.001; FVC: β±se 21.4±4.6 L; p=0.008; FEV1/FVC: β±se 2.11±0.42; p<0.001); however, in controls there were significant negative associations between Geary's C and FEV1 and FEV1/FVC (FEV1: β±se −10.9±4.5 L; p=0.016; FEV1/FVC: β±se −1.06±0.51; p=0.040).
Differences in spatial radiomic measures were noted between sarcoidosis subjects and controls
Radiomic measures differed between sarcoidosis subjects and controls throughout much of the lung across all three planes, as exhibited by separation of confidence bands in figure 3 and t-statistics greater than 1.96 in figure 4. Skewness (top row in figure 3, yellow in figure 4) and kurtosis (second row in figure 3, light green in figure 4) were significantly lower in sarcoidosis subjects compared with controls, with the largest significant differences observed in the superior and middle sagittal lung regions; smaller, but still significant differences were observed in the coronal plane. Fractal dimension (third row in figure 3, teal in figure 4) and Moran's I (fourth row in figure 3, dark blue in figure 4) were significantly higher, and Geary's C (bottom row in figure 3, dark purple in figure 4) significantly lower, in the sarcoidosis subjects, with the most significant differences in the superior and lateral lung regions; there were also large differences in fractal dimension in the coronal plane. See supplementary figures E1 and E2 for right lung results.
Radiomic measures of HRCT differed according to Scadding stage classifications
Global radiomic measures differed when patients were classified by Scadding stage, across all anatomical orientations, with the significant differences driven by stage IV (table 3). Not surprisingly, radiomic measures did not follow a sequential increasing or decreasing pattern across Scadding stages. In addition, the various radiomic measures did not follow the same patterns across Scadding stages in terms of which stages have higher (or lower) mean values compared with others. The means of the global skewness, kurtosis and fractal dimension were highest in stage I subjects and lowest in stage IV subjects (p=0.027, p=0.032 and p<0.001, respectively). For the global Moran's I, the mean±se was lowest in stage 0 subjects (0.667±0.013) and highest in stage IV subjects (0.729±0.008) (p<0.001). The mean±se Geary's C was highest in stage III subjects (0.229±0.005) and lowest in stage IV subjects (0.195±0.002) (p<0.001).
Importantly, in sarcoidosis subjects, global radiomic measures explained more variation in the lung function measures than Scadding stage measures. For pre-BD FEV1, Scadding stage along with sex, age and BMI explained 44.5% of the variability of pre-BD FEV1 compared with 67.7% with global radiomic measures and sex, age and BMI. Scadding stage along with sex, age and BMI explained 51.4% of the variability in pre-BD FVC, whereas global radiomic measures along with sex, age and BMI explained 71.1% of the variation in pre-BD FVC. For FEV1/FVC, Scadding stage, sex, age and BMI explained 26.9% compared with 40.2% of the variation explained with global radiomic measures, sex, age and BMI.
Discussion
To the best of our knowledge, this is the first radiomics analysis in sarcoidosis. We show the potential of radiomic measures as biomarkers for sarcoidosis, by 1) detecting differences in HRCT between sarcoidosis subjects and healthy controls, 2) finding associations between radiomic measures and lung function, and 3) identifying trends between Scadding stages. This study can be classified as phase II evidence in the developmental evidence of radiomic measures as biomarkers for sarcoidosis treatment and disease course prediction [24].
In our findings, global radiomic measures differed significantly between sarcoidosis subjects and controls in both the left and right lungs in each anatomical plane. The distribution of Hounsfield units on HRCT from sarcoidosis subjects is more normally distributed (i.e. less skew and kurtosis) compared with controls. This is due to increased opacification (i.e. whiter areas) on HRCT scans of sarcoidosis subjects, likely caused by parenchymal abnormalities, although it is not clear from our findings which abnormalities are contributing the most. These radiomic findings are consistent with findings via visual assessment that note increased opacification and parenchymal abnormalities on HRCT scans from sarcoidosis subjects [2]. Moreover, the significant differences in our second-order radiomic features (fractal dimension, Moran's I and Geary's C) show adjacent pixels are more similar (in terms of Hounsfield units) on HRCT scans from sarcoidosis subjects compared with controls. This is likely a result of nodule conglomeration and/or fibrosis on HRCT scans of sarcoidosis subjects, which is consistent with findings via visual assessment [2]. Furthermore, the spatial radiomic measures displayed significant geographic variation across the lung between sarcoidosis subjects and controls, with differences most apparent in the superior, mid-to-outer sagittal regions. These spatial findings suggest an upper and lateral lobe predominance of radiographic abnormalities in sarcoidosis, which is consistent with findings on visual assessment [35]. Thus, our radiomic findings are consistent with known visual abnormalities for pulmonary sarcoidosis. It is reassuring that these results support rather than conflict with visual scoring. A next step in our work will be to assess correlations between visual assessment and radiomic measures, and to use the two together to provide an integrative radiological assessment of pulmonary sarcoidosis.
Global radiomic measures were also significantly associated with spirometry. The association between radiomic measures and lung function was in general stronger among the sarcoidosis population. In sarcoidosis subjects, lower skewness and kurtosis, likely caused by more parenchymal abnormalities, were associated with lower spirometry. Moreover, lower fractal dimension, higher Moran's I and lower Geary's C, likely caused by nodule conglomeration and/or fibrosis, were associated with lower spirometry. It is particularly interesting that there are strong associations between radiomic measures and lung function considering that previous work measuring associations between lung function and visual scoring on HRCT scans from sarcoidosis subjects has not shown a consistent association [2]. The associations between radiomic measures and spirometry provide further evidence of the potential clinical utility of radiomic analyses in assessment of pulmonary sarcoidosis.
Based on Scadding stage classifications, we found that global radiomics differed significantly in both the left and right lungs in each anatomical plane, with much of the significant differences driven by Scadding stage IV. For skewness, kurtosis, fractal dimension and Geary's C, the most significant pairwise differences were between stage I and IV; for Moran's I, between stage 0 and IV. Not surprisingly, radiomic measures did not follow a sequential increasing or decreasing pattern across Scadding stages. In addition, the various radiomic measures did not follow the same patterns across Scadding stages in terms of which stages have higher (or lower) mean values compared with others. These findings are consistent with our knowledge regarding the nonsequential ordering of Scadding stages [2], and the potential for HRCT to provide more sensitive and specific information with regard to parenchymal abnormalities in pulmonary sarcoidosis. Additionally, these radiomic features appear to be sensitive to parenchymal abnormalities noted in Scadding stage IV, such as fibrosis and/or upper lobe volume loss with hilar retraction. Furthermore, we found that radiomic measures are a better predictor of spirometry than Scadding stage, which is consistent with other studies that note the poor predictive ability of Scadding stage with regard to lung function [36, 37].
Further investigation of the utility of radiomic measures as quantitative biomarkers for pulmonary disease, including lung function and other clinical characteristics, is warranted, especially given the ability to generate these measures in an automated fashion. A major benefit of a radiomic analysis is the automated computational efficiency and reproducibility, increasing the potential use of these methods in clinical settings. We analysed 151 segmented scans at ∼3 min per scan, which is arguably faster than visual assessment. Radiomic algorithms could be programmed into scanners to be available along with visual reads to further enhance a patient's image profile for sarcoidosis staging and disease monitoring.
Our study is not without limitations. This investigation was performed on a modest sample size. Repeating this analysis in a larger sarcoidosis population will be important to verify generalisability and reproducibility. Based on the way the COPDGene cohort was constructed, our control population was only nonsmokers. We do not believe that this introduced significant biases given the small proportion of smokers in the sarcoidosis population and because a sensitivity analysis removing the sarcoidosis subjects who smoked showed qualitatively similar results to the main results. Although the HRCT scans from both the sarcoidosis subjects and controls were obtained under very similar protocols, we note that a weakness of this study is that the sarcoidosis subjects and controls were scanned on various different Siemens HRCT machines with different tube currents. This potentially introduces biases due to scanner and protocol differences.
To conclude, our work identifies the usefulness of radiomics on HRCT by efficiently and objectively identifying pulmonary parenchymal abnormalities in sarcoidosis subjects. We also highlight the significant association between radiomics and lung function, particularly among those with sarcoidosis, suggesting that the radiomic measures that we evaluated have functional implications. This work shows exciting promise for radiomic measures as biomarkers for disease in sarcoidosis, which should be further evaluated for application in the clinic and/or research setting. We are hopeful that future research with radiomics on HRCT will result in a better understanding of disease progression, classification and treatment options for sarcoidosis subjects.
Supplementary material
Supplementary Material
Please note: supplementary material is not edited by the Editorial Office, and is uploaded as it has been supplied by the author.
ERJ-00371-2019_Online_Data_Supplement_Revised ERJ-00371-2019_Online_Data_Supplement_Revised
ERJ-00371-2019_figureE1 ERJ-00371-2019_figureE1
ERJ-00371-2019_figureE2 ERJ-00371-2019_figureE2
Shareable PDF
Supplementary Material
This one-page PDF can be shared freely online.
Shareable PDF ERJ-00371-2019.Shareable
Acknowledgements
This research used data generated by the COPDGene study, which was supported by NIH grants U01HL089856 and U01HL089897. The COPDGene project is also supported by the COPD Foundation through contributions made by an Industry Advisory Board comprised of Pfizer, AstraZeneca, Boehringer Ingelheim, Novartis, and Sunovion.
Footnotes
This article has supplementary material available from erj.ersjournals.com
Conflict of interest: S.M. Ryan has nothing to disclose.
Conflict of interest: T.E. Fingerlin has nothing to disclose.
Conflict of interest: M. Mroz has nothing to disclose.
Conflict of interest: B. Barkes has nothing to disclose.
Conflict of interest: N. Hamzeh has nothing to disclose.
Conflict of interest: L.A. Maier reports grants from NIH/NHLBI during the conduct of the study, and grants from NIH/NHLBI, aTYR and Mallinckrodt ARD, Inc., outside the submitted work.
Conflict of interest: N.E. Carlson has nothing to disclose.
Support statement: Funding was provided from the US Dept of Health and Human Services, National Institute of Health, National Heart, Lung, and Blood Institute: R01 HL089856, R01 HL114587, R01 HL142049, U01 HL112695 and U01 HL112707. Funding information for this article has been deposited with the Crossref Funder Registry.
An earlier version of this manuscript was posted to ArXiv in June 2018 (arXiv:1806.10281) (https://arxiv.org/abs/1806.10281). Compared with that version, the present manuscript has substantial improvements to its methods, results and presentation.
- Received February 21, 2019.
- Accepted May 24, 2019.
- Copyright ©ERS 2019