Radiomic Profiling of Lung CT in a Cohort of Sarcoidosis Cases ============================================================== * Nichole E Carlson * William Lippitt * Sarah M Ryan * Margaret Mroz * Zachary Buchalski * Briana Barkes * Shu-Yi Liao * Lisa A Maier * Tasha E Fingerlin ## Abstract **Question Addressed** High resolution computed tomography (HRCT) of the chest is increasingly used in clinical practice for sarcoidosis. Visual assessment of chest HRCTs in patients with sarcoidosis has high inter- and intra-rater variation. Radiomics offers a reproducible quantitative assessment of HRCT lung parenchyma and could be useful as an additional summary measure of disease. We develop radiomic profiles on HRCT and map them to clinical and patient reported outcomes. **Patients and Methods** Three-dimensional radiomic features were calculated on chest HRCT for the left and right lung from sarcoidosis cases enrolled in the Genomic Research in Alpha-1 Antitrypsin Deficiency and Sarcoidosis study (N=321). Sparse K-means was used to group sarcoidosis cases using their radiomic profiles. Linear regression investigated how pulmonary function tests and patient reported outcomes differed between groups. Resampling approaches were used to validate the robustness of the findings. **Results** Five groups were identified. The new radiomic-based grouping was associated with Scadding stage (p<0.001), but each radiomic group had patients from all Scadding stages. All pulmonary function testing measures significantly differed between radiomic groups (p<0.001). Radiomic group remained significantly associated with pulmonary function even after adjusting for Scadding stage (p<0.0001). Individual radiomic measures explained 9-18% of the variation in pulmonary function testing. Only two patient reported outcomes (shortness of breath and physical health) differed by radiomic group (p<0.013). **Answer to Question** Radiomic quantification of sarcoidosis identifies subgroups associated with pulmonary function and patient reported outcomes. These associations provide additional evidence that radiomics may be useful for quantifying new disease phenotypes. Key words * quantitative imaging * texture analysis * pulmonary function * phenotyping * patient outcomes ## Introduction Sarcoidosis is a granulomatous interstitial lung disease which affects ~ 110 thousand individuals in the United States (1). Typical diagnoses is between 30-50 years of life, resulting in a decrease in quality of life and productivity (2). Pulmonary disease occurs in over 90% of those with sarcoidosis (3) with significant morbidity and mortality. Currently, visual assessment of chest radiography (CXR) is used to quantify lung abnormalities standardized via the Scadding staging system (4) which groups CXR findings into five groups from 0 to 4. Substantial variation exists in the radiographic patterns within each Scadding stage. Scadding stage has limited utility in predicting prognosis even in the extremes of the scale (5). Chest high resolution computed tomography (HRCT) is increasingly used in clinical practice of sarcoidosis to monitor disease as it offers more detailed visualization of parenchymal abnormalities compared to CXR (4, 6–8). As with CXR, visual assessment of chest HRCT is used to evaluate abnormalities although there are limited standardized scoring metrics (4, 9, 10). This is due in part to the diverse and heterogeneous patterns present on chest HRCT in sarcoidosis, often with multiple patterns noted on one CT. These complexities result in high inter- and intra-rater variation (11). More automated systems that quantify sarcoidosis chest HRCT could decrease these sources of variation. Radiomics is a field of study in which large numbers of quantitative features are extracted from medical images (12). A radiomics panel computes summary measures of the distribution of the Hounsfield units (HU) along with summary measures of the spatial relationships of neighbouring voxels (13). The result is a panel of quantitative measures characterizing the texture of the image. Radiomic analysis has proved useful for quantifying HRCT in emphysema (7), idiopathic pulmonary fibrosis (14, 15), interstitial lung disease (16, 17), diffuse lung disease (18) and cancer (19). Ryan et al. (20) showed the potential utility of radiomics in sarcoidosis, comparing radiomic and other textural based measures between sarcoidosis patients and controls. It remains unclear whether radiomics also has the potential to differentiate varied phenotypes *within* sarcoidosis patients. Radiomic profiles within sarcoidosis patients that also correlate with pulmonary function or patient reported outcomes could indicate that radiomics may serve as a useful measure to track change in the lung parenchyma over time. The goal of this research study is to develop a radiomic profile of sarcoidosis chest HRCT using a large, phenotypically-diverse population of sarcoidosis cases from the Genomic Research in Alpha-1 Antitrypsin Deficiency and Sarcoidosis (GRADS) study (21) using statistical clustering techniques and to investigate the clinical utility of the clusters by quantifying their association with pulmonary function testing (PFT) and patient reported outcomes (PROs). PFTs and PROs were chosen to capture both clinician and patient focused measures of disease (26,,27). ## Methods ### The study design and participants The sarcoidosis population was recruited in the multicentre NHLBI-funded GRADS study (N=368) (21). This investigation has GRADS ancillary study approval and all participants provided informed consent (IRB approval HS-2779 and HS-2780;N=365). More details of this cohort can be found in the online supplement. Pulmonary function testing included pre-bronchodilator (BD) forced expiratory volume at one second (FEV1), forced vital capacity (FVC), the ratio of FEV1 to FVC, and diffusing capacity of the lungs for carbon monoxide (DLCO). The PROs included the gastroesophageal reflux disease questionnaire (GERDQ) (24), the University of California San Diego Shortness of Breath Questionnaire (SOBQ) (25), two measures of fatigue (the Fatigue Assessment Scale [FAS] (26) and Patient-Reported Outcomes Measurement Information System fatigue profile [PROMIS] (27)), the Cognitive Failure Questionnaire (CFQ) (28), and the physical and mental subscales of the SF-12 (33,,34). The final analysis dataset included N=321 patients who each had both an analysable CT and clinical data (online supplement Figure 1). ### Radiomic Analysis Details of the imaging acquisition and lung segmentation can be found in the online supplement. Radiomic features were calculated on the left and right lungs using RIA_lung from the lungct R package. These features include 44 first-order features and 239 GLCM features for each lung, for a total of 566 features. To calculate the GLCM features, the Hounsfield units (HU) from each HRCT scan were first discretized into 16 bins with equal relative frequencies; then, the features were calculated in each of 26 directions, assuming a voxel distance of one; finally, these features from all directions were summarized using the mean statistic. The exact computation for each radiomic measure can be found elsewhere (31). The ComBat function ([https://github.com/Jfortin1/ComBatHarmonization](https://github.com/Jfortin1/ComBatHarmonization)) was used to harmonize the radiomic measures across scanners (32–34), while preserving the biological variability in the following covariates: age, height, BMI, sex, race and clinical phenotype. As radiomic features are high-dimensional and repetitive (Figure 1), we first used a decorrelation filter (see online supplement), to identify a subset of features for analysis. This reduced the number of variables from 566 to 97. We then used robust k-means (35) to cluster patients (R package, RSKC). For outlier robustness, the proportion of cases trimmed was set at 0.1; the optimal bound on feature weights was found to be 7.5, using the permutation approach observed in sparcl R package (36). We used a standardized Gap statistic to determine the optimal number of clusters using the cluster R package (37). A standardized Gap statistic is the typical gap statistic divided by its standard deviation. The variables with the top five weights from the RSKC clustering algorithm were then selected for further investigation (discriminative features). ![Figure 1:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2022/10/02/2022.10.01.22280365/F1.medium.gif) [Figure 1:](http://medrxiv.org/content/early/2022/10/02/2022.10.01.22280365/F1) Figure 1: Heat map of the correlation between different radiomic measures for the entire population. ### Statistical Analysis Linear regression was used to assess associations between cluster group and Scadding stage and each of the outcomes controlling for age, sex, race, height, and BMI. We fitted a model with only cluster group and only Scadding stage and then put both together in the model to determine if the association with radiomics remained significant in the presence of Scadding stage. Each outcome was modelled separately using a complete case analysis for that outcome. Linear regression was used quantify the associations between the selected discriminative radiomic features and the outcomes controlling for age, sex, race, height, and BMI. ### Validation The statistical analysis is an unsupervised learning problem, which makes traditional training and test validation approaches difficult because the true groupings are unknown. Instead, we investigated the effect of repeatedly applying our analysis pipeline under various conditions and with bootstrapped samples. The details of the validation can be found in the online supplement. ### Results Tables 1 and 2 show the characteristics of the study population, which included N=321 sarcoidosis cases, including 147 (45.8%) males and 233 (72.6%) whites, with an average age of 53 (SD=10) years, average height of 67.0 in (SD=4.2) and average BMI of 30.6 kg/m2 (SD=6.5). Participants were spread across Scadding stages, with 43 (13.5%) in stage 0, 63 (19.8%) in stage 1, 92 (28.9%) in stage 2, 44 (13.8%) in stage 3 and 76 (23.9%) in stage 4. The average FVC was 2.62 L (SD=0.93), FEV1 was 3.57 L (SD=1.14), and DLCO 80.15 (SD=23.37). The population was predominantly of the non-obstructive type (72.7%). View this table: [Table 1:](http://medrxiv.org/content/early/2022/10/02/2022.10.01.22280365/T1) Table 1: Patient demographics by radiomic grouping, ordered from least severe (0) to most severe (4) based on average FVC. Unless otherwise noted values are mean (SD). View this table: [Table 2:](http://medrxiv.org/content/early/2022/10/02/2022.10.01.22280365/T2) Table 2: Summary measures of pulmonary function testing and self-reported outcomes by radiomic grouping #### Radiomic Based Clustering of Sarcoidosis Five groups (clusters) of patients were identified (Table 1). Ordered from highest FVC to lowest, 75 (23.4%) patients were in radiomic group 0, 57 (17.8%) in group 1, 51 (15.9%) in group 2, 58 (18.1%) in group 3 and 80 (24.9%) in group 4. The new radiomic-based grouping was associated with Scadding stage (p<0.001), but not a direct reflection of Scadding stage (Table 1). PFT differed between radiomic groups (Figure 2; p<0.0001). For FVC, radiomic groups 3 and 4 had between 0.5 and 0.7 L lower average FVC compared to radiomic groups 0, 1, and 2. For FEV1, radiomic groups 2 and 3 and had an average FEV1 0.5 L lower than radiomic groups 0 and 1, which were similar. Radiomic group 4 had the lowest average FEV1 and was approximately 0.8 L lower than groups 0 and 1. DLCO has a similar pattern to FVC. Groups 2 and 4 demonstrated more obstructive disease compared to groups 0, 1, and 3 (57.4% and 38.0% vs. 13.5%-17.2%, respectively; p<0.001). ![Figure 2:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2022/10/02/2022.10.01.22280365/F2.medium.gif) [Figure 2:](http://medrxiv.org/content/early/2022/10/02/2022.10.01.22280365/F2) Figure 2: Group differences in average PFT values from regression models with adjustment for demographics (circles) and then also Scadding stage (triangle). The significance of the radiomic group differences from group 0 are in green (p>0.05) and blue (p<0.05). The bottom left text in each panel is the overall p-value for the association between radiomic grouping and PFT outcome along with the R2. The bottom right text if the overall p-value for the association between radiomic grouping and PFT after additional adjustment for Scadding stage along with the R2. Some PROs also differed between groups, but not consistently (Figure 3). Average shortness of breath (SOBQ) score differed between radiomic groups (p<0.0001). Radiomic group 4 had higher average SOBQ compared to radiomic groups 0 and 1 with radiomic group 4 being approximately 17.5 units higher than radiomic groups 0 and 1. In addition, average physical health (SF-12) was lowest for radiomic group 4 and linearly increased to radiomic group 0. ![Figure 3:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2022/10/02/2022.10.01.22280365/F3.medium.gif) [Figure 3:](http://medrxiv.org/content/early/2022/10/02/2022.10.01.22280365/F3) Figure 3: Group differences in average PRO values from regression models with adjustment for demographics (circles) and then also Scadding stage (triangle). The significance of the radiomic group differences from group 0 are in green (p>0.05) and blue (p<0.05). The bottom left text in each panel is the overall p-value for the association between radiomic grouping and PFT outcome along with the R2. The bottom right text if the overall p-value for the association between radiomic grouping and PFT after additional adjustment for Scadding stage along with the R2. The new radiomic groups remained significantly associated with PFT after adjusting for Scadding stage (Figure 2; p<0.0001). Scadding stage also remains significant for all PFT after controlling for radiomic group (p<0.0041; Figure 2). The radiomic group also remained associated with SOBQ score (p=0.032; Figure 3), while Scadding stage was no longer associated with SOBQ (p=0.072; Figure 3). None of the other PROs maintained a significant association with radiomic group after adjustment for Scadding stage (p>0.38). The five most discriminatory radiomic features included kurtosis, which is a measure of shape of the distribution of the HU from an image, as well as four summary measures from the gray level co-occurrence matrix (GLCM). The GLCM measure spatial correlation and similarity of the HU in image voxels near each other. Image visualization of cases with different values of kurtosis and two GLCM measures are shown in Figure 4 with distribution of kurtosis noted in Figure 5 for maximum, median and minimum kurtosis levels in our population. The images visibly show the increase in observable parenchymal abnormalities (left panel Figure 4) with lower kurtosis. ![Figure 4:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2022/10/02/2022.10.01.22280365/F4.medium.gif) [Figure 4:](http://medrxiv.org/content/early/2022/10/02/2022.10.01.22280365/F4) Figure 4: CT images in axial orientation for three patients with minimum, median, and maximum values for GLCM Gaussian (left column), the GLCM Inverse Gaussian, and the kurtosis. ![Figure 5:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2022/10/02/2022.10.01.22280365/F5.medium.gif) [Figure 5:](http://medrxiv.org/content/early/2022/10/02/2022.10.01.22280365/F5) Figure 5: Distribution of HU from HRCT images with the maximum (32.2) (solid), median (9.6) (dashed) and minimum (0.4) (dotted) percentiles of the kurtosis distribution, respectively. The discriminatory radiomic measures were jointly associated with FVC, FEV1, FEV1/FVC and DLCO (p<0.001; Table 3). The radiomic measures explained between 9 and 18% more variation in PFT than adjustment for demographics (age, race, sex, height and BMI) only. For comparison, Scadding stage explained between 0 and 11% more variation in PFT. Kurtosis was associated with FVC, FEV1/FVC and DLCO (p<0.0001). A 1-unit increase in kurtosis was associated with a 0.4 L (SE=0.07; p<0.0001) increase in FVC. Kurtosis was also positively associated with increasing DLCO (1.59, SE=0.68, p=0.020). Kurtosis was negatively associated with FEV1/FVC. A 1-unit increase in kurtosis was associated with a 7% (SE=0.01, p<0.0001) decrease in FEV1/FVC ratio. View this table: [Table 3:](http://medrxiv.org/content/early/2022/10/02/2022.10.01.22280365/T3) Table 3: Results of the regression analysis of the five discriminatory radiomic measures for PFT.**d** #### Validation of Clustering Pairwise ARI values ranged from 0.3 to 1 and peaking around 0.5 (online supplement Figure 2 left). The maximum ARI of fit clusters with Scadding stage was 0.07. In the linear models, the maximum p-values for the significant of cluster in the demographic adjusted and demographic plus Scadding models were <0.0001 for both models. The corresponding proportions of p-values less than 0.01 were 100% for both models. Bootstrap samples contained between 184 and 219 unique observations and contained 203 on average. Pairs of bootstrap samples contained between 100 and 159 unique overlapping observations and contained 129 on average or about 63% of the sample is used in each ARI calculation. Pairwise ARI values computed from unique overlapping observations ranged from 0.2 to 1 and had distribution peaking around 0.5 (online supplement Figure 2 right). The maximum ARI of fit clusters with Scadding stage was 0.17. For significance of cluster label in linear models fit to bootstrap samples, maximum p-values for the significant of cluster in the demographic adjusted and demographic plus Scadding models were <0.0001 for both models. The corresponding proportions of p-values less than 0.01 were 100%. Taken together the validation analysis suggest fairly mild sensitivity of clustering to the analysis pipeline. There is more sensitivity to bootstrapped samples; however cluster groups remain a significant predictor of FVC across all sources of random perturbations in the pipeline. ## Discussion Radiographic manifestations in sarcoidosis are protean. As a result, image characterization is traditionally done visually and not standardized. The utility of visual assessment in sarcoidosis is limited by the intra-observer and inter-observer variability. Radiomics is a more reproducible and computationally efficient approach to characterize HRCT. We used radiomics to quantitatively characterize images from a large, phenotypically diverse cohort of sarcoidosis subjects and demonstrated that radiomics are associated with clinical and patient reported outcomes of disease. Using a common unsupervised learning approach (35), we identified five clusters of cases based on radiomic characterization of their chest CT. These five radiomic-based groups differed significantly by PFTs and several PROs. Notably, each radiomic group included a range of Scadding stages and radiomic group remained significantly associated with PFT after adjusting for Scadding stage. These data suggest that radiomics represents radiographic abnormalities that differ from Scadding stage. Demographic characteristics of the individual explained a sizable amount of the variation observed in PFT (R2=50-60%). Radiomics also explained a significant amount of additional variation in PFT (8% to 18%). This additional amount of variation explained is consistent with other quantitative imaging approaches such as CALIPER (16) and those used for investigations of other lung conditions such as systemic sclerosis (38) and diffuse interstitial lung diseases (39). Except in lung cancer research, much of the work on quantitative CT image analysis has focused only on the HU density (17, 38). Our radiomic panel included summarization of both the density and the spatial characterization. We note that more measures of the GLCM appear as discriminative in the cluster analysis than first-order densitometry measures. This implies that the texture (the spatial information) is perhaps more useful for differentiating various parenchymal abnormalities seen in sarcoidosis. Although it remains speculative to the reasons for the association between decreased kurtosis and decreased FVC and DLCO, the following explanation is plausible. As kurtosis decreases, more severe parenchymal abnormalities are present that impact the function of the lung, as is shown in Figure 4. Severe parenchymal abnormalities in sarcoidosis are observed as higher HU values. In Figure 5 we observe the presence of a higher frequency of moderately high HU values dampening the peak in the HU distribution and leading to a lower kurtosis in this case. We also found a negative association between lung function and Gaussian weighted GLCM. Figure 4 highlights more abnormalities present with a higher Gaussian weighted GLCM value, as might be expected with this association. The associations with PROs were less consistent, although important patterns emerged. Radiomic group 4 demonstrated decreased quality of life based on physical function in the SF-12 and worse shortness of breath on the SOBQ, findings that are consistent with the worse lung function in this group. Radiomic group explained substantially less variation in PROs (1 to 15%) compared to PFTs. While the tools we used to measure PROs are validated and used widely, they are not generally correlated with objective measures of lung function (40). As an example, fatigue is a known multi-factorial PRO in sarcoidosis and may impact a number of other PRO (22, 40). For example, fatigue can be associated with shortness of breath and the SOBQ but also with other organ involvement; these two measurements can be confounded by physical function and cardiac or even neurological involvement. Our finding that radiomic group is associated with decreased physical function and increased shortness of breath provides encouragement that more detailed characterization of lung abnormalities, like we developed here, could contribute to a better understanding of the current disconnect between objective measures of lung function and patient experiences. This study had several strengths. To our knowledge, this was the largest group of sarcoidosis patients with research grade HRCT available allowing for a full 3-D based quantitative analysis. In addition to the high-quality and robust clinical data, the GRADS study also provides a comprehensive and consistently collected set of patient reported outcomes, which reflect important aspects of treatment decision-making processes (23). The results were also internally validated. The validation results taken together suggest our approach is not over fitting to these data. This study is not without limitations. The GRADS study relied on enrolment from academic centres and could be skewed to a population that was referred for worse disease, was near one of the centres, which were primarily localized to the Eastern US. The demographic is predominantly white and of higher SES as a result and thus did not provide full representation of the spectrum of disease. In addition, many subjects had disease for a decade or longer and were treated. While the same protocol was used across study sites for obtaining the CT images we studied, the scanners themselves differed. We performed harmonization of the scans to mitigate the potential for systematic large differences in scans due to scanner type. In addition, the distribution of PFT measures is not dependent on scanner type such that we expect any differences in the radiomic measures due to scanner type are non-differential as they relate to PFTs. Finally, the number of radiomic clusters (groups) we identified may not directly translate to other populations of sarcoidosis patients or even be the optimal solution for this group of patients. The optimization algorithm we used is common, however, there is no single way to choose the optimal number of clusters. Radiomic and other quantitative imaging approaches have a major strength in that they can be computed in a time efficient and reliable, reproducible automated procedure. The radiomics package in R took only 3 minutes to compute all the 3-D radiomics measures. This means large quantities of scans can have radiomic profiles computed for aiding in decisions on what scans the radiologist might prioritize for further consideration for visual or clinician evaluation. In summary, this work provides evidence that in sarcoidosis radiomic quantification is useful for classifying abnormality of the lung along with pulmonary function and to a lesser degree patient reported outcomes. Future work should evaluate the potential of radiomics to capture small changes over time not easily detected by radiologic assessment to further evaluate the potential of radiomics to serve as a clinically useful quantitative measure of sarcoidosis presentation and progression. ## Supporting information Online Supplement [[supplements/280365_file02.docx]](pending:yes) ## Data Availability Data is available via the process laid out by the GRADS consortium. ## Footnotes * **Sources of Support:** This work was supported by the National Institutes of Health (R01 HL114587; R01 HL142049; U01 HL112695). Data from the GRADS study was used, which was funded by the NIH grant U01 HL112707 entitled “Sarcoidosis and A1AT Genomics and Informatics Centre”, as well as others (U01 HL112707, U01 HL112694, U01 HL112695, U01 HL112696, U01 HL112702, U01 HL112708, U01 HL112711, U01 HL112712). The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Heart, Lung, and Blood Institute or the National Institutes of Health. * **Supplementary Material Statement:** This article has an online supplement, which is accessible from this issue’s table of contents online. * Received October 1, 2022. * Revision received October 1, 2022. * Accepted October 2, 2022. * © 2022, Posted by Cold Spring Harbor Laboratory This pre-print is available under a Creative Commons License (Attribution-NonCommercial-NoDerivs 4.0 International), CC BY-NC-ND 4.0, as described at [http://creativecommons.org/licenses/by-nc-nd/4.0/](http://creativecommons.org/licenses/by-nc-nd/4.0/) ## References 1. 1.Erdal BS, Clymer BD, Yildiz VO, Julian MW, Crouser ED. Unexpectedly high prevalence of sarcoidosis in a representative U.S. Metropolitan population. Respir Med 2012;106:893–899. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.rmed.2012.02.007&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=22417737&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F10%2F02%2F2022.10.01.22280365.atom) 2. 2.Cox CE, Donohue JF, Brown CD, Kataria YP, Judson MA. The Sarcoidosis Health Questionnaire: a new measure of health-related quality of life. Am J Respir Crit Care Med 2003;168:323–329. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1164/rccm.200211-1343OC&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=12738606&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F10%2F02%2F2022.10.01.22280365.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000184466400016&link_type=ISI) 3. 3.Wasfi YS, Rose CS, Murphy JR, Silveira LJ, Grutters JC, Inoue Y, Judson MA, Maier LA. A new tool to assess sarcoidosis severity. Chest 2006;129:1234–1245. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1378/chest.129.5.1234&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=16685014&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F10%2F02%2F2022.10.01.22280365.atom) 4. 4.Scadding JG. Prognosis of intrathoracic sarcoidosis in England. A review of 136 cases after five years’ observation. Br Med J 1961;2:1165–1172. [FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6MzoiUERGIjtzOjExOiJqb3VybmFsQ29kZSI7czozOiJibWoiO3M6NToicmVzaWQiO3M6MTE6IjIvNTI2MS8xMTY1IjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjIvMTAvMDIvMjAyMi4xMC4wMS4yMjI4MDM2NS5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 5. 5.Oberstein A, von Zitzewitz H, Schweden F, Müller-Quernheim J. Non invasive evaluation of the inflammatory activity in sarcoidosis with high-resolution computed tomography. Sarcoidosis Vasc Diffuse Lung Dis Off J WASOG 1997;14:65–72. 6. 6.Drent M, De Vries J, Lenters M, Lamers RJS, Rothkranz-Kos S, Wouters EFM, van Dieijen-Visser MP, Verschakelen JA. Sarcoidosis: assessment of disease severity using HRCT. Eur Radiol 2003;13:2462–2471. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/s00330-003-1965-x&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=12811502&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F10%2F02%2F2022.10.01.22280365.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000185980600008&link_type=ISI) 7. 7.Sluimer I, Schilham A, Prokop M, van Ginneken B. Computer analysis of computed tomography scans of the lung: a survey. IEEE Trans Med Imaging 2006;25:385– 405. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1109/TMI.2005.862753&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=16608056&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F10%2F02%2F2022.10.01.22280365.atom) 8. 8.Keijsers RGM, Heuvel DAF van den, Grutters JC. Imaging the inflammatory activity of sarcoidosis. Eur Respir J 2013;41:743–751. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6MzoiZXJqIjtzOjU6InJlc2lkIjtzOjg6IjQxLzMvNzQzIjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjIvMTAvMDIvMjAyMi4xMC4wMS4yMjI4MDM2NS5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 9. 9.Moller DR. Negative clinical trials in sarcoidosis: failed therapies or flawed study design? Eur Respir J 2014;44:1123–1126. [FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiRlVMTCI7czoxMToiam91cm5hbENvZGUiO3M6MzoiZXJqIjtzOjU6InJlc2lkIjtzOjk6IjQ0LzUvMTEyMyI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDIyLzEwLzAyLzIwMjIuMTAuMDEuMjIyODAzNjUuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 10. 10.Nunes H, Uzunhan Y, Gille T, Lamberto C, Valeyre D, Brillet P-Y. Imaging of sarcoidosis of the airways and lung parenchyma and correlation with lung function. Eur Respir J 2012;40:750–765. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6MzoiZXJqIjtzOjU6InJlc2lkIjtzOjg6IjQwLzMvNzUwIjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjIvMTAvMDIvMjAyMi4xMC4wMS4yMjI4MDM2NS5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 11. 11.Van den Heuvel DA, de Jong PA, Zanen P, van Es HW, van Heesewijk JP, Spee M, Grutters JC. Chest Computed Tomography-Based Scoring of Thoracic Sarcoidosis: Inter-rater Reliability of CT Abnormalities. Eur Radiol 2015;25:2558–2566. 12. 12.Kumar V, Gu Y, Basu S, Berglund A, Eschrich SA, Schabath MB, Forster K, Aerts HJWL, Dekker A, Fenstermacher D, Goldgof DB, Hall LO, Lambin P, Balagurunathan Y, Gatenby RA, Gillies RJ. Radiomics: the process and the challenges. Magn Reson Imaging 2012;30:1234–1248. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.mri.2012.06.010&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=22898692&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F10%2F02%2F2022.10.01.22280365.atom) 13. 13.Haralick RM. Statistical and structural approaches to texture. Proc IEEE 1979;67:786–804. 14. 14.Park YS, Seo JB, Kim N, Chae EJ, Oh YM, Lee SD, Lee Y, Kang S-H. Texture-based quantification of pulmonary emphysema on high-resolution computed tomography: comparison with density-based quantification and correlation with pulmonary function test. Invest Radiol 2008;43:395–402. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1097/RLI.0b013e31816901c7&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=18496044&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F10%2F02%2F2022.10.01.22280365.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000256204200008&link_type=ISI) 15. 15.Humphries SM, Yagihashi K, Huckleberry J, Rho B-H, Schroeder JD, Strand M, Schwarz MI, Flaherty KR, Kazerooni EA, van Beek EJR, Lynch DA. Idiopathic Pulmonary Fibrosis: Data-driven Textural Analysis of Extent of Fibrosis at Baseline and 15-Month Follow-up. Radiology 2017;285:270–278. [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F10%2F02%2F2022.10.01.22280365.atom) 16. 16.Ungprasert P, Wilton KM, Ernste FC, Kalra S, Crowson CS, Rajagopalan S, Bartholmai BJ. Novel Assessment of Interstitial Lung Disease Using the “Computer-Aided Lung Informatics for Pathology Evaluation and Rating” (CALIPER) Software System in Idiopathic Inflammatory Myopathies. Lung 2017;195:545–552. 17. 17.Ash SY, Harmouche R, Vallejo DLL, Villalba JA, Ostridge K, Gunville R, Come CE, Onieva Onieva J, Ross JC, Hunninghake GM, El-Chemaly SY, Doyle TJ, Nardelli P, Sanchez-Ferrero GV, Goldberg HJ, Rosas IO, San Jose Estepar R, Washko GR. Densitometric and local histogram based analysis of computed tomography images in patients with idiopathic pulmonary fibrosis. Respir Res 2017;18:. 18. 18.Wang J, Li F, Doi K, Li Q. Computerized detection of diffuse lung disease in MDCT: the usefulness of statistical texture features. Phys Med Biol 2009;54:6881–6899. [PubMed](http://medrxiv.org/lookup/external-ref?access_num=19864701&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F10%2F02%2F2022.10.01.22280365.atom) 19. 19.Lee G, Lee HY, Park H, Schiebler ML, van Beek Ejr, Ohno Y, Seo JB, Leung A. Radiomics and its emerging role in lung cancer research, imaging biomarkers and clinical management: State of the art. Eur J Radiol 2017;86:297–307. 20. 20.Ryan SM, Fingerlin TE, Mroz M, Barkes B, Hamzeh N, Maier LA, Carlson NE. Radiomic measures from chest high-resolution computed tomography associated with lung function in sarcoidosis. Eur Respir J 2019;54:. 21. 21.Moller DR, Koth LL, Maier LA, Morris A, Drake W, Rossman M, Leader JK, Collman RG, Hamzeh N, Sweiss NJ, Zhang Y, O’Neal S, Senior RM, Becich M, Hochheiser HS, Kaminski N, Wisniewski SR, Gibson KF, GRADS Sarcoidosis Study Group. Rationale and Design of the Genomic Research in Alpha-1 Antitrypsin Deficiency and Sarcoidosis (GRADS) Study. Sarcoidosis Protocol. Ann Am Thorac Soc 2015;12:1561–1571. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1513/AnnalsATS.201503-172OT&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F10%2F02%2F2022.10.01.22280365.atom) 22. 22.Kampstra NA, Grutters JC, Beek FT van, Culver DA, Baughman RP, Renzoni EA, Wuyts W, Kouranos V, Wijsenbeek MS, Biesma DH, Wees PJ van der, Nat PB van der. First patient-centred set of outcomes for pulmonary sarcoidosis: a multicentre initiative. BMJ Open Respir Res 2019;6:e000394. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NzoiYm1qcmVzcCI7czo1OiJyZXNpZCI7czoxMToiNi8xL2UwMDAzOTQiO3M6NDoiYXRvbSI7czo1MDoiL21lZHJ4aXYvZWFybHkvMjAyMi8xMC8wMi8yMDIyLjEwLjAxLjIyMjgwMzY1LmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 23. 23.Wijsenbeek MS, Culver DA. Treatment of Sarcoidosis. Clin Chest Med 2015;36:751–767. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.ccm.2015.08.015&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=26593147&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F10%2F02%2F2022.10.01.22280365.atom) 24. 24.Jones R, Junghard O, Dent J, Vakil N, Halling K, Wernersson B, Lind T. Development of the GerdQ, a tool for the diagnosis and management of gastro-oesophageal reflux disease in primary care. Aliment Pharmacol Ther 2009;30:1030– 1038. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1111/j.1365-2036.2009.04142.x&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=19737151&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F10%2F02%2F2022.10.01.22280365.atom) 25. 25.Eakin EG, Resnikoff PM, Prewitt LM, Ries AL, Kaplan RM. Validation of a New Dyspnea Measure: The UCSD Shortness of Breath Questionnaire. Chest 1998;113:619–624. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1378/chest.113.3.619&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=9515834&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F10%2F02%2F2022.10.01.22280365.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000072516000013&link_type=ISI) 26. 26.Michielsen HJ, De Vries J, Van Heck GL, Van de Vijver FJR, Sijtsma K. Examination of the Dimensionality of Fatigue: The Construction of the Fatigue Assessment Scale (FAS). Eur J Psychol Assess 2004;20:39–48. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1027/1015-5759.20.1.39&link_type=DOI) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000189385100004&link_type=ISI) 27. 27.Cella D, Riley W, Stone A, Rothrock N, Reeve B, Yount S, Amtmann D, Bode R, Buysse D, Choi S, Cook K, Devellis R, DeWalt D, Fries JF, Gershon R, Hahn EA, Lai J-S, Pilkonis P, Revicki D, Rose M, Weinfurt K, Hays R, PROMIS Cooperative Group. The Patient-Reported Outcomes Measurement Information System (PROMIS) developed and tested its first wave of adult self-reported health outcome item banks: 2005-2008. J Clin Epidemiol 2010;63:1179–1194. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.jclinepi.2010.04.011&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=20685078&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F10%2F02%2F2022.10.01.22280365.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000282861600004&link_type=ISI) 28. 28.Broadbent DE, Cooper PF, FitzGerald P, Parkes KR. The Cognitive Failures Questionnaire (CFQ) and its correlates. Br J Clin Psychol 1982;21:1–16. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1111/j.2044-8260.1982.tb01421.x&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=7126941&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F10%2F02%2F2022.10.01.22280365.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=A1982NC05600001&link_type=ISI) 29. 29.Ware JE, Kosinski M, Keller SD. A 12-Item Short-Form Health Survey: Construction of Scales and Preliminary Tests of Reliability and Validity. Med Care 1996;34:220– 233. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1097/00005650-199603000-00003&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=8628042&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F10%2F02%2F2022.10.01.22280365.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=A1996TZ57700003&link_type=ISI) 30. 30.Ware J, Kosinski M, Turner-Bowker D, Gandek B. How to score SF-12 items. SF-12 V2 Score Version 2 SF-12 Health Surv 2002;29–38. 31. 31.Parmar C, Rios Velazquez E, Leijenaar R, Jermoumi M, Carvalho S, Mak RH, Mitra S, Shankar BU, Kikinis R, Haibe-Kains B, Lambin P, Aerts Hjwl. Robust Radiomics feature quantification using semiautomatic volumetric segmentation. PloS One 2014;9:e102107. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1371/journal.pone.0102107&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25025374&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F10%2F02%2F2022.10.01.22280365.atom) 32. 32.Fortin J-P, Parker D, Tunç B, Watanabe T, Elliott MA, Ruparel K, Roalf DR, Satterthwaite TD, Gur RC, Gur RE, Schultz RT, Verma R, Shinohara RT. Harmonization of multi-site diffusion tensor imaging data. NeuroImage 2017;161:149–170. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.neuroimage.2017.08.047&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F10%2F02%2F2022.10.01.22280365.atom) 33. 33.Fortin J-P, Cullen N, Sheline YI, Taylor WD, Aselcioglu I, Cook PA, Adams P, Cooper C, Fava M, McGrath PJ, McInnis M, Phillips ML, Trivedi MH, Weissman MM, Shinohara RT. Harmonization of cortical thickness measurements across scanners and sites. NeuroImage 2018;167:104–120. [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F10%2F02%2F2022.10.01.22280365.atom) 34. 34.Johnson WE, Li C, Rabinovic A. Adjusting batch effects in microarray expression data using empirical Bayes methods. Biostat Oxf Engl 2007;8:118–127. 35. 35.Kondo Y, Salibian-Barrera M, Zamar R. RSKCL: An R Package for a Robust and Sparse K-Means Clustering Algorithm. J Stat Softw 2016;72:. 36. 36.Witten DM, Tibshirani R. A framework for feature selection in clustering. J Am Stat Assoc 2010;105:713–726. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1198/jasa.2010.tm09415&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=20811510&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F10%2F02%2F2022.10.01.22280365.atom) 37. 37.Maechler M, Roussseeuw P, Struyf A, Hubert M, Hornik K. cluster: Cluster Analysis Basics and Extensions. Rpackage version 2.1.0. 2019; 38. 38.Camiciottoli G, Orlandi I, Bartolucci M, Meoni E, Nacci F, Diciotti S, Barcaroli C, Conforti ML, Pistolesi M, Matucci-Cerinic M, Mascalchi M. Lung CT Densitometry in Systemic Sclerosis: Correlation With Lung Function, Exercise Testing, and Quality of Life. Chest 2007;131:672–681. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1378/chest.06-1401&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=17356079&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F10%2F02%2F2022.10.01.22280365.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000245072900010&link_type=ISI) 39. 39.Shin KE, Chung MJ, Jung MP, Choe BK, Lee KS. Quantitative Computed Tomographic Indexes in Diffuse Interstitial Lung Disease: Correlation With Physiologic Tests and Computed Tomography Visual Scores. J Comput Assist Tomogr 2011;35:266–271. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1097/RCT.0b013e31820ccf18&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=21412102&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F10%2F02%2F2022.10.01.22280365.atom) 40. 40.Thunold RF, Løkke A, Cohen AL, Hilberg O, Bendstrup E. Patient Reported Outcome Measures (PROMs) in Sarcoidosis. Sarcoidosis Vasc Diffuse Lung Dis 2017;34:2–17. [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F10%2F02%2F2022.10.01.22280365.atom)