Multivariable risk modelling and survival analysis with machine learning in SARS-CoV-2 infection

Andrea Ciarmiello; Francesca Tutino; Elisabetta Giovannini; Amalia Milano; Matteo Barattini; Nikola Yosifov; Debora Calvi; Maurizio Setti; Massimiliano Sivori; Cinzia Sani; Andrea Bastreri; Raffaele Staffiere; Teseo Stefanini; Stefania Artioli; Giampiero Giovacchini

doi:10.1101/2023.06.22.23291773

Abstract

Aim We evaluated the performance of a machine learning model based on demographic variables, blood tests, pre-existing comorbidities, and CT-based radiomic features to predict critical outcome in patients with acute respiratory syndrome coronavirus 2 (SARS-CoV-2).

Methods We retrospectively enrolled 694 SARS-CoV-2 positive patients. Clinical and demographic data were extracted from clinical records. Radiomic data were extracted from CT. Patients were randomized to the training (80%, n=556) or test (20%, n=138) dataset. The training set was used to define the association between severity of disease and comorbidities, laboratory tests, demographic and CT-based radiomic variables, and to implement a risk prediction model. Models were evaluated using the C statistic and Brier scores. The test set was used for external validation.

Results Patients who died (n=157) were predominantly male (66%) over the age of 50 with (median [range] C-reactive protein (CRP)=5 [1, 37] mg/dL, lactate dehydrogenase (LDH)=494 [141, 3,631] U/I and D-dimer=6.006 [168, 152.015] ng/ml). Surviving patients (n=537) had (median [range]) CRP=3 [0, 27] mg/dL, LDH=484 [78, 3.745] U/I, and D-dimer=1.133 [96, 55.660]ng/ml. The strongest risk factors were D-dimer, age, and cardiovascular disease.

The model implemented using the variables identified by the LASSO Cox regression analysis classified 152 of the 157 (97%) non-survivors as high risk individuals (Odd ratio=54.2 [21.9, 134.4]). Median survival in this group (14 [12, 19] days) was not different from that observed in non-survivors (12 [10, 14] days).

Conclusions A machine learning model based on combined data available on the first days of hospitalization (demographics, CT-radiomics, comorbidities, and blood biomarkers), can identify SARS-CoV-2 patients at risk of serious illness and death.

Introduction

Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) disease has had a significant global health and economic impact and continues to be a major concern as new variants are identified and the number of infected people continues to be high [1–3].

The clinical disease phenotype is extremely heterogeneous since the infection can proceed asymptomatically or evolve through forms of different intensity up to severe disease associated in this latter case with a low survival rate [4].

The variability of clinical manifestations makes prediction of outcome particularly difficult and this may be a major issue when the volume of patients is high and resources are limited as it occurs during a pandemic. Therefore, the identification of major risk factors and the implementation of an outcome prediction model could support treatment planning and optimal resource allocation.

To date, several studies have reported the association between the mortality rate and subject’s age, pre-existing comorbidities, some blood biomarkers, and the degree of lung involvement mainly based on CT scans [4–6].

The data published so far have highlighted a greater frailty of adults who would seem to have a higher rate of severe disease and mortality than young patients did [7].

There is broad agreement among the authors that comorbidities are present in approximately half of patients with SARS-CoV-2. According to the paper by Richardson et al. coronary heart disease, hypertension, diabetes and chronic obstructive lung disease are significantly associated with increased mortality [4, 8].

Several blood biomarkers have been associated with SARS-CoV-2. High D-dimer levels have been reported as predictors of mortality in hospitalized patients [9].

Similarly some blood biomarkers of inflammation such as C-reactive protein (CRP) and cell damage such as lactate dehydrogenase (LDH) would appear to be significantly increased in the most severe forms of the disease [10].

The degree of lung involvement as mainly assessed by computed tomography (CT) is a potential predictor of outcome [6, 11–15]. Data of potential clinical interest contained in the medical images can be read by expert radiologists or extracted by means of dedicated software, This approach, is known as radiomics. Recently, several studies have proposed radiomics and deep learning methods to recognize normal lung parenchyma from that affected by SARS-CoV-2 pneumonia [16] or to predict patient diagnosis [17] and outcome [18]. These approaches provide high diagnostic performance, as evidenced by the area under the receiver operating characteristic curve (AUC>=89%) [18, 19].

The current study is aimed at implementing and validating a mortality risk prediction model for SARS-CoV-2 based on demographic data, blood biomarkers, baseline comorbidities and radiomic CT data using machine learning methods.

Matherial and Methods

Population

This retrospective study was based on clinical records from patients admitted to hospital services through to the emergency department with fever, sore throat, dry cough, diarrhea, loss of taste or smell chest pain, and/or shortness or breathing difficulty between March 1, 2020 to December 31, 2020. CT images were retrieved from the hospital picture archiving and communications systems (PACs). The regional review committee granted ethical approval (CER Liguria: 251/2020). The written informed consent for this study was waived. Data were de-identified to avoid any potential breach of patient privacy. Inclusion criteria included: i) positive RT-PCR assay for COVID-19; ii) at least one non-contrast chest CT. For patients with multiple RT-PCR test or CT scan the test closest to the time of initial presentation to the emergency department was used. Exclusion criteria included: i) incomplete clinical records ii) evidence of artifacts affecting image quality. Since they did not meet the inclusion criteria 95 of the 789 patients (12%) initially recruited from the institutional database were excluded. Therefore, the study cohort consisted of 694 subjects with RT-PCR confirmed diagnosis of COVID-19 pneumonia encompassing 447 males and 247 females.

Overall survival (OS) was defined as the time from first hospital presentation to date of death or censoring. Patients who were alive were censored at last follow-up to 31st December 2020. The hospital records were used to determine the status of patients.

CT imaging acquisition and interpretation

All patients underwent non-enhanced chest CT imaging. Images were acquired in supine position on Aquilion; (Toshiba Medical Systems, Tokyo, Japan) and Optima CT660 (GE Healthcare, Milwaukee, WI) multi-detector CT scanner. Following acquisition parameter were used for all scans: tube voltage: 120 kVp, automatic tube current: (120-440) mAs, thickness: (5-7) mm, slice interval: 5 mm, rotation speed: 0.5-1.0 s, helical pitch 1.0875:1 or 1.375:1. Images were reconstructed at 512×512 pixels with a section width of 0.625 mm.

All CT images were reviewed in the imaging laboratory of S. Andrea Hospital by two board-certified radiologists specifically skilled in thoracic imaging. CT images were classified according to the criteria proposed by the Radiological Society of North America (RSNA) [20 302]. Subjects were grouped into two classes defined as typical and atypical findings which included typical / indeterminate and atypical / negative CT patterns as defined by Simpson and colleagues [20 302].

Image analysis and texture features extraction

Lung images were segmented using the 3D slicer software v4.11 [21]. All segmented images were reviewed by two certified radiologists to rule out those with segmentation errors. Radiomics package was used to extract image feature (https://github.com/mvallieres/radiomics/). CT images were evaluated with the first and second-level order features describing the pattern of spatial distribution of voxels intensity. Texture features consisted of 3 histogram-based, 9 Gray-Level Co-occurrence Matrix (GLCM), 13 Gray-Level Run-Length Matrix (GLRLM), 13 Gray-Level Size Zone Matrix (GLSZM), and 5 Neighborhood Gray-Tone Difference Matrix (NGTDM) features [22, 23]. Therefore the set of radiomic predictors used for this study included 43 features extracted from each CT lung image.

Feature selection and classification

Least absolute shrinkage and selection operator (LASSO) Cox regression analysis [24] was used to build the model for predicting overall patient survival with the clinical and radiomic features [25]. Regularized Cox models regression was performed with cv.glmnet under R 4.1.3 (http://www.r-project.org) glmnet package [26]. Lasso regularization parameters were chosen by means of the penalty term (L1-norm) tuned with the constant lambda (λ). Tuning parameters were selected using cross-validation to find the λ value able to minimize the mean squared error of the predictions. The cv.glmnet function provides the cross-validated mean C-index and C-index standard error estimate. The function also reports the minimum mean cross-validated error (lambda.min) and the value of lambda providing the most regularized model with a cross-validated error within 1 standard error of the minimum. [25].

Model design

Fifty-seven predictors were included in Lasso’s initial selection. They included 2 demographic (age and gender), 3 laboratory tests (C-Reactive Protein, Lactate Dehydrogenase and D-dimer), 9 comorbidities (Cancer, blood cancer, diabetes, obesity, hematologc disease, cardiovascular disease, cerebrovascluar disease, chronic obstructive pulunary disease) and 43 radiomic features. The predictive model was implemented with the demographic, metabolic and radiomic characteristics that survived the Lasso analysis with the Cox multiple regression method.

To evaluate the model’s performance on new data not used for training, the cohort was randomly divided to include 80% of the sample in the training set and 20% in the validation set. The proportions for groups splitting has been reported to depend on the sample size and the percentage of complete data. The split ratio used for training and test dataset division has proven accurate in developing predictive models when sample sizes are => 100 and the percentage of cases with a complete data set used for the model estimate is greater than 85% [27].

Model validation and calibration

The predictive ability of Cox fitted model was evaluted with calibrated and validated functions available under rms package. This platform provide a robust approach including cross validation, bootstrap, randomization and resampling [28]. Calibration method was used to evaluate the performance of the prediction model by comparing the predicted to the observed probabilities. To reduce overfitting and quantify optimism, the model was internally validated by computing an optimism-corrected C-statistic after 1000 bootstrapped resampling. External validation was also performed using a test dataset splitted from the study sample and not used for model training. Model calibration and validation was based on C-index and Brier score metric.

After validation patient’s individual risk score was calculated using the ggrisk package. Subjects were grouped into high- and low-risk groups based on the median risk score. The ability of the risk score to assess the probability of survival was assessed in the whole sample using Kaplan-Meier analysis with ggsurvplot function and log-rank test.

Statistics

R software (version 4.1.3, http://www.r-project.org) was used to data analysis and graphics. Continuous data were tested using independent t-tests, with degrees of freedom adjusted for inequality of variance where appropriate. Possible association of predictors with patient outcome was assessed with Wald’s test [29, 30]. LASSO logistic regression analysis was conducted using the glmnet package in R. The survival curves were generated using the Kaplan-Meier method implemented in the ggsurvplot function. The pROC and survival-ROC packages were applied to analyze ROC curves. Validation plots were produced by the root mean squares (RMS) package. The chi-square analysis was used for categorical variables. Sensitivity (SS) and specificity (SP) odd ratio (OR) and their 95% confidence intervals (CIs), were calculated to estimate how strongly the model predicted diagnosis was associated with clinical outcome. Two-tailed P values of less than 0.05 were considered statistically significant.

Results

A total of 694 patients were recruited and randomized to include 80% (n = 556) in training and 20% (n = 138) in test datasets. Patient’s characteristics were summarized in Table 1. The median age was 64 years (range: 20-107). The study sample consisted predominantly of males (64%). The majority of patients were resident in northeastern Italy. Median hospital stay was 11 days (range: 3-86). Patients had median CRP of 3 mg/dL (range: 0.11-37) and median LDH of 48 U/I (range: 78-3745). Moreover, patients had a median d-dimer of 1133 ng/mL (range: 96-152015). Deceased patients were predominantly male (66%) older than 50 years. Compared with survivors, deceased patients showed differences in laboratory findings (Table 1). As expected, also in the current study sample D-dimer, CRP and LDH were significantly increased in non-survivors compared with survivors (Table 1). Visual assessment of CT images according to RSNA guidelines [20], identified 111 of 157 non-survivors and 299 of 537 survivors with typical findings.

View this table:

Table 1. Clinical characteristics of Sars-CoV-2 population in alive and deceased subjects.

Table 2 shows the impact of pre-existing comorbidities on mortality in patients with SARS-CoV-2. In particular, cardiovascular and cerebrovascular diseases, cancer, haematological diseases and chronic obstructive pulmonary disease significantly increase the probability of death in the study sample.

View this table:

Table 2. Diseases associated with a high risk of mortality in SARS-CoV-2 infection

Figure 1 shows the relevance of each predictor based on the Wald test obtained from multivariable logistic regression used for modeling patient’s mortality. Predictors are sorted by decreasing importance and only those with a significance <= 0.05 are shown. The most important predictor was the D-dimer which resulted the most significant among the laboratory tests and the demographic variables used to define the prediction model. Moreover, important outcome predictors were also found among some textural features belonging to the GLOBAL, GLCM, GLSZM, GLRLM and NGDTM families. Among the comorbidities, cardiovascular disease appears to have a significant impact on survival, ranking among the most significant predictors of mortality.

Figure 1 Predictors importance.

Relevance of predictors based on the Wald test obtained from multivariable logistic regression. Predictors are shown by decreasing statistical importance, and only those with a significance ≤ 0.05 are shown. The D-dimer was the most important predictor. Important predictors were several textural features belonging to the GLOBAL, GLCM, GLSZM, GLRLM and NGDTM families. Among comorbidities, cardiovascular disease was the strongest risk factor

Based on the results of the LASSO regression a mixed model to predict survival in hospitalized SARS-COV-2 patients was implemented. Prediction model optimization was based on C-index using 10-fold cross validation. The parameter tuning producing a C-index within 1 standard error was 0.053 corresponding to a C-index of 0.87 (standard error 0.013) (Figure 2A, B). Twelve out of 57 variables with non-zero coefficients survived the tuning parameters giving the C-index within 1 standard error of the maximum (Figure 2C). Selected variables included age, d-dimer, LDH, 3 groups of comorbidities and 6 radiomic variables (Figure 2C).

Figure 2 Predictors of outcome.

Parameter tuning (A). C index (B). Variables that survived at the LASSO regression, including age, D-dimer, LDH, 3 comorbidities and 6 radiomic variables (C)

Internal validation showed high agreement between the predicted and observed survival curves (Figure 3 left panel). The unadjusted and bias-adjusted curves were similar and aligned with the dashed curve representing the best possible relationship between observed and predicted outcome as estimated by the mean absolute error (MAE) of 0.03. Furthermore, the C statistic and Brier score used as a measure of prediction accuracy were also 0.893 and 0.068, respectively, confirming significant agreement between the estimates. A sample of 138 SARS-COV-2 patients was randomly drawn from the study population and used as a test set for external validation. The mean absolute error estimated between the predicted and observed curves in the test set was 0.05 (Figure 3 right panel). The C-index and Brier score on this dataset were 0.886 and 0.050, respectively.

Figure 3 Calibration curves.

Internal validation (left panel) shows high agreement between the predicted and observed survival curves. The unadjusted and bias-adjusted curves were similar to the dashed curve representing the best possible relationship between observed and predicted outcome as estimated by the mean absolute error (MAE) of 0.03. In the test set for external validation (right panel), the MAE between the predicted and observed curves was 0.03

Using the Cox regression algorithm on the predictors selected by the operator Lasso, the individual risk score of the patients included in the training dataset was estimated. The median risk score of 0.18 was used as a cutoff point, to split patients into high- and low-risk groups. KM survival analysis was subsequently performed to evaluate the predictor accuracy and build a risk model for the survival rate of SARS-COV-2 patients in both the train and test datasets.

In the training data set, median survival was 12 days (95% CI; 10-14). Using the mixed model for risk prediction in the training dataset, the survival time of SARS-COV-2 patients with a 50% survival rate in the high-risk group was significantly different compared to the low-risk subjects (Log-rank; p<0.001), with a median of 15 days (95%CI; 12-20) (Figure 4A). By contrast, the low-risk group did not achieve the 50% survival rate.

Figure 4 Survival curves

Training data set (A): the survival time of SARS-CoV-2 patients in the high-risk group was significantly different compared to the low-risk subjects, with a median of 15 days (95%CI; 12-20). The low-risk group did not achieve the 50% survival rate. Test dataset (B): the median survival of the high risk group was 12 days (95% CI; 7-27) and low-risk patients did not reach the median survival of 50%

The median survival observed in the test dataset was 9 days, (7-16). In the test dataset, the observed trend was similar to that estimated in the training group with a median survival of 12 days (95% CI; 7-27). As in the training sample, also in the test dataset, low-risk patients did not reach the median survival of 50% (Log-rank p<0.001) (Figure 4B).

Table 3 shows the comparison between the observed outcome and the expected risk. The prediction model identifies 97% of true positives among subjects at risk of death while 64% of true negatives were classified as having a low risk of event. The risk of mortality was found to be significantly higher (odds ratio = 54.2 (95%CI; 21.9-134.4), p<0.0001) among the high-risk group compared to the low-risk group.

View this table:

Table 3. Bivariate analysis of predicted risk on disease outcome.

Discussion

Prediction of disease severity and progression in SARS-CoV-2 patients is relevant as early intervention is significantly associated with reduced mortality [31, 32]. In this study, we developed and validated a risk scoring model based on demographics, laboratory tests, and radiomic features to predict the disease progression and survival of hospitalized patients with SARS-CoV-2.

The model was implemented with 12 out of 57 variables selected using Lasso Cox regression and the C-index metric. The proposed model is highly predictive, identifying 97% of deceased patients as high-risk and 64% of surviving patients as low-risk. In addition, how well the model-predicted risk describes the observed sequence of events is summarized by the estimated C-index of 0.90.

The risk estimation model included age, laboratory tests (D-dimer, LDH) and 7 radiomic features. The variables used to estimate the risk of developing critical illness due to SARS-CoV-2 infection are generally available during the early stages of hospitalization. Risk estimation in this phase could support clinicians in planning the treatment strategy by allocating resources for more aggressive treatments or admission to intensive care units for higher risk cases or by choosing to watch and wait for low risk cases.

Previous studies have reported the impact of age on SARS-CoV-2 mortality. A meta-analysis demonstrated a decisive effect of age on mortality [7]. A 60% higher risk of mortality was reported in subjects aged >80 years [7]. As expected also in our study the age of non-surviving patients was significantly higher than survivors (median 80 years, 95%CI: 51-107 vs. 59yo, 95%CI: 20-94; p<0.001). Age was an important predictor of disease outcome and survived the Lasso regression thus contributing to prediction model implementation.

D-dimer was associated with poor outcome of patients with SARS-CoV-2, presumably due to the increased likelihood of developing pulmonary embolism with D-dimer levels above 2590 μg/ml [33]. According to recently published papers [34, 35], also in our sample, D-dimer was the variable with the strongest association with patient outcome as suggested by measured levels of 6.006 vs 1.133 μg/ml for deceased and survivors, respectively.

Similarly, elevated lactate dehydrogenase (LDH) levels have been associated with worse outcomes in patients with viral infections [36, 37]. Deceased patients in our study had significantly higher LDH levels than survivors and it was selected by Lasso regression for survival prediction model.

Literature reports have documented that chronic comorbidities are associated with increased risk of poor prognosis and fatal outcome associated with SARS-CoV-2 [8]. Similarly, in our model, pre-existing comorbidities, like cardiovascular and cerebrovascular diseases, cancer, hematological diseases and chronic obstructive pulmonary disease, were significant predictors of severity of disease and death. Among comorbidities, cardiovascular disease resulted the strongest predictor of mortality in our study sample with a 4.42 fold higher risk of poor prognosis, in line with findings of meta-analyses [38–41].

CT is the most widespread imaging modalities that play a key role in the diagnosis and assessment of prognosis in patients with SARS-CoV-2 [42]. However, CT findings (such as ground-glass opacities, consolidation) are not specific for SARS-CoV-2, as similar findings can also be found in other diseases, such as seasonal influenza associated with a lower risk of death.

Innovative methods of quantitative image analysis (such as radiomics) can provide an operator independent semi-quantitative approach by describing spatial and temporal information derived from images (CT, MRI, PET/CT). Until now, radiomics has found application in different scenarios of medicine such as oncology and neurodegenerative disease [43, 44, 45]. Lately it has also been used to support the “digital biopsy,” a non-invasive tissue characterization technique.

Previous studies have reported the potential use of CT radiomic features to better characterize pulmonary involvement of patients with SARS-CoV-2. Spatial information measured with radiomic features can be used to support differential diagnosis between covid and non-covid disease [46] as well as in modeling risk of death and predicting survival[47].

In our study, six radiomic features (Global_Skewness, GLCM_Correlation, GLSZM_LZE, GLSZM_HGZE, GLSZM_LZHGE, NGTDM_Busyness) were selected to model a risk profile with significant discriminative capabilities for patient outcome. Indeed, selected variables were significantly associated with patient outcome in multivariate logistic regression (p<0.001). These features contribute to risk modeling by providing quantitative information on lung CT signal intensity and heterogeneity in SARS-CoV-2 patients.

A systematic review of existing prognostic models identified several prognostic models designed to support diagnosis and predict mortality among SARS-CoV-2 hospitalized patients [5]. Most of the studies reported predictive models implemented with CT images and/or clinical variables combined differently depending on the available data.

Only few studies included radiomics, demographics, comorbidities and laboratory tests together as potential predictor candidates. The main disadvantage of these studies is related to the small sample size which exposes the results to a high risk of bias due to inappropriate evaluation of the predictive performance on external dataset and inappropriate missing data handling.

Our study included 694 patients with complete radiomic and clinical datasets. The predictors needed to calculate the risk of developing serious disease are usually available within the first few hours of hospital admission. Based on these variables, the model is able to estimate the risk of mortality, identifying 97% of non-survivors in the study sample. The availability of this information could be useful for optimizing treatment planning according to the estimated risk during patient admission to hospital.

The major limitation of the study lies in the lack of an external validation carried out on a dataset obtained from another hospital. Although the external validation was performed on a test set not used for training, to build a robust model and to obtain reliable performance evaluation it would be advisable to validate the model on data from different sources.

The model is not available as a ready-to-use software package. The study was designed to define and validate a predictive risk model to be subsequently produced as a usable application in clinical practice. To this end, open source widespread statistical software was used. These packages can easily allow to transfer the method into clinical practice.

Conclusion

A predictive model of mortality was developed in a sample of 694 SARS-CoV-2 patients using demographic, CT-radiomic and laboratory tests. The model was calibrated and validated by randomly splitting the sample into the training and test dataset. The final model was implemented with a combination of 12 variables including age, D-dimer, LDH, preexisting comorbidities as cancer, cardiovascular and cerebrovascular disease and 6 radiomic features. The model was able to correctly identify 97% of non-survivors. Identifying high-risk individuals with predictors usually available within the first few hours of hospital admission could be useful in case of widespread disease for a better allocation of available resources.

Acknowledgments

The authors would like to thank all study participants, including Dr. Manuele Sicuteri, head of the information and communication technology unit of the S. Andrea hospital, whose support in the data retrieving and organization was fundamental.

References

1.↵
Wu Z, McGoogan JM. Characteristics of and Important Lessons From the Coronavirus Disease 2019 (COVID-19) Outbreak in China: Summary of a Report of 72 314 Cases From the Chinese Center for Disease Control and Prevention. Jama. 2020;323(13):1239–42. doi: 10.1001/jama.2020.2648.
OpenUrl CrossRef PubMed Google Scholar
2.
Lambrou AS, Shirk P, Steele MK, Paul P, Paden CR, Cadwell B, et al. Genomic Surveillance for SARS-CoV-2 Variants: Predominance of the Delta (B.1.617.2) and Omicron (B.1.1.529) Variants - United States, June 2021-January 2022. MMWR Morbidity and mortality weekly report. 2022;71(6):206–11. Epub 2022/02/11. doi: 10.15585/mmwr.mm7106a4. PubMed PMID: 35143464; PubMed Central PMCID: PMC8830620 Journal Editors form for disclosure of potential conflicts of interest. No potential conflicts of interest were disclosed.
OpenUrl CrossRef PubMed Google Scholar
3.↵
Colson P, Delerce J, Burel E, Dahan J, Jouffret A, Fenollar F, et al. Emergence in southern France of a new SARS-CoV-2 variant harbouring both N501Y and E484K substitutions in the spike protein. Archives of virology. 2022. Epub 2022/02/19. doi: 10.1007/s00705-022-05385-y. PubMed PMID: 35178586; PubMed Central PMCID: PMC8853869.
OpenUrl CrossRef PubMed Google Scholar
4.↵
Richardson S, Hirsch JS, Narasimhan M, Crawford JM, McGinn T, Davidson KW, et al. Presenting Characteristics, Comorbidities, and Outcomes Among 5700 Patients Hospitalized With COVID-19 in the New York City Area. Jama. 2020;323(20):2052–9. doi: 10.1001/jama.2020.6775.
OpenUrl CrossRef PubMed Google Scholar
5.↵
Wynants L, Van Calster B, Collins GS, Riley RD, Heinze G, Schuit E, et al. Prediction models for diagnosis and prognosis of covid-19: systematic review and critical appraisal. Bmj. 2020;369:m1328. Epub 2020/04/09. doi: 10.1136/bmj.m1328. PubMed PMID: 32265220; PubMed Central PMCID: PMC7222643 at www.icmje.org/coi_disclosure.pdf and declare: no support from any organisation for the submitted work; no competing interests with regards to the submitted work; LW discloses support from Research Foundation-Flanders (FWO); RDR reports personal fees as a statistics editor for The BMJ (since 2009), consultancy fees for Roche for giving meta-analysis teaching and advice in October 2018, and personal fees for delivering in-house training courses at Barts and The London School of Medicine and Dentistry, and also the Universities of Aberdeen, Exeter, and Leeds, all outside the submitted work.
OpenUrl Abstract/FREE Full Text Google Scholar
6.↵
Esposito A, Palmisano A, Cao R, Rancoita P, Landoni G, Grippaldi D, et al. Quantitative assessment of lung involvement on chest CT at admission: Impact on hypoxia and outcome in COVID-19 patients. Clinical imaging. 2021;77:194–201. Epub 2021/05/14. doi: 10.1016/j.clinimag.2021.04.033. PubMed PMID: 33984670; PubMed Central PMCID: PMC8081746.
OpenUrl CrossRef PubMed Google Scholar
7.↵
Bonanad C, Garcia-Blas S, Tarazona-Santabalbina F, Sanchis J, Bertomeu-Gonzalez V, Facila L, et al. The Effect of Age on Mortality in Patients With COVID-19: A Meta-Analysis With 611,583 Subjects. Journal of the American Medical Directors Association. 2020;21(7):915–8. Epub 2020/07/18. doi: 10.1016/j.jamda.2020.05.045. PubMed PMID: 32674819; PubMed Central PMCID: PMC7247470.
OpenUrl CrossRef PubMed Google Scholar
8.↵
Zhou F, Yu T, Du R, Fan G, Liu Y, Liu Z, et al. Clinical course and risk factors for mortality of adult inpatients with COVID-19 in Wuhan, China: a retrospective cohort study. The Lancet. 2020;395(10229):1054–62. doi: https://doi.org/10.1016/S0140-6736(20)30566-3.
OpenUrl Google Scholar
9.↵
Huang I, Pranata R, Lim MA, Oehadian A, Alisjahbana B. C-reactive protein, procalcitonin, D-dimer, and ferritin in severe coronavirus disease-2019: a meta-analysis. Therapeutic Advances in Respiratory Disease. 2020;14:1753466620937175. doi: 10.1177/1753466620937175.
OpenUrl CrossRef Google Scholar
10.↵
Ponti G, Maccaferri M, Ruini C, Tomasi A, Ozben T. Biomarkers associated with COVID-19 disease progression. Critical Reviews in Clinical Laboratory Sciences. 2020;57(6):389–99. doi: 10.1080/10408363.2020.1770685.
OpenUrl CrossRef PubMed Google Scholar
11.↵
Colombi D, Bodini FC, Petrini M, Maffi G, Morelli N, Milanese G, et al. Well-aerated Lung on Admitting Chest CT to Predict Adverse Outcome in COVID-19 Pneumonia. Radiology. 2020;296(2):E86–E96. Epub 2020/04/18. doi: 10.1148/radiol.2020201433. PubMed PMID: 32301647; PubMed Central PMCID: PMC7233411.
OpenUrl CrossRef PubMed Google Scholar
12.
Huang L, Han R, Ai T, Yu P, Kang H, Tao Q, et al. Serial Quantitative Chest CT Assessment of COVID-19: A Deep Learning Approach. Radiology Cardiothoracic imaging. 2020;2(2):e200075. Epub 2021/03/30. doi: 10.1148/ryct.2020200075. PubMed PMID: 33778562; PubMed Central PMCID: PMC7233442 R.H. disclosed no relevant relationships. T.A. disclosed no relevant relationships. P.Y. disclosed no relevant relationships. H.K. disclosed no relevant relationships. Q.T. disclosed no relevant relationships. L.X. disclosed no relevant relationships.
OpenUrl CrossRef PubMed Google Scholar
13.
Revel MP, Boussouar S, de Margerie-Mellon C, Saab I, Lapotre T, Mompoint D, et al. Study of Thoracic CT in COVID-19: The STOIC Project. Radiology. 2021;301(1):E361–E70. Epub 2021/06/30. doi: 10.1148/radiol.2021210384. PubMed PMID: 34184935; PubMed Central PMCID: PMC8267782.
OpenUrl CrossRef PubMed Google Scholar
14.
Zhan J, Li H, Yu H, Liu X, Zeng X, Peng D, et al. 2019 novel coronavirus (COVID-19) pneumonia: CT manifestations and pattern of evolution in 110 patients in Jiangxi, China. European radiology. 2021;31(2):1059–68. Epub 2020/08/28. doi: 10.1007/s00330-020-07201-0. PubMed PMID: 32852587; PubMed Central PMCID: PMC7450162.
OpenUrl CrossRef PubMed Google Scholar
15.↵
Zhao C, Xu Y, He Z, Tang J, Zhang Y, Han J, et al. Lung Segmentation and Automatic Detection of COVID-19 Using Radiomic Features from Chest CT Images. Pattern recognition. 2021:108071. Epub 2021/06/08. doi: 10.1016/j.patcog.2021.108071. PubMed PMID: 34092815; PubMed Central PMCID: PMC8169223.
OpenUrl CrossRef PubMed Google Scholar
16.↵
Jiao Z, Choi JW, Halsey K, Tran TML, Hsieh B, Wang D, et al. Prognostication of patients with COVID-19 using artificial intelligence based on chest x-rays and clinical data: a retrospective study. The Lancet Digital health. 2021;3(5):e286–e94. Epub 2021/03/29. doi: 10.1016/S2589-7500(21)00039-X. PubMed PMID: 33773969; PubMed Central PMCID: PMC7990487 Service, Radiological Society of North America, and National Cancer Institute of the National Institute of Health, during the conduct of the study. XF currently works in Carina Medical, a for-profit organisation that develops clinical products, outside of the submitted work. KC reports grants from National Institute of Biomedical Imaging and Bioengineering and National Cancer Institute of the National Institute of Health, during the conduct of the study. YF reports grants from National Institute of Health, during the conduct of the study. All other authors declare no competing interests.
OpenUrl CrossRef PubMed Google Scholar
17.↵
Tan HB, Xiong F, Jiang YL, Huang WC, Wang Y, Li HH, et al. The study of automatic machine learning base on radiomics of non-focus area in the first chest CT of different clinical types of COVID-19 pneumonia. Scientific reports. 2020;10(1):18926. Epub 2020/11/05. doi: 10.1038/s41598-020-76141-y. PubMed PMID: 33144676; PubMed Central PMCID: PMC7641115.
OpenUrl CrossRef PubMed Google Scholar
18.↵
Shiri I, Sorouri M, Geramifar P, Nazari M, Abdollahi M, Salimi Y, et al. Machine learning-based prognostic modeling using clinical data and quantitative radiomic features from chest CT images in COVID-19 patients. Computers in biology and medicine. 2021;132:104304. Epub 2021/03/11. doi: 10.1016/j.compbiomed.2021.104304. PubMed PMID: 33691201; PubMed Central PMCID: PMC7925235.
OpenUrl CrossRef PubMed Google Scholar
19.↵
Guiot J, Vaidyanathan A, Deprez L, Zerka F, Danthine D, Frix AN, et al. Development and Validation of an Automated Radiomic CT Signature for Detecting COVID-19. Diagnostics. 2020;11(1). Epub 2021/01/06. doi: 10.3390/diagnostics11010041. PubMed PMID: 33396587; PubMed Central PMCID: PMC7823620.
OpenUrl CrossRef PubMed Google Scholar
20.↵
Simpson S, Kay FU, Abbara S, Bhalla S, Chung JH, Chung M, et al. Radiological Society of North America Expert Consensus Statement on Reporting Chest CT Findings Related to COVID-19. Endorsed by the Society of Thoracic Radiology, the American College of Radiology, and RSNA - Secondary Publication. Journal of thoracic imaging. 2020;35(4):219–27. Epub 2020/04/24. doi: 10.1097/RTI.0000000000000524. PubMed PMID: 32324653; PubMed Central PMCID: PMC7255403.
OpenUrl CrossRef PubMed Google Scholar
21.↵
Fedorov A, Beichel R, Kalpathy-Cramer J, Finet J, Fillion-Robin JC, Pujol S, et al. 3D Slicer as an image computing platform for the Quantitative Imaging Network. Magnetic resonance imaging. 2012;30(9):1323–41. Epub 2012/07/10. doi: 10.1016/j.mri.2012.05.001. PubMed PMID: 22770690; PubMed Central PMCID: PMC3466397.
OpenUrl CrossRef PubMed Web of Science Google Scholar
22.↵
Peng H, Long F, Ding C. Feature selection based on mutual information: criteria of max-dependency, max-relevance, and min-redundancy. IEEE transactions on pattern analysis and machine intelligence. 2005;27(8):1226–38. Epub 2005/08/27. doi: 10.1109/TPAMI.2005.159. PubMed PMID: 16119262.
OpenUrl CrossRef PubMed Web of Science Google Scholar
23.↵
Zwanenburg A, Vallieres M, Abdalah MA, Aerts H, Andrearczyk V, Apte A, et al. The Image Biomarker Standardization Initiative: Standardized Quantitative Radiomics for High-Throughput Image-based Phenotyping. Radiology. 2020;295(2):328–38. Epub 2020/03/11. doi: 10.1148/radiol.2020191145. PubMed PMID: 32154773; PubMed Central PMCID: PMC7193906.
OpenUrl CrossRef PubMed Google Scholar
24.↵
Tibshirani R. Regression Shrinkage and Selection via the Lasso. Journal of the Royal Statistical Society Series B (Methodological). 1996;58(1):267–88.
OpenUrl CrossRef Web of Science Google Scholar
25.↵
Friedman J, Hastie T, Tibshirani R. Regularization Paths for Generalized Linear Models via Coordinate Descent. Journal of statistical software. 2010;33(1):1–22. Epub 2010/09/03. PubMed PMID: 20808728; PubMed Central PMCID: PMC2929880.
OpenUrl CrossRef PubMed Web of Science Google Scholar
26.↵
glmnet. http://cranr-projectorg/web/packages/glmnet,.
27.↵
Dobbin KK, Simon RM. Optimally splitting cases for training and testing high dimensional classifiers. BMC medical genomics. 2011;4:31. Epub 2011/04/12. doi: 10.1186/1755-8794-4-31. PubMed PMID: 21477282; PubMed Central PMCID: PMC3090739.
OpenUrl CrossRef PubMed Google Scholar
28.↵
Harrell FE, Jr.. Regression Modeling Strategies. Springer-Verlag. 2006.
Google Scholar
29.↵
Harrell FE, Jrl FEH. Regression Modeling Strategies: With Applications to Linear Models, Logistic Regression, and Survival Analysis: Springer; 2001.
Google Scholar
30.↵
Steyerberg EW. Clinical Prediction Models: A Practical Approach to Development, Validation, and Updating: Springer International Publishing; 2019.
Google Scholar
31.↵
Sun Q, Qiu H, Huang M, Yang Y. Lower mortality of COVID-19 by early recognition and intervention: experience from Jiangsu Province. Annals of intensive care. 2020;10(1):33. Epub 2020/03/20. doi: 10.1186/s13613-020-00650-2. PubMed PMID: 32189136; PubMed Central PMCID: PMC7080931 product mentioned in this paper.
OpenUrl CrossRef PubMed Google Scholar
32.↵
Goyal DK, Mansab F, Iqbal A, Bhatti S. Early intervention likely improves mortality in COVID-19 infection. Clinical medicine. 2020;20(3):248–50. Epub 2020/05/03. doi: 10.7861/clinmed.2020-0214. PubMed PMID: 32357975; PubMed Central PMCID: PMC7354047.
OpenUrl Abstract/FREE Full Text Google Scholar
33.↵
Mouhat B, Besutti M, Bouiller K, Grillet F, Monnin C, Ecarnot F, et al. Elevated D-dimers and lack of anticoagulation predict PE in severe COVID-19 patients. The European respiratory journal. 2020;56(4). Epub 2020/09/11. doi: 10.1183/13993003.01811-2020. PubMed PMID: 32907890; PubMed Central PMCID: PMC7487272 Besutti has nothing to disclose. Conflict of interest: K. Bouiller has nothing to disclose. Conflict of interest: F. Grillet has nothing to disclose. Conflict of interest: C. Monnin has nothing to disclose. Conflict of interest: F. Ecarnot has nothing to disclose. Conflict of interest: J. Behr has nothing to disclose. Conflict of interest: G. Capellier has nothing to disclose. Conflict of interest: T. Soumagne has nothing to disclose. Conflict of interest: S. Pili-Floury has nothing to disclose. Conflict of interest: G. Besch has nothing to disclose. Conflict of interest: G. Mourey has nothing to disclose. Conflict of interest: Q. Lepiller has nothing to disclose. Conflict of interest: C. Chirouze has nothing to disclose. Conflict of interest: F. Schiele has nothing to disclose. Conflict of interest: R. Chopard has nothing to disclose. Conflict of interest: N. Meneveau has nothing to disclose.
OpenUrl Abstract/FREE Full Text Google Scholar
34.↵
Soni M, Gopalakrishnan R, Vaishya R, Prabu P. D-dimer level is a useful predictor for mortality in patients with COVID-19: Analysis of 483 cases. Diabetes & metabolic syndrome. 2020;14(6):2245–9. Epub 2021/01/06. doi: 10.1016/j.dsx.2020.11.007. PubMed PMID: 33395786; PubMed Central PMCID: PMC7670909.
OpenUrl CrossRef PubMed Google Scholar
35.↵
Poudel A, Poudel Y, Adhikari A, Aryal BB, Dangol D, Bajracharya T, et al. D-dimer as a biomarker for assessment of COVID-19 prognosis: D-dimer levels on admission and its role in predicting disease outcome in hospitalized patients with COVID-19. PloS one. 2021;16(8):e0256744. Epub 2021/08/27. doi: 10.1371/journal.pone.0256744. PubMed PMID: 34437642; PubMed Central PMCID: PMC8389366.
OpenUrl CrossRef PubMed Google Scholar
36.↵
Henry BM, Aggarwal G, Wong J, Benoit S, Vikse J, Plebani M, et al. Lactate dehydrogenase levels predict coronavirus disease 2019 (COVID-19) severity and mortality: A pooled analysis. The American journal of emergency medicine. 2020;38(9):1722–6. Epub 2020/08/02. doi: 10.1016/j.ajem.2020.05.073. PubMed PMID: 32738466; PubMed Central PMCID: PMC7251362.
OpenUrl CrossRef PubMed Google Scholar
37.↵
Tao RJ, Luo XL, Xu W, Mao B, Dai RX, Li CW, et al. Viral infection in community acquired pneumonia patients with fever: a prospective observational study. Journal of thoracic disease. 2018;10(7):4387–95. Epub 2018/09/04. doi: 10.21037/jtd.2018.06.33. PubMed PMID: 30174887; PubMed Central PMCID: PMC6105945.
OpenUrl CrossRef PubMed Google Scholar
38.↵
Li X, Guan B, Su T, Liu W, Chen M, Bin Waleed K, et al. Impact of cardiovascular disease and cardiac injury on in-hospital mortality in patients with COVID-19: a systematic review and meta-analysis. Heart. 2020;106(15):1142–7. Epub 2020/05/29. doi: 10.1136/heartjnl-2020-317062. PubMed PMID: 32461330; PubMed Central PMCID: PMC7295861.
OpenUrl Abstract/FREE Full Text Google Scholar
39.
Borges do Nascimento IJ, Cacic N, Abdulazeem HM, von Groote TC, Jayarajah U, Weerasekara I, et al. Novel Coronavirus Infection (COVID-19) in Humans: A Scoping Review and Meta-Analysis. Journal of clinical medicine. 2020;9(4). Epub 2020/04/03. doi: 10.3390/jcm9040941. PubMed PMID: 32235486; PubMed Central PMCID: PMC7230636.
OpenUrl CrossRef PubMed Google Scholar
40.
Nishiga M, Wang DW, Han Y, Lewis DB, Wu JC. COVID-19 and cardiovascular disease: from basic mechanisms to clinical perspectives. Nature reviews Cardiology. 2020;17(9):543–58. Epub 2020/07/22. doi: 10.1038/s41569-020-0413-9. PubMed PMID: 32690910; PubMed Central PMCID: PMC7370876.
OpenUrl CrossRef PubMed Google Scholar
41.↵
Wang BX. Susceptibility and prognosis of COVID-19 patients with cardiovascular disease. Open heart. 2020;7(1). Epub 2020/06/27. doi: 10.1136/openhrt-2020-001310. PubMed PMID: 32587104; PubMed Central PMCID: PMC7319720.
OpenUrl FREE Full Text Google Scholar
42.↵
Ai T, Yang Z, Hou H, Zhan C, Chen C, Lv W, et al. Correlation of Chest CT and RT-PCR Testing for Coronavirus Disease 2019 (COVID-19) in China: A Report of 1014 Cases. Radiology. 2020;296(2):E32–E40. Epub 2020/02/27. doi: 10.1148/radiol.2020200642. PubMed PMID: 32101510; PubMed Central PMCID: PMC7233399.
OpenUrl CrossRef PubMed Google Scholar
43.↵
Sun R, Limkin EJ, Vakalopoulou M, Dercle L, Champiat S, Han SR, et al. A radiomics approach to assess tumour-infiltrating CD8 cells and response to anti-PD-1 or anti-PD-L1 immunotherapy: an imaging biomarker, retrospective multicohort study. The Lancet Oncology. 2018;19(9):1180–91. Epub 2018/08/19. doi: 10.1016/S1470-2045(18)30413-3. PubMed PMID: 30120041.
OpenUrl CrossRef PubMed Google Scholar
44.↵
Ciarmiello A, Giovannini E, Pastorino S, Ferrando O, Foppiano F, Mannironi A, et al. Machine Learning Model to Predict Diagnosis of Mild Cognitive Impairment by Using Radiomic and Amyloid Brain PET. Clinical nuclear medicine. 2023;48(1):1–7. Epub 2022/10/15. doi: 10.1097/RLU.0000000000004433. PubMed PMID: 36240660.
OpenUrl CrossRef PubMed Google Scholar
45.↵
Ciarmiello A, Giovannini E, Florimonte L, Bonatto E, Bareggi C, Milano A, et al. Machine learning radiomics for prediction of survival in non-small cell lung cancer patients studied with PET/CT and FDG. Annals of Oncology. 2021;32:01/095. Epub 2021/01/09. doi: https://doi.org/10.1016/j.annonc.2021.08.779.
OpenUrl Google Scholar
46.↵
Hu Z, Yang Z, Lafata KJ, Yin FF, Wang C. A radiomics-boosted deep-learning model for COVID-19 and non-COVID-19 pneumonia classification using chest x-ray images. Medical physics. 2022;49(5):3213–22. Epub 2022/03/10. doi: 10.1002/mp.15582. PubMed PMID: 35263458; PubMed Central PMCID: PMC9088469.
OpenUrl CrossRef PubMed Google Scholar
47.↵
Shiri I, Salimi Y, Pakbin M, Hajianfar G, Avval AH, Sanaat A, et al. COVID-19 prognostic modeling using CT radiomic features and machine learning algorithms: Analysis of a multi-institutional dataset of 14,339 patients. Computers in biology and medicine. 2022;145:105467. Epub 2022/04/05. doi: 10.1016/j.compbiomed.2022.105467. PubMed PMID: 35378436; PubMed Central PMCID: PMC8964015.
OpenUrl CrossRef PubMed Google Scholar

Posted June 29, 2023.

Download PDF

Author Declarations

Data/Code

Citation Tools

Get QR code

Tweet Widget

Subject Area

Radiology and Imaging

Reviews and Context

Comment

TRIP Peer Reviews

Community Reviews

Automated Services

Blogs/Media

Author Videos

Subject Areas

All Articles

Addiction Medicine (425)
Allergy and Immunology (746)
Anesthesia (219)
Cardiovascular Medicine (3240)
Dentistry and Oral Medicine (358)
Dermatology (271)
Emergency Medicine (476)
Endocrinology (including Diabetes Mellitus and Metabolic Disease) (1153)
Epidemiology (13263)
Forensic Medicine (19)
Gastroenterology (892)
Genetic and Genomic Medicine (5077)
Geriatric Medicine (471)
Health Economics (776)
Health Informatics (3203)
Health Policy (1130)
Health Systems and Quality Improvement (1177)
Hematology (423)
HIV/AIDS (1005)
Infectious Diseases (except HIV/AIDS) (14557)
Intensive Care and Critical Care Medicine (903)
Medical Education (468)
Medical Ethics (126)
Nephrology (515)
Neurology (4829)
Nursing (256)
Nutrition (717)
Obstetrics and Gynecology (871)
Occupational and Environmental Health (783)
Oncology (2490)
Ophthalmology (709)
Orthopedics (279)
Otolaryngology (338)
Pain Medicine (321)
Palliative Medicine (89)
Pathology (533)
Pediatrics (1284)
Pharmacology and Therapeutics (543)
Primary Care Research (551)
Psychiatry and Clinical Psychology (4137)
Public and Global Health (7397)
Radiology and Imaging (1679)
Rehabilitation Medicine and Physical Therapy (995)
Respiratory Medicine (975)
Rheumatology (474)
Sexual and Reproductive Health (492)
Sports Medicine (417)
Surgery (536)
Toxicology (70)
Transplantation (233)
Urology (202)

Comments

medRxiv aims to provide a venue for anyone to comment on a medRxiv preprint. Comments are moderated for offensive or irrelevant content (this can take ~24 h). Please avoid duplicate submissions and read our Comment Policy before commenting. The content of a comment is not endorsed by medRxiv.

medRxiv aims to inform readers about online discussion of this preprint occurring elsewhere. The content at the links below is not endorsed by either medRxiv or the preprint's authors.

Community reviews for this article:

There are no community reviews for this paper.

Automated Evaluations

Certain services provide automated analysis of preprints. Analyses invited by the authors are displayed at the top of this tab. Those done independently of authors are shown underneath . None of these analyses is endorsed by medRxiv.

Automated Evaluations:

There are no automated evaluations for this paper.

[1] 1.↵
Wu Z, McGoogan JM. Characteristics of and Important Lessons From the Coronavirus Disease 2019 (COVID-19) Outbreak in China: Summary of a Report of 72 314 Cases From the Chinese Center for Disease Control and Prevention. Jama. 2020;323(13):1239–42. doi: 10.1001/jama.2020.2648.
OpenUrl CrossRef PubMed Google Scholar

[2] 2.
Lambrou AS, Shirk P, Steele MK, Paul P, Paden CR, Cadwell B, et al. Genomic Surveillance for SARS-CoV-2 Variants: Predominance of the Delta (B.1.617.2) and Omicron (B.1.1.529) Variants - United States, June 2021-January 2022. MMWR Morbidity and mortality weekly report. 2022;71(6):206–11. Epub 2022/02/11. doi: 10.15585/mmwr.mm7106a4. PubMed PMID: 35143464; PubMed Central PMCID: PMC8830620 Journal Editors form for disclosure of potential conflicts of interest. No potential conflicts of interest were disclosed.
OpenUrl CrossRef PubMed Google Scholar

[3] 3.↵
Colson P, Delerce J, Burel E, Dahan J, Jouffret A, Fenollar F, et al. Emergence in southern France of a new SARS-CoV-2 variant harbouring both N501Y and E484K substitutions in the spike protein. Archives of virology. 2022. Epub 2022/02/19. doi: 10.1007/s00705-022-05385-y. PubMed PMID: 35178586; PubMed Central PMCID: PMC8853869.
OpenUrl CrossRef PubMed Google Scholar

[4] 4.↵
Richardson S, Hirsch JS, Narasimhan M, Crawford JM, McGinn T, Davidson KW, et al. Presenting Characteristics, Comorbidities, and Outcomes Among 5700 Patients Hospitalized With COVID-19 in the New York City Area. Jama. 2020;323(20):2052–9. doi: 10.1001/jama.2020.6775.
OpenUrl CrossRef PubMed Google Scholar

[5] 5.↵
Wynants L, Van Calster B, Collins GS, Riley RD, Heinze G, Schuit E, et al. Prediction models for diagnosis and prognosis of covid-19: systematic review and critical appraisal. Bmj. 2020;369:m1328. Epub 2020/04/09. doi: 10.1136/bmj.m1328. PubMed PMID: 32265220; PubMed Central PMCID: PMC7222643 at www.icmje.org/coi_disclosure.pdf and declare: no support from any organisation for the submitted work; no competing interests with regards to the submitted work; LW discloses support from Research Foundation-Flanders (FWO); RDR reports personal fees as a statistics editor for The BMJ (since 2009), consultancy fees for Roche for giving meta-analysis teaching and advice in October 2018, and personal fees for delivering in-house training courses at Barts and The London School of Medicine and Dentistry, and also the Universities of Aberdeen, Exeter, and Leeds, all outside the submitted work.
OpenUrl Abstract/FREE Full Text Google Scholar

[6] 6.↵
Esposito A, Palmisano A, Cao R, Rancoita P, Landoni G, Grippaldi D, et al. Quantitative assessment of lung involvement on chest CT at admission: Impact on hypoxia and outcome in COVID-19 patients. Clinical imaging. 2021;77:194–201. Epub 2021/05/14. doi: 10.1016/j.clinimag.2021.04.033. PubMed PMID: 33984670; PubMed Central PMCID: PMC8081746.
OpenUrl CrossRef PubMed Google Scholar

[7] 7.↵
Bonanad C, Garcia-Blas S, Tarazona-Santabalbina F, Sanchis J, Bertomeu-Gonzalez V, Facila L, et al. The Effect of Age on Mortality in Patients With COVID-19: A Meta-Analysis With 611,583 Subjects. Journal of the American Medical Directors Association. 2020;21(7):915–8. Epub 2020/07/18. doi: 10.1016/j.jamda.2020.05.045. PubMed PMID: 32674819; PubMed Central PMCID: PMC7247470.
OpenUrl CrossRef PubMed Google Scholar

[8] 8.↵
Zhou F, Yu T, Du R, Fan G, Liu Y, Liu Z, et al. Clinical course and risk factors for mortality of adult inpatients with COVID-19 in Wuhan, China: a retrospective cohort study. The Lancet. 2020;395(10229):1054–62. doi: https://doi.org/10.1016/S0140-6736(20)30566-3.
OpenUrl Google Scholar

[9] 9.↵
Huang I, Pranata R, Lim MA, Oehadian A, Alisjahbana B. C-reactive protein, procalcitonin, D-dimer, and ferritin in severe coronavirus disease-2019: a meta-analysis. Therapeutic Advances in Respiratory Disease. 2020;14:1753466620937175. doi: 10.1177/1753466620937175.
OpenUrl CrossRef Google Scholar

[10] 10.↵
Ponti G, Maccaferri M, Ruini C, Tomasi A, Ozben T. Biomarkers associated with COVID-19 disease progression. Critical Reviews in Clinical Laboratory Sciences. 2020;57(6):389–99. doi: 10.1080/10408363.2020.1770685.
OpenUrl CrossRef PubMed Google Scholar

[11] 11.↵
Colombi D, Bodini FC, Petrini M, Maffi G, Morelli N, Milanese G, et al. Well-aerated Lung on Admitting Chest CT to Predict Adverse Outcome in COVID-19 Pneumonia. Radiology. 2020;296(2):E86–E96. Epub 2020/04/18. doi: 10.1148/radiol.2020201433. PubMed PMID: 32301647; PubMed Central PMCID: PMC7233411.
OpenUrl CrossRef PubMed Google Scholar

[12] 12.
Huang L, Han R, Ai T, Yu P, Kang H, Tao Q, et al. Serial Quantitative Chest CT Assessment of COVID-19: A Deep Learning Approach. Radiology Cardiothoracic imaging. 2020;2(2):e200075. Epub 2021/03/30. doi: 10.1148/ryct.2020200075. PubMed PMID: 33778562; PubMed Central PMCID: PMC7233442 R.H. disclosed no relevant relationships. T.A. disclosed no relevant relationships. P.Y. disclosed no relevant relationships. H.K. disclosed no relevant relationships. Q.T. disclosed no relevant relationships. L.X. disclosed no relevant relationships.
OpenUrl CrossRef PubMed Google Scholar

[13] 13.
Revel MP, Boussouar S, de Margerie-Mellon C, Saab I, Lapotre T, Mompoint D, et al. Study of Thoracic CT in COVID-19: The STOIC Project. Radiology. 2021;301(1):E361–E70. Epub 2021/06/30. doi: 10.1148/radiol.2021210384. PubMed PMID: 34184935; PubMed Central PMCID: PMC8267782.
OpenUrl CrossRef PubMed Google Scholar

[14] 14.
Zhan J, Li H, Yu H, Liu X, Zeng X, Peng D, et al. 2019 novel coronavirus (COVID-19) pneumonia: CT manifestations and pattern of evolution in 110 patients in Jiangxi, China. European radiology. 2021;31(2):1059–68. Epub 2020/08/28. doi: 10.1007/s00330-020-07201-0. PubMed PMID: 32852587; PubMed Central PMCID: PMC7450162.
OpenUrl CrossRef PubMed Google Scholar

[15] 15.↵
Zhao C, Xu Y, He Z, Tang J, Zhang Y, Han J, et al. Lung Segmentation and Automatic Detection of COVID-19 Using Radiomic Features from Chest CT Images. Pattern recognition. 2021:108071. Epub 2021/06/08. doi: 10.1016/j.patcog.2021.108071. PubMed PMID: 34092815; PubMed Central PMCID: PMC8169223.
OpenUrl CrossRef PubMed Google Scholar

[16] 16.↵
Jiao Z, Choi JW, Halsey K, Tran TML, Hsieh B, Wang D, et al. Prognostication of patients with COVID-19 using artificial intelligence based on chest x-rays and clinical data: a retrospective study. The Lancet Digital health. 2021;3(5):e286–e94. Epub 2021/03/29. doi: 10.1016/S2589-7500(21)00039-X. PubMed PMID: 33773969; PubMed Central PMCID: PMC7990487 Service, Radiological Society of North America, and National Cancer Institute of the National Institute of Health, during the conduct of the study. XF currently works in Carina Medical, a for-profit organisation that develops clinical products, outside of the submitted work. KC reports grants from National Institute of Biomedical Imaging and Bioengineering and National Cancer Institute of the National Institute of Health, during the conduct of the study. YF reports grants from National Institute of Health, during the conduct of the study. All other authors declare no competing interests.
OpenUrl CrossRef PubMed Google Scholar

[17] 17.↵
Tan HB, Xiong F, Jiang YL, Huang WC, Wang Y, Li HH, et al. The study of automatic machine learning base on radiomics of non-focus area in the first chest CT of different clinical types of COVID-19 pneumonia. Scientific reports. 2020;10(1):18926. Epub 2020/11/05. doi: 10.1038/s41598-020-76141-y. PubMed PMID: 33144676; PubMed Central PMCID: PMC7641115.
OpenUrl CrossRef PubMed Google Scholar

[18] 18.↵
Shiri I, Sorouri M, Geramifar P, Nazari M, Abdollahi M, Salimi Y, et al. Machine learning-based prognostic modeling using clinical data and quantitative radiomic features from chest CT images in COVID-19 patients. Computers in biology and medicine. 2021;132:104304. Epub 2021/03/11. doi: 10.1016/j.compbiomed.2021.104304. PubMed PMID: 33691201; PubMed Central PMCID: PMC7925235.
OpenUrl CrossRef PubMed Google Scholar

[19] 19.↵
Guiot J, Vaidyanathan A, Deprez L, Zerka F, Danthine D, Frix AN, et al. Development and Validation of an Automated Radiomic CT Signature for Detecting COVID-19. Diagnostics. 2020;11(1). Epub 2021/01/06. doi: 10.3390/diagnostics11010041. PubMed PMID: 33396587; PubMed Central PMCID: PMC7823620.
OpenUrl CrossRef PubMed Google Scholar

[20] 20.↵
Simpson S, Kay FU, Abbara S, Bhalla S, Chung JH, Chung M, et al. Radiological Society of North America Expert Consensus Statement on Reporting Chest CT Findings Related to COVID-19. Endorsed by the Society of Thoracic Radiology, the American College of Radiology, and RSNA - Secondary Publication. Journal of thoracic imaging. 2020;35(4):219–27. Epub 2020/04/24. doi: 10.1097/RTI.0000000000000524. PubMed PMID: 32324653; PubMed Central PMCID: PMC7255403.
OpenUrl CrossRef PubMed Google Scholar

[21] 21.↵
Fedorov A, Beichel R, Kalpathy-Cramer J, Finet J, Fillion-Robin JC, Pujol S, et al. 3D Slicer as an image computing platform for the Quantitative Imaging Network. Magnetic resonance imaging. 2012;30(9):1323–41. Epub 2012/07/10. doi: 10.1016/j.mri.2012.05.001. PubMed PMID: 22770690; PubMed Central PMCID: PMC3466397.
OpenUrl CrossRef PubMed Web of Science Google Scholar

[22] 22.↵
Peng H, Long F, Ding C. Feature selection based on mutual information: criteria of max-dependency, max-relevance, and min-redundancy. IEEE transactions on pattern analysis and machine intelligence. 2005;27(8):1226–38. Epub 2005/08/27. doi: 10.1109/TPAMI.2005.159. PubMed PMID: 16119262.
OpenUrl CrossRef PubMed Web of Science Google Scholar

[23] 23.↵
Zwanenburg A, Vallieres M, Abdalah MA, Aerts H, Andrearczyk V, Apte A, et al. The Image Biomarker Standardization Initiative: Standardized Quantitative Radiomics for High-Throughput Image-based Phenotyping. Radiology. 2020;295(2):328–38. Epub 2020/03/11. doi: 10.1148/radiol.2020191145. PubMed PMID: 32154773; PubMed Central PMCID: PMC7193906.
OpenUrl CrossRef PubMed Google Scholar

[24] 24.↵
Tibshirani R. Regression Shrinkage and Selection via the Lasso. Journal of the Royal Statistical Society Series B (Methodological). 1996;58(1):267–88.
OpenUrl CrossRef Web of Science Google Scholar

[25] 25.↵
Friedman J, Hastie T, Tibshirani R. Regularization Paths for Generalized Linear Models via Coordinate Descent. Journal of statistical software. 2010;33(1):1–22. Epub 2010/09/03. PubMed PMID: 20808728; PubMed Central PMCID: PMC2929880.
OpenUrl CrossRef PubMed Web of Science Google Scholar

[26] 26.↵
glmnet. http://cranr-projectorg/web/packages/glmnet,.

[27] 27.↵
Dobbin KK, Simon RM. Optimally splitting cases for training and testing high dimensional classifiers. BMC medical genomics. 2011;4:31. Epub 2011/04/12. doi: 10.1186/1755-8794-4-31. PubMed PMID: 21477282; PubMed Central PMCID: PMC3090739.
OpenUrl CrossRef PubMed Google Scholar

[28] 28.↵
Harrell FE, Jr.. Regression Modeling Strategies. Springer-Verlag. 2006.
Google Scholar

[29] 29.↵
Harrell FE, Jrl FEH. Regression Modeling Strategies: With Applications to Linear Models, Logistic Regression, and Survival Analysis: Springer; 2001.
Google Scholar

[30] 30.↵
Steyerberg EW. Clinical Prediction Models: A Practical Approach to Development, Validation, and Updating: Springer International Publishing; 2019.
Google Scholar

[31] 31.↵
Sun Q, Qiu H, Huang M, Yang Y. Lower mortality of COVID-19 by early recognition and intervention: experience from Jiangsu Province. Annals of intensive care. 2020;10(1):33. Epub 2020/03/20. doi: 10.1186/s13613-020-00650-2. PubMed PMID: 32189136; PubMed Central PMCID: PMC7080931 product mentioned in this paper.
OpenUrl CrossRef PubMed Google Scholar

[32] 32.↵
Goyal DK, Mansab F, Iqbal A, Bhatti S. Early intervention likely improves mortality in COVID-19 infection. Clinical medicine. 2020;20(3):248–50. Epub 2020/05/03. doi: 10.7861/clinmed.2020-0214. PubMed PMID: 32357975; PubMed Central PMCID: PMC7354047.
OpenUrl Abstract/FREE Full Text Google Scholar

[33] 33.↵
Mouhat B, Besutti M, Bouiller K, Grillet F, Monnin C, Ecarnot F, et al. Elevated D-dimers and lack of anticoagulation predict PE in severe COVID-19 patients. The European respiratory journal. 2020;56(4). Epub 2020/09/11. doi: 10.1183/13993003.01811-2020. PubMed PMID: 32907890; PubMed Central PMCID: PMC7487272 Besutti has nothing to disclose. Conflict of interest: K. Bouiller has nothing to disclose. Conflict of interest: F. Grillet has nothing to disclose. Conflict of interest: C. Monnin has nothing to disclose. Conflict of interest: F. Ecarnot has nothing to disclose. Conflict of interest: J. Behr has nothing to disclose. Conflict of interest: G. Capellier has nothing to disclose. Conflict of interest: T. Soumagne has nothing to disclose. Conflict of interest: S. Pili-Floury has nothing to disclose. Conflict of interest: G. Besch has nothing to disclose. Conflict of interest: G. Mourey has nothing to disclose. Conflict of interest: Q. Lepiller has nothing to disclose. Conflict of interest: C. Chirouze has nothing to disclose. Conflict of interest: F. Schiele has nothing to disclose. Conflict of interest: R. Chopard has nothing to disclose. Conflict of interest: N. Meneveau has nothing to disclose.
OpenUrl Abstract/FREE Full Text Google Scholar

[34] 34.↵
Soni M, Gopalakrishnan R, Vaishya R, Prabu P. D-dimer level is a useful predictor for mortality in patients with COVID-19: Analysis of 483 cases. Diabetes & metabolic syndrome. 2020;14(6):2245–9. Epub 2021/01/06. doi: 10.1016/j.dsx.2020.11.007. PubMed PMID: 33395786; PubMed Central PMCID: PMC7670909.
OpenUrl CrossRef PubMed Google Scholar

[35] 35.↵
Poudel A, Poudel Y, Adhikari A, Aryal BB, Dangol D, Bajracharya T, et al. D-dimer as a biomarker for assessment of COVID-19 prognosis: D-dimer levels on admission and its role in predicting disease outcome in hospitalized patients with COVID-19. PloS one. 2021;16(8):e0256744. Epub 2021/08/27. doi: 10.1371/journal.pone.0256744. PubMed PMID: 34437642; PubMed Central PMCID: PMC8389366.
OpenUrl CrossRef PubMed Google Scholar

[36] 36.↵
Henry BM, Aggarwal G, Wong J, Benoit S, Vikse J, Plebani M, et al. Lactate dehydrogenase levels predict coronavirus disease 2019 (COVID-19) severity and mortality: A pooled analysis. The American journal of emergency medicine. 2020;38(9):1722–6. Epub 2020/08/02. doi: 10.1016/j.ajem.2020.05.073. PubMed PMID: 32738466; PubMed Central PMCID: PMC7251362.
OpenUrl CrossRef PubMed Google Scholar

[37] 37.↵
Tao RJ, Luo XL, Xu W, Mao B, Dai RX, Li CW, et al. Viral infection in community acquired pneumonia patients with fever: a prospective observational study. Journal of thoracic disease. 2018;10(7):4387–95. Epub 2018/09/04. doi: 10.21037/jtd.2018.06.33. PubMed PMID: 30174887; PubMed Central PMCID: PMC6105945.
OpenUrl CrossRef PubMed Google Scholar

[38] 38.↵
Li X, Guan B, Su T, Liu W, Chen M, Bin Waleed K, et al. Impact of cardiovascular disease and cardiac injury on in-hospital mortality in patients with COVID-19: a systematic review and meta-analysis. Heart. 2020;106(15):1142–7. Epub 2020/05/29. doi: 10.1136/heartjnl-2020-317062. PubMed PMID: 32461330; PubMed Central PMCID: PMC7295861.
OpenUrl Abstract/FREE Full Text Google Scholar

[39] 39.
Borges do Nascimento IJ, Cacic N, Abdulazeem HM, von Groote TC, Jayarajah U, Weerasekara I, et al. Novel Coronavirus Infection (COVID-19) in Humans: A Scoping Review and Meta-Analysis. Journal of clinical medicine. 2020;9(4). Epub 2020/04/03. doi: 10.3390/jcm9040941. PubMed PMID: 32235486; PubMed Central PMCID: PMC7230636.
OpenUrl CrossRef PubMed Google Scholar

[40] 40.
Nishiga M, Wang DW, Han Y, Lewis DB, Wu JC. COVID-19 and cardiovascular disease: from basic mechanisms to clinical perspectives. Nature reviews Cardiology. 2020;17(9):543–58. Epub 2020/07/22. doi: 10.1038/s41569-020-0413-9. PubMed PMID: 32690910; PubMed Central PMCID: PMC7370876.
OpenUrl CrossRef PubMed Google Scholar

[41] 41.↵
Wang BX. Susceptibility and prognosis of COVID-19 patients with cardiovascular disease. Open heart. 2020;7(1). Epub 2020/06/27. doi: 10.1136/openhrt-2020-001310. PubMed PMID: 32587104; PubMed Central PMCID: PMC7319720.
OpenUrl FREE Full Text Google Scholar

[42] 42.↵
Ai T, Yang Z, Hou H, Zhan C, Chen C, Lv W, et al. Correlation of Chest CT and RT-PCR Testing for Coronavirus Disease 2019 (COVID-19) in China: A Report of 1014 Cases. Radiology. 2020;296(2):E32–E40. Epub 2020/02/27. doi: 10.1148/radiol.2020200642. PubMed PMID: 32101510; PubMed Central PMCID: PMC7233399.
OpenUrl CrossRef PubMed Google Scholar

[43] 43.↵
Sun R, Limkin EJ, Vakalopoulou M, Dercle L, Champiat S, Han SR, et al. A radiomics approach to assess tumour-infiltrating CD8 cells and response to anti-PD-1 or anti-PD-L1 immunotherapy: an imaging biomarker, retrospective multicohort study. The Lancet Oncology. 2018;19(9):1180–91. Epub 2018/08/19. doi: 10.1016/S1470-2045(18)30413-3. PubMed PMID: 30120041.
OpenUrl CrossRef PubMed Google Scholar

[44] 44.↵
Ciarmiello A, Giovannini E, Pastorino S, Ferrando O, Foppiano F, Mannironi A, et al. Machine Learning Model to Predict Diagnosis of Mild Cognitive Impairment by Using Radiomic and Amyloid Brain PET. Clinical nuclear medicine. 2023;48(1):1–7. Epub 2022/10/15. doi: 10.1097/RLU.0000000000004433. PubMed PMID: 36240660.
OpenUrl CrossRef PubMed Google Scholar

[45] 45.↵
Ciarmiello A, Giovannini E, Florimonte L, Bonatto E, Bareggi C, Milano A, et al. Machine learning radiomics for prediction of survival in non-small cell lung cancer patients studied with PET/CT and FDG. Annals of Oncology. 2021;32:01/095. Epub 2021/01/09. doi: https://doi.org/10.1016/j.annonc.2021.08.779.
OpenUrl Google Scholar

[46] 46.↵
Hu Z, Yang Z, Lafata KJ, Yin FF, Wang C. A radiomics-boosted deep-learning model for COVID-19 and non-COVID-19 pneumonia classification using chest x-ray images. Medical physics. 2022;49(5):3213–22. Epub 2022/03/10. doi: 10.1002/mp.15582. PubMed PMID: 35263458; PubMed Central PMCID: PMC9088469.
OpenUrl CrossRef PubMed Google Scholar

[47] 47.↵
Shiri I, Salimi Y, Pakbin M, Hajianfar G, Avval AH, Sanaat A, et al. COVID-19 prognostic modeling using CT radiomic features and machine learning algorithms: Analysis of a multi-institutional dataset of 14,339 patients. Computers in biology and medicine. 2022;145:105467. Epub 2022/04/05. doi: 10.1016/j.compbiomed.2022.105467. PubMed PMID: 35378436; PubMed Central PMCID: PMC8964015.
OpenUrl CrossRef PubMed Google Scholar

Multivariable risk modelling and survival analysis with machine learning in SARS-CoV-2 infection

Abstract

Introduction