PT - JOURNAL ARTICLE AU - Li, Matthew D. AU - Arun, Nishanth Thumbavanam AU - Gidwani, Mishka AU - Chang, Ken AU - Deng, Francis AU - Little, Brent P. AU - Mendoza, Dexter P. AU - Lang, Min AU - Lee, Susanna I. AU - O’Shea, Aileen AU - Parakh, Anushri AU - Singh, Praveer AU - Kalpathy-Cramer, Jayashree TI - Automated assessment of COVID-19 pulmonary disease severity on chest radiographs using convolutional Siamese neural networks AID - 10.1101/2020.05.20.20108159 DP - 2020 Jan 01 TA - medRxiv PG - 2020.05.20.20108159 4099 - http://medrxiv.org/content/early/2020/05/23/2020.05.20.20108159.short 4100 - http://medrxiv.org/content/early/2020/05/23/2020.05.20.20108159.full AB - Purpose To develop an automated measure of COVID-19 pulmonary disease severity on chest radiographs (CXRs), for longitudinal disease evaluation and clinical risk stratification.Materials and Methods A convolutional Siamese neural network-based algorithm was trained to output a measure of pulmonary disease severity on anterior-posterior CXRs (pulmonary x-ray severity (PXS) score), using weakly-supervised pretraining on ~160,000 images from CheXpert and transfer learning on 314 CXRs from patients with COVID-19. The algorithm was evaluated on internal and external test sets from different hospitals, containing 154 and 113 CXRs respectively. The PXS score was correlated with a radiographic severity score independently assigned by two thoracic radiologists and one in-training radiologist. For 92 internal test set patients with follow-up CXRs, the change in PXS score was compared to radiologist assessments of change. The association between PXS score and subsequent intubation or death was assessed.Results The PXS score correlated with the radiographic pulmonary disease severity score assigned to CXRs in the COVID-19 internal and external test sets (ρ=0.84 and ρ=0.78 respectively). The direction of change in PXS score in follow-up CXRs agreed with radiologist assessment (ρ=0.74). In patients not intubated on the admission CXR, the PXS score predicted subsequent intubation or death within three days of hospital admission (area under the receiver operator characteristic curve=0.80 (95%CI 0.75-0.85)).Conclusion A Siamese neural network-based severity score automatically measures COVID-19 pulmonary disease severity in chest radiographs, which can be scaled and rapidly deployed for clinical triage and workflow optimization.SUMMARY A convolutional Siamese neural network-based algorithm can calculate a continuous radiographic pulmonary disease severity score in COVID-19 patients, which can be used for longitudinal disease evaluation and clinical risk stratification.KEY RESULTSA Siamese neural network-based severity score correlates with radiologist-annotated pulmonary disease severity on chest radiographs from patients with COVID-19 (ρ=0.84 and ρ=0.78 in internal and external test sets respectively).The direction of change in the severity score in follow-up radiographs is concordant with radiologist assessment (ρ=0.74).The admission chest radiograph severity score can help predict subsequent intubation or death within three days of admission (receiver operator characteristic area under the curve=0.80).Competing Interest StatementThe authors have declared no competing interest.Funding StatementResearch reported in this publication was supported by a training grant from the National Institute of Biomedical Imaging and Bioengineering (NIBIB) of the National Institutes of Health under award number 5T32EB1680 and by the National Cancer Institute (NCI) of the National Institutes of Health under Award Number F30CA239407 to K. Chang. This study was supported by National Institutes of Health grants U01 CA154601, U24 CA180927, and U24 CA180918 to J. Kalpathy-Cramer. This research was carried out in whole or in part at the Athinoula A. Martinos Center for Biomedical Imaging at the Massachusetts General Hospital, using resources provided by the Center for Functional Neuroimaging Technologies, P41EB015896, a P41 Biotechnology Resource Grant supported by the National Institute of Biomedical Imaging and Bioengineering (NIBIB), National Institutes of Health. GPU computing resources were provided by the MGH and BWH Center for Clinical Data Science.Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesAll necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.YesThe CheXpert data is available through the Stanford ML Group (https://stanfordmlgroup.github.io/competitions/chexpert/). Patient data used in this study is not publicly available.AbbreviationsCOVID-19coronavirus disease 2019CXRchest radiographRT-PCRreverse transcriptase-polymerase chain reactionAPanterior-posteriormRALE scoremodified Radiographic Assessment of Lung Edema scorePXS scorepulmonary x-ray severity scoreAUCarea under the curve.