Federated Learning of Electronic Health Records Improves Mortality Prediction in Patients Hospitalized with COVID-19

Akhil Vaid; Suraj K Jaladanki; Jie Xu; Shelly Teng; Arvind Kumar; Samuel Lee; Sulaiman Somani; Ishan Paranjpe; Jessica K De Freitas; Tingyi Wanyan; Kipp W Johnson; Mesude Bicak; Eyal Klang; Young Joon Kwon; Anthony Costa; Shan Zhao; Riccardo Miotto; Alexander W Charney; Erwin Böttinger; Zahi A Fayad; Girish N Nadkarni; Fei Wang; Benjamin S Glicksberg

doi:10.1101/2020.08.11.20172809

ABSTRACT

Machine learning (ML) models require large datasets which may be siloed across different healthcare institutions. Using federated learning, a ML technique that avoids locally aggregating raw clinical data across multiple institutions, we predict mortality within seven days in hospitalized COVID-19 patients. Patient data was collected from Electronic Health Records (EHRs) from five hospitals within the Mount Sinai Health System (MSHS). Logistic Regression with L1 regularization (LASSO) and Multilayer Perceptron (MLP) models were trained using local data at each site, a pooled model with combined data from all five sites, and a federated model that only shared parameters with a central aggregator. Both the federated LASSO and federated MLP models performed better than their local model counterparts at four hospitals. The federated MLP model also outperformed the federated LASSO model at all hospitals. Federated learning shows promise in COVID-19 EHR data to develop robust predictive models without compromising patient privacy.

INTRODUCTION

Background and Significance

Coronavirus disease 2019 (COVID-19) has led to over 700,000 deaths worldwide and other devastating outcomes.[1] Clinical manifestations occur in heterogeneous patterns that are not yet well understood, creating an urgent need to understand factors that lead to negative outcomes to aid delivery of more tailored treatments. This task requires learning from large, diverse patient populations. However, pertinent patient data are siloed within hospitals worldwide. While many studies have produced significant findings in regards to COVID-19 outcomes using data from single hospitals, larger representation from additional populations is needed to derive generalizable outcomes, especially within the context of machine learning (ML).[2-11] Large-scale initiatives such as the National COVID Cohort Collaborative and the 4CE Consortium have joined globally distributed hospitals to examine association analyses, but meta-analysis does not allow joint machine learning from diverse patient data.[12-13]

In light of patient privacy, federated learning is emerging within the biomedical field as a promising strategy particularly in the context of COVID-19 where patient data are fragmented across health systems which are not representative of the entire population, leading to predictions that may not be generalizable.[14] Briefly, federated learning allows for decentralized refinement of independently built ML models. This is done via iterative exchange of model parameters to a central aggregator without sharing raw data.

A limited number of studies have assessed ML models within a federated learning framework for outcome prediction in COVID-19 but show promise. Kumar et al. built a blockchain-based federated learning schema and achieved enhanced sensitivity for COVID-19 detection from lung CT scans.[15] Xu et al. used deep learning to identify COVID-19 on CT scans from multiple hospitals in China [16] and found that models built on hospitals in one region did not generalize well to others, but were able to achieve considerable performance gains when using a federated learning approach. For more detailed background regarding COVID-19, ML in the context of COVID-19, challenges for multi-institutional collaborations, and federated learning, see Supplementary Materials.

While it has been proposed, to our knowledge there are no published studies that implement and assess the utility of federated learning to predict COVID-19 outcomes from Electronic Health Record (EHR) data.[17] In this work, we are the first to build federated learning models to predict mortality in patients diagnosed with COVID-19 within seven days of hospital admission using EHR data.

MATERIALS AND METHODS

Clinical Data Source and Study Population

Data from COVID-19 positive patients (n=4029) were derived from Epic EHR systems of five hospitals within the Mount Sinai Health System (MSHS) in New York City (NYC): Mount Sinai Brooklyn (MSB; n=611); Mount Sinai Hospital (MSH; n=1644); Mount Sinai Morningside (MSM); n=749); Mount Sinai Queens (MSQ; n=540); and Mount Sinai West (MSW; n=485). A schematic of all study inclusion criteria is shown in Figure 1A, further detailed in Supplementary Materials. Details on cross-hospital demographic and clinical comparisons are also in the Supplementary Materials.

Figure 1.

Study Design and Model Workflow.

(A) Criteria for patient inclusion in study. (B) Overview of local and pooled models. Local models only utilize data from the site itself while pooled models incorporate data from all sites. Both local and pooled MLP and LASSO models were utilized. (C) Overview of federated model. Parameters from a central aggregator are shared with each site, and sites do not have direct access to clinical data from others. After models are trained locally at a site, parameters with and without added noise are sent back to the central aggregator to update federated model parameters. A federated LASSO and federated MLP model were utilized.

Study Design

We performed multiple experiments as outlined in Figures 1B and 1C (see Model Development and Selection). First, we developed classifiers using and testing on local data from each hospital separately. We also built a federated learning model by averaging the parameters of models from each individual hospital. Finally, we combined all individual hospital data into a superset to develop a “pooled” model. Although the pooled model is not practically feasible, it represents an ideal framework.

Data included demographics, past medical history, vital signs, lab tests, and outcomes for all patients (Table 1, Supplementary Table 1). Due to varying prevalence across hospitals, we assessed multiple class balancing techniques (Supplementary Table 2). To simulate federated learning in practice, we also performed experiments with the addition of Gaussian noise (see Supplementary Materials).

View this table:

Table 1:

Demographics of Hospitalized COVID-19 Patients.

Demographics (age, gender, ethnicity, race, and past medical history) for all patients (N = 4029) included in study. Inter-hospital comparisons for categorical data were assessed with chi-square tests and numerical data using Kruskal-Wallis tests with Bonferroni-adjusted p-values reported. Values with fewer than ten patients per field are not provided to protect patient privacy.

To promote replicability, we placed our study within the Transparent Reporting of a Multivariable Prediction Model for Individual Prognosis or Diagnosis (TRIPOD) guidelines (Supplementary Table 3) and released our code under the GPLv3 license (Supplementary Materials). Software information can be found within the Supplementary Materials.

Definition of Outcomes

The primary outcome was mortality within seven days of admission.

Model Development and Selection

We generated two baseline conventional predictive models - a multilayer perceptron (MLP) and logistic regression with L1-regularization, or least absolute shrinkage and selection operator (LASSO). For consistency and to enable direct comparisons, each MLP model was built with the same architecture. We provide more information on model architecture and tuning in the Supplemental Materials. MLP and LASSO models were fit on all five hospitals.

Our primary model of interest was a federated learning model. In this model, training was performed in different sites, and parameters were shared to a central location, as depicted in Figure 1C. The process was as follows: a central aggregator initialized the federated model with random parameters. This model was sent to each site, then trained for one epoch. Next, model parameters were sent back to the central aggregator where federated averaging was performed. Updated parameters from the central aggregator were then sent back to each site, and this cycle was repeated for multiple epochs. Federated averaging scales the parameters of each site according to the number of available data points and sums all parameters by layer. Through this technique, federated models did not receive any raw data.

Experimental Evaluation

All models were trained and evaluated using 10-fold stratified cross validation. We utilized the models’ probability scores to calculate areas under the receiver operating characteristic curve (AUC-ROC) with averages across the 10 folds.

RESULTS

Intercohort Comparisons

EHR data consisted of patient demographics, past medical history (Table 1), and admission vitals and labs (Supplementary Table 1). We found significant differences across hospitals in the proportion of outcome, specifically mortality within seven days (p<0.001), which ranged from 5.6% to 24.2% (Table 1). There were also significant differences in gender (p=0.004), age (p<0.001), ethnicity (p<0.001), and race (p<0.001). We found significant differences in the majority of key clinical features such as past medical history like asthma (p<0.001) and relevant lab tests including white blood cell counts (p<0.001; Supplementary Table 1).

Classifier Training and Performance

LASSO and MLP models trained on data from each of the five MSHS hospitals (sites) separately (local), combined dataset (pooled), and via a federated learning framework (federated). All three training strategies for both models were evaluated on all sites (Figure 1B, Figure 1C). Results for model optimization (Supplementary Figure 2) and class balancing experiments (Supplementary Table 2) can be found in the Supplementary Materials. Final model hyperparameters are listed in Supplementary Table 4.

Learning Framework Comparisons

Performance of all LASSO and MLP models (local, pooled, federated) was assessed at each site (Table 2, Figure 3). Federated LASSO outperformed the site’s local LASSO at all sites except MSQ, with AUC-ROC ranging from 0.693 to 0.805. LASSO pooled outperformed federated LASSO at all five sites and had AUC-ROCs ranging from 0.736 to 0.822.

View this table:

Table 2:

Model Performance By Site.

Figure 2.

Federated Model Training.

Performance of (A) federated MLP and (B) federated LASSO models as measured by area under the receiver-operating characteristic (AUC-ROC) versus the number of training epochs. Binary Cross-Entropy Loss of (C) Federated MLP and (D) Federated LASSO versus the number of training epochs.

Figure 3.

Model Performance by Site.

Performance of all models (LASSO local, LASSO pooled, LASSO federated, MLP local, MLP pooled, MLP federated (no noise) by area under the receiver-operating characteristic (AUC- ROC) at (A) Mount Sinai Brooklyn (MSB) (n=611) (B) Mount Sinai West (MSW) (n=485), (C) Mount Sinai Morningside (MSM) (n=749), (D) Mount Sinai Hospital (MSH) (n=1644), and (E) Mount Sinai Queens (MSQ) (n=540). Averages of receiver-operating characteristic after 10-fold cross validation are shown. Average performance of each model across all five sites is presented in (F).

Performance of local, pooled, and federated LASSO and MLP models at each site as measured by area under the receiver-operating characteristic (AUC-ROC) with 95% confidence intervals. MLP pooled performed worse than local MLP at all hospitals except MSW, with MLP pooled AUC-ROCs varying from 0.754 to 0.829 while local MLP achieved AUC-ROCs ranging from 0.739 to 0.833. Federated MLP performed better than local MLP at all hospitals except MSB and performed better than the MLP pooled at all sites.

Cross-model Comparisons

Local MLP outperformed local LASSO at all hospitals. Federated MLP surpassed federated LASSO at all sites. While performance varied slightly among both classifiers in different experiments, federated MLP consistently outperformed all other tested models at all hospitals except MSB. Additional performance metrics (AUPRC, accuracy, sensitivity, specificity, and F1- score) are described in Supplementary Table 5.

Effect of Gaussian Noise

Gaussian noise introduced into federated MLP led to decreased performance at all sites except MSQ, where both had AUC-ROCs of 0.822. AUC-ROCs ranged from 0.796 to 0.834 in federated MLP without noise and 0.767-0.830 in federated MLP with noise (Supplemental Figure 1).

DISCUSSION

To our knowledge, this is the first study to evaluate the efficacy of applying federated learning to predict mortality for COVID-19 using clinical data from EHRs. The data from the five hospitals used in this study represent a demonstrative use-case for this experiment. With disparate patient characteristics per hospital in terms of demographics, outcomes, sample size, and lab values, this study reflects a real-world scenario where federated learning can be leveraged for hospitals with diverse patient populations.

The primary findings of this study demonstrate that federated MLP and LASSO models outperformed their respective local models at most sites. Federated LASSO did not outperform pooled LASSO at any hospital. Federated MLP outperformed local MLP and federated LASSO at all hospitals. Federated MLP also outperformed pooled MLP at all sites which might be due to the experimental condition where the same underlying architecture was used for all MLP models. While this framework allowed for consistency in comparisons across learning strategies, it may have led to improper tuning of the pooled models. Collectively, these results exhibit the potential of federated learning to overcome drawbacks of fragmented local models.

Future inclusion of more sites into the federated learning network could continue to bridge the gap between pooled models, especially for LASSO.

Our results illustrate scenarios in which federated models should either be approached with caution or favored. MSQ was the only hospital where federated LASSO performed slightly worse than the local model with a difference of 0.04 in AUC-ROC, which may be attributed to a low sample size of 540 patients and a high mortality prevalence of 23.0% compared to other sites. However, at MSW, local LASSO severely underperformed in comparison to federated LASSO, with a difference of 0.351. MSW had the lowest sample size of all hospitals at 485 and the lowest COVID-19 mortality prevalence of 5.6%. This pattern emphasizes the benefit of federated learning for sites with small sample sizes and large class imbalance.

We also assessed the impact of adding noise to federated MLP. Training metrics were similar to federated MLP without noise insertion (Supplemental Figure 2). An average difference of 0.013 in AUC-ROC across all hospitals in these two models demonstrates that noise may be inserted to increase data security without severely compromising federated model performance.

We note a few limitations of our study. First, data collection was limited to MSHS hospitals in NYC. This may limit model generalizability to hospitals in other regions. Also, this study focused on applying federated learning to predict outcomes based on patient EHR data in principle rather than creating an operational framework for immediate deployment. As such, there are various aspects of the federated learning process that this work does not address such as load balancing, convergence, and scaling. These models included only clinical data and could be enhanced by incorporating other modalities such as imaging or free-text. We only implemented two widely used classifiers within this framework, but other algorithms may perform better.

Finally, identical MLP architectures were used across all learning strategies for direct comparisons but could have been further optimized.

Future work will focus on accessibility and expanding analysis of federated models. We plan to release code written within common data model EHR formats to better facilitate scalability. We will study salient features of importance for federated models and analyze changes as data are added. Finally, we will integrate additional data types such as images to improve model performance. We aim to use this federated learning framework to predict other adverse outcomes in hospitalized COVID-19 patients such as acute kidney injury.

CONCLUSION

Federated learning is an effective method to share ML models without jeopardizing patient privacy. It is invaluable in the context of COVID-19, where limited data are segregated across various institutions. We demonstrated that federated learning models outperformed locally trained models to predict mortality. Federated learning offers a promising opportunity to construct more robust predictive models that can be leveraged in this pandemic.

CONTRIBUTORS

BSG, FW, and GSN conceived, designed, and supervised the study. AV collected the data and AV, SKJ, and JX were involved in data analysis. AV and SKJ were involved in interpreting the results. AV, SKJ, JX, ST, AK, SL, SS, IP, JKDF, TW, KPW, MB, EK, FK, AC, SZ, RM, AWC, EB, ZAF, FW and BSG drafted the initial manuscript. All authors approved provided critical comments, edited, and approved of the manuscript in its final form for submission.

COMPETING INTERESTS

None.

ETHICS APPROVAL

This study has been approved by the Institutional Review Board at the Icahn School of Medicine at Mount Sinai (IRB-20-03271).

FUNDING

This work was supported by U54 TR001433-05, National Center for Advancing Translational Sciences, National Institutes of Health.

ACKNOWLEDGMENTS

We thank the Clinical Data Science and Mount Sinai Data Warehouse teams for providing data. We appreciate all the nurses, physicians, and providers who contributed to the care of these patients. This work is for patients and their family members who were affected by this pandemic.

REFERENCES

1.↵
Dong E, Du H, Gardner L. An interactive web-based dashboard to track COVID-19 in real time. The Lancet Infectious Diseases 2020;20(5):533–34.
OpenUrl CrossRef PubMed Google Scholar
2.↵
Charney AW, Simons NW, Mouskas K, et al. Sampling the host response to SARS-CoV-2 in hospitals under siege. Nature Medicine 2020.
Google Scholar
3.
Clerkin KJ, Fried JA, Raikhelkar J, et al. COVID-19 and Cardiovascular Disease. Circulation 2020; 141 (20):1648–55.
OpenUrl PubMed Google Scholar
4.
Wu Z, McGoogan JM. Characteristics of and Important Lessons From the Coronavirus Disease 2019 (COVID-19) Outbreak in China: Summary of a Report of 72-314 Cases From the Chinese Center for Disease Control and Prevention. JAMA 2020;323(13):1239–42.
OpenUrl CrossRef PubMed Google Scholar
5.
Lauer SA, Grantz K, Bi Q, et al. The Incubation Period of Coronavirus Disease 2019 (COVID- 19) From Publicly Reported Confirmed Cases: Estimation and Application. Annals of Internal Medicine 2020;172(9):577–82.
OpenUrl CrossRef PubMed Google Scholar
6.
Chung M, Bernheim A, Mei X, et al. CT Imaging Features of 2019 Novel Coronavirus (2019- nCoV). Radiology 2020;295(1):202–07.
OpenUrl CrossRef PubMed Google Scholar
7.
Xu Z, Shi L, Wang Y, et al. Pathological findings of COVID-19 associated with acute respiratory distress syndrome. The Lancet Respiratory Medicine 2020;8(4):420–22.
OpenUrl Google Scholar
8.
Paranjpe I, Fuster V, Lala A, et al. Association of Treatment Dose Anticoagulation With InHospital Survival Among Hospitalized Patients With COVID-19. J Am Coll Cardiol 2020;76(1):122–24.
OpenUrl FREE Full Text Google Scholar
9.
Lala A, Johnson KW, Januzzi JL, et al. Prevalence and Impact of Myocardial Injury in Patients Hospitalized With COVID-19 Infection. J Am Coll Cardiol 2020;76(5):533–46.
OpenUrl FREE Full Text Google Scholar
10.
Sigel K, Swartz T, Golden E, et al. Covid-19 and People with HIV Infection: Outcomes for Hospitalized Patients in New York City. Clin Infect Dis 2020.
Google Scholar
11.↵
Mei X, Lee H-C, Diao K-y, et al. Artificial intelligence-enabled rapid diagnosis of patients with COVID-19. Nature Medicine 2020.
Google Scholar
12.↵
NCATS. National COVID Cohort Collaborative (N3C). 2020. https://ncats.nih.gov/n3c (accessed August 2020).
Google Scholar
13.↵
Brat GA, Weber GM, Gehlenborg N, et al. International Electronic Health Record-Derived COVID-19 Clinical Course Profiles: The 4CE Consortium. medRxiv 2020:2020.04.13.20059691.
Google Scholar
14.↵
Xu J, Wang F. Federated learning for healthcare informatics. arXiv preprint arXiv:1911.06270 2019.
Google Scholar
15.↵
Kumar R, Khan AA, Zhang S, et al. Blockchain-Federated-Learning and Deep Learning Models for COVID-19 detection using CT Imaging. arXiv preprint arXiv:2007.06537 2020.
Google Scholar
16.↵
Xu Y, Ma L, Yang F, et al. A collaborative online AI engine for CT-based COVID-19 diagnosis. medRxiv 2020:2020.05.10.20096073.
Google Scholar
17.↵
Raisaro JL, Marino F, Troncoso-Pastoriza J, et al. SCOR: A secure international informatics infrastructure to investigate COVID-19. Journal of the American Medical Informatics Association 2020.
Google Scholar

Posted August 14, 2020.

Download PDF

Author Declarations

Supplementary Material

Data/Code

Citation Tools

Get QR code

Tweet Widget

Subject Area

Health Informatics

Reviews and Context

Comment

TRIP Peer Reviews

Community Reviews

Automated Services

Blogs/Media

Author Videos

Subject Areas

All Articles

Addiction Medicine (417)
Allergy and Immunology (739)
Anesthesia (216)
Cardiovascular Medicine (3168)
Dentistry and Oral Medicine (354)
Dermatology (268)
Emergency Medicine (469)
Endocrinology (including Diabetes Mellitus and Metabolic Disease) (1125)
Epidemiology (13136)
Forensic Medicine (17)
Gastroenterology (878)
Genetic and Genomic Medicine (4962)
Geriatric Medicine (457)
Health Economics (761)
Health Informatics (3124)
Health Policy (1114)
Health Systems and Quality Improvement (1154)
Hematology (417)
HIV/AIDS (986)
Infectious Diseases (except HIV/AIDS) (14436)
Intensive Care and Critical Care Medicine (894)
Medical Education (462)
Medical Ethics (121)
Nephrology (511)
Neurology (4715)
Nursing (250)
Nutrition (699)
Obstetrics and Gynecology (856)
Occupational and Environmental Health (772)
Oncology (2431)
Ophthalmology (691)
Orthopedics (272)
Otolaryngology (335)
Pain Medicine (314)
Palliative Medicine (88)
Pathology (523)
Pediatrics (1263)
Pharmacology and Therapeutics (535)
Primary Care Research (535)
Psychiatry and Clinical Psychology (4053)
Public and Global Health (7281)
Radiology and Imaging (1629)
Rehabilitation Medicine and Physical Therapy (972)
Respiratory Medicine (952)
Rheumatology (468)
Sexual and Reproductive Health (485)
Sports Medicine (409)
Surgery (527)
Toxicology (66)
Transplantation (225)
Urology (195)

Comments

medRxiv aims to provide a venue for anyone to comment on a medRxiv preprint. Comments are moderated for offensive or irrelevant content (this can take ~24 h). Please avoid duplicate submissions and read our Comment Policy before commenting. The content of a comment is not endorsed by medRxiv.

medRxiv aims to inform readers about online discussion of this preprint occurring elsewhere. The content at the links below is not endorsed by either medRxiv or the preprint's authors.

Community reviews for this article:

There are no community reviews for this paper.

Automated Evaluations

Certain services provide automated analysis of preprints. Analyses invited by the authors are displayed at the top of this tab. Those done independently of authors are shown underneath . None of these analyses is endorsed by medRxiv.

Automated Evaluations:

There are no automated evaluations for this paper.

[1] 1.↵
Dong E, Du H, Gardner L. An interactive web-based dashboard to track COVID-19 in real time. The Lancet Infectious Diseases 2020;20(5):533–34.
OpenUrl CrossRef PubMed Google Scholar

[2] 2.↵
Charney AW, Simons NW, Mouskas K, et al. Sampling the host response to SARS-CoV-2 in hospitals under siege. Nature Medicine 2020.
Google Scholar

[3] 3.
Clerkin KJ, Fried JA, Raikhelkar J, et al. COVID-19 and Cardiovascular Disease. Circulation 2020; 141 (20):1648–55.
OpenUrl PubMed Google Scholar

[4] 4.
Wu Z, McGoogan JM. Characteristics of and Important Lessons From the Coronavirus Disease 2019 (COVID-19) Outbreak in China: Summary of a Report of 72-314 Cases From the Chinese Center for Disease Control and Prevention. JAMA 2020;323(13):1239–42.
OpenUrl CrossRef PubMed Google Scholar

[5] 5.
Lauer SA, Grantz K, Bi Q, et al. The Incubation Period of Coronavirus Disease 2019 (COVID- 19) From Publicly Reported Confirmed Cases: Estimation and Application. Annals of Internal Medicine 2020;172(9):577–82.
OpenUrl CrossRef PubMed Google Scholar

[6] 6.
Chung M, Bernheim A, Mei X, et al. CT Imaging Features of 2019 Novel Coronavirus (2019- nCoV). Radiology 2020;295(1):202–07.
OpenUrl CrossRef PubMed Google Scholar

[7] 7.
Xu Z, Shi L, Wang Y, et al. Pathological findings of COVID-19 associated with acute respiratory distress syndrome. The Lancet Respiratory Medicine 2020;8(4):420–22.
OpenUrl Google Scholar

[8] 8.
Paranjpe I, Fuster V, Lala A, et al. Association of Treatment Dose Anticoagulation With InHospital Survival Among Hospitalized Patients With COVID-19. J Am Coll Cardiol 2020;76(1):122–24.
OpenUrl FREE Full Text Google Scholar

[9] 9.
Lala A, Johnson KW, Januzzi JL, et al. Prevalence and Impact of Myocardial Injury in Patients Hospitalized With COVID-19 Infection. J Am Coll Cardiol 2020;76(5):533–46.
OpenUrl FREE Full Text Google Scholar

[10] 10.
Sigel K, Swartz T, Golden E, et al. Covid-19 and People with HIV Infection: Outcomes for Hospitalized Patients in New York City. Clin Infect Dis 2020.
Google Scholar

[11] 11.↵
Mei X, Lee H-C, Diao K-y, et al. Artificial intelligence-enabled rapid diagnosis of patients with COVID-19. Nature Medicine 2020.
Google Scholar

[12] 12.↵
NCATS. National COVID Cohort Collaborative (N3C). 2020. https://ncats.nih.gov/n3c (accessed August 2020).
Google Scholar

[13] 13.↵
Brat GA, Weber GM, Gehlenborg N, et al. International Electronic Health Record-Derived COVID-19 Clinical Course Profiles: The 4CE Consortium. medRxiv 2020:2020.04.13.20059691.
Google Scholar

[14] 14.↵
Xu J, Wang F. Federated learning for healthcare informatics. arXiv preprint arXiv:1911.06270 2019.
Google Scholar

[15] 15.↵
Kumar R, Khan AA, Zhang S, et al. Blockchain-Federated-Learning and Deep Learning Models for COVID-19 detection using CT Imaging. arXiv preprint arXiv:2007.06537 2020.
Google Scholar

[16] 16.↵
Xu Y, Ma L, Yang F, et al. A collaborative online AI engine for CT-based COVID-19 diagnosis. medRxiv 2020:2020.05.10.20096073.
Google Scholar

[17] 17.↵
Raisaro JL, Marino F, Troncoso-Pastoriza J, et al. SCOR: A secure international informatics infrastructure to investigate COVID-19. Journal of the American Medical Informatics Association 2020.
Google Scholar

Federated Learning of Electronic Health Records Improves Mortality Prediction in Patients Hospitalized with COVID-19

ABSTRACT

INTRODUCTION

Background and Significance

MATERIALS AND METHODS

Clinical Data Source and Study Population

Study Design

Definition of Outcomes

Model Development and Selection

Experimental Evaluation

RESULTS

Intercohort Comparisons

Classifier Training and Performance

Learning Framework Comparisons

Cross-model Comparisons

Effect of Gaussian Noise

DISCUSSION

CONCLUSION

Data Availability

CONTRIBUTORS

COMPETING INTERESTS

ETHICS APPROVAL

FUNDING

ACKNOWLEDGMENTS

REFERENCES

Subject Area

Citation Manager Formats

Federated Learning of Electronic Health Records Improves Mortality Prediction in Patients Hospitalized with COVID-19

ABSTRACT

INTRODUCTION

Background and Significance

MATERIALS AND METHODS

Clinical Data Source and Study Population

Study Design

Definition of Outcomes

Model Development and Selection

Experimental Evaluation

RESULTS

Intercohort Comparisons

Classifier Training and Performance

Learning Framework Comparisons

Cross-model Comparisons

Effect of Gaussian Noise

DISCUSSION

CONCLUSION

Data Availability

CONTRIBUTORS

COMPETING INTERESTS

ETHICS APPROVAL

FUNDING

ACKNOWLEDGMENTS

REFERENCES

Subject Area

Follow this preprint