Artificial Intelligence Applications for COVID-19 in Intensive Care and Emergency Settings: A Systematic Review =============================================================================================================== * Marcel Lucas Chee * Marcus Eng Hock Ong * Fahad Javaid Siddiqui * Zhongheng Zhang * Shir Lynn Lim * Andrew Fu Wah Ho * Nan Liu ## Abstract **Background** Little is known about the role of artificial intelligence (AI) as a decisive technology in the clinical management of COVID-19 patients. We aimed to systematically review and critically appraise the current evidence on AI applications for COVID-19 in intensive care and emergency settings, focusing on methods, reporting standards, and clinical utility. **Methods** We systematically searched PubMed, Embase, Scopus, CINAHL, IEEE Xplore, and ACM Digital Library databases from inception to 1 October 2020, without language restrictions. We included peer-reviewed original studies that applied AI for COVID-19 patients, healthcare workers, or health systems in intensive care, emergency or prehospital settings. We assessed predictive modelling studies using PROBAST (prediction model risk of bias assessment tool) and a modified TRIPOD (transparent reporting of a multivariable prediction model for individual prognosis or diagnosis) statement for AI. We critically appraised the methodology and key findings of all other studies. **Results** Of fourteen eligible studies, eleven developed prognostic or diagnostic AI predictive models, all of which were assessed to be at high risk of bias. Common pitfalls included inadequate sample sizes, poor handling of missing data, failure to account for censored participants, and weak validation of models. Studies had low adherence to reporting guidelines, with particularly poor reporting on model calibration and blinding of outcome and predictor assessment. Of the remaining three studies, two evaluated the prognostic utility of deep learning-based lung segmentation software and one studied an AI-based system for resource optimisation in the ICU. These studies had similar issues in methodology, validation, and reporting. **Conclusions** Current AI applications for COVID-19 are not ready for deployment in acute care settings, given their limited scope and poor quality. Our findings underscore the need for improvements to facilitate safe and effective clinical adoption of AI applications, for and beyond the COVID-19 pandemic. Keywords * artificial intelligence * Machine learning * COVID-19 * emergency department * intensive care * critical care ## 1 Introduction The ongoing coronavirus disease 2019 (COVID-19) pandemic has challenged healthcare systems and healthcare practitioners worldwide. Intensive care units (ICU) and emergency departments (ED) in badly afflicted areas have been overwhelmed by the surge in patients suspected or diagnosed with COVID-19 1-3. This exerts significant pressure on healthcare resources, necessitating novel diagnostics and care pathways to rationally deploy scarce emergency and intensive care healthcare resources. Current strategies and recommendations on clinical management and resource rationalisation draw on past pandemic experiences and expert recommendations 3-5; however, there has been growing interest in novel applications of artificial intelligence (AI) to assist in the COVID-19 response within these settings. AI is commonly defined as the use of computational methods to mimic human intelligence. Machine learning and deep learning are branches of AI which focus on automatic improvement of computer programmes through experience 6,7. Regression models, such as logistic, linear, or Cox regression, are simple forms of machine learning which already have longstanding use in medical research. More advanced machine learning, including random forest models, neural networks, or support vector machines, are also becoming more common in the medical literature, introducing more complex and diverse applications of AI. In intensive care and emergency settings, AI applications have assisted with automated patient monitoring 8-11, prognostication 12, and optimisation of staffing allocations 13-16. Given the unprecedented volume of COVID-19 patients, recent reviews have also identified resource optimisation of ICU beds as a potentially significant application of AI 17,18. Earlier systematic reviews have identified significant issues in the quality and reporting of predictive models for COVID-19 diagnosis and prognosis 19 and AI applications for classifying COVID-19 medical images 20. Shillian et al.’s 21 systematic review of machine learning studies in pre-COVID-19 ICUs reported similar issues, such as limited sample size and poor validation of predictions. However, no study has evaluated the scope and quality of all available AI applications in intensive care and emergency settings. This gap in knowledge precludes valuable improvements to the development and deployment of AI applications in these settings. We aimed to systematically review and critically appraise the current evidence on AI applications for COVID-19 in intensive care and emergency settings, focussing on methods, reporting standards, and clinical utility. ## 2 Methods We reported this systematic review according to the Preferred Reporting Items for Systematic Reviews (PRISMA) guidelines (Additional file 1). A review protocol was developed but was not publicly registered. ### 2.1 Search strategy and selection criteria We searched six databases, PubMed, Embase, Scopus, CINAHL, IEEE Xplore, and ACM Digital Library, by combining search terms related to AI, COVID-19, and intensive care or emergency settings. For brevity, the search strategy showing only the first three terms in each concept set is as follows: ((“Artificial intelligence” OR “Deep learning” OR “Machine learning” OR …) AND (“COVID-19” OR “Coronavirus disease 2019” OR “2019-nCoV” OR …) AND (Emergency OR “ED” OR “intensive care” OR …)). The complete search strategy can be found in Additional file 2. We also screened the reference lists of included articles to identify additional relevant studies. We included articles that met the following criteria: (1) applied AI; (2) investigated COVID-19 operations of ICU, ED, or emergency medical services (EMS) or analysed data from COVID-19 patients in the ED or within a prehospital setting, COVID-19 patients requiring intensive care (admission to the ICU, mechanical ventilation, or a composite including either of these outcomes), or the healthcare workers treating these patients, including ED or ICU physicians and nurses as well as paramedics; and (3) were original, peer-reviewed research articles. For this review, artificial intelligence only encompassed conventional machine learning algorithms such as random forest models, neural networks, or support vector machines. Multivariable logistic regression predictive models (including ridge and least absolute shrinkage and selection operator (LASSO) regression) were excluded. No restrictions were placed on the language of articles. ### 2.2 Literature selection and data extraction We conducted an initial search on 30 August 2020 and updated the search results on 1 October 2020. Articles were screened by title, abstract, and, if ambiguous, full text by two independent reviewers (MLC and NL). Subsequently, the two reviewers (MLC and NL) independently extracted data using a standardised data extraction form. Discrepancies in article selection and data extraction were resolved between reviewers through discussion. We extracted the following data for all included articles: country of study population, outcome predicted, sample size of the training and validation datasets, AI algorithms used, discrimination (e.g. C-index, accuracy) and calibration (e.g. calibration slope, Brier loss score) of models on the strictest form of validation, features included in the final model, and transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD) study type 22, if applicable. ### 2.3 Data analysis For studies including multivariate AI predictive models, we evaluated the risk of bias within the study methodology using prediction model risk of bias assessment tool (PROBAST) 23. PROBAST is a structured tool comprising 20 signalling questions for assessing the risk of bias and applicability across the four domains of participants, predictors, outcome, and analysis. Applicability of included studies was not assessed as our study was not concerned with a specific application of AI predictive models. In lieu of specific reporting standards for AI studies at the time of study conception 24, we assessed the reporting quality of multivariable predictive modelling studies using an adaptation of Wang et al.’s 25 modified TRIPOD statement 26 for AI models (Additional file 3). For all other studies, we summarised the study methodology, including data sources, application of AI, and validation methods, as well as the key findings of the study. ## 3 Results ### 3.1 Study characteristics From our search of the six databases, 14 studies were included in this review (Figure 1). Table 1 presents the main characteristics of the study. 11 of the 14 studies investigated predictive models and were assessed according to PROBAST and TRIPOD: eight studies developed prognostic models 27-34 and three studies developed diagnostic models 35-37. Of the remaining three studies, two evaluated the prognostic potential of existing AI-based lung segmentation software (without integration into a multivariate predictive model) 38,39 and one investigated an AI-based system for resource optimisation in the ICU 40. Eleven studies used patient data collected from the ICU and four studies used data from the ED. No study collected data from the prehospital setting, despite including prehospital-related search terms in the search strategy. View this table: [Table 1](http://medrxiv.org/content/early/2021/02/18/2021.02.15.21251727/T1) Table 1 Main study characteristics ![Figure 1:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2021/02/18/2021.02.15.21251727/F1.medium.gif) [Figure 1:](http://medrxiv.org/content/early/2021/02/18/2021.02.15.21251727/F1) Figure 1: PRISMA flow diagram In terms of country of study, Italy (*n*=3) and United States (*n*=3) were represented by more than one study, while Brazil, Canada, China, France, Germany, Israel, Turkey, and the United Kingdom had one study each. According to the TRIPOD classification of predictive models, two studies were classified as Type 2b (validation using a non-random split of data by time and/or location), three studies as Type 2a (validation using a random split of data such as a train-test split), four studies as Type 1b (validation using re-sampling techniques such as bootstrapping or k-fold cross-validation), and one study as Type 1a (no validation, only evaluation of apparent model performance on the same training dataset). One study that conducted development and validation using data from separate studies was considered Type 3. ### 3.2 Risk of bias Table 2 presents the risk of bias assessment of AI predictive models according to PROBAST. All 11 predictive modelling studies had a high overall risk of bias. Two out of 11 studies had an unclear risk of bias within the participant domain. Unclear risk of bias in the participant domain was mainly due to ambiguous exclusion criteria that may lead to the study population not being representative of the intended target population 33,34. View this table: [Table 2](http://medrxiv.org/content/early/2021/02/18/2021.02.15.21251727/T2) Table 2 PROBAST (prediction model risk of bias assessment tool) assessment of predictive modelling studies All three studies at a high risk of bias in the predictor domain were prognostic. Two studies 32,33 used retrospective, multicentre data and were at risk of bias from varying methods of predictor assessment at different centres. The remaining study 31 obtained predictor data from the most recent assessments available, instead of assessing predictors at the intended time of use. Two studies did not report adequately on the assessment of computed tomography (CT) 36 or other features 34, resulting in an unclear risk of bias. Two and four out of 11 studies were at high and unclear risk of bias within the outcome domain, respectively. In many prognostic studies 28,29,31,34, the criteria for ICU admission and blinding of outcome determination to predictor variables were often not reported, leading to an unclear risk of bias. Within the analysis domain, all eleven studies had insufficient outcome events per variable (EPV) (<20 EPV for model development studies and <100 for model validation studies) leading to a high risk of bias. Furthermore, no study reported on model calibration and only two studies 34,35 appropriately handled and reported on missing data. Prognostic predictive models were particularly at risk of inadequately accounting for, or reporting on, censored patients who were still hospitalised without the outcome (e.g. ICU admission) at the end of the study period. Only one study appropriately accounted for censored data by combining deep learning techniques with traditional Cox regression 33. ### 3.3 Adherence to reporting standards The modified TRIPOD checklist comprised 25 terms, including 17 terms for reporting of methods and eight terms for results. Figure 2 describes the adherence of studies to reporting standards, as assessed by the modified TRIPOD checklist. Studies reported on a median of 48% (IQR: 48-59%) of relevant TRIPOD items, with 10 of 25 TRIPOD items having 50% adherence or less. Additionally, the following eight TRIPOD items had 25% adherence or less: reporting on treatments administered to study participants (item 5c), blinding of outcome and predictor assessment (items 6b and 7b), study size determination (item 8), reporting on characteristics of study participants, including proportions of participants with missing data (item 13b), reporting of unadjusted associations between predictors and outcomes in multivariable logistic regression models (item 14b), explanation of how to use the prediction model (item 15b), and calibration and method of calibration (adjusted item 16b). ![Figure 2:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2021/02/18/2021.02.15.21251727/F2.medium.gif) [Figure 2:](http://medrxiv.org/content/early/2021/02/18/2021.02.15.21251727/F2) Figure 2: (a) Number of TRIPOD items reported per study and (b) Proportion of studies reporting on each TRIPOD item ### 3.4 Diagnosis Three studies investigated diagnostic AI predictive models; two studies developed models to predict the outcome of COVID-19 status at admission to the ED. Only one study was externally validated: Vasse et al. 37 developed a decision tree based on cellular population data using Random Forest for feature selection (accuracy=60.5%). Brinati et al.’s 35 Random Forest model (C-index=0.84, accuracy=82%) and Three-Way Random Forest model (accuracy=86%) achieved better performance but was validated using weaker *k*-fold cross-validation. Both studies included leucocyte or a leucocyte sub-population count as a predictor in their final model. The third study 36 developed a decision tree for determining COVID-19 infection status in the ICU based on plasma inflammatory analyte features selected by a random forest classifier. On five-fold cross-validation, this classifier achieved an accuracy of 98%. ### 3.5 Prognosis Most studies on prognostic AI predictive models (9/10, 90%) predicted ICU admission, mechanical ventilation, or a similar composite outcome of severe or critical illness. Collectively, such studies reported C-indices between 0.79-0.98. Liang et al’s 33 Deep Learning Survival Cox model had the largest training cohort of 1590 patients and achieved a C-index of 0.890, 0.852, and 0.967 when externally validated on cohorts of 801, 305, and 73 patients from Wuhan, Hubei, and Guangzhou, respectively. Schwab et al.’s 34 support vector machine achieved a superior C-index of 0.98 on a weaker internal validation and a smaller sample size for testing model performance. The artificial neural network trained by Abdulaal et al. 27 using data collected at ED admission (C-index=0.901) was the only prognostic AI model developed to predict in-hospital mortality in COVID-19 patients. Apart from predictive modelling, Durhan et al. 38 and Mushtaq et al. 39 evaluated the prognostic utility of two separate deep learning-based software that determine the normal lung proportion and total lung involvement, respectively. Scores obtained from each software achieved a C-index of 0.944 and 0.77 for predicting ICU admission, respectively. While multivariate predictive models were not developed, both studies were subject to similar issues in development and reporting, including ambiguous criteria for ICU admission, inappropriate handling of missing data using complete-case analysis, and lack of reporting on treatments received by participants and on blinding of the outcome. ### 3.6 Other applications Apart from diagnostic and prognostic applications, Belciug et al. 40 utilised an Artificial Immune System algorithm, a type of evolutionary AI algorithm, to optimise a queueing model for simulating hospital bed allocation in the ICU. The final model, intended as a tool for hospital managers, proposes an optimal admission rate and number of beds while balancing the costs associated with increasing capacity and refusing patients. The model was applied to ICU data published by the Ministry of Health of Italy and estimated a minimum rejection rate of 3.4% and 1.7% of patients requiring ICU admission from 13 March 2020 to 23 March 2020 (average daily volume of 200 patients) and 23 March 2020 to 30 March 2020 (average daily volume of 63 patients), respectively. However, these estimates were not validated. ## 4 Discussion Our study is the first systematic review of AI applications for COVID-19 in intensive care and emergency settings. Applications were largely limited to diagnostic and prognostic predictive modelling, with only one study investigating a separate application of simulating ICU bed occupancy for resource optimisation. Due to high risk of bias, inadequate validation, or poor adherence to reporting standards in all reviewed studies, we have found no AI application for COVID-19 ready for clinical deployment in intensive care or emergency settings. Among the reviewed articles, we found a limited range of AI applications being studied within intensive care and emergency settings. An exploratory review identified early detection and diagnosis, resource management of hospital beds or healthcare workers, and automatic monitoring and prognostication as possible applications of AI for the COVID-19 pandemic 17. However, current applications within the reviewed articles mainly comprised prognostic models for critical illness or diagnostic models to predict COVID-19 status, none of which are ready for clinical use. Only one preliminary study by Belciug et al. 40, which lacked validation, investigated allocative simulation and resource optimisation in the ICU, while no study investigated automatic monitoring or prognostication of COVID-19 patients. Belciug et al.’s study on ICU resource optimisation employed queueing theory, a mathematical field of study, and Artificial Immune Systems, an evolutionary AI algorithm that is uncommonly utilised in medical research. Unfamiliarity and the absence of general adoption of these methods within the medical community may contribute to the paucity of studies exploring less common but potentially impactful AI applications. As highlighted in previous literature 19,41, robust interdisciplinary collaboration and communication will be crucial in stimulating broader applications of AI for COVID-19 in intensive care and emergency settings, as well as the in medical literature at large. Assessment of AI predictive models also revealed significant deficiencies in model development, validation, and reporting. Unfortunately, these findings corroborate with earlier systematic reviews on predictive models for COVID-19 19 and in intensive care settings 21. Studies developing AI models should adhere to the TRIPOD reporting guidelines 22, PROBAST 23, or, ideally, recent AI-specific guidelines. These include the guidelines for transparency, reproducibility, ethics, and effectiveness (TREE) 42, CONSORT-AI (Consolidated Standards of Reporting Trials-Artificial Intelligence) 43, and SPIRIT-AI (Standard Protocol Items: Recommendations for Interventional Trials-Artificial Intelligence) 44. While the above guidelines provide comprehensive explanations and elaborations, we emphasise hereinafter several common problematic areas within the reviewed studies and recommendations for future studies. The most common source of bias was an inadequate sample size, which was found in all studies. A low sample size introduces the risk of over-fitting and model optimism. A benchmark for the development of logistic regression models is 20 EPV 4,23,45, while models using AI algorithms like random forest, support vector machines, and neural networks may require up to 200 EPV to account for model optimism 46; a minimum of 100 EPV is recommended for validation studies 23. Missing data also contributed significantly to bias; only two studies appropriately handled and reported on missing data. Ideally, the proportion of missing data for each variable should be reported 22 and multiple imputation should be used to avoid bias from inappropriate exclusion of participants with missing data (i.e. complete-case analysis) 47-50. However, if complete-case analysis is used, authors should provide a comparative analysis of model performance with and without excluded participants to facilitate the judgement of bias from exclusion. For prognostic studies, studies often failed to appropriately account for censored patients (e.g. neither discharged nor admitted to the ICU). Censored patients should be handled using a time-to-event analysis such as Cox regression; inappropriate exclusion of these patients may lead to a skewed dataset that includes fewer patients without the outcome, introducing bias into the model 23. For diagnostic studies, bias was often introduced by using the reverse transcription-polymerase chain reaction (RT-PCR) test as the ground truth or gold-standard for COVID-19 diagnosis, despite potentially poor sensitivity 51. We recommend repeat RT-PCR testing to minimise the likelihood of false-negative tests in both diagnostic model development and validation studies. Several key areas for improvements in reporting were identified in our study, including treatments received by participants, blinding, and study size determination. In particular, no study reported on calibration, a crucial yet often unevaluated measure of model performance 52. We recommend assessing calibration using the calibration hierarchy described by Van Calster et al. 53 instead of the commonly used Hosmer-Lemeshow test 54. This avoids artificial stratification of patients into risk groups and other limitations associated with the Hosmer-Lemeshow test 52. Studies should also endeavour to validate their data using stricter validation techniques. Studies with smaller sample sizes should utilise re-sampling techniques, such as bootstrapping or *k*-fold cross-validation. Studies with larger sample sizes should use a non-random split of data (e.g. by location or time) or perform external validation on independent data, for example, from a different study 22,23,55. Validation using the same data for model development is inappropriate as it only provides apparent model performance. Similarly, validation using a random split of data, such as a ‘train-test’ split, has lower power than re-sampling techniques 22,56 and should be avoided. In addition to the limitations in quality and reporting of AI applications, the narrow scope of applications being investigated naturally leads to fewer AI applications eventually being suitable for clinical use. While AI has been practically applied for the identification of candidate drugs for drug repurposing 57 and contact tracing 18, its application and utility for COVID-19 in clinical settings have been insignificant to date. Several studies have employed AI techniques for the detection and classification of COVID-19 images 20, however, none have been validated as a clinical diagnostic adjunct in the ED. Factors that may contribute to this lack of clinical validation include the high risk of bias within existing models 19,58, limited applicability of radiographic images for discriminating between multiple differential diagnoses, and the high prevalence of asymptomatic radiographs in patients who present soon after the onset of symptoms 59,60. Notwithstanding the high risk of bias and poor reporting of the reviewed AI models, AI algorithms tend to produce uninterpretable “black box” predictive models, which may lead to decreased acceptability of both diagnostic and prognostic AI applications amongst clinicians and hospital administrators. Some studies 32,36,37 have attempted to overcome this by using AI techniques for feature selection and presenting the final model as a decision tree or scoring system with clearly defined input variables. However, such simplifications of AI models curtail performance and limit the utility of the final model. The above barriers to the validation and integration of AI in clinical settings may preclude significant contribution of AI to combatting the COVID-19 pandemic in intensive care units and emergency departments in the near future. However, improvements in the development, validation, and reporting of AI applications will be critical in advancing the applicability and acceptance of these systems in clinical settings in later phases of the COVID-19 pandemic and in future global health crises. Encouragingly, leading journals such as the *Lancet* family of journals have committed to enforcing AI-specific guidelines such as CONSORT-AI and SPIRIT-AI for submissions with an AI intervention 61. However, concerted effort is needed from the entire research community, including journals, editors, and authors, to normalise the use of these guidelines and checklists. Such changes will encourage improved development, reporting, and eventual clinical uptake of future AI applications. ### Limitations The results from our systematic review should be considered along with the following limitations. Firstly, our search excluded non-peer-reviewed articles which may neglect the most recent literature but ensures a baseline quality of included studies. Secondly, we may have missed some relevant articles despite using a comprehensive search strategy due to publication in journals not indexed in the searched databases and variations in terminology used to describe AI algorithms and intensive care and emergency settings. We may also have missed AI applications that were deployed without publication in scientific literature; in particular, given the intense media attention and the pressure to deploy solutions quickly, AI solutions developed by governments and industry are more likely to be published in mass media formats rather than scientific journals. Thirdly, assessment according to PROBAST and, to a lesser extent, TRIPOD reporting guidelines still rely on a degree of subjectivity, despite comprehensive explanations and elaborations. Hence, other reviewers may arrive at slightly differing results. Lastly, the unprecedented volume of research on COVID-19 has resulted in a rapidly evolving body of literature. Hence, our findings are merely descriptive of the current state of affairs, which may change with welcome improvements and additions to the medical literature. ## 5 Conclusions Despite widespread interest in novel technologies for the COVID-19 pandemic, our systematic review of the literature reveals that current AI applications were limited in both the range of applications and clinical applicability. Several significant issues in the development, validation, and reporting of AI applications undermine safe and effective implementation of these systems within intensive care units or emergency departments. The integration of new AI-specific reporting guidelines like CONSORT-AI and SPIRIT-AI into research and publication processes will be a vital step in creating future AI applications that are clinically acceptable in the current pandemic, future pandemics, and within the wider medical field. We also emphasise the importance of closer interdisciplinary collaboration between AI experts and clinicians. ## Data Availability The datasets used and/or analysed during the current study are available from the corresponding author on reasonable request. ## 6 Conflict of Interest The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest. ## 7 Author Contributions NL conceived the study. MLC and NL designed the study. MLC and NL screened and reviewed the articles and extracted paper information. MLC and NL planned the formal analyses, analysed the data, and drafted the manuscript. MLC, MEHO, FJS, ZZ, SLL, AFWH, and NL made substantial contributions to results interpretation and critical revision of the manuscript. All authors read and approved the final manuscript. ## 8 Funding This work was supported by the Duke-NUS Signature Research Programme funded by the Ministry of Health, Singapore. The funder of the study had no role in study design, data collection, data analysis, data interpretation, or writing of the report. ## 10 Data Availability Statement The datasets used and/or analysed during the current study are available from the corresponding author on reasonable request. ## 9 Acknowledgements Not applicable. * Received February 15, 2021. * Revision received February 15, 2021. * Accepted February 18, 2021. * © 2021, Posted by Cold Spring Harbor Laboratory This pre-print is available under a Creative Commons License (Attribution 4.0 International), CC BY 4.0, as described at [http://creativecommons.org/licenses/by/4.0/](http://creativecommons.org/licenses/by/4.0/) ## References 1. Carenzo, L. et al. Hospital surge capacity in a tertiary emergency referral centre during the COVID-19 outbreak in Italy. Anaesthesia 75, 928–934, doi:10.1111/anae.15072 (2020). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1111/anae.15072&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=32246838&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F02%2F18%2F2021.02.15.21251727.atom) 2. Grasselli, G., Pesenti, A. & Cecconi, M. Critical Care Utilization for the COVID-19 Outbreak in Lombardy, Italy. Jama 323, doi:10.1001/jama.2020.4031 (2020). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1001/jama.2020.4031&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=32167538&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F02%2F18%2F2021.02.15.21251727.atom) 3. Phua, J. et al. Intensive care management of coronavirus disease 2019 (COVID-19): challenges and recommendations. The Lancet Respiratory Medicine 8, 506–517, doi:10.1016/s2213-2600(20)30161-2 (2020). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/s2213-2600(20)30161-2&link_type=DOI) 4. Ogundimu, E. O., Altman, D. G. & Collins, G. S. Adequate sample size for developing prediction models is not simply related to events per variable. Journal of Clinical Epidemiology 76, 175–182, doi:10.1016/j.jclinepi.2016.02.031 (2016). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.jclinepi.2016.02.031&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=26964707&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F02%2F18%2F2021.02.15.21251727.atom) 5. Mojoli, F. et al. Our recommendations for acute management of COVID-19. Critical Care 24, doi:10.1186/s13054-020-02930-6 (2020). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1186/s13054-020-02930-6&link_type=DOI) 6. LeCun, Y., Bengio, Y. & Hinton, G. Deep learning. Nature 521, 436–444, doi:10.1038/nature14539 (2015). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/nature14539&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=26017442&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F02%2F18%2F2021.02.15.21251727.atom) 7. Mitchell, T. M. Machine Learning. (McGraw-Hill, 1997). 8. Blanch, L. et al. Validation of the Better Care® system to detect ineffective efforts during expiration in mechanically ventilated patients: a pilot study. Intensive Care Medicine 38, 772–780, doi:10.1007/s00134-012-2493-4 (2012). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/s00134-012-2493-4&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=22297667&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F02%2F18%2F2021.02.15.21251727.atom) 9. Chen, C.-W., Lin, W.-C., Hsu, C.-H.,Cheng, K.-S. & Lo, C.-S. Detecting ineffective triggering in the expiratory phase in mechanically ventilated patients based on airway flow and pressure deflection: Feasibility of using a computer algorithm*. Critical Care Medicine 36, 455–461, doi:10.1097/01.Ccm.0000299734.34469.D9 (2008). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1097/01.CCM.0000299734.34469.D9&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=18091543&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F02%2F18%2F2021.02.15.21251727.atom) 10. Clifton, D. A. et al. A Large-Scale Clinical Validation of an Integrated Monitoring System in the Emergency Department. IEEE Journal of Biomedical and Health Informatics 17, 835–842, doi:10.1109/jbhi.2012.2234130 (2013). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1109/jbhi.2012.2234130&link_type=DOI) 11. Curtis, D. W. et al. SMART--An Integrated Wireless System for Monitoring Unattended Patients. Journal of the American Medical Informatics Association 15, 44–53, doi:10.1197/jamia.M2016 (2008). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1197/jamia.M2016&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=17947629&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F02%2F18%2F2021.02.15.21251727.atom) 12. Escobar, G. J. et al. Piloting electronic medical record-based early detection of inpatient deterioration in community hospitals. Journal of Hospital Medicine 11, S18–S24, doi:10.1002/jhm.2652 (2016). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1002/jhm.2652&link_type=DOI) 13. Jones, S. S. & Evans, R. S. An agent based simulation tool for scheduling emergency department physicians. AMIA Annu Symp Proc, 338–342 (2008). 14. Lin, A. X. et al. Leveraging Machine Learning Techniques and Engineering of Multi-Nature Features for National Daily Regional Ambulance Demand Prediction. Int J Environ Res Public Health 17, doi:10.3390/ijerph17114179 (2020). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.3390/ijerph17114179&link_type=DOI) 15. Sun, Y., Heng, B. H., Seow, Y. T. & Seow, E. Forecasting daily attendances at an emergency department to aid resource planning. BMC Emergency Medicine 9, doi:10.1186/1471-227x-9-1 (2009). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1186/1471-227x-9-1&link_type=DOI) 16. Liu, N., Zhang, Z., Wah Ho, A. F. & Ong, M. E. H. Artificial intelligence in emergency medicine. Journal of Emergency and Critical Care Medicine 2, 82–82, doi:10.21037/jeccm.2018.10.08 (2018). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.21037/jeccm.2018.10.08&link_type=DOI) 17. Vaishya, R., Javaid, M., Khan, I. H. & Haleem, A. Artificial Intelligence (AI) applications for COVID-19 pandemic. Diabetes & Metabolic Syndrome: Clinical Research & Reviews 14, 337–339, doi:10.1016/j.dsx.2020.04.012 (2020). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.dsx.2020.04.012&link_type=DOI) 18. Lalmuanawma, S., Hussain, J. & Chhakchhuak, L. Applications of machine learning and artificial intelligence for Covid-19 (SARS-CoV-2) pandemic: A review. Chaos, Solitons & Fractals 139, doi:10.1016/j.chaos.2020.110059 (2020). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.chaos.2020.110059&link_type=DOI) 19. Wynants, L. et al. Prediction models for diagnosis and prognosis of covid-19: systematic review and critical appraisal. Bmj, doi:10.1136/bmj.m1328 (2020). [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6MzoiYm1qIjtzOjU6InJlc2lkIjtzOjE3OiIzNjkvYXByMDdfMi9tMTMyOCI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDIxLzAyLzE4LzIwMjEuMDIuMTUuMjEyNTE3MjcuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 20. Albahri, O. S. et al. Systematic review of artificial intelligence techniques in the detection and classification of COVID-19 medical images in terms of evaluation and benchmarking: Taxonomy analysis, challenges, future solutions and methodological aspects. Journal of Infection and Public Health 13, 1381–1396, doi:10.1016/j.jiph.2020.06.028 (2020). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.jiph.2020.06.028&link_type=DOI) 21. Shillan, D., Sterne, J. A. C., Champneys, A. & Gibbison, B. Use of machine learning to analyse routinely collected intensive care unit data: a systematic review. Critical Care 23, doi:10.1186/s13054-019-2564-9 (2019). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1186/s13054-019-2564-9&link_type=DOI) 22. Moons, K. G. M. et al. Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis (TRIPOD): Explanation and Elaboration. Annals of Internal Medicine 162, doi:10.7326/m14-0698 (2015). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.7326/m14-0698&link_type=DOI) 23. Moons, K. G. M. et al. PROBAST: A Tool to Assess Risk of Bias and Applicability of Prediction Model Studies: Explanation and Elaboration. Annals of Internal Medicine 170, doi:10.7326/m18-1377 (2019). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.7326/m18-1377&link_type=DOI) 24. Collins, G. S. & Moons, K. G. M. Reporting of artificial intelligence prediction models. The Lancet 393, 1577–1579, doi:10.1016/s0140-6736(19)30037-6 (2019). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/s0140-6736(19)30037-6&link_type=DOI) 25. Wang, W. et al. A systematic review of machine learning models for predicting outcomes of stroke with structured data. Plos One 15, doi:10.1371/journal.pone.0234722 (2020). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1371/journal.pone.0234722&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F02%2F18%2F2021.02.15.21251727.atom) 26. Collins, G. S., Reitsma, J. B., Altman, D. G. & Moons, K. G. M. Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD): the TRIPOD statement. Bmj 350, g7594–g7594, doi:10.1136/bmj.g7594 (2015). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1136/bmj.g7594&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25569120&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F02%2F18%2F2021.02.15.21251727.atom) 27. Abdulaal, A. et al. Prognostic Modeling of COVID-19 Using Artificial Intelligence in the United Kingdom: Model Development and Validation. Journal of Medical Internet Research 22, doi:10.2196/20259 (2020). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.2196/20259&link_type=DOI) 28. Assaf, D. et al. Utilization of machine-learning models to accurately predict the risk for critical COVID-19. Internal and Emergency Medicine, doi:10.1007/s11739-020-02475-0 (2020). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/s11739-020-02475-0&link_type=DOI) 29. Burdick, H. et al. Prediction of respiratory decompensation in Covid-19 patients using machine learning: The READY trial. Comput Biol Med 124, 103949, doi:10.1016/j.compbiomed.2020.103949 (2020). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.compbiomed.2020.103949&link_type=DOI) 30. Burian, E. et al. Intensive Care Risk Estimation in COVID-19 Pneumonia Based on Clinical and Imaging Parameters: Experiences from the Munich Cohort. Journal of Clinical Medicine 9, doi:10.3390/jcm9051514 (2020). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.3390/jcm9051514&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=32443442&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F02%2F18%2F2021.02.15.21251727.atom) 31. Cheng, F.-Y. et al. Using Machine Learning to Predict ICU Transfer in Hospitalized COVID-19 Patients. Journal of Clinical Medicine 9, doi:10.3390/jcm9061668 (2020). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.3390/jcm9061668&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=32492874&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F02%2F18%2F2021.02.15.21251727.atom) 32. Jackson, B. R. et al. Predictors at admission of mechanical ventilation and death in an observational cohort of adults hospitalized with COVID-19. Clinical Infectious Diseases, doi:10.1093/cid/ciaa1459 (2020). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/cid/ciaa1459&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=32971532&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F02%2F18%2F2021.02.15.21251727.atom) 33. Liang, W. et al. Early triage of critically ill COVID-19 patients using deep learning. Nat Commun 11, 3543, doi:10.1038/s41467-020-17280-8 (2020). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41467-020-17280-8&link_type=DOI) 34. Schwab, P., DuMont Schütte, A., Dietz, B. & Bauer, S. Clinical Predictive Models for COVID-19: Systematic Study. Journal of Medical Internet Research 22, doi:10.2196/21439 (2020). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.2196/21439&link_type=DOI) 35. Brinati, D. et al. Detection of COVID-19 Infection from Routine Blood Exams with Machine Learning: A Feasibility Study. J Med Syst 44, 135, doi:10.1007/s10916-020-01597-4 (2020). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/s10916-020-01597-4&link_type=DOI) 36. Fraser, D. D. et al. Inflammation Profiling of Critically Ill Coronavirus Disease 2019 Patients. Crit Care Explor 2, e0144, doi:10.1097/CCE.0000000000000144 (2020). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1097/CCE.0000000000000144&link_type=DOI) 37. Vasse, M. et al. Interest of the cellular population data analysis as an aid in the early diagnosis of SARS-CoV-2 infection. International Journal of Laboratory Hematology, doi:10.1111/ijlh.13312 (2020). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1111/ijlh.13312&link_type=DOI) 38. Durhan, G. et al. Visual and software-based quantitative chest CT assessment of COVID-19: correlation with clinical findings. Diagn Interv Radiol, doi:10.5152/dir.2020.20407 (2020). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.5152/dir.2020.20407&link_type=DOI) 39. Mushtaq, J. et al. Initial chest radiographs and artificial intelligence (AI) predict clinical outcomes in COVID-19 patients: analysis of 697 Italian patients. Eur Radiol, doi:10.1007/s00330-020-07269-8 (2020). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/s00330-020-07269-8&link_type=DOI) 40. Belciug, S., Bejinariu, S. I. & Costin, H. An Artificial Immune System Approach for a Multi-compartment Queuing Model for Improving Medical Resources and Inpatient Bed Occupancy in Pandemics. Advances in Electrical and Computer Engineering 20, 23–30, doi:10.4316/aece.2020.03003 (2020). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.4316/aece.2020.03003&link_type=DOI) 41. Liu, N. et al. Coronavirus disease 2019 (COVID-19): an evidence map of medical literature. BMC Medical Research Methodology 20, doi:10.1186/s12874-020-01059-y (2020). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1186/s12874-020-01059-y&link_type=DOI) 42. Vollmer, S. et al. Machine learning and artificial intelligence research for patient benefit: 20 critical questions on transparency, replicability, ethics, and effectiveness. Bmj, doi:10.1136/bmj.l6927 (2020). [FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiRlVMTCI7czoxMToiam91cm5hbENvZGUiO3M6MzoiYm1qIjtzOjU6InJlc2lkIjtzOjE3OiIzNjgvbWFyMjBfMS9sNjkyNyI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDIxLzAyLzE4LzIwMjEuMDIuMTUuMjEyNTE3MjcuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 43. Liu, X. et al. Reporting guidelines for clinical trial reports for interventions involving artificial intelligence: the CONSORT-AI extension. The Lancet Digital Health 2, e537–e548, doi:10.1016/s2589-7500(20)30218-1 (2020). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/s2589-7500(20)30218-1&link_type=DOI) 44. Cruz Rivera, S. et al. Guidelines for clinical trial protocols for interventions involving artificial intelligence: the SPIRIT-AI extension. The Lancet Digital Health 2, e549–e560, doi:10.1016/s2589-7500(20)30219-3 (2020). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/s2589-7500(20)30219-3&link_type=DOI) 45. van Smeden, M. et al. No rationale for 1 variable per 10 events criterion for binary logistic regression analysis. BMC Medical Research Methodology 16, doi:10.1186/s12874-016-0267-3 (2016). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1186/s12874-016-0267-3&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=27881078&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F02%2F18%2F2021.02.15.21251727.atom) 46. van der Ploeg, T., Austin, P. C. & Steyerberg, E. W. Modern modelling techniques are data hungry: a simulation study for predicting dichotomous endpoints. BMC Medical Research Methodology 14, doi:10.1186/1471-2288-14-137 (2014). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1186/1471-2288-14-137&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25532820&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F02%2F18%2F2021.02.15.21251727.atom) 47. Janssen, K. J. M. et al. Missing covariate data in medical research: To impute is better than to ignore. Journal of Clinical Epidemiology 63, 721–727, doi:10.1016/j.jclinepi.2009.12.008 (2010). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.jclinepi.2009.12.008&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=20338724&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F02%2F18%2F2021.02.15.21251727.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000278412800007&link_type=ISI) 48. Schafer, J. L. Multiple imputation: a primer. Statistical Methods in Medical Research 8, 3–15, doi:10.1177/096228029900800102 (2016). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1177/096228029900800102&link_type=DOI) 49. White, I. R., Royston, P. & Wood, A. M. Multiple imputation using chained equations: Issues and guidance for practice. Statistics in Medicine 30, 377–399, doi:10.1002/sim.4067 (2011). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1002/sim.4067&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=21225900&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F02%2F18%2F2021.02.15.21251727.atom) 50. Kang, H. The prevention and handling of the missing data. Korean Journal of Anesthesiology 64, doi:10.4097/kjae.2013.64.5.402 (2013). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.4097/kjae.2013.64.5.402&link_type=DOI) 51. Fang, Y. et al. Sensitivity of Chest CT for COVID-19: Comparison to RT-PCR. Radiology 296, E115–E117, doi:10.1148/radiol.2020200432 (2020). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1148/radiol.2020200432&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F02%2F18%2F2021.02.15.21251727.atom) 52. Van Calster, B., McLernon, D. J., van Smeden, M., Wynants, L. & Steyerberg, E. W. Calibration: the Achilles heel of predictive analytics. BMC Medicine 17, doi:10.1186/s12916-019-1466-7 (2019). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1186/s12916-019-1466-7&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=31842878&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F02%2F18%2F2021.02.15.21251727.atom) 53. Van Calster, B. et al. A calibration hierarchy for risk models was defined: from utopia to empirical data. Journal of Clinical Epidemiology 74, 167–176, doi:10.1016/j.jclinepi.2015.12.005 (2016). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.jclinepi.2015.12.005&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=26772608&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F02%2F18%2F2021.02.15.21251727.atom) 54. Hosmer, D. W. & Lemeshow, S. Applied Logistic Regression. (2000). 55. Steyerberg, E. W., Bleeker, S. E., Moll, H. A., Grobbee, D. E. & Moons, K. G. M. Internal and external validation of predictive models: A simulation study of bias and precision in small samples. Journal of Clinical Epidemiology 56, 441–447, doi:10.1016/s0895-4356(03)00047-7 (2003). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/S0895-4356(03)00047-7&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=12812818&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F02%2F18%2F2021.02.15.21251727.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000183589200007&link_type=ISI) 56. Steyerberg, E. W. et al. Internal validation of predictive models. Journal of Clinical Epidemiology 54, 774–781, doi:10.1016/s0895-4356(01)00341-9 (2001). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/S0895-4356(01)00341-9&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=11470385&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F02%2F18%2F2021.02.15.21251727.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000170019900003&link_type=ISI) 57. Zhou, Y., Wang, F., Tang, J., Nussinov, R. & Cheng, F. Artificial intelligence in COVID-19 drug repurposing. The Lancet Digital Health, doi:10.1016/s2589-7500(20)30192-8 (2020). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/s2589-7500(20)30192-8&link_type=DOI) 58. Tartaglione, E., Barbano, C. A., Berzovini, C., Calandri, M. & Grangetto, M. Unveiling COVID-19 from CHEST X-Ray with Deep Learning: A Hurdles Race with Small Data. International Journal of Environmental Research and Public Health 17, doi:10.3390/ijerph17186933 (2020). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.3390/ijerph17186933&link_type=DOI) 59. Laghi, A. Cautions about radiologic diagnosis of COVID-19 infection driven by artificial intelligence. The Lancet Digital Health 2, doi:10.1016/s2589-7500(20)30079-0 (2020). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/s2589-7500(20)30079-0&link_type=DOI) 60. Vancheri, S. G. et al. Radiographic findings in 240 patients with COVID-19 pneumonia: time-dependence after the onset of symptoms. European Radiology, doi:10.1007/s00330-020-06967-7 (2020). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/s00330-020-06967-7&link_type=DOI) 61. The Lancet Digital, H. Guiding better design and reporting of AI-intervention trials. The Lancet Digital Health 2, doi:10.1016/s2589-7500(20)30223-5 (2020). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/s2589-7500(20)30223-5&link_type=DOI)