RT Journal Article SR Electronic T1 Augmented Curation of Clinical Notes from a Massive EHR System Reveals Symptoms of Impending COVID-19 Diagnosis JF medRxiv FD Cold Spring Harbor Laboratory Press SP 2020.04.19.20067660 DO 10.1101/2020.04.19.20067660 A1 Wagner, Tyler A1 Shweta, FNU A1 Murugadoss, Karthik A1 Awasthi, Samir A1 Venkatakrishnan, AJ A1 Bade, Sairam A1 Puranik, Arjun A1 Kang, Martin A1 Pickering, Brian W. A1 O’Horo, John C. A1 Bauer, Philippe R. A1 Razonable, Raymund R. A1 Vergidis, Paschalis A1 Temesgen, Zelalem A1 Rizza, Stacey A1 Mahmood, Maryam A1 Wilson, Walter R. A1 Challener, Douglas A1 Anand, Praveen A1 Liebers, Matt A1 Doctor, Zainab A1 Silvert, Eli A1 Solomon, Hugo A1 Anand, Akash A1 Barve, Rakesh A1 Gores, Gregory J. A1 Williams, Amy W. A1 Morice, William G. A1 Halamka, John A1 Badley, Andrew D. A1 Soundararajan, Venky YR 2020 UL http://medrxiv.org/content/early/2020/06/11/2020.04.19.20067660.abstract AB Understanding temporal dynamics of COVID-19 patient symptoms could provide fine-grained resolution to guide clinical decision-making. Here, we use deep neural networks over an institution-wide platform for the augmented curation of clinical notes from 77,167 patients subjected to COVID-19 PCR testing. By contrasting Electronic Health Record (EHR)-derived symptoms of COVID-19-positive (COVIDpos; n=2,317) versus COVID-19-negative (COVIDneg; n=74,850) patients for the week preceding the PCR testing date, we identify anosmia/dysgeusia (27.1-fold), fever/chills (2.6-fold), respiratory difficulty (2.2-fold), cough (2.2-fold), myalgia/arthralgia (2-fold), and diarrhea (1.4-fold) as significantly amplified in COVIDpos over COVIDneg patients. The combination of cough and fever/chills has 4.2-fold amplification in COVIDpos patients during the week prior to PCR testing, and along with anosmia/dysgeusia, constitutes the earliest EHR-derived signature of COVID-19. This study introduces an Augmented Intelligence platform for the real-time synthesis of institutional biomedical knowledge. The platform holds tremendous potential for scaling up curation throughput, thus enabling EHR-powered early disease diagnosis.Competing Interest StatementThe authors are all employees of nference or the Mayo Clinic. The authors from nference have financial interests in the company. One or more of the investigators associated with this project and Mayo Clinic have a Financial Conflict of Interest in technology used in the research and that the investigator(s) and Mayo Clinic may stand to gain financially from the successful outcome of the research. This research has been reviewed by the Mayo Clinic Conflict of Interest Review Board and is being conducted in compliance with Mayo Clinic Conflict of Interest policies. ADB is a consultant for Abbvie, is on scientific advisory boards for Nference and Zentalis, and is founder and President of Splissen therapeutics.Funding StatementADB is supported by Grants AI 110173 and AI120698 from NIAID, 109593-62-RGRL from Amfar, and the HH Sheikh Khalifa Bin Zayed Al-Nahyan named professorship from Mayo Clinic.Author DeclarationsAll relevant ethical guidelines have been followed; any necessary IRB and/or ethics committee approvals have been obtained and details of the IRB/oversight body are included in the manuscript.YesAll necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.YesThe EHR dataset where augmented curation was conducted from the Mayo Clinic records was accessed under IRB 20-003278, "Study of COVID-19 patient characteristics with augmented curation of Electronic Health Records (EHR) to inform strategic and operational decisions". The EHR data cannot be shared or released due to HIPAA regulations. Contact corresponding authors for additional details regarding the IRB, and please refer to the Mayo Clinic IRB website for further details on our commitment to patient privacy (https://www.mayo.edu/research/institutional-review-board/overview). The summary statistics derived from the EHRs are enclosed within the manuscript.