Moving Biosurveillance Beyond Coded Data: AI for Symptom Detection from Physician Notes
=======================================================================================

* Andrew McMurry
* Amy R Zipursky
* Alon Geva
* Karen L Olson
* James Jones
* Vlad Ignatov
* Timothy Miller
* Kenneth D Mandl

## Abstract

**Background** Real-time surveillance of emerging infectious diseases necessitates a dynamically evolving, computable case definition, which frequently incorporates symptom-related criteria. For symptom detection, both population health monitoring platforms and research initiatives primarily depend on structured data extracted from electronic health records.

**Objective** To validate and test an artificial intelligence (AI) based Natural Language Processing (NLP) pipeline for detecting COVID-19 symptoms from physician notes.

**Methods** Subjects in this retrospective cohort study are patients 21 years old and younger, who presented to a pediatric emergency department (ED) at a large academic children’s hospital between March 1, 2020 and May 31, 2022. ED notes for all patients were processed with an NLP pipeline tuned to detect the mention of 11 COVID-19 symptoms based on CDC criteria. For a gold standard, 3 subject matter experts labeled 226 ED notes and had strong agreement (F1=98.6; PPV=97.2; Recall=100.0). F1, PPV, and recall were used to compare the performance of both NLP and ICD-10 to the gold standard chart review. As a formative use case, variations in symptom patterns were measured across SARS-Cov2 variant eras.

**Results** There were 85,678 ED encounters during the study period, 4.0% with patients with COVID-19. NLP was more accurate at identifying encounters with patients that had any of the COVID-19 symptoms (F1=79.6) than ICD-10 codes (F1=45.1%). NLP accuracy was higher for positive symptoms (recall=93%) than ICD-10 (recall=30%). However, ICD-10 accuracy was higher for negative symptoms (specificity=99.4%) than NLP (specificity=91.7%). Congestion or runny nose showed the highest accuracy difference: NLP F1=82.8%, ICD-10 F1=4.2%. Prevalence of NLP symptoms among patients with COVID-19 differed across variant eras. And patients with COVID-19 were more likely to have each symptom than patients without this disease. Effect sizes (odds ratios) varied across pandemic eras.

**Conclusions** This study establishes the value of AI based NLP as a highly effective tool for real-time COVID-19 symptom detection in pediatric patients, outperforming traditional ICD-10 methods. It also reveals the evolving nature of symptom prevalence across different virus variants, underscoring the need for dynamic, technology-driven approaches in infectious disease surveillance.

Keywords
*   Natural language processing
*   COVID-19
*   artificial intelligence
*   public health
*   biosurveillance
*   surveillance

## Introduction

Real time emerging infection surveillance requires a case definition that often involves symptomatology. To detect symptoms, population health monitoring systems and research studies tend to largely rely on structured data from electronic health records (EHRs), including International Classification of Diseases, 10th Revision (ICD-10) coding [1]. However, symptoms are not diagnoses and therefore, may not be consistently coded. We sought to validate and test an open source artificial intelligence (AI) based natural language processing (NLP) pipeline that includes a large language model to detect COVID-19 symptoms from physician notes. As a formative use case, we measured differences in symptom patterns across SARS-CoV2 variant eras.

## Methods

### Study Design and Setting

This is a retrospective cohort study of all patients up to 21 years old presenting to the emergency department (ED) of a large, free-standing, university-affiliated, pediatric hospital between March 1, 2020 and May 31, 2022. The Boston Children’s Hospital Committee on Clinical Investigation found the study to be exempt from human subjects oversight.

### Study Variables

The main dependent variables were a set of 11 COVID-19 symptoms based on Centers for Disease Control and Prevention (CDC) criteria [2]—fever or chills, cough, shortness of breath or difficulty breathing, fatigue, muscle or body aches, headache, new loss of taste or smell, sore throat, congestion or runny nose, nausea or vomiting, and diarrhea. We identified these symptoms by both NLP and ICD-10. For the formative use case, the study period was divided into 3 variant eras defined using Massachusetts COVID-19 data from CoVariant [3]. The pre-Delta era was March 1, 2020 to June 20, 2021, the Delta era was June 21, 2021 to December 19, 2021, and the Omicron era was December 20, 2021 onwards. A diagnosis of COVID-19 was defined as a positive SARS-CoV2 polymerase chain reaction (PCR) test or the presence of ICD-10 code U07.1 for COVID-19 during an encounter.

### AI/NLP Pipeline Development

Three reviewers reached consensus on a symptom concept dictionary [4] to capture each of the 11 COVID-19 symptoms. They relied on the Unified Medical Language System [5] which has a near comprehensive list of symptom descriptors [6] including SNOMED coded clinical terms [7], ICD-10 codes for administrative billing, abbreviations, and common language for patients [8]. The open source and free Apache cTAKES natural language processing pipeline was tuned to recognize and extract coded concepts for positive symptom mentions (based on the dictionary) from physician notes [9]. Apache cTAKES utilizes a NegEx algorithm which can help address negation [9–12]. To further address negation, we incorporated a large language model, BERT, fine-tuned for negation classification on clinical text [13,14].

### Gold Standard

Two reviewers established a gold standard by manually reviewing physician ED notes. After all notes were labeled by the cTAKES pipeline, a sample of 226 ED notes was loaded into Label Studio [15], an open source application for ground truth labeling. These notes were from patients both with and without COVID-19, and were selected to ensure that each of the 11 symptoms was mentioned in at least 30 ED notes. Some notes mentioned more than one symptom. Using an annotation guide (Supplement 1), 2 reviewers, who were masked from the terms identified by the NLP pipeline for note selection, each labeled 113 notes for mention of the 11 COVID-19 symptoms. As per the guide, only symptoms relevant to the present illness were considered positive mentions. Symptoms were not considered positive mentions if stated as past medical history, family history, social history, or an indication for a medication unrelated to the encounter.

### Inter-rater reliability

The F1 score was used to assess consistency in manual chart review. The F1 score is the balance of recall and positive predictive value (PPV) [16]. It was computed by comparing the annotations of each of the 2 initial reviewers to those of a third reviewer, who independently labeled a subset (n=56, 25%) of notes annotated by the other reviewers. The choice of F1 score as the metric for agreement was informed by the observed high frequency of true negative annotations when they were assigned by chance [9,16,17]. Reliability analyses used Python version 3.10.

### AI/NLP and ICD-10 Accuracy

Accuracy measures of true symptom prevalence for each symptom included F1-score, positive predictive value (PPV), recall (sensitivity) and specificity [18,19].

### Formative use case

The impact of pandemic variant era on COVID-19 symptomatology was examined. Descriptive statistics were used to characterize patients presenting to the ED during each pandemic era.

Symptom prevalence amongst ED patients with COVID-19 was assessed in separate analyses for each symptom using Chi-square analyses of 3x2 tables (pandemic era x symptom presence/absence) with alpha set at .05. Post-hoc Chi-square tests were used to compare each pandemic era with all others using a Bonferroni adjusted alpha of .017. To assess the effect of pandemic era, COVID-19 status, and the interaction of these variables on whether or not a patient had each symptom, logistic regression was used in separate analyses for each symptom. Bonferroni adjusted confidence limits were used for post-hoc analyses. If the interaction term was not significant, main effects for COVID-19 and variant era were reported. Data were analyzed using SAS version 9.4 (SAS Institute Inc.).

## Results

### Study population

There were 59,173 unique patients with 85,678 ED encounters during the study period. Characteristics of the entire study cohort and variant-specific cohorts are summarized in Table 1. A patient could appear in the cohort more than once if they had multiple ED encounters.

View this table:
[Table 1.](http://medrxiv.org/content/early/2023/09/25/2023.09.24.23295960/T1)

Table 1. Characteristics of patients at emergency department encounters.

### Inter-rater reliability

High consistency was demonstrated between Reviewer 3, who labeled a subset of notes, and both Reviewer 1 and Reviewer 2, who each labeled half of the notes chosen to establish the gold standard. F1-scores for the 2 reviewers were 98.8% and 98.4%, respectively. PPV was 97.6% and 96.8%, and recall 100% for both.

### AI/NLP ICD-10 Accuracy

As shown in Table 2, the F1 score for NLP was higher and thus more accurate at identifying encounters with patients that had any of the COVID-19 symptoms than ICD-10. NLP also had higher F1 accuracy for each individual symptom. In addition, NLP sensitivity (recall) of true positive symptoms was higher than ICD-10. However, NLP accuracy of true negative symptoms (specificity) was somewhat lower compared to ICD-10.

View this table:
[Table 2.](http://medrxiv.org/content/early/2023/09/25/2023.09.24.23295960/T2)

Table 2. Accuracy of COVID-19 symptom monitoring by NLP and ICD-10.
F1: accuracy measure balancing precision and recall, ICD-10: International Classification of Diseases, 10th Revision, NLP: natural language processing, PPV: positive predictive value, Recall: also known as sensitivity, Spec: specificity.

The 2 most prevalent symptoms, cough and fever, had NLP recall scores that were among the highest of the symptoms, and much higher than those for ICD-10 codes. The greatest discrepancy between NLP and ICD-10 F1 accuracy was for congestion or runny nose. The smallest difference was for diarrhea.

### Symptom Prevalence over time

During each month of the study, the percentage of encounters with asymptomatic COVID-19 positive patients was much lower using NLP compared to ICD-10 (Figure 1). Using NLP, the range was from 0 to 19% of encounters (Mean 6, SD 4), while with ICD-10, the range was 22 to 52% (Mean 38, SD 7).

![Figure 1.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2023/09/25/2023.09.24.23295960/F1.medium.gif)

[Figure 1.](http://medrxiv.org/content/early/2023/09/25/2023.09.24.23295960/F1)

Figure 1. Asymptomatic COVID-19 patients presenting to emergency departments, as measured using NLP and ICD-10.
NLP (solid red line), ICD-10 (black dotted line), ED (emergency department).

Monthly prevalence for each symptom was higher using NLP than ICD-10 (Supplement 2). The 2 most prevalent symptoms for encounters with COVID-19 patients, cough and fever, are shown in Figure 2 and Figure 3. On average, cough was identified during 52% (SD 13) of the encounters using NLP, but only 15% (SD 5) using ICD-10. And on average, fever characterized 70% (SD 11) of encounters using NLP, but 41% (SD 9) using ICD-10.

![Figure 2.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2023/09/25/2023.09.24.23295960/F2.medium.gif)

[Figure 2.](http://medrxiv.org/content/early/2023/09/25/2023.09.24.23295960/F2)

Figure 2. Prevalence of cough during emergency department encounters with patients with COVID-19.
NLP (solid red line), ICD-10 (black bars), ED (emergency department).

![Figure 3.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2023/09/25/2023.09.24.23295960/F3.medium.gif)

[Figure 3.](http://medrxiv.org/content/early/2023/09/25/2023.09.24.23295960/F3)

Figure 3. Prevalence of fever during emergency department encounters with patients with COVID-19.
NLP (solid red line), ICD-10 (black bars), ED (emergency department).

Using ICD-10, there were many months where individual symptoms were not detected. Of the 27 study months, loss of taste or smell was not detected using ICD-10 during 24 months, nor were muscle or body aches during 13. Three more symptoms had at least 3 consecutive months where each was not detected using ICD-10. These were congestion or runny nose (9), sore throat (8), and fatigue (7). Sporadic months without detection using ICD-10 were observed for headache (5), diarrhea (2), cough (1), and nausea or vomiting (1). Using NLP, sporadic months without detection were observed for just 2 symptoms, loss of taste or smell (6) and sore throat (2).

### Symptom Prevalence across variant eras

Prevalence across variant eras during encounters with patients with COVID-19 differed for each symptom identified by NLP, except for nausea or vomiting and sore throat (Table 3). Post-hoc analyses revealed several patterns. New loss of taste or smell was the only symptom that varied across all 3 eras. It was most prevalent in the pre-Delta era, followed by Delta and then Omicron. Congestion or runny nose, cough, and fever or chills, were more prevalent during Delta and Omicron than during pre-Delta, but Delta did not differ from Omicron. Muscle or body aches were more prevalent during pre-Delta than both Delta and Omicron, but Delta did not differ from Omicron. Diarrhea, fatigue, headache, and shortness of breath were more prevalent during pre-Delta than Omicron but were not different than Delta, and Delta did not differ from Omicron.

View this table:
[Table 3.](http://medrxiv.org/content/early/2023/09/25/2023.09.24.23295960/T3)

Table 3. Symptom prevalence by variant era for encounters with patients with COVID-19.
Variant eras with the same superscript across a row did not differ in post-hoc analyses.

Nausea or vomiting and sore throat did not differ by variant era. Chi-square results are in Supplement 3.

### Symptoms by COVID-19 status and variant era

The interaction of COVID-19 status and variant era on the presence of each symptom is shown in Table 4. However, because the interaction was not significant for 2 symptoms, fever and chills and sore throat, main effects for COVID-19 status are shown for both (*P* <.001). The odds ratios indicate that patients with each of these symptoms were more likely to have COVID-19 at an encounter than not have COVID-19. These symptoms were also more likely to occur during Delta and Omicron than during pre-Delta. For the remaining symptoms, the interaction term was significant and odds ratios in each variant era are shown in the table. The odds ratios comparing patients with COVID-19 to those without the disease differed among the variant eras. Several patterns were observed. For congestion or runny nose, cough, fatigue, headache, muscle or body aches, new loss of taste or smell, shortness of breath or difficulty breathing, each symptom was more likely to be observed in patients with COVID-19. However, effect sizes (odds ratios) differed among pandemic eras. For diarrhea, this symptom was more likely for COVID-19 patients in the pre-Delta and Delta eras, but not during Omicron. And nausea was more likely only in the pre-Delta era. Significant odds ratios ranged in size from 1.3 to 26.7 (Mean 4.6). The logistic regression results are in Supplement 4.

View this table:
[Table 4.](http://medrxiv.org/content/early/2023/09/25/2023.09.24.23295960/T4)

Table 4. Effect of COVID-19 status and variant era on the presence of each symptom.
Odds ratios compare patients with COVID-19 at an ED encounter to patients without the disease. If the interaction term was significant, the effect of COVID-19 during each variant era is shown. Otherwise, the effect for COVID-19 is shown.

## Discussion

### Principal Findings

We find evidence that AI-based NLP of physician notes is a superior method for capturing patient symptoms for real-time biosurveillance than reliance on traditional approaches using ICD-10. NLP was more sensitive than ICD-10 codes in identifying symptoms and some symptoms could only be detected using NLP. As a form of internal validation, the symptoms identified by the CDC as associated with COVID-19 were more prevalent in patients with than without this disease.

### Comparison with Prior Work

The study was also able to capture a nuanced picture of symptom prevalence and odds across different SARS-CoV-2 variant eras. Consistent with previous literature, symptom patterns changed over time as new variants emerged. Variants may present with differences in symptomatology as a result of a number of factors including differences in mutations in spike proteins, receptor binding, and ability to escape host antibodies [20]. As has been previously reported [21–25], we found that fever or chills was the most common COVID-19 symptom across variants. In our cohort, shortness of breath was less common in the Omicron compared to pre-Delta era. Omicron has less of an ability to replicate in the lungs compared to the bronchi, which may explain why this symptom became less common [26]. Studies have reported sore throat as a common symptom in the Omicron era, but we did not observe a significant difference across eras [27,28]. It is possible that we did not see a higher prevalence of sore throat in the Omicron era because it may be more challenging for pediatric patients to describe this symptom. One study found that sore throat was observed more often in those 5-20 years old compared to those 0-4 [28]. Similarly, a study reported that sore throat was more common in those greater than or equal to 13 years old in Omicron compared to Delta [29]. In our study cohort, approximately half of the patients were less than 5 years old. As children this age may not be able to describe their symptoms well, symptoms that are also signs, like fever or cough, might be more commonly documented in physician notes than symptoms like sore throat. New loss of taste or smell was most prevalent in the pre-Delta era, followed by Delta and then Omicron in this study. This symptom has been reported less commonly in Omicron [27,28]. Studies have postulated that patients with Omicron are less likely to present with loss of taste or smell as this variant has less penetration of the mucus layer and therefore may be less likely to infect the olfactory epithelium [30].

### Limitations

There were important limitations in our use of NLP. The NLP pipeline does not account for vital signs and so fever may not have been detected with the pipeline if it was documented in a patient’s vital signs rather than the clinical text. The cTAKES tool in the pipeline lacks the temporal context to ascertain if the mention of a symptom in a note is a new symptom or a prior symptom. We modified our technique because of this, but nevertheless may have overestimated the prevalence of symptoms in our study. Future work will involve filtering by note section so that certain components of a note like past medical history are not included. Finally, we utilized two techniques to recognize negation, but some negated symptoms (e.g., “patient had no cough”) were still captured as positive symptom mentions leading to possible overestimation of symptom prevalence.

Our formative study had some limitations. First, we examined COVID-19 symptoms in patients presenting to a single urban pediatric ED. Patients presenting to outpatient settings, who likely had milder symptoms, were not included and our results may reflect patients with more severe symptoms. And because the setting was a single site, results may not generalize to other EDs.

Second, we defined COVID-19 status as positive if a patient had a PCR positive test for COVID-19 or an appropriate ICD-10 code at the ED encounter. Patients who were COVID-19 positive on a test at home or at an outside center may not have been captured by this definition even if they presented to the ED with COVID-19 [31]. Additionally, symptoms may have differed across variant eras as a result of COVID-19 vaccinations or previous infections rather than variant differences. Literature in adults shows that vaccination is associated with a decrease in systemic symptoms [32]. The United States Food and Drug Administration authorized the use of the COVID-19 vaccine in October of 2021, during the Delta era and prior to the Omicron era, for children 5 to 11 years old [33]. Vaccination rates for pediatric patients vary by age group in Massachusetts; of those 0-19 years of age, 3%-57% have received a primary series but have not been boosted, and 3%-18% have been boosted since September 1, 2022 [34]. As such, some patients in the Delta and Omicron eras may have been vaccinated or had previous COVID-19 infections [35].

## Conclusions

In an era where rapid and accurate infectious disease surveillance is crucial, this study underscores the transformative potential of AI-based NLP for real-time symptom detection, significantly outperforming traditional methods like ICD-10 coding. The dynamic adaptability of NLP technology allows for the nuanced capture of evolving symptomatology across different virus variants, offering a more responsive and precise toolkit for biosurveillance efforts. Its integration into existing healthcare infrastructure could be a game-changer, elevating our capabilities to monitor, understand, and ultimately control the spread of emerging infectious diseases.

## Supporting information

Supplement 1 [[supplements/295960_file03.pdf]](pending:yes)

Supplement 2 [[supplements/295960_file04.pdf]](pending:yes)

Supplement 3 [[supplements/295960_file05.xlsx]](pending:yes)

Supplement 4 [[supplements/295960_file06.xlsx]](pending:yes)

## Data Availability

All data produced in the present study are available upon reasonable request to the authors.

## Conflicts of Interest

None Declared.

### Abbreviations

AI
:   artificial intelligence
ED
:   emergency department
EHR
:   electronic health record
ICD-10
:   International Classification of Diseases, 10th Revision
NLP
:   natural language processing
PPV
:   positive predictive value

## Supplements

### Supplement 1

COVID-19 symptoms annotation guide.

### Supplement 2

Detection of COVID-19 symptoms using NLP and ICD-10 by month for ED encounters with COVID-19 positive patients.

### Supplement 3

Chi-square analysis of COVID-19 symptom prevalence by pandemic variant era for ED encounters with COVID-19 positive patients. Symptoms were detected using NLP.

### Supplement 4

Logistic regression analysis of the effect of COVID-19 status, pandemic variant era, and their interaction on symptom status for ED encounters. Symptoms were detected using NLP.

## Acknowledgements

Conceptualized by KDM, AM, TM. Software and Analysis by KLO, AM, ARZ, JJ, AG, VI. First draft of manuscript by AM, KDM, ARZ. Manuscript edits by KLO, AG. Funding obtained by KDM. Work supported by the Centers for Disease Control and Prevention of the U.S. Department of Health and Human Services (HHS) as part of a financial assistance award (KDM, AM, DG, TM, KLO, IG) The contents are those of the author(s) and do not necessarily represent the official views of, nor an endorsement, by CDC/HHS, or the U.S. Government. ARZ was supported by a training grant from the National Institute of Child Health and Human Development, T32HD040128.

*   Received September 24, 2023.
*   Revision received September 24, 2023.
*   Accepted September 25, 2023.


*   © 2023, Posted by Cold Spring Harbor Laboratory

This pre-print is available under a Creative Commons License (Attribution 4.0 International), CC BY 4.0, as described at [http://creativecommons.org/licenses/by/4.0/](http://creativecommons.org/licenses/by/4.0/)

## References

1.  1.Subramanian A, Nirantharakumar K, Hughes S, Myles P, Williams T, Gokhale KM, Taverner T, Chandan JS, Brown K, Simms-Williams N, Shah AD, Singh M, Kidy F, Okoth K, Hotham R, Bashir N, Cockburn N, Lee SI, Turner GM, Gkoutos GV, Aiyegbusi OL, McMullan C, Denniston AK, Sapey E, Lord JM, Wraith DC, Leggett E, Iles C, Marshall T, Price MJ, Marwaha S, Davies EH, Jackson LJ, Matthews KL, Camaradou J, Calvert M, Haroon S. Symptoms and risk factors for long COVID in non-hospitalized adults. Nat Med 2022 Aug;28(8):1706–1714. PMID35879616
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41591-022-01909-w&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=35879616&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F09%2F25%2F2023.09.24.23295960.atom) 

2.  2.Symptoms of COVID-19. Centers for Disease Control and Prevention. Available from: [https://www.cdc.gov/coronavirus/2019-ncov/symptoms-testing/symptoms.html](https://www.cdc.gov/coronavirus/2019-ncov/symptoms-testing/symptoms.html).
    
    
3.  3.Hodcroft EB. CoVariants: SARS-CoV-2 Mutations and Variants of Interest. 2021. Available from: [https://covariants.org/](https://covariants.org/)
    
    
4.  4.Machine-Learning-for-Medical-Language / ctakes-client-py. Github; Available from: [https://github.com/Machine-Learning-for-Medical-Language/ctakes-client-py/blob/main/ctakesclient/resources/covid\_symptoms.bsv](https://github.com/Machine-Learning-for-Medical-Language/ctakes-client-py/blob/main/ctakesclient/resources/covid_symptoms.bsv) [accessed Aug 13, 2023]
    
    
5.  5.Bodenreider O. The Unified Medical Language System (UMLS): integrating biomedical terminology. Nucleic Acids Res 2004 Jan 1;32(Database issue):D267–70.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/nar/gkh061&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=14681409&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F09%2F25%2F2023.09.24.23295960.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000188079000061&link_type=ISI) 

6.  6.Köhler S, Gargano M, Matentzoglu N, Carmody LC, Lewis-Smith D, Vasilevsky NA, Danis D, Balagura G, Baynam G, Brower AM, Callahan TJ, Chute CG, Est JL, Galer PD, Ganesan S, Griese M, Haimel M, Pazmandi J, Hanauer M, Harris NL, Hartnett MJ, Hastreiter M, Hauck F, He Y, Jeske T, Kearney H, Kindle G, Klein C, Knoflach K, Krause R, Lagorce D, McMurry JA, Miller JA, Munoz-Torres MC, Peters RL, Rapp CK, Rath AM, Rind SA, Rosenberg AZ, Segal MM, Seidel MG, Smedley D, Talmy T, Thomas Y, Wiafe SA, Xian J, Yüksel Z, Helbig I, Mungall CJ, Haendel MA, Robinson PN. The Human Phenotype Ontology in 2021. Nucleic Acids Res 2021 Jan 8;49(D1):D1207–D1217. PMID33264411
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/nar/gkaa1043&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F09%2F25%2F2023.09.24.23295960.atom) 

7.  7.SNOMEDCT_US, UMLS Vocabularies,. Unified Medical Language System (UMLS). Available from: [https://www.nlm.nih.gov/research/umls/sourcereleasedocs/current/SNOMEDCT_US/index.html](https://www.nlm.nih.gov/research/umls/sourcereleasedocs/current/SNOMEDCT_US/index.html) [accessed Apr 13, 2023]
    
    
8.  8.CHV (Consumer Health Vocabulary), UMLS Vocabularies. Unified Medical Language System (UMLS). Available from: [https://www.nlm.nih.gov/research/umls/sourcereleasedocs/current/CHV/index.html](https://www.nlm.nih.gov/research/umls/sourcereleasedocs/current/CHV/index.html) [accessed Apr 13, 2023]
    
    
9.  9.Savova GK, Masanz JJ, Ogren PV, Zheng J, Sohn S, Kipper-Schuler KC, Chute CG. Mayo clinical Text Analysis and Knowledge Extraction System (cTAKES): architecture, component evaluation and applications. J Am Med Inform Assoc 2010 Sep-Oct;17(5):507–513. PMID20819853
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1136/jamia.2009.001560&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=20819853&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F09%2F25%2F2023.09.24.23295960.atom) 

10. 10.Chapman WW, Bridewell W, Hanbury P, Cooper GF, Buchanan BG. A simple algorithm for identifying negated findings and diseases in discharge summaries. J Biomed Inform 2001 Oct;34(5):301–310. PMID12123149
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1006/jbin.2001.1029&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=12123149&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F09%2F25%2F2023.09.24.23295960.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000176496400001&link_type=ISI) 

11. 11.Harkema H, Dowling JN, Thornblade T, Chapman WW. ConText: an algorithm for determining negation, experiencer, and temporal status from clinical reports. J Biomed Inform 2009 Oct;42(5):839–851. PMID19435614
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.jbi.2009.05.002&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=19435614&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F09%2F25%2F2023.09.24.23295960.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000270870500010&link_type=ISI) 

12. 12.Chapman WW, Hillert D, Velupillai S, Kvist M, Skeppstedt M, Chapman BE, Conway M, Tharp M, Mowery DL, Deleger L. Extending the NegEx lexicon for multiple languages. Stud Health Technol Inform 2013;192:677–681. PMID23920642
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=23920642&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F09%2F25%2F2023.09.24.23295960.atom) 

13. 13.Machine-Learning-for-Medical-Language. GitHub. Available from: [https://github.com/Machine-Learning-for-Medical-Language](https://github.com/Machine-Learning-for-Medical-Language)
    
    
14. 14.Miller T, Bethard S, Amiri H, Savova G. Unsupervised Domain Adaptation for Clinical Negation Detection, in BioNLP. 2017 :165–170.
    
    
15. 15.Tkachenko M, Malyuk M, Holmanyuk A, Liubimov N. Label Studio: Data labeling software. 2020-2022. Available from: [https://github.com/heartexlabs/label-studio](https://github.com/heartexlabs/label-studio)
    
    
16. 16.Hripcsak G, Rothschild AS. Agreement, the f-measure, and reliability in information retrieval. J Am Med Inform Assoc 2005 Jan 31;12(3):296–298. PMID15684123
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1197/jamia.M1733&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=15684123&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F09%2F25%2F2023.09.24.23295960.atom) 

17. 17.McHugh ML. Interrater reliability: the kappa statistic. Biochem Med 2012;22(3):276–282. PMID23092060
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.11613/BM.2012.031&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=23092060&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F09%2F25%2F2023.09.24.23295960.atom) 

18. 18.Habibzadeh F, Habibzadeh P, Yadollahie M. The apparent prevalence, the true prevalence. Biochem Med 2022 Jun 15;32(2):020101. PMID35799992
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=35799992&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F09%2F25%2F2023.09.24.23295960.atom) 

19. 19.Monaghan TF, Rahman SN, Agudelo CW, Wein AJ, Lazar JM, Everaert K, Dmochowski RR. Foundational Statistical Principles in Medical Research: Sensitivity, Specificity, Positive Predictive Value, and Negative Predictive Value. Med Bogota Colomb 2021 May 16;57(5). PMID34065637
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=34065637&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F09%2F25%2F2023.09.24.23295960.atom) 

20. 20.Lauring AS, Hodcroft EB. Genetic Variants of SARS-CoV-2-What Do They Mean? JAMA 2021 Feb 9;325(6):529–531. PMID33404586
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1001/jama.2020.27124&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=33404586&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F09%2F25%2F2023.09.24.23295960.atom) 

21. 21.Viner RM, Ward JL, Hudson LD, Ashe M, Patel SV, Hargreaves D, Whittaker E. Systematic review of reviews of symptoms and signs of COVID-19 in children and adolescents. Arch Dis Child 2020 Dec 17; PMID33334728
    
    [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6MTI6ImFyY2hkaXNjaGlsZCI7czo1OiJyZXNpZCI7czo5OiIxMDYvOC84MDIiO3M6NDoiYXRvbSI7czo1MDoiL21lZHJ4aXYvZWFybHkvMjAyMy8wOS8yNS8yMDIzLjA5LjI0LjIzMjk1OTYwLmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 

22. 22.Götzinger F, Santiago-García B, Noguera-Julián A, Lanaspa M, Lancella L, Calò Carducci FI, Gabrovska N, Velizarova S, Prunk P, Osterman V, Krivec U, Lo Vecchio A, Shingadia D, Soriano-Arandes A, Melendo S, Lanari M, Pierantoni L, Wagner N, L’Huillier AG, Heininger U, Ritz N, Bandi S, Krajcar N, RogliĆ S, Santos M, Christiaens C, Creuven M, Buonsenso D, Welch SB, Bogyi M, Brinkmann F, Tebruegge M, ptbnet COVID-19 Study Group. COVID-19 in children and adolescents in Europe: a multinational, multicentre cohort study. Lancet Child Adolesc Health 2020 Sep;4(9):653–661. PMID32593339
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/S2352-4642(20)30177-2&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=32593339&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F09%2F25%2F2023.09.24.23295960.atom) 

23. 23.King JA, Whitten TA, Bakal JA, McAlister FA. Symptoms associated with a positive result for a swab for SARS-CoV-2 infection among children in Alberta. CMAJ 2021 Jan 4;193(1):E1–E9. PMID33234533
    
    [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NDoiY21haiI7czo1OiJyZXNpZCI7czo4OiIxOTMvMS9FMSI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDIzLzA5LzI1LzIwMjMuMDkuMjQuMjMyOTU5NjAuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 

24. 24.Takács AT, Bukva M, Gavallér G, Kapus K, Rózsa M, Bán-Gagyi B, Sinkó M, Szücs D, Terhes G, Bereczki C. Epidemiology and clinical features of SARS-CoV-2 infection in hospitalized children across four waves in Hungary: A retrospective, comparative study from March 2020 to December 2021. Health Sci Rep Wiley; 2022 Nov;5(6):e937. PMID36425898
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=36425898&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F09%2F25%2F2023.09.24.23295960.atom) 

25. 25.Kenney PO, Chang AJ, Krabill L, Hicar MD. Decreased Clinical Severity of Pediatric Acute COVID-19 and MIS-C and Increase of Incidental Cases during the Omicron Wave in Comparison to the Delta Wave. Viruses 2023 Jan 7;15(1). PMID36680220
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=36680220&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F09%2F25%2F2023.09.24.23295960.atom) 

26. 26.Hui KPY, Ho JCW, Cheung M-C, Ng K-C, Ching RHH, Lai K-L, Kam TT, Gu H, Sit K-Y, Hsin MKY, Au TWK, Poon LLM, Peiris M, Nicholls JM, Chan MCW. SARS-CoV-2 Omicron variant replication in human bronchus and lung ex vivo. Nature 2022 Mar;603(7902):715–720. PMID35104836
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41586-022-04479-6&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=35104836&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F09%2F25%2F2023.09.24.23295960.atom) 

27. 27.Akaishi T, Kushimoto S, Katori Y, Sugawara N, Egusa H, Igarashi K, Fujita M, Kure S, Takayama S, Abe M, Kikuchi A, Ohsawa M, Ishizawa K, Abe Y, Imai H, Inaba Y, Iwamatsu-Kobayashi Y, Nishioka T, Onodera K, Ishii T. COVID-19-Related Symptoms during the SARS-CoV-2 Omicron (B.1.1.529) Variant Surge in Japan. Tohoku J Exp Med 2022 Sep 6;258(2):103–110. PMID36002251
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=36002251&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F09%2F25%2F2023.09.24.23295960.atom) 

28. 28.Menni C, Valdes AM, Polidori L, Antonelli M, Penamakuri S, Nogal A, Louca P, May A, Figueiredo JC, Hu C, Others. Symptom prevalence, duration, and risk of hospital admission in individuals infected with SARS-CoV-2 during periods of omicron and delta variant dominance: a prospective observational study from the ZOE COVID Study. Lancet Elsevier; 2022;399(10335):1618–1624.
    
    
29. 29.Shoji K, Akiyama T, Tsuzuki S, Matsunaga N, Asai Y, Suzuki S, Iwamoto N, Funaki T, Ohmagari N. Clinical characteristics of COVID-19 in hospitalized children during the Omicron variant predominant period. J Infect Chemother 2022 Nov;28(11):1531–1535. PMID35963599
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.jiac.2022.08.004&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=35963599&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F09%2F25%2F2023.09.24.23295960.atom) 

30. 30.Butowt R, Bilińska K, von Bartheld C. Why Does the Omicron Variant Largely Spare Olfactory Function? Implications for the Pathogenesis of Anosmia in Coronavirus Disease 2019. J Infect Dis 2022 Oct 17;226(8):1304–1308. PMID35467743
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=35467743&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F09%2F25%2F2023.09.24.23295960.atom) 

31. 31.Wang L, Zipursky A, Geva A, McMurry AJ, Mandl KD, Miller TA. A computable phenotype for patients with SARS-CoV2 testing that occurred outside the hospital. medRxiv 2023 Jan 19; PMID36711461
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=36711461&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F09%2F25%2F2023.09.24.23295960.atom) 

32. 32.Bramante CT, Proper JL, Boulware DR, Karger AB, Murray T, Rao V, Hagen A, Tignanelli CJ, Puskarich M, Cohen K, Liebovitz DM, Klatt NR, Broedlow C, Hartman KM, Nicklas J, Ibrahim S, Zaman A, Saveraid H, Belani H, Ingraham N, Christensen G, Siegel L, Sherwood NE, Fricton R, Lee S, Odde DJ, Buse JB, Huling JD. Vaccination Against SARS-CoV-2 Is Associated With a Lower Viral Load and Likelihood of Systemic Symptoms. Open Forum Infect Dis 2022 May;9(5):ofac066. PMID35392460
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/ofid/ofac066&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=35392460&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F09%2F25%2F2023.09.24.23295960.atom) 

33. 33.FDA Authorizes Pfizer-BioNTech COVID-19 Vaccine for Emergency Use in Children 5 through 11 Years of Age. US Food and Drug Administration. Available from: [https://www.fda.gov/news-events/press-announcements/fda-authorizes-pfizer-biontech-covid-19-vaccine-emergency-use-children-5-through-11-years-age](https://www.fda.gov/news-events/press-announcements/fda-authorizes-pfizer-biontech-covid-19-vaccine-emergency-use-children-5-through-11-years-age)
    
    
34. 34.Weekly COVID-19 Vaccination Report. Massachusetts Department of Public Health COVID-19 Dashboard --Wednesday, April5, 2023. Available from: [https://www.mass.gov/doc/weekly-covid-19-vaccination-report-april-5-2023/download](https://www.mass.gov/doc/weekly-covid-19-vaccination-report-april-5-2023/download)
    
    
35. 35.Bhattacharyya RP, Hanage WP. Challenges in Inferring Intrinsic Severity of the SARS-CoV-2 Omicron Variant. N Engl J Med 2022 Feb 17;386(7):e14. PMID35108465
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access\_num=10.1056/NEJMP2119682/SUPPL_FILE/NEJMP2119682_DISCLOSURES.PDF&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F09%2F25%2F2023.09.24.23295960.atom)