Abstract
Background The majority of clinical studies reporting on COVID-19 symptom frequencies focus on patients already hospitalized. Thus, reported symptom frequencies may not be applicable to the general population. Here we report COVID-19 symptom frequencies for the general population in a major European city.
Methods During a scientific collaboration between the Vienna Social Fund (FSW, Vienna, Austria), the Public Health Services of the City of Vienna (MA15) and the AI-biotech company Symptoma we recorded symptom frequencies gathered by the COVID-19 chatbot of the city government of Vienna and corresponding SARS-CoV-2 nucleic acid amplification test (NAAT) results. Chatbot users answered 13 yes/no questions about symptoms and provided information about age and sex. Subsequently a medically trained professional came to their address to take a sample and NAAT results were obtained.
Findings Between November 2 and January 5, a total of 3011 persons experiencing flu-like symptoms had completed the chatbot-session and were subsequently tested by a NAAT. NAATs were performed by at home visitations of medical professionals. NAAT analysis was positive in 816 persons (27.1%). We compared the symptom frequencies between COVID-19 positive and negative users, and between male and female users. The symptoms (sorted by frequency) of users with positive NAATs were malaise (81.1%), fatigue (72.9%), headache (64.1%), cough (57.7%), fever (50.7%), sore throat (40.7%), rhinorrhea (31.0%), sneezing (28.4%), dysgeusia (27.1%), hyposmia (26.5%), dyspnea (11.4%) and diarrhea (10.9%) while 34.9% reported a close contact with a COVID-19 case. Among these the frequencies of cough, fever, hyposmia, dysgeusia, malaise, headache, close contact with COVID-19 case and fatigue were significantly (P < 0.01) increased in COVID-19 positive persons while the frequencies of dyspnea, diarrhea and sore throat were significantly (P < 0.01) decreased in COVID-19 positive persons. There was no significant difference for rhinorrhea and sneezing.
Introduction
COVID-19 symptom frequencies deliver valuable information for health authorities during this pandemic (e.g. screening the population and rapidly identifying cases). Symptom frequencies are used in applications including triage recommendation [1] and COVID-19 diagnostics [2] [3]. However, the reported symptom frequencies vary considerably across studies [1]. An example is “fever” for which symptom frequencies were reported between 7% and 91% [1]. Causes for these discrepancies include the over-representation of patients in hospital settings [2] and the lack of clarity on how symptoms were collected [1].
This study describes the COVID-19 symptom frequencies observed in a non-hospitalized metropolitan population experiencing flu-like symptoms and compares them with COVID-19 negatively tested persons also experiencing flu-like symptoms. The data were collected systematically via a standardized web interface and the COVID-19 status was determined by standardized NAAT.
Methods
Data collection
From November 2020, the City of Vienna’s online COVID-19 symptom checker provided inhabitants of Austria’s capital with an initial COVID-19 risk assessment, as well as possible options for further action, e.g. a NAAT [4] [5] test, taken by visitation at home.
The aim was to offer an additional service, complementing the medical telephone health service “1450”. The symptom checker lent itself as an easily executable digital channel, available via https://coronavirus.wien.gv.at/site/symptomchecker/.
The Vienna Social Fund (FSW), the Public Health Services of the City of Vienna (MA15) and the company Symptoma mutually developed the chatbot based on the database, artificial intelligence, and algorithms of Symptoma [3], [6], [7]. During the chatbot conversation, the user is answering a series of 13 yes/no questions about symptoms. These were fever (>38°C), cough, dyspnea, sneezing, rhinorrhea, sore throat, malaise, fatigue, diarrhea, headache, hyposmia, dysgeusia and close contact with COVID-19 case. Each user was asked the same set of questions and responses were recorded accordingly. If the chatbot AI assessed a user to have a medium/high risk for a COVID-19 infection, the user reported a positive self-test, the user returned from abroad or the user had a severe medical precondition and any type of symptom then the user was offered a NAAT [4] [5].
The statistics reported in this paper are based on the combined information of the chatbot conversations and the results of the NAATs. A total of 3011 users were screened this way between the 2nd of November 2020 and 5th of January 2021.
Data analysis
All data were anonymised prior to this analysis. Only sex, age group, the answers to the questions and the result of the NAAT were used for this study. Users who did not provide a sex information (n=20) were exlcuded from the symptom frequencies comparison between female and male. All analyses were done in Python 3.8 using the libraries numpy (1.19.4), pandas (1.1.5), scikit-learn (0.24.0) and statsmodels (0.12.1). P-values were calculated by a two-tailed Fisher’s exact test and corrected for multiple testing by the Benjamini-Hochberg method [8].
Results
Symptom frequencies among COVID-19 positiv and COVID-19 negativ users
Summary statistics of participants and numerical details can be found in Table 1. Our study cohort consists of 3011 non-hospitalised persons experiencing flu-like symptoms of which 816 (27.1%) tested positive for COVID-19 (C19+) and 2142 (71.2%) tested negative for COVID-19 (C19-). The test was unclear for 53 persons (1.7%). While we did not record the exact age of users for data protection issues (see Table 1) the median group age is 35 years for C19+ as well as for C19-.
In Figure 1, we compared the symptom frequencies between C19+ and C19-. The symptoms most frequently reported by C19+ users were malaise (81.1%), fatigue (72.9%), headache (64.1%), cough (57.7%) and fever (50.7%). Users less frequently reported sore throat (40.7%), close contact with COVID-19 case (34.9%), rhinorrhea (31.0%), sneezing (28.4%), dysgeusia (27.1%) and hyposmia (26.5%). Dyspnea (11.4%) and diarrhea (10.9%) were rarely reported (Figure 1A). C19+ users significantly (P < 0.001) more often experienced cough, fever, hyposmia, dysgeusia, malaise, headache and close contact with COVID-19 case. On the contrary, C19-users significantly (P < 0.01) more often experienced sore throat, diarrhea and dyspnea.
No significant difference between the C19+ and C19-group was found for rhinorrhea (P = 0.19) and sneezing (P = 0.11) (Figure 1A).
The largest relative increase in C19+ persons was found for hyposmia (+63% or +16 percent points), dysgeusia (+46% or +12 percent points), cough (+39% or +22 percent points) and fever (+39% or +20 percent points). The largest relative decrease in C19+ persons was found in dyspnea (−56% or −6 percent points) and diarrhea (−60% or −7 percent points) (Figure 1B).
Co-occurrence and correlation of symptoms
Co-occurrence and correlation matrices between the symptoms for C19+, C19- and for all users together are shown in Supplementary Figure 2. The three highest co-occurrences can be found for malaise and fatigue in C19+ (present in 67%) and all users (60%) as well as for malaise and headache in C19+ (59%).
The three highest correlations can be found for dysgeusia and hyposmia in C19+ (rs=0.70), all (rs=0.64) and C19-(rs=0.58). Other correlations for all users include expected pairs like malaise and fatigue (rs=0.48), sneezing and rhinorrhea (rs=0.45), malaise and headache (rs=0.37), headache and fatigue (rs=0.33) as well as fever and malaise (rs=0.31). There were no notable correlations for age and sex. For details see Supplementary Figure 2.
Symptom frequency comparison between female and male users
In the C19+ group, fever is more frequently present for men than for women (P = 0.010) while sneezing is more frequently present for women than for men (P = 0.0012).
In the C19-group, the fever frequency is also higher for men than for women (P < 0.001) but for sneezing the difference is insignificant (P = 0.27). In the C19-group, women more frequently experience headache and sore throat (P < 0.001) (Figure 2).
Discussion
To the best of our knowledge, this is the first study reporting COVID-19 symptom frequencies gathered by an automated system directly associated with experimental NAATs.
Our results showed a cough frequency of 58% which falls within the previously reported symptom frequency range of the literature [9] (range [43%-71%]). Similar agreements were found for sore throat (41% vs [5%-71%]), fever (51% vs [7%-91%]), malaise (81% vs [19%-86%]) and headache (64% vs [3%-71%]). The relatively high upper limit of these literature symptom frequency ranges might be due to the majority of previous studies being based on hospitalized patients [1].
Only the frequency of fatigue (73%) does not fall within the range of [9] (10%-57%). This discrepancy might be explained by the non-specific nature of fatigue.
The results also show that all the symptom frequencies reported here (with the exception of rhinorrhea and sneezing) are different between C19+ from C19-users. This agrees with other studies [10] [11] that also reported hyposmia, dysgeusia and fever as significantly increased in C19+ persons. Considering the high relative difference of hyposmia and dysgeusia frequencies for C19+ users, our study suggests that hyposmia and dysgeusia are specific but not sensitive, i.e. their presence likely leads to C19+ but no conclusion can be drawn from their absence.
Finally, results show symptoms were reported in different frequencies by men and women, which could be caused by sex-specific differences in the clinical course [12], [13] or women being more likely to take a test at lighter symptoms (COVID-19 positiv rates were 24.8% for women and 29.7% for men while paticipants were 55.0% female and 45.0% male).
One strength of our study is the systematic symptom collection i.e. each user was asked the same questions by the chatbot which allows for higher data consistency than handwritten reports. Additionally, the result of the NAAT is automatically associated with the reported symptoms, instead of being communicated by the users themselves which would risk the inclusion of falsely reported NAAT results. Another strength is that most symptom frequencies reported in the literature are based on hospitalized patients [1] while this study reports symptom frequencies for the general population in central Europe.
However, our study also has limitations: there is a selection bias for participants as old people are unlikely to use a chatbot (see Table 1 and [14]), there is an overrepresentation of female participants in this study (45.0% male and 55.0% female) and persons with a low risk of being C19+ were not offered a test (see methods). Also, the dyspnea frequency and difference reported here might be due to a sample bias as (a) dyspnea is often a late symptom of an infection while chatbot users might rather be at an earlier stage of an infection (b) dyspnea can be a distressing symptom and affected individuals might rather call an emergency hotline instead of using a chatbot [15] and/or (c) dyspnea might be underreported in patients with COVID-19 pneumonia which might not be the case for other (non-COVID-19-related) pulmonary diseases, such as chronic obstructive pulmonary disease, or acute hypercapnic respiratory failure [16].
Also the finding that C19-persons more often experienced sore throat and diarrhea than C19+ persons might be due to the potentially rather early time point at which the chatbot is used during an infection progression i.e. it is likely that persons use the chatbot once they start experiencing flu-like symptoms but seldomly after being a week or two into the infection. Nevertheless we believe that these differences are interesting insights for early onset diagnosis of COVID-19.
Data Availability
All relevant data is reported within the study.
Data Availability
All relevant data is contained within the manuscript
Funding
This study has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 830017 and by the Austrian Research Promotion Agency under grant agreement No 880939 (supported by the Federal Ministries Republic of Austria for Digital and Economic Affairs and Climate Action, Environment, Energy, Mobility, Innovation and Technology).
Declaration of interests
NM, SG, AM, JN, TL and BK are employees of Symptoma GmbH. JN and TL hold shares of Symptoma.
Ethical considerations
This study was exempted from ethics review by the ethics commission of the city of Vienna (MA15-EK/21-037-VK_NZ). All individuals using the chatbot agreed that their data will be used in an anonymised way.
Author contributions
Study design: BK, JN, TL. Data compilation: NM, TL, MB. Data analysis and critical revision: NM, BK, AM, JA. Writing the manuscript: NM, BK. Revising the manuscript critically: BK, SG, AM, JN, JA, MB.
Acknowledgement
We would like to thank the Vienna Social Fund (FSW), Public Health Services of the City of Vienna (MA15) and the Vienna Health Authority for providing data and making this publication possible.
Footnotes
v2