Long COVID symptoms from Reddit: Characterizing post-COVID syndrome from patient reports ======================================================================================== * Abeed Sarker * Yao Ge ## Abstract **Objective** To mine Reddit to discover long-COVID symptoms self-reported by users, compare symptom distributions across studies, and create a symptom lexicon. **Materials and Methods** We retrieved posts from the */r/covidlonghaulers* subreddit and extracted symptoms via approximate matching using an expanded meta-lexicon. We mapped the extracted symptoms to standard concept IDs, compared their distributions with those reported in recent literature and analyzed their distributions over time. **Results** From 42,995 posts by 4249 users, we identified 1744 users who expressed at least 1 symptom. The most frequently reported long-COVID symptoms were *mental health-related symptoms* (55.2%), *fatigue* (51.2%), *general ache/pain* (48.4%), *brain fog/confusion* (32.8%) and *dyspnea* (28.9%) amongst users reporting at least 1 symptom. Comparison with recent literature revealed a large variance in reported symptoms across studies. Temporal analysis showed several persistent symptoms up to 15 months after infection. **Conclusion** The spectrum of symptoms identified from Reddit may provide early insights about long-COVID. Keywords (MeSH) * Social media (MeSH ID: D061108) * Communicable Diseases (MeSH ID: D003141) * Virus Diseases (MeSH ID: D014777) * Natural language processing (MeSH ID: D009323) * Text mining (MeSH ID: D057225) ## INTRODUCTION Many patients continue to experience symptoms long after the acute phase of infection with the SARS-CoV-2 (COVID-19) virus,1,2 and currently, the persistence of symptoms 28 days after the diagnosis of COVID-19 infection is referred to as ‘*post-acute COVID syndrome*’, ‘*post-acute sequelae of SARS-CoV-2*’, ‘*long-haul COVID-19*’ or ‘*long-COVID*’.3 A growing body of research is exploring long-COVID and the constellation of symptoms that patients experience following the acute phase of COVID-19 infection.3–6 The phrase ‘long-COVID’ was coined by patients, many of whom initially suffered mild symptoms only to progress to complex illnesses.7 To date, research suggests that long-COVID symptoms may affect people of varying ages including children and those with asymptomatic acute infections.8–12 However, after more than a year since the beginning of the pandemic, there are still many unknowns associated with long-COVID, including the full spectrum of symptoms that patients experience, their progression, and their long-term manifestation. As more patients around the world continue to recover from acute COVID-19 infections, long-COVID is emerging to be a complex health problem affecting millions of people globally, and a wide range of symptoms are being discovered over time.5,9,12,13 Consequently, this syndrome is emerging as a critical research topic relevant to the pandemic with many ongoing studies globally. One potential source of information regarding long-COVID syndrome is social media, as many people, including healthcare professionals who had been infected with COVID-19, are reporting their experiences through this medium.14,15 Social media adoption is currently at an all-time high and continues to grow globally and in the United States.16,17 In the past, large volumes of user-generated information on social media have been utilized by researchers to obtain early insights about health-related topics, including infectious diseases.18,19 Due to the global nature of the current COVID-19 pandemic, there is an abundance of knowledge about diverse aspects of it in publicly available social media data. Studies have thus attempted to utilize this resource to answer targeted questions associated with COVID-19.20–23 In this study, we attempted to mine publicly available social media data from Reddit to discover and analyze the spectrum of symptoms self-reported by sufferers of long-COVID. This study builds on our prior work on the topic, which focused on extracting self-reported COVID-19 symptoms from Twitter.23 Our objectives for this study were to extend a COVID-19 symptom lexicon built using Twitter data, deploy the extended lexicon to identify self-reported long-COVID symptoms from a specific forum on Reddit, analyze the distribution of symptoms and compare them with symptoms reported in recent studies. We describe our methods and findings in the following sections. ## MATERIALS AND METHODS ### Data source and collection We collected data for study from Reddit, which consists of many topic-specific forums called subreddits. The subreddit */r/covidlonghaulers* has emerged as the go-to forum for the discussion of long-COVID related topics, and it is an information-rich source for obtaining self-reported long-COVID symptoms. As of 6th June, 2021, the subreddit had over 14,000 subscribers and over 1200 threads, most of which focus on long-COVID experiences self-reported by users. Within each thread, users report and discuss the symptoms they experience and also often provide timelines of their symptoms. This subreddit is thus an excellent resource for obtaining crowdsourced information about long-COVID syndrome. We collected all the posts from this subreddit using the PRAW application programming interface (API).24 The earliest post included in our analyses was from 25th July, 2020 and the latest post was from 6th June, 2021. ### Symptom extraction from user posts We first manually annotated a symptom lexicon from Reddit, grouping similar symptoms and mapping each symptom expression to a standard identifier in the unified medical language system (UMLS) using the National Center for Biomedical Ontology BioPortal.25 We augmented the lexicon entries for rarely occurring symptoms by adding expressions from the Medical Dictionary for Regulatory Activities (MedDRA).26 We combined this extended Reddit lexicon with the previously-developed Twitter lexicon23 to create a meta-lexicon consisting of common COVID-19 and long-COVID symptom expressions. We then further expanded the meta-lexicon automatically using the LexExp tool.27 During the manual annotation process, we discovered that users often expressed the resolution or absence of symptoms using negation expressions such as ‘*no*’, ‘*do not*’, ‘*never had*’, and so we created a small lexicon of such negations. To detect symptoms from free text, we applied an inexact matching method. Exact matching on social media free texts typically results in low recall due to the presence of nonstandard expressions and misspellings. To overcome this issue, in line with our past work, we searched through term sequence windows in the texts and computed the similarity of each text window with all entries in the meta-lexicon. Text windows that obtained similarities above a specific threshold with any entry in the lexicon were extracted and considered to be candidates for long-COVID symptoms. We used the Levenshtein ratio metric to compute similarity, with a high threshold of 0.98. For an expression of *n* terms, the term sequence windows were of the range [*n* − 1: *n* + 2]. A negation detection algorithm then searches for the occurrence of negations in the same post as the candidate symptom and estimating if the symptom appears within the scope of the negation. The negation detection algorithm preprocesses the texts by lowercasing and tokenizing, and labels any term appearing within a 3 token window or before an end-of-sentence marker following the negation term (eg., a period) as negated. We grouped the symptoms per user and generated their frequency distributions for analysis and comparison. We evaluated the performance of the algorithm using the precision, recall and F1-score metrics. To estimate precision and recall, we selected a random sample of posts, including posts from which at least one symptom was automatically detected and posts from which our algorithm did not detect any. The F1-score is the harmonic mean of precision and recall (*F*1 − *score* = (2 × *precision* × *recall*)/(*precision* + *recall*)). ### Symptom distribution by month During our exploration of the threads within the subreddit, we discovered many thread titles that specified the number of months since the initial COVID-19 diagnosis that the particular user was experiencing symptoms for (*eg*., ‘*8 month update*’, ‘*11 month long hauler just got first vaccine*’). We created a set of regular expressions to detect such expressions and computed the relative frequency distributions for the numbers of users expressing at least one symptom within these posts. We grouped the frequencies by month, starting from 3 up to 15. ## RESULTS We collected 42,995 posts from 1220 threads and 4249 users. We reviewed posts associated with a total of 450 symptom expressions (*i*.*e*., the same post could be reviewed multiple times in association with different detected symptoms) along with 25 posts without any detected symptoms to estimate the performance of our approach (recall = 0.93, precision = 0.95, F1-score = 0.94). 1744 users expressed at least 1 non-negated symptom (2576 users without accounting for negations). Figure 1 presents the symptom distributions reported by this cohort. From the histogram (left) we can see that most users reported 1 – 6 symptoms. The median number of symptoms reported per user was 3 (boxplot; right) and a handful of users reported 12 or more symptoms, with 32 being the highest number of symptoms expressed by a single user. ![Figure 1.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2021/06/18/2021.06.15.21259004/F1.medium.gif) [Figure 1.](http://medrxiv.org/content/early/2021/06/18/2021.06.15.21259004/F1) Figure 1. Distribution of long-COVID symptoms reported by Reddit users. Figure 2 presents the raw counts for the top 20 symptoms reported by the users and their relative frequencies as a percentage of the total number of users who reported at least 1 non-negated symptom. There are some interesting differences between the acute COVID-19 symptoms reported on Twitter, which we presented in a prior publication, and the long-COVID symptoms we detected in this study.23 The most commonly reported long-COVID symptoms are *anxiety/stress and related mental health symptoms*.† The next 4 most commonly reported symptoms are *fatigue, general & body ache & pain, confusion/disorientation*‡ and *dyspnea. Pyrexia*, which was the most commonly reported symptom among users who self-reported to have tested positive for COVID-19, is reported much less frequently for long-COVID (6th). ![Figure 2.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2021/06/18/2021.06.15.21259004/F2.medium.gif) [Figure 2.](http://medrxiv.org/content/early/2021/06/18/2021.06.15.21259004/F2) Figure 2. Number of users reporting each of the top 20 symptoms and their percentages. Figure 3 illustrates, via a heatmap, the relative frequencies of 24 symptoms from months 3 to 15. From the figure, it appears that *general* & *body ache & pain, fatigue, anxiety, stress and other mental health issues*, and *dyspnea* were reported to be the most persistent symptoms, with a large number of users reporting to experience it 12 months after their initial diagnosis. Some symptoms, such as pyrexia, *GI issues*, and *oropharyngeal pain* appear to lower over time. Table 1 presents the relative frequency distributions of 10 commonly-reported long-COVID symptoms from this study and 7 recently-published papers on long-COVID. *Fatigue* is consistently reported with high frequency across studies, but considerable variations are can be observed in the distribution all symptoms. View this table: [Table 1.](http://medrxiv.org/content/early/2021/06/18/2021.06.15.21259004/T1) Table 1. Comparison of long-COVID symptom distributions (in percentage) identified in this study with those reported in recent literature for frequently-reported symptoms. ![Figure 3.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2021/06/18/2021.06.15.21259004/F3.medium.gif) [Figure 3.](http://medrxiv.org/content/early/2021/06/18/2021.06.15.21259004/F3) Figure 3. Heatmap of relative frequencies of symptoms from months 3 to 15. ## DISCUSSION AND CONCLUSIONS While there is still limited research data available about long-COVID symptoms, our findings broadly agree with findings from recently-published studies. The ranges for the symptom frequency distributions in Table 1 are wide, which is likely due to the differing patient populations included in the studies. This also suggests that the long-COVID experiences of patients can vary considerably compared to acute COVID-19 infections, and further research is essential to better understand the chronic manifestation of this syndrome. Figure 3 shows that patients continue to experience certain symptoms, such as *fatigue, dyspnea, aches & pains*, and *mental health* symptoms up to 15 months following acute infection. Continuing surveillance of this cohort may reveal how the symptoms resolve, if at all. From these findings, it is evident that as some countries manage to lower new COVID-19 infections through vaccinations, an ongoing global challenge will be to address complications associated with long-COVID. With limited scientific data available about long-COVID and with medical care globally primarily focusing on treating acute COVID-19 cases, many long-COVID sufferers are turning to social media to discuss their persistent symptoms, finding other sufferers with similar symptoms, and identifying potential solutions to their debilitating symptoms and improving quality of life. It is possible, and perhaps likely, that many patients will continue to suffer from long-COVID symptoms in the future, and social media will be an invaluable resource for obtaining early, crowdsourced insights about the topic. While manually reviewing many of the posts from our chosen subreddit, we found numerous posts where the users expressed their frustrations regarding their healthcare providers not understanding the extent of their sufferings and/or not being able to diagnose the underlying reasons behind their persistent symptoms. For example, many users who reported *dyspnea* stated that they had no issues with their lungs during the acute phases of their COVID-19 infections, and their care providers, who relied on standard imaging methods for diagnosis, dismissed their sufferings since imaging did not reveal anything wrong. We also found that many users explained that their symptoms were not continuous, but cyclic—disappearing and returning from time to time. Many users also discussed that their long-COVID symptoms were preventing them from getting back to their usual lives and work, adding to their mental health problems. Our study has several notable limitations: the detection of symptoms is not 100% accurate, users may not report all the symptoms they experience (leading to underestimation of symptom prevalence), Reddit users tend to be younger compared to the general population, and rare symptoms may have been missed during our lexicon preparation. In our future work, we will attempt to analyze, from social media, how long-COVID typically progresses, the distribution of time spans at which users report the resolution of symptoms, and treatment regimens that appear to help. Social media data, including from Reddit and Twitter, may help the medical community to better understand long-COVID from the perspective of patients and thus enable them to improve the care they provide. Considering the large volume of data that is generated in social media about this topic, it is necessary to develop automated methods involving NLP for long-term surveillance. To aid future research, we have made the lexicon developed for this study publicly available. ## Data Availability Lexicon associated with the article is publicly available. [https://sarkerlab.org/covid\_sm\_data\_bundle](https://sarkerlab.org/covid_sm_data_bundle) ## Footnotes * † We grouped such symptoms into one because users often use non-standard expressions for these and it is difficult to separate such symptoms in a more fine-grained manner. * ‡ The most commonly used phrase by the users is *brain fog*. * Received June 15, 2021. * Revision received June 15, 2021. * Accepted June 18, 2021. * © 2021, Posted by Cold Spring Harbor Laboratory This pre-print is available under a Creative Commons License (Attribution-NonCommercial-NoDerivs 4.0 International), CC BY-NC-ND 4.0, as described at [http://creativecommons.org/licenses/by-nc-nd/4.0/](http://creativecommons.org/licenses/by-nc-nd/4.0/) ## References 1. 1.Nath A. Long-Haul COVID. Neurology. 2020;95(13):559–560. doi:10.1212/WNL.0000000000010640 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1212/WNL.0000000000010640&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F06%2F18%2F2021.06.15.21259004.atom) 2. 2.Gorna R, MacDermott N, Rayner C, et al. Long COVID guidelines need to reflect lived experience. Lancet. 2021;397(10273):455–457. doi:10.1016/S0140-6736(20)32705-7 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/S0140-6736(20)32705-7&link_type=DOI) 3. 3.Mendelson M, Nel J, Blumberg L, et al. Long-COVID: An evolving problem with an extensive impact. South African Med J. 2020;111(1):10–12. doi:10.7196/SAMJ.2020.V111I11.15433 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.7196/SAMJ.2020.V111I11.15433&link_type=DOI) 4. 4.Halpin SJ, Mcivor C, Whyatt G, et al. Postdischarge symptoms and rehabilitation needs in survivors of COVID-19 infection: A cross-sectional evaluation. J Med Virol. 2021;93(2):1013–1022. doi:10.1002/jmv.26368 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1002/jmv.26368&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=32729939&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F06%2F18%2F2021.06.15.21259004.atom) 5. 5.Logue JK, Franko NM, McCulloch DJ, et al. Sequelae in Adults at 6 Months after COVID-19 Infection. JAMA Netw Open. 2021;4(2):e210830–e210830. doi:10.1001/jamanetworkopen.2021.0830 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1001/jamanetworkopen.2021.0830&link_type=DOI) 6. 6.Del Rio C, Collins LF, Malani P. Long-term Health Consequences of COVID-19. JAMA - J Am Med Assoc. 2020;324(17):1723–1724. doi:10.1001/jama.2020.19719 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1001/jama.2020.19719&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=33031513&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F06%2F18%2F2021.06.15.21259004.atom) 7. 7.Callard F, Perego E. How and why patients made Long Covid. Soc Sci Med. 2021;268. doi:10.1016/j.socscimed.2020.113426 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.socscimed.2020.113426&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=33199035&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F06%2F18%2F2021.06.15.21259004.atom) 8. 8.The Lancet. Facing up to long COVID. Lancet. 2020;396(10266):1861. doi:10.1016/S0140-6736(20)32662-3 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/S0140-6736(20)32662-3&link_type=DOI) 9. 9.Marshall M. The four most urgent questions about long COVID. Nature. 2021;594(7862):168–170. doi:10.1038/d41586-021-01511-z [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/d41586-021-01511-z&link_type=DOI) 10. 10.Buonsenso D, Munblit D, De Rose C, et al. Preliminary evidence on long COVID in children. Acta Paediatr Int J Paediatr. 2021. doi:10.1111/apa.15870 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1111/apa.15870&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=33835507&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F06%2F18%2F2021.06.15.21259004.atom) 11. 11.Fegert JM, Vitiello B, Plener PL, Clemens V. Challenges and burden of the Coronavirus 2019 (COVID-19) pandemic for child and adolescent mental health: a narrative review to highlight clinical and research needs in the acute phase and the long return to normality. Child Adolesc Psychiatry Ment Health. 2020;14(1). doi:10.1186/s13034-020-00329-3 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1186/s13034-020-00329-3&link_type=DOI) 12. 12.Raveendran A V., Jayadevan R, Sashidharan S. Long COVID: An overview. Diabetes Metab Syndr Clin Res Rev. 2021;15(3):869–875. doi:10.1016/j.dsx.2021.04.007 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.dsx.2021.04.007&link_type=DOI) 13. 13.Dani M, Dirksen A, Taraborrelli P, et al. Autonomic dysfunction in ‘long COVID’: rationale, physiology and management strategies. Clin Med J R Coll Physicians London. 2021;21(1):E63–E67. doi:10.7861/CLINMED.2020-0896 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.7861/CLINMED.2020-0896&link_type=DOI) 14. 14.Mahase E. Covid-19: What do we know about “long covid”? BMJ. 2020;370. doi:10.1136/bmj.m2815 [FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiRlVMTCI7czoxMToiam91cm5hbENvZGUiO3M6MzoiYm1qIjtzOjU6InJlc2lkIjtzOjE3OiIzNzAvanVsMTRfNi9tMjgxNSI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDIxLzA2LzE4LzIwMjEuMDYuMTUuMjEyNTkwMDQuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 15. 15.Ladds E, Rushforth A, Wieringa S, et al. Persistent symptoms after Covid-19: qualitative study of 114 “long Covid” patients and draft quality principles for services. BMC Health Serv Res. 2020;20(1):1–13. doi:10.1186/s12913-020-06001-y [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1186/s12913-020-06001-y&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F06%2F18%2F2021.06.15.21259004.atom) 16. 16.• Number of social media users 2025 | Statista. [https://www.statista.com/statistics/278414/number-of-worldwide-social-network-users/](https://www.statista.com/statistics/278414/number-of-worldwide-social-network-users/). Accessed June 9, 2021. 17. 17.Social Media Use in 2021 | Pew Research Center. [https://www.pewresearch.org/internet/2021/04/07/social-media-use-in-2021/](https://www.pewresearch.org/internet/2021/04/07/social-media-use-in-2021/). Accessed June 9, 2021. 18. 18.Shan S, Yan Q, Wei Y. Infectious or recovered? Optimizing the infectious disease detection process for epidemic control and prevention based on social media. Int J Environ Res Public Health. 2020;17(18):1–25. doi:10.3390/ijerph17186853 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.3390/ijerph17186591&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F06%2F18%2F2021.06.15.21259004.atom) 19. 19.Tang L, Bie B, Park SE, Zhi D. Social media and outbreaks of emerging infectious diseases: A systematic review of literature. Am J Infect Control. 2018;46(9):962–972. doi:10.1016/j.ajic.2018.02.010 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.ajic.2018.02.010&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F06%2F18%2F2021.06.15.21259004.atom) 20. 20.Puri N, Coomes EA, Haghbayan H, Gunaratne K. Social media and vaccine hesitancy: new updates for the era of COVID-19 and globalized infectious diseases. Hum Vaccines Immunother. 2020;16(11):2586–2593. doi:10.1080/21645515.2020.1780846 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1080/21645515.2020.1780846&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=32693678&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F06%2F18%2F2021.06.15.21259004.atom) 21. 21.Aggarwal NR, Alasnag M, Mamas MA. Social media in the era of COVID-19. Open Hear. 2020;7(2):e001352. doi:10.1136/openhrt-2020-001352 [FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiRlVMTCI7czoxMToiam91cm5hbENvZGUiO3M6Nzoib3BlbmhydCI7czo1OiJyZXNpZCI7czoxMToiNy8yL2UwMDEzNTIiO3M6NDoiYXRvbSI7czo1MDoiL21lZHJ4aXYvZWFybHkvMjAyMS8wNi8xOC8yMDIxLjA2LjE1LjIxMjU5MDA0LmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 22. 22.Cinelli M, Quattrociocchi W, Galeazzi A, et al. The COVID-19 social media infodemic. Sci Rep. 2020;10(1):16598. doi:10.1038/s41598-020-73510-5 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41598-020-73510-5&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=33024152&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F06%2F18%2F2021.06.15.21259004.atom) 23. 23.Sarker A, Lakamana S, Hogg-Bremer W, Xie A, Al-Garadi MA, Yang Y-C. Self-reported COVID-19 symptoms on Twitter: an analysis and a research resource. J Am Med Informatics Assoc. 2020;27(8):1310–1315. doi:10.1093/jamia/ocaa116 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/jamia/ocaa116&link_type=DOI) 24. 24.Boe B. PRAW: The Python Reddit API Wrapper. [https://praw.readthedocs.io/en/latest/](https://praw.readthedocs.io/en/latest/). Published 2017. 25. 25.Welcome to the NCBO BioPortal | NCBO BioPortal. [https://bioportal.bioontology.org/](https://bioportal.bioontology.org/). Accessed April 15, 2020. 26. 26.Mozzicato P. MedDRA: An overview of the medical dictionary for regulatory activities. Pharmaceut Med. 2009;23(2):65–75. doi:10.1007/BF03256752 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/BF03256752&link_type=DOI) 27. 27.1. Jonathan W Sarker A. LexExp: a system for automatically expanding concept lexicons for noisy biomedical texts. Jonathan W, ed. Bioinformatics. November 2020. doi:10.1093/bioinformatics/btaa995 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/bioinformatics/btaa995&link_type=DOI) 28. 28.Orrù G, Bertelloni D, Diolaiuti F, et al. Long-COVID Syndrome? A Study on the Persistence of Neurological, Psychological and Physiological Symptoms. Healthcare. 2021;9(5):575. doi:10.3390/healthcare9050575 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.3390/healthcare9050575&link_type=DOI) 29. 29.Osikomaiya B, Erinoso O, Wright KO, et al. ‘Long COVID’: persistent COVID-19 symptoms in survivors managed in Lagos State, Nigeria. BMC Infect Dis. 2021;21(1):1–7. doi:10.1186/s12879-020-05716-x [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1186/s12879-021-05950-x&link_type=DOI) 30. 30.Huang C, Huang L, Wang Y, et al. 6-month consequences of COVID-19 in patients discharged from hospital: a cohort study. Lancet. 2021;397(10270):220–232. doi:10.1016/S0140-6736(20)32656-8 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/S0140-6736(20)32656-8&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F06%2F18%2F2021.06.15.21259004.atom) 31. 31.Sykes DL, Holdsworth L, Jawad N, Gunasekera P, Morice AH, Crooks MG. Post-COVID-19 Symptom Burden: What is Long-COVID and How Should We Manage It? Lung. 2021;199(2):113–119. doi:10.1007/s00408-021-00423-z [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/s00408-021-00423-z&link_type=DOI) 32. 32.Mandal S, Barnett J, Brill SE, et al. Long-COVID’: A cross-sectional study of persisting symptoms, biomarker and imaging abnormalities following hospitalisation for COVID-19. Thorax. 2021;76(4):396–398. doi:10.1136/thoraxjnl-2020-215818 [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6OToidGhvcmF4am5sIjtzOjU6InJlc2lkIjtzOjg6Ijc2LzQvMzk2IjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjEvMDYvMTgvMjAyMS4wNi4xNS4yMTI1OTAwNC5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 33. 33.Carfì A, Bernabei R, Landi F. Persistent symptoms in patients after acute COVID-19. JAMA - J Am Med Assoc. 2020;324(6):603–605. doi:10.1001/jama.2020.12603 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1001/jama.2020.12603&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=32644129&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F06%2F18%2F2021.06.15.21259004.atom) 34. 34.Garrigues E, Janvier P, Kherabi Y, et al. Post-discharge persistent symptoms and health-related quality of life after hospitalization for COVID-19. 2020. doi:10.1016/j.jinf.2020.08.029 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.jinf.2020.08.029&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=32853602&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F06%2F18%2F2021.06.15.21259004.atom)