PT - JOURNAL ARTICLE AU - Sanders, Abraham C. AU - White, Rachael C. AU - Severson, Lauren S. AU - Ma, Rufeng AU - McQueen, Richard AU - Alcântara Paulo, Haniel C. AU - Zhang, Yucheng AU - Erickson, John S. AU - Bennett, Kristin P. TI - Unmasking the conversation on masks: Natural language processing for topical sentiment analysis of COVID-19 Twitter discourse AID - 10.1101/2020.08.28.20183863 DP - 2021 Jan 01 TA - medRxiv PG - 2020.08.28.20183863 4099 - http://medrxiv.org/content/early/2021/03/20/2020.08.28.20183863.short 4100 - http://medrxiv.org/content/early/2021/03/20/2020.08.28.20183863.full AB - In this exploratory study, we scrutinize a database of over one million tweets collected from March to July 2020 to illustrate public attitudes towards mask usage during the COVID-19 pandemic. We employ natural language processing, clustering and sentiment analysis techniques to organize tweets relating to mask-wearing into high-level themes, then relay narratives for each theme using automatic text summarization. In recent months, a body of literature has highlighted the robustness of trends in online activity as proxies for the sociological impact of COVID-19. We find that topic clustering based on mask-related Twitter data offers revealing insights into societal perceptions of COVID-19 and techniques for its prevention. We observe that the volume and polarity of mask-related tweets has greatly increased. Importantly, the analysis pipeline presented may be leveraged by the health community for qualitative assessment of public response to health intervention techniques in real time.Competing Interest StatementThe authors have declared no competing interest.Funding StatementThis work was partially funded by a grant from the United Health FoundationAuthor DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:No IRB review requiredAll necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.YesAll Tweet IDs associated with this work have been made available via a publicly-accessible repository https://github.com/TheRensselaerIDEA/COVID-masks-nlp