PT - JOURNAL ARTICLE AU - Sanders, Abraham AU - White, Rachael AU - Severson, Lauren S. AU - Ma, Rufeng AU - McQueen, Richard AU - Paulo, Haniel C. Alcânatara AU - Zhang, Yucheng AU - Erickson, John S. AU - Bennett, Kristin P. TI - Unmasking the conversation on masks: Natural language processing for topical sentiment analysis of COVID-19 Twitter discourse AID - 10.1101/2020.08.28.20183863 DP - 2020 Jan 01 TA - medRxiv PG - 2020.08.28.20183863 4099 - http://medrxiv.org/content/early/2020/09/01/2020.08.28.20183863.short 4100 - http://medrxiv.org/content/early/2020/09/01/2020.08.28.20183863.full AB - In this exploratory study, we scrutinize a database of over 1 million tweets collected across the first five months of 2020 to draw conclusions about public attitudes towards the preventative measure of mask usage during the COVID-19 pandemic. In recent months, a body of literature has emerged to suggest the robustness of trends in online activity as proxies for the epidemiological and sociological impact of COVID-19. We employ natural language processing, clustering and sentiment analysis techniques to organize tweets relating to mask-wearing into high-level themes, then relay narratives for individual clusters through automatic text summarization. We find that topic clustering and visualization based on mask-related Twitter data offers revealing insights into societal perceptions of COVID-19 and techniques for its prevention. We observe that the volume and polarity of mask related tweets has greatly increased. Importantly, the analysis pipeline presented can be leveraged by the health community for the assessment of public response to health interventions in the ongoing global health crisis.Competing Interest StatementThe authors have declared no competing interest.Funding StatementThis work was partially funded by a grant from the United Health FoundationAuthor DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:No IRB review requiredAll necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.YesAll Tweet IDs associated with this work have been made available via a publicly-accessible repository https://github.com/TheRensselaerIDEA/COVID-masks-nlp