PT - JOURNAL ARTICLE AU - Charles, Lauren E. AU - Corley, Courtney D. TI - Disease and Social Media in Post-Natural Disaster Recovery Philippines AID - 10.1101/2021.03.22.21254137 DP - 2021 Jan 01 TA - medRxiv PG - 2021.03.22.21254137 4099 - http://medrxiv.org/content/early/2021/03/26/2021.03.22.21254137.short 4100 - http://medrxiv.org/content/early/2021/03/26/2021.03.22.21254137.full AB - Introduction The Philippines is plagued with natural disasters and resulting precipitating factors for disease outbreaks. The developing country has a strong disease surveillance program during and post-disaster phases; however, latent disease contracted during these emergency situations emerges once the Filipinos return to their homes. Coined the social media capital of the world, the Philippines provides an opportunity to evaluate the potential of social media use in disease surveillance during the post-recovery period. By developing and defining a non-traditional method for enhancing detection of infectious diseases post-natural disaster recovery in the Philippines, this research aims to increase the resilience of affected developing countries through advanced passive disease surveillance with minimal cost and high impact.Methods We collected 50 million geo-tagged tweets, weekly case counts for six diseases, and all natural disasters from the Philippines between 2012 and 2013. We compared the predictive capability of various disease lexicon-based time series models (e.g., Twitter’s BreakoutDetection, Autoregressive Integrated Moving Average with Explanatory Variable [ARIMAX], Multilinear regression, and Logistic regression) and document embeddings (Gensim’s Doc2Vec).Results The analyses show that the use of only tweets to predict disease outbreaks in the Philippines has varying results depending on which technique is applied, the disease type, and location. Overall, the most consistent predictive results were from the ARIMAX model which showed the significance in tweet value for prediction and a role of disaster in specific instances.Discussion Overall, the use of disease/sick lexicon-filtered tweets as a predictor of disease in the Philippines appears promising. Due to the consistent and large increase use of Twitter within the country, it would be informative to repeat analysis on more recent years to confirm the top method for prediction. In addition, we suggest that a combination disease-specific model would produce the best results. The model would be one where the case counts of a disease are updated periodically along with the continuous monitoring of lexicon-based tweets plus or minus the time from disaster.Competing Interest StatementThe authors have declared no competing interest.Funding StatementThe research described is part of the Deep Science Initiative at Pacific Northwest National Laboratory (PNNL) and was performed using PNNL Institutional Computing. This effort was funded by a contract to PNNL from the United States Agency for International Development and under the PNNL Laboratory Directed Research and Development Program, a multi-program national laboratory operated by Battelle for the U.S. Department of Energy.Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:This project was exempt from IRB/oversight by PNNL Institutional Review Board which reviews and approves research involving humans, data about humans, and human biological materials according to Federal law, DOE policy, and PNNL policy.All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.YesUpon request, social media counts per region can be made available as well as disease count data.