PT - JOURNAL ARTICLE AU - , AU - Ulahannan, Jijo Pulickiyil AU - Narayanan, Nikhil AU - Thalhath, Nishad AU - Prabhakaran, Prem AU - Chaliyeduth, Sreekanth AU - Suresh, Sooraj P AU - Mohammed, Musfir AU - Rajeevan, E AU - Joseph, Sindhu AU - Balakrishnan, Akhil AU - Uthaman, Jeevan AU - Karingamadathil, Manoj AU - Thomas, Sunil Thonikkuzhiyil AU - Sureshkumar, Unnikrishnan AU - Balan, Shabeesh AU - Vellichirammal, Neetha Nanoth TI - A citizen science initiative for open data and visualization of COVID-19 outbreak in Kerala, India AID - 10.1101/2020.05.13.20092510 DP - 2020 Jan 01 TA - medRxiv PG - 2020.05.13.20092510 4099 - http://medrxiv.org/content/early/2020/05/18/2020.05.13.20092510.short 4100 - http://medrxiv.org/content/early/2020/05/18/2020.05.13.20092510.full AB - India—the second most populated country in the world—reported its first COVID-19 case in the state of Kerala with a travel history from Wuhan. Subsequently, a surge of cases was observed in the state mainly through the individuals who traveled from Europe and the Middle East to Kerala, thus initiating an outbreak. Since public awareness through dissemination of reliable information plays a significant role in controlling the spread of the disease, the Department of Health Services, Government of Kerala initially released daily updates through daily textual bulletins. However, this unstructured data requires refinement and enrichment for upstream applications, such as visualization, and/or analysis. Here we reported a citizen science initiative that leveraged publicly available and crowd-verified data on COVID-19 outbreak in Kerala from the government bulletins, supplemented with the information from media outlets to generate reusable datasets. This data was further used to provide real-time analysis, and daily updates of COVID-19 cases in Kerala, through a user-friendly bilingual dashboard (https://covid19kerala.info/) for non-specialists. We ensured longevity and reusability of the dataset by depositing it in a public repository, aligning with open source principles for future analytical efforts. Finally, to show the scope of the sourced data, we also provided a snapshot of outbreak trends and demographic characteristics of the individuals affected with COVID-19 in Kerala during the first 99 days of the outbreak.Competing Interest StatementThe authors have declared no competing interest.Funding StatementThis study was not funded by any agencies and was purely a voluntary effort during the community-wide quarantine period by a team of technologists, academicians and students advocating open data and citizen scienceAuthor DeclarationsAll relevant ethical guidelines have been followed; any necessary IRB and/or ethics committee approvals have been obtained and details of the IRB/oversight body are included in the manuscript.YesAll necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.YesThe dataset associated with this manuscript is made available under the Open Data Commons Attribution License v1.0 (ODC-BY 1.0) https://zenodo.org/record/3818096