Abstract
Background Electronic data capture systems (EDCs) have the potential to achieve efficiency and quality in collection of multisite data. We quantify volume, time, accuracy and costs of an EDC using large-scale census data from the STRATAA consortium, a comprehensive programme assessing population dynamics and epidemiology of typhoid fever in Malawi, Nepal and Bangladesh to inform vaccine and public health interventions.
Results A census form was developed through a structured iterative process and implemented using Open Data Kit Collect running on Android-based tablets. Data were uploaded to Open Data Kit Aggregate, then auto-synced to MySQL-defined database nightly. Data were backed-up daily from 3 sites centrally, and auto-reported weekly. Pre-census materials’ costs were estimated. Demographics of 308,348 individuals from 80,851 households were recorded within average of 14.7 weeks range (13-16) using 65 fieldworkers. Overall, 21.7 errors (95% confidence interval: 21.4, 22.0) per 10,000 data points were found: 13.0 (95% confidence interval: 12.6, 13.5) and 24.5 (95% confidence interval: 24.1, 24.9) errors on numeric and text fields respectively. These values meet standard quality threshold of 50 errors per 10,000 data points. The EDC’s total variable cost was estimated at US$13,791.82 per site.
Conclusions In conclusion, the EDC is robust, allowing for timely and high volume accurate data collection, and could be adopted in similar epidemiological settings.
Competing Interest Statement
The authors have declared no competing interest.
Funding Statement
Funding for the STRATAA study has been provided by a Wellcome Trust Strategic Award (no. 106158/Z/14/Z), https://wellcome.ac.uk/funding/managing-grant/grantsawarded, and the Bill and Melinda Gates Foundation (no. 617 OPP1141321), https://www.gatesfoundation.org/How-We-Work/Quick-Links/Grants-Database to AJP. The Malawi-Liverpool-Wellcome Programme and the Oxford University Clinical Research Unit in Vietnam are supported by the Wellcome Trust with Major Overseas Programme core awards. The funders did not play any role in the design of the study and collection, analysis and interpretation of data and in writing the manuscript.
Author Declarations
All relevant ethical guidelines have been followed; any necessary IRB and/or ethics committee approvals have been obtained and details of the IRB/oversight body are included in the manuscript.
Yes
All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.
Yes
Data Availability
The code scripts used to develop the EDC (ODK Collect eCRF and MySQL database objects), and the raw data for errors analysed in this paper are all available through GitHub.
Abbreviations
- EDCs
- Electronic data capture systems
- STRATAA
- Strategic Typhoid alliance across Africa and Asia consortium
- ODK
- Open Data Kit
- GPS
- global positioning system
- eCRF
- electronic census report form
- SQL
- Structured Query Language
- CI
- Confidence Intervals
- US$
- United States dollar
- SCDM
- Society of Clinical Data Management