Abstract
Background Epidemiological surveys of malaria currently rely on microscopy, polymerase chain reaction assays (PCR) or rapid diagnostic test kits for Plasmodium infections (RDTs). This study shows that mid-infrared (MIR) spectroscopy coupled with supervised machine learning could constitute an alternative method for rapid malaria screening, directly from dried human blood spots.
Methods Filter papers containing dried blood spots (DBS) were obtained from a cross-sectional malaria survey in twelve wards in south-eastern Tanzania in 2018/19. The DBS were scanned using attenuated total reflection-Fourier transform infrared (ATR-FTIR) spectrometer to obtain high-resolution MIR spectra in the range, 4000 cm-1 to 500 cm−1. The spectra were cleaned to compensate for atmospheric water vapor and CO2 interference bands and used to train different classification algorithms to distinguish between malaria-positive and malaria-negative DBS papers based on PCR test results as reference. The analysis considered 296 individuals, including 123 PCR-confirmed malaria-positives and 173 negatives. Model training was done using 80% of the dataset, after which the best-fitting model was optimized by bootstrapping of 80/20 train/test stratified splits. The trained models were evaluated by predicting Plasmodium falciparum positivity in the 20% validation set of DBS.
Results Logistic regression was the best-performing model. Considering PCR as reference, the models attained overall accuracies of 92% for predicting P. falciparum infections (specificity = 91.7%; sensitivity = 92.8%) and 85% for predicting mixed infections of P. falciparum and P. ovale (specificity = 85%, sensitivity = 85%) in the field-collected specimen.
Conclusion These results demonstrate that mid-infrared spectroscopy coupled with supervised machine learning (MIR-ML) could be used to screen for malaria parasites in dried human blood spots. The approach could have potential for rapid and high-throughput screening of Plasmodium infections in both non-clinical settings (e.g. field surveys) and clinical settings (diagnosis to aid case management). However, full utility will require further advances in classification algorithms, field validation of this technology in other study sites and an in-depth evaluation of the biological basis of the observed test results. Training the models on larger datasets could also improve specificity and sensitivity of the technique. The MIR-ML spectroscopy system is robust, low-cost, and requires minimum maintenance.
Competing Interest Statement
The authors have declared no competing interest.
Funding Statement
This research was supported by Wellcome Trust Intermediate Fellowship in Public Health & Tropical Medicine awarded to FOO (Grant No. WT102350/Z/13/Z), a Howard Hughes Medical Institute (HHMI)-Gates International Research Scholarship awarded to FOO (Grant No. OPP1099295) and an MRC grant awarded to University of Glasgow (Grant No. MR/P025501/1). EPM, DJS, SAM and JKS were also supported by Wellcome Trust International Masters Fellowships in Tropical Medicine & Hygiene, (Grant Nos. WT214643/Z/18/Z, WT 214644/Z/18/Z, WT212633/Z/18/Z and WT200086/Z/15/Z respectively).
Author Declarations
All relevant ethical guidelines have been followed and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.
Yes
Any clinical trials involved have been registered with an ICMJE-approved registry such as ClinicalTrials.gov and the trial ID is included in the manuscript.
Not Applicable
I have followed all appropriate research reporting guidelines and uploaded the relevant Equator, ICMJE or other checklist(s) as supplementary files, if applicable.
Not Applicable
Data Availability
All data for this study will be available upon request
https://github.com/MwangaEP/Mannu-ML-projects/tree/master/My%20projects/DBS%20work