PT - JOURNAL ARTICLE AU - Shrestha, Pranav AU - Lohse, Hendrik AU - Bhatla, Christopher AU - McCartney, Heather AU - Alzaki, Alaa AU - Sandhu, Navdeep AU - Oli, Pradip Kumar AU - Li, Hongquan AU - Prakash, Manu AU - Amid, Ali AU - Onell, Rodrigo AU - Au, Nicholas AU - Merkeley, Hayley AU - Kapoor, Videsh AU - Pande, Rajan AU - Stoeber, Boris TI - Morphology-based classification of sickle cell disease and β-thalassemia using a low-cost automated microscope and machine learning AID - 10.1101/2024.09.21.24314128 DP - 2024 Jan 01 TA - medRxiv PG - 2024.09.21.24314128 4099 - http://medrxiv.org/content/early/2024/09/23/2024.09.21.24314128.short 4100 - http://medrxiv.org/content/early/2024/09/23/2024.09.21.24314128.full AB - Sickle cell disease (SCD) and β-thalassemia are the most common monogenic diseases, disproportionately affecting low- and middle-income countries, where low-cost and accurate diagnostic tools are needed to reduce the global disease burden. Although the sickling test is commonly used to screen for the sickle mutation, it cannot distinguish between the asymptomatic sickle cell trait (SCT) and SCD, or identify β-thalassemia. Here, we enhanced the inexpensive sickling test using automated microscopy and morphology-based machine learning classification to detect SCD, trait conditions (SCT and β-thalassemia trait) and normal individuals with an overall area under receiver operating curve, sensitivity and specificity of 0.940 (95% confidence intervals: 0.938-0.942), 84.6% (84.2%-84.9%), and 92.3% (92.1%-92.4%), respectively. Notably, the sensitivity and specificity to detect severe disease (SCD) was over 97% and 98%, respectively, thus establishing a low-cost automated screening option for disease detection in low-resource settings. Furthermore, leveraging high-throughput microscopy, we generated an open-access dataset comprising over 300,000 images with 1.5 trillion segmented cells from 138 individuals in Canada and Nepal including individuals with sickle and/or β-thalassemia mutations, to accelerate further research.Competing Interest StatementHongquan Li and Manu Prakash are co-founders, and Pranav Shrestha is an employee, of Cephla Inc., which is a spin-out company from Stanford University, deploying Octopi globally for disease diagnostics.Clinical TrialNCT05506358Funding StatementThis research was undertaken, in part, with support from the Canada Research Chairs program, UBC Health Innovation Funding Investment (HIFI) Awards, and the UBC Four Year Doctoral Fellowship (4YF) program. The UBC Centre for Blood Research is home to the Naiman Vickars Endowment fund which has provided funds for this project.Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:Ethics committee/IRB University of British Columbia-Providence Health Care Research Institute (UBC-PHC REB Number: H21-01929) of the University of British Columbia gave ethical approval for this work (for the study in Canada) Ethics committee/IRB University of British Columbia Clinical Research Ethics Board (UBC CREB Number: H22-00294) of the University of British Columbia, and Nepal Health Research Council (NHRC Registration Number: 85/2022) of Nepal (national research ethics review board) gave ethical approval for this work (for the study in Nepal)I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.YesIn accordance with the approved research protocols and informed consent from participants, de-identified data (such as de-identified images of blood films from Octopi, and processed data such as segmentation outlines, morphological parameters, etc.) were deposited in an online public repository, Federated Research Data Repository (FRDR), an open-access repository hosted by the Digital Research Alliance of Canada. The link to the open-access dataset is provided https://doi.org/10.20383/103.0916. All the associated code (written in Python, MATLAB, or shell) for generating results presented here are also available online in a public Github repository (https://github.com/p-shrestha/erythroSight). https://doi.org/10.20383/103.0916