RT Journal Article SR Electronic T1 A Novel Machine Learning Based Framework for Developing Composite Digital Biomarkers of Disease Progression JF medRxiv FD Cold Spring Harbor Laboratory Press SP 2024.09.23.24313737 DO 10.1101/2024.09.23.24313737 A1 Zhai, Song A1 Liaw, Andy A1 Shen, Judong A1 Xu, Yuting A1 Svetnik, Vladimir A1 FitzGerald, James J. A1 Antoniades, Chrystalina A. A1 Holder, Dan A1 Dockendorf, Marissa F. A1 Ren, Jie A1 Baumgartner, Richard YR 2024 UL http://medrxiv.org/content/early/2024/09/24/2024.09.23.24313737.abstract AB Background Current methods of measuring disease progression of neurodegenerative disorders, including Parkinson’s disease (PD), largely rely on composite clinical rating scales, which are prone to subjective biases and lack the sensitivity to detect progression signals in a timely manner. Digital health technology (DHT)-derived measures offer potential solutions to provide objective, precise, and sensitive measures that address these limitations. However, the complexity of DHT datasets and the potential to derive numerous digital features that were not previously possible to measure pose challenges, including in selection of the most important digital features and construction of composite digital biomarkers.Methods We present a comprehensive machine learning based framework to construct composite digital biomarkers for progression tracking. This framework consists of a marginal (univariate) digital feature screening, a univariate association test, digital feature selection, and subsequent construction of composite (multivariate) digital disease progression biomarkers using Penalized Generalized Estimating Equations (PGEE). As an illustrative example, we applied this framework to data collected from a PD longitudinal observational study. The data consisted of OpalTM sensor-based movement measurements and MDS-UPDRS Part III scores collected at 3-month intervals for 2 years in 30 PD and 10 healthy control participants.Results In our illustrative example, 77 out of 235 digital features from the study passed univariate feature screening, with 11 features selected by PGEE to include in construction of the composite digital measure. Compared to MDS-UPDRS Part III, the composite digital measure exhibited a smoother and more significant increasing trend over time in PD groups with less variability, indicating improved ability for tracking disease progression. This digital composite measure also demonstrated the ability to classify between de novo PD and healthy control groups.Conclusion Measures from DHTs show promise in tracking neurodegenerative disease progression with increased sensitivity and reduced variability as compared to traditional clinical scores. Herein, we present a novel framework and methodology to construct composite digital measure of disease progression from high-dimensional DHT datasets, which may have utility in accelerating the development and application of composite digital biomarkers in drug development.Competing Interest StatementS.Z., A.L., J.S., Y.X., V.S., D.H., M.F.D., J.R. and R.B. are employees of Merck Sharp & Dohme LLC, a subsidiary of Merck & Co., Inc., Rahway, NJ, USA, and may own stock and/or stock options in Merck & Co., Inc., Rahway, NJ, USA. C.A.A and J.J.F were supported by the National Institute for Health Research (NIHR) Oxford Biomedical Research Centre (BRC). C.A.A and J.J.F have received research grants from UCB Pharma and Merck & Co., Inc.Funding StatementJJF and CAA were supported by the National Institute for Health Research Oxford Biomedical Research Centre. JJF has received consulting fees from Abbott and Medtronic, unrelated to this study. JJF and CAA have received research grant support from UCB Pharma and Merck & Co., Inc. We thank the participants and their families for their endless support with our research work.Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:The study was approved by a research ethics committee and the Health Research Authority (REC 16/SW/0262). All participants were informed about the study's aims and protocols, and gave their informed written consent to participate in the study.I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.YesThe original data presented in this paper is from the ongoing OxQUIP study and cannot be shared until completion of the whole study and full dissemination of results. This is expected to become possible within 24 months from the end of the study. Qualified researchers will be able to contact the Principal Investigator at the University of Oxford.