Abstract
Highlights
Many diseases are increasingly conceptualized as multifactorial, progressive processes
Robust prediction of progressive disease courses can advance risk stratification and treatment targeting
RiskPath provides optimizable timeseries AI to predict progressive disease with longitudinal cohort data
Enhanced explainability and functionality facilitates risk pathway mapping and compact models
The Bigger Picture Identifying persons at elevated risk for a disease outcome is a key prerequisite for targeting interventions to improve health. Current risk stratification tools for common diseases are aging and achieve only moderate performance. Moreover, many diseases are increasingly recognized to be complex outcomes where individual risk is determined not by a single effect modifier but by time-dependent interactions among many contributory factors over the lifecourse. There is an urgent need to improve individual-level prediction for progressive diseases and understand how multifactorial risks interact over time so that risk stratification and accompanying prevention and intervention strategies can be targeted earlier and more effectively in the disease course.
Summary Many diseases are the end outcomes of multifactorial risks that interact and increment over months or years. Timeseries AI methods have attracted increasing interest given their ability to operate on native timeseries data to predict disease outcomes. Instantiating such models in risk stratification tools has proceeded more slowly, in part limited by factors such as structural complexity, model size and explainability. Here, we present RiskPath, an explainable AI toolbox that offers advanced timeseries methods and additional functionality relevant to risk stratification use cases in classic and emerging longitudinal cohorts. Theoretically-informed optimization is integrated in prediction to specify optimal model topology or explore performance-complexity tradeoffs. Accompanying modules allow the user to map the changing importance of predictors over the disease course, visualize the most important antecedent time epochs contributing to disease risk or remove predictors to construct compact models for clinical applications with minimal performance impact.
Competing Interest Statement
The authors have declared no competing interest.
Funding Statement
This work was supported by the National Institute of Mental Health under award R00MH118359
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
The IRB of the University of Utah waived ethical approval of this work and deemed it Not Human Subjects research.
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.
Yes
Footnotes
The updated manuscript involves experiments with two additional deep-learning algorithms, Temporal Convolutional Networks and Transformers, and an additional timeseries dataset (e.g., the Multi-Ethnic Study of Atherosclerosis). The theoretical significance of model explainability has been explored in more detail.
Data Availability
All data produced in the present study are available upon reasonable request to the authors