Abstract
Objective Identifying children at high risk of developing obesity can offer a critical time to change the course of the disease before it establishes. Numerous studies have tried to achieve this; but practical limitations remain, including (i) relying on data not present in routinely available pediatric data (like prenatal data), (ii) focusing on a single age prediction (hence, not tested across ages), and (iii) not achieving good results or adequately validating those.
Methods A customized sequential deep learning model was built to predict the risk of childhood obesity, focusing especially on capturing the temporal patterns. The model was trained only on routinely collected EHRs, containing a list of features identified by a group of clinical experts, and sourced from 36,191 diverse children aged 0 to 10. The model was evaluated using extensive discrimination, calibration, and utility analysis; and was validated temporally, geographically, and across various subgroups.
Results Our results are mostly better (and never worse) than all previous studies, including those that focus on single-age predictions or link EHRs to external data. Specifically, the model consistently achieved an area under the curve (AUROC) of above 0.8 (with most cases around 0.9) for predicting obesity within the next 3 years for children 2 to 7. The validation results show the robustness of the model. Furthermore, the most influential predictors of the model match important risk factors of obesity.
Conclusions Our model is able to predict the risk of obesity for young children using only routinely collected EHR data, greatly facilitating its integration with the periodicity schedule. The model can serve as an objective screening tool to inform prevention efforts, especially by helping with very delicate interactions between providers and families in primary care settings.
Competing Interest Statement
The authors have declared no competing interest.
Funding Statement
Our study was supported by NIH awards, P20GM103446 and U54-GM104941.
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
Ethics committee/IRB of Nemours Children's Health waived ethical approval for this work
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.
Yes
Footnotes
mehakg{at}smu.edu, thaoly.phan{at}nemours.org, daniel.eckrich{at}nemours.org, tim.bunnell{at}nemours.org, rbi{at}udel.edu
Funding: Our study was supported by NIH awards, P20GM103446 and U54-GM104941.
Disclosure: The authors declare no competing financial interests.
Funding information and references
5. Data availability
Our code, containing the model with parameter (weight) values, is publicly available on GitHub at https://github.com/healthylaife/ObesityPrediction. Interested scholars can access the data by contacting Nemours Biomedical Research Informatics Center and signing a data use agreement.
Abbreviations
- AUPRC
- Area Under Precision-Recall (PR) Curve
- AUROC
- Area Under the Receiver Operating Characteristic
- BMI
- Body Mass Index
- CDC
- Centers for Disease Control and Prevention
- EHR
- Electronic Health Record
- US
- United States
- WFL
- Weight-for-length
- WHO
- World Health Organization