Abstract
Background Biological age (BA) has been accepted as a more accurate proxy of aging than chronological age (CA). This study aimed to use machine learning (ML) algorithms to estimate BA in the Chinese population.
Methods We used data from 9,771 middle-aged and older (≥ 45 years) Chinese adults in the China Health and Retirement Longitudinal Study. We used several ML algorithms (e.g., Gradient Boosting Regressor, Random Forest, CatBoost Regressor, and Support Vector Machine) to develop new measures of biological aging (ML-BAs) based on physiological biomarkers. R-squared value and mean absolute error (MAE) were used to determine the optimal performance of these ML-BAs. We used logistic regression models to examine the associations of the best ML-BA and a conventional aging measure – Klemera and Doubal method-biological age (KDM-BA) we previously developed – with physical disability and mortality, respectively.
Results The Gradient Boosting Regression model performed best, resulting in a ML-BA with R-squared value of 0.270 and MAE of 6.519. This ML-BA was significantly associated with disability in basic activities of daily living, instrumental activities of daily living, lower extremity mobility, and upper extremity mobility, and mortality, with odds ratios ranging from 1% to 7% (per one-year increment in ML-BA, all P <0.001), independent of CA. These associations were generally comparable to that of KDM-BA.
Conclusion This study provides a valid ML-based measure of biological aging for middle-aged and older Chinese adults. These findings support the application of ML in geroscience research and help facilitate the understanding of the aging process.
Competing Interest Statement
The authors have declared no competing interest.
Funding Statement
This work was supported by the 2020 Irma and Paul Milstein Program for Senior Health project award (Milstein Medical Asian American Partnership Foundation), Suzhou Municipal Science, Technology Bureau (SS2019069) and Chinese Ministry of Science and Technology (2020YFC2005600), and the Fundamental Research Funds for the Central Universities. The funders had no role in the study design; data collection, analysis, or interpretation; in the writing of the report; or in the decision to submit the article for publication.
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
Ethical approval for collecting data on human subjects was received from the institutional review board at Peking University.
All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.
Yes
Data Availability
The CHARLS data are publicly available through the CHARLS website at http://charls.pku.edu.cn/en.