Machine learning-based mortality prediction models for non-alcoholic fatty liver disease in the general United States population

Jia-Rui Zheng; Zi-Long Wang; Bo Feng

doi:10.1101/2024.07.10.24310253

Abstract

Background & Aims Nowadays, the global prevalence of non-alcoholic fatty liver disease (NAFLD) has reached about 25%, which is the most common chronic liver disease worldwide, and the mortality risk of NAFLD patients is higher. Our research created five machine learning (ML) models for predicting overall mortality in ultrasound-proven NAFLD patients and compared their performance with conventional non-invasive scoring systems, aiming to find a generalizable and valuable model for early mortality prediction in NAFLD patients.

Methods National Health and Nutrition Examination Survey (NHANES)-III from 1988 to 1994 and NHANES-III related mortality data from 2019 were used. 70% of subjects were separated into the training set (N = 2262) for development, while 30% were in the testing set (N= 971) for validation. The outcome was all-cause death at the end of follow-up. Twenty-nine related variables were trained as predictor features for five ML–based models: Logistic regression (LR), K-nearest neighbors (KNN), Gradient-boosted decision tree (XGBoost), Random forest (RF) and Decision tree. Five typical evaluation indexes including area under the curve (AUC), F1 score, accuracy, sensitivity and specificity were used to measure the prediction performance.

Results 3233 patients with NAFLD in total were eligible for the inclusion criteria, with 1231 death during the average 25.3 years follow up time. AUC of the LR model in predicting the mortality of NAFLD was 0.888 (95% confidence interval [CI] 0.867-0.909), the accuracy was 0.808, the sensitivity was 0.819, the specificity was 0.802, and the F1 score was 0.765, which showed the best performance compared with other models (AUC were: RF, 0.876 [95%CI 0.852-0.897]; XGBoost, 0.875 [95%CI 0.853-0.898]; Decision tree, 0.793 [95%CI 0.766-0.819] and KNN, 0.787 [95%CI 0.759-0.816]) and conventional clinical scores (AUC were: Fibrosis-4 Score (FIB-4), 0.793 [95%CI 0.777-0.809]; NAFLD fibrosis score (NFS), 0.770 [95%CI 0.753-0.787] and aspartate aminotransferase-to-platelet ratio index (APRI), 0.522 [95%CI 0.502-0.543]).

Conclusions ML–based models, especially LR model, had better discrimination performance in predicting all-cause mortality in patients with NAFLD compared to the conventional non-invasive scores, and an interpretable model like Decision tree, which only used three predictors: age, systolic pressure and glycated hemoglobin, is simple to use in clinical practice.

Introduction

NAFLD has become the most common chronic liver disease and affects up to 1 billion people worldwide, leading to a health and economic burden^1–3, and can increase the risk of end-stage liver disease and hepatocellular carcinoma (HCC)⁴. Patients with NAFLD have a significant increase of the all-cause mortality, among which the main causes are cardiovascular disease, malignant tumor as well as end-stage liver disease^{5, 6}. It is of great importance to early detect patients with a higher risk of death, which may may help to optimize the use of finite resources and provide appropriate care.

In addition to age, fibrosis stage has the best predictive power for overall mortality^{7, 8}. However, liver biopsy as the gold standard is inappropriate to screen clinically significant fibrosis because of its features like invasive, inconvenient and expensive. Some studies have shown that conventional non-invasive scores, such as NFS⁹, FIB-4¹⁰, and APRI¹¹, have prognostic significance of death for NAFLD patients^12–14. However, their results were controversial. A meta-analysis including 19 longitudinal studies showed that only the NFS > 0.676 was predictive of overall mortality, while FIB-4 and APRI failed¹⁴. And a retrospective analysis including 646 NAFLD patients proven by liver biopsy revealed that although FIB-4 and NFS could precisely predict the risk of overall mortality of NAFLD patients, owing to the AUC values were not high enough (FIB-4, 0.72 [95% CI, 0.68–0.76]; NFS, 0.72 [95% CI, 0.68–0.76] and APRI, 0.52 [95% CI, 0.47–0.57]), so they were not useful in the clinical practice and new methods are needed to confirm the prognosis of NAFLD patients ¹³.

Predictive tools using ML have been extensively developed and used in medicine in recent years because they are often superior to traditional predictive methods¹⁶, and nowadays, the utilization of ML in gastroenterology territory is in steady-state growth¹⁷. Some studies have shown that ML is superior to traditional non-invasive approaches for the prediction of liver fibrosis^18–21, such as FIB-4 and FibroScan to predict significant fibrosis (≥F2) and advanced fibrosis (≥F3) in NAFLD patients¹⁸. However, a model for predicting NAFLD mortality based on ML has not yet been developed. Our study aimed to develop, test and verify the mortality prediction model established by ML for patients with NAFLD in the USA.

Methods

Data sources and ethical approval

NHANES-III (1988–1994) database with nationwide, multilevel, stratified, clustered probability sampling design, is used to assess the health status of the civilian population in the USA. The data in the NHANES-III includes interviews, physical examinations, laboratory tests, and ultrasound examinations were conducted to assess the liver steatosis. NHANES-III data is also related to death certificates from the National Death Index (NDI) as of December 31, 2019, allowing for mortality analysis.

The survey was ratified by the ethics review committee of the National Center for health statistics, and the written informed consent of all participants was acquired to collect data. The institutional review committee dispensed with the consideration of human research because the data was fully certain.

Study population and definitions

In the NHANES-III survey, among the adult participants (20-74 years old) with gradable liver / gallbladder ultrasound results (n=13,856), we first excluded individuals with heavy drinking (men >21drinks / week, women >14 drinks/ week), viral hepatitis (serum hepatitis B surface antigen positive and /or serum hepatitis C antibody positive), iron overload (transferring saturation≥50%). In addition, participants with incomplete or missing data on mortality, physical examinations and laboratory tests were also excluded (Figure 1).

Figure 1. Study design and data partitioning flow chart.

NAFLD, non-alcoholic fatty liver disease.

NHANES-III examination includes gallbladder ultrasonography in adults aged 20-74. In order to evaluate fatty liver, the gallbladder ultrasound images were examined by three committee certified radiologists. The following five criteria were used in the review process: (I) parenchymal brightness, (II) liver to kidney contrast, (III) deep beam attenuation, (IV) bright vascular walls, and (V) gallbladder wall definition. The degree of hepatic steatosis were reported as normal, mild, moderate or severe according to these five criteria. In this study, NAFLD was defined as mild to severe hepatic steatosis, excluding whatever known causes of liver disease.

Variable selection and outcome

In this study, twenty-nine NAFLD related factors were included, such as demographic features (age, gender and ethnicity), general measurement (waist circumference, body mass index (BMI), systolic blood pressure (SBP) and diastolic blood pressure (DBP)), biochemistry tests (WBC, PLT, C-reactive protein (CRP), iron, total iron-binding capacity (TBIL), ferritin, transferrin saturation, total cholesterol, triglyceride, high-density lipoprotein (HDL) cholesterol, and uric acid), diabetes testing profile (fasting plasma glucose, glycated hemoglobin (HbA1c), fasting C-peptide and fasting insulin),and liver chemistry (aspartate aminotransferase (AST), alanine aminotransferase (ALT), alkaline phosphatase (ALP), gamma glutamyl transferase (GGT), albumin and total bilirubin).

The outcome was set as the passive mortality as of December 31, 2019 according to the follow up time of NHANES-III. For assessing the state of death (including the date of death), we performed probability matching with the NDI records. NHANES-III related death documents use the ucod_113 to code the deaths before 1998 and between 1999 and 2015, which are coded according to the Ninth Revision of the international classification of diseases (ICD-9).

Development and validation of machine learning models

The ML models in our study, including LR, KNN, Decision tree, RF and XGBoost, were trained using the selected 29 variables to predict mortality, and then 10-fold stratified cross-validation was adopted in the training process to avoid overfitting of the mode. Briefly speaking, the training data was divided into 10 hierarchical subsets, followed by using 9 subsets to train the model, and using the left one subset for verification. These training and verification processes were repeated 10 times, and each subset was used once as the verification dataset, so that we could obtain 10 estimates of prediction accuracy, and these estimates were averaged to obtain a single estimate. For the LR model, we described the absolute value of standardized beta coefficient, while for the RF and XGBoost, the feature importance was showed of each model.

Testing data were used to verify the performance of developed ML models, which was independent from the training process. Accuracy, sensitivity or recall, specificity, precision, AUC and the F1 score (the harmonic average of recall and precision) were taken as performance indicators, and then compared with those of three conventional NAFLD scores (FIB-4, NFS, APRI) on the testing dataset, where we also established calibration plots in order to observe the coherence between predicted and observed mortality during follow-up. A good calibration degree shows that from the model explanation to the random samples prediction, the predicted value of the model is closer to the actual probability of the results.

Statistical analysis

To present the patient characteristics, the mean of standard deviation (SD) for numerical variables and percentage counting for the categorical variables were used. We use Student t-test to compare the mean value between two samples, and chi-square test to compare the frequencies. For all tests, the bilateral significance level <5% was considered statistically significant (p<0.05). All statistical analyses were conducted using RStudio software (Macintosh; Intel Mac OS X 12_5_0).

Results

Characteristics of study subjects

3,233 patients with NAFLD met the inclusion criteria and were categorized into two groups at random: training set (70%, N = 2262) as well as testing set (30%, N = 971). Figure 1 shows the patient screening process. The overall mortality in patients with NAFLD is 38.7% during the average 25.3 years follow up time. The baseline characteristics of patients according to whether died or not in the training and testing sets are described in Table 1. Compared with the patients finally died, those survival NAFLD patients were more likely to be young people, women, Mexican Americans. In addition, BMI, waist circumference, SBP, DBP, TC, TG, CRP, uric acid, GGT, ALP, HbA1c, FPG, fasting C-peptide, fasting insulin of these people were lower, and PLT, TIBC, ALT, albumin were higher. There were no significant differences between the training set and testing set for all factors.

View this table:

Table 1. Baseline characteristics of the data set.

Model building and evaluation

Five ML methods, including LR, decision tree, RF, KNN and XGBoost, with all the 29 factors inputted were built for the mortality prediction of NAFLD patients. Absolute values of standardized beta coefficients for LR and feature importance for XGBoost and RF models were assessed and the results were shown in Figure 2. Age was the most important factor among all the ML models to predict mortality during follow-up period. SBP and glucose level were listed as the top 5 important variables of RF and XGBoost. The other essential factors for the models development were iron, transferrin saturation, BMI and uric acid for the LR, DBP and waist circumference for the RF, and HbA1c as well as C-peptide for the XGBoost.

Figure 2. (A) Absolute values of standardized beta coefficients for the logistic regression model. (B) Feature importances of variables for the random forest model. (C) Feature importances of variables for the XGBoost model.

BMI, Body mass index; UA, uric acid; SBP, systolic blood pressure; DBP, diastolic blood pressure; TC, total cholesterol; TG, triglyceride; HDL, high-density lipoprotein cholesterol; CRP, C-reactive protein; ALT, alanine aminotransferase; AST, aspartate aminotransferase; GGT, gamma glutamyl transferase; ALP, alkaline phosphatase; TBIL, total bilirubin; HbA1c, glycated hemoglobin; FIBC, total iron-binding capacity.

After 10-fold cross-validation, the training accuracy of ML models were 0.807 for the LR, 0.807 for the decision tree, 0.814 for the RF, 0.692 for the KNN, and 0.808 for the XGBoost.

Models performance analysis

The receiver operating characteristic (ROC) curves with AUC values of the five ML models in the testing data is shown in Figure 3. Validation of our developed ML models showed reliable performance for the mortality prediction in NAFLD patients, whose AUC values were: LR, 0.888 (0.867–0.909); RF, 0.876 (0.852–0.897); XGBoost, 0875 (0.853–0.898); decision tree, 0.793 (0.766–0.819) and KNN, 0.787 (0.759–0.816), respectively. The F1 score of above models were 0.765, 0.759, 0.745, 0.744 and 0.759, respectively. The other evaluation measures of the prediction models, including accuracy, sensitivity, specificity, positive predictive value (PPV) and negative predictive value (NPV) are described in Table 2. Among all the evaluated classifiers, the LR model had the highest sensitivity of 0.819 and NPV of 0.878. The specificity, PPV and accuracy of the RF model were the highest on the other hand, which were 0.837, 0.745 and 0.813, respectively. Table 2 shows the performance of NFS, FIB-4 and APRI on the testing data at the same time, and among all the conventional non-invasive scores, FIB-4 showed the best performance, whose accuracy was 73.0% and F1 score was 0.765. Nevertheless, the performance of all the ML models were superior to FIB-4 in all metrics.

Figure 3. Comparison of ROC curves and AUC among the developed machine-learning models and among the conventional non-invasive scores for mortality prediction.

ROC, receiver operating characteristic; AUC, area under the curve; NFS, NAFLD fibrosis score; FIB-4, fibrosis-4 score; APRI, aspartate aminotransferase-to-platelet ratio index.

View this table:

Table 2. The performance of machine learning models and conventional non-invasive scores on testing data.

AUC, area under the curve; CI, confidence interval; NPV, negative predictive value; PPV, positive predictive value; NFS, NAFLD fibrosis score; FIB-4, fibrosis-4 score; APRI, aspartate aminotransferase-to-platelet ratio index.

Finally, Supplementary Figure 1 shows the probability calibration curves of the ML models in validation and it can be seen that the predicted probability of all the models was uncertainty because they were not well-calibrated and underestimated.

Discussion

To our knowledge, this study is the first time that the NAFLD mortality prediction model based on ML has been developed and evaluated. In conclusion, we selected 29 clinical variables for NAFLD mortality prediction from the NHANES-III database, which were important in the liver diseases and generally readily available at hospital admission. After using several ML algorithms of LR, decision tree, RF, KNN and XGBoost to train these variables, the verification of the developed models revealed dependable performance with relatively high AUC values. The LR model had both a higher AUC and F1 score, which indicated a superior performance in the death classification of NAFLD patients, and showed better performance than that of decision tree, RF, KNN and XGBoost.

On the other hand, we found that the decision tree model which consisted of only three factors: age, systolic blood pressure and HbA1c (Figure 4), although not showed the best performance among all the ML models, it had a certain degree of value (accuracy was 80.1%; AUC was 0.793 and F1 score was 0.744) in terms of testing performance. The decision tree model is easy to explain and use, so it can be used more practically in clinical practice.

Figure 4. The decision logic of decision tree.

SBP, systolic blood pressure; HbA1c, glycated hemoglobin.

The performance of some conventional non-invasive scores, like FIB-4, NFS and APRI for the overall mortality prediction in NAFLD patients was also showed in our study. AUC values for the overall mortality were: FIB-4, 0.793 (0.777–0.809); NFS, 0.770 (0.753–0.787) and APRI, 0.522 (0.502–0.543), respectively, which were closely similar to the results of a retrospective analysis of 646 biopsy-proven patients with NAFLD (AUC for the overall mortality were FIB-4, 0.72 (0.68–0.76); NFS, 0.72(0.68–0.76) and APRI, 0.52 (0.47–0.57)), indicating that these scoring systems were insufficient for clinical use.

On the other hand, the availability of ML methods in the development of medical prediction models have been proven in recent years^{22, 23}. In the same way, by using a ML algorithm, a well-done prognostic model for NAFLD has been successfully developed in our study. With regard to the AUC values, the models we developed showed statistical advantages over the conventional non-invasive scores, and the F1 scores in the developed models were also significantly higher, which incited the validity of ML models in detecting NAFLD mortality. However, the calibration chart showed that the prediction of the probability of results was underestimated or overestimated, indicating that these models were only applicable to classification problems.

Our predictive models have the latent capacity for use in the clinical practice. Since we only used demographic characteristics and laboratory data as the predictor variables which are easily obtained, clinicians can use the predictive results as a reference tool to initiate treatment as early as possible. In addition, the model can be used to retrospectively evaluate the quality of care in NAFLD treatment. Nevertheless, ML models should not be used as an explicit tool to decide the withdraw of treatment.

Age is the most important factor not only in the decision tree model, but also ranks the first in all the other ML models. Many studies have proven that age is an independent risk factor for liver fibrosis in NAFLD patients^{24, 25}, and liver fibrosis is also an independent risk factor for liver-related mortality^26–28. Age is in correlation with increased cardiovascular mortality as the independent risk factor in NAFLD at the same time^{29, 30}. So the role of age in all-cause mortality can be rationally explained.

Systolic blood pressure and HbA1c are also components of the decision tree to predict the all-cause mortality. With the deepening of research, people’s understanding of NAFLD is no longer limited to the liver itself, but as a major performance of metabolic syndrome (MetS) in the liver, which is closely associated with hypertension, obesity, dyslipidemia, type 2 diabetes (T2DM), insulin resistance (IR) and cardiovascular disease^{31, 32}. Although patients with NAFLD, especially those with nonalcoholic steatohepatitis (NASH), have an increased risk of liver-related death, there is evidence that cardiovascular disease (CV) risk factors like hypertension, obesity, IR as well as T2DM are the major drivers of morbidity and mortality in NAFLD patients^33–35. A recent study using NHANES-III database found poor glycaemic control (adjusted population-attributable fraction (PAF)=28.3% for all-cause mortality) and hypertension (adjusted PAF=23% for all-cause mortality) were the largest contributors to mortality for NAFLD patients and reaching desirable glycaemic control (HbA1c of <5.7%) could avoid 28.3% of all-cause deaths³⁶.

Our study has several limitations. First, we all know that liver biopsy is the gold standard for NAFLD diagnosis, but it was ultrasound-proven in our study. Nevertheless, in population-based studies, it is the dominant imaging approach for NAFLD diagnosis and is available in primary care settings; Second, missing data is an unavoidable nature of NHANES III population dataset; Third, since we used the US registration database for training and verification of the model, we should use foreign databases for external verification in the future. However, there was no external data set like NHANES III that can be used to validate the model at the time of writing this paper; Fourth, due to the use of randomization in the modeling process, such as data segmentation, cross validation and the creation of some ML models, it may not be possible to completely reproduce the ML algorithm in our research. Finally, it might be criticized that ML models need a computing device to calculate results, and it is unrealistic to only use a single model for NAFLD patients. Since the features we selected are mainly patient background and laboratory data, we suggest that we can use ML models as a plugin for electrical health records after completing a prospective study of further performance improvement and future external validation.

But there are also some strengths in our study. First, to our knowledge, this study firstly assessed the performance of ML models in predicting all-cause mortality in NAFLD patients, based on over 3000 US individuals from NHANES III. Second, we proposed a simple model with rational performance of mortality prediction in patients with NAFLD, which will potentially be used by primary care providers in clinical practice.

Conclusion

In conclusion, a new mortality prediction model for NAFLD patients in the USA was developed using ML technology. The LR model performed best in our study, using the AUC and F1 score for measurement. On the other hand, the decision tree model, which is composed of age, systolic blood pressure and HbA1c, can produce a rational prediction performance, and the most important thing is that it is the simplest to use. Although we need to further improve the quality of performance by increasing the sample size or conducting prospective validation in the clinical environment, our research has demonstrated for the first time the potential of NAFLD prediction models based on machine learning.

Author contributions

Study concept and design (BF), acquisition, analysis and interpretation of data, drafting of the manuscript (J-RZ), critical revision of the manuscript for important intellectual content (Z-LW), and study supervision (H-S C). All authors have made a significant contribution to this study and have approved the final manuscript.

Funding

The work was supported in part by a grant from the National Major Project for Infectious Diseases Prevention and Treatment (No. 2017ZX10302201-004-001, 2017ZX10203202- 003-003).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

1.↵
Sheka AC, Adeyi O, Thompson J, et al. Nonalcoholic Steatohepatitis: A Review. Jama 2020;323:1175–1183.
OpenUrl CrossRef PubMed Google Scholar
2.
Younossi ZM, Koenig AB, Abdelatif D, et al. Global epidemiology of nonalcoholic fatty liver disease-Meta-analytic assessment of prevalence, incidence, and outcomes. Hepatology 2016;64:73–84.
OpenUrl CrossRef PubMed Google Scholar
3.↵
Younossi ZM, Blissett D, Blissett R, et al. The economic and clinical burden of nonalcoholic fatty liver disease in the United States and Europe. Hepatology 2016;64:1577–1586.
OpenUrl CrossRef PubMed Google Scholar
4.↵
Younossi Z, Stepanova M, Ong JP, et al. Nonalcoholic Steatohepatitis Is the Fastest Growing Cause of Hepatocellular Carcinoma in Liver Transplant Candidates. Clin Gastroenterol Hepatol 2019;17:748–755.e3.
OpenUrl CrossRef PubMed Google Scholar
5.↵
Younossi ZM. Non-alcoholic fatty liver disease - A global public health perspective. J Hepatol 2019;70:531–544.
OpenUrl CrossRef PubMed Google Scholar
6.↵
Kim D, Vazquez-Montesino LM, Escober JA, et al. Low Thyroid Function in Nonalcoholic Fatty Liver Disease Is an Independent Predictor of All-Cause and Cardiovascular Mortality. Am J Gastroenterol 2020;115:1496–1504.
OpenUrl Google Scholar
7.↵
Hagström H, Nasr P, Ekstedt M, et al. Fibrosis stage but not NASH predicts mortality and time to development of severe liver disease in biopsy-proven NAFLD. J Hepatol 2017;67:1265–1273.
OpenUrl CrossRef PubMed Google Scholar
8.↵
Taylor RS, Taylor RJ, Bayliss S, et al. Association Between Fibrosis Stage and Outcomes of Patients With Nonalcoholic Fatty Liver Disease: A Systematic Review and Meta-Analysis. Gastroenterology 2020;158:1611–1625.e12.
OpenUrl CrossRef PubMed Google Scholar
9.↵
Angulo P, Hui JM, Marchesini G, et al. The NAFLD fibrosis score: a noninvasive system that identifies liver fibrosis in patients with NAFLD. Hepatology 2007;45:846–54.
OpenUrl CrossRef PubMed Web of Science Google Scholar
10.↵
Shah AG, Lydecker A, Murray K, et al. Comparison of noninvasive markers of fibrosis in patients with nonalcoholic fatty liver disease. Clin Gastroenterol Hepatol 2009;7:1104–12.
OpenUrl CrossRef PubMed Google Scholar
11.↵
Wai CT, Greenson JK, Fontana RJ, et al. A simple noninvasive index can predict both significant fibrosis and cirrhosis in patients with chronic hepatitis C. Hepatology 2003;38:518–26.
OpenUrl CrossRef PubMed Web of Science Google Scholar
12.↵
Younes R, Caviglia GP, Govaere O, et al. Long-term outcomes and predictive ability of non-invasive scoring systems in patients with non-alcoholic fatty liver disease. J Hepatol 2021;75:786–794.
OpenUrl Google Scholar
13.↵
Hagström H, Nasr P, Ekstedt M, et al. Accuracy of Noninvasive Scoring Systems in Assessing Risk of Death and Liver-Related Endpoints in Patients With Nonalcoholic Fatty Liver Disease. Clin Gastroenterol Hepatol 2019;17:1148–1156.e4.
OpenUrl Google Scholar
14.↵
Liu CH, Ampuero J, Pavlides M, et al. Simple non-invasive scoring systems and histological scores in predicting mortality in patients with non-alcoholic fatty liver disease: A systematic review and meta-analysis. J Gastroenterol Hepatol 2021;36:1754–1768.
OpenUrl Google Scholar
15.
Ahn JC, Connell A, Simonetto DA, et al. Application of Artificial Intelligence for the Diagnosis and Treatment of Liver Diseases. Hepatology 2021;73:2546–2563.
OpenUrl Google Scholar
16.↵
Schwalbe N, Wahl B. Artificial intelligence and the future of global health. Lancet 2020;395:1579–1586.
OpenUrl CrossRef PubMed Google Scholar
17.↵
Spann A, Yasodhara A, Kang J, et al. Applying Machine Learning in Liver Disease and Transplantation: A Comprehensive Review. Hepatology 2020;71:1093–1105.
OpenUrl CrossRef PubMed Google Scholar
18.↵
Chang D, Truong E, Mena EA, et al. Machine learning models are superior to noninvasive tests in identifying clinically significant stages of NAFLD and NAFLD-related cirrhosis. Hepatology 2022.
Google Scholar
19.
Choi KJ, Jang JK, Lee SS, et al. Development and Validation of a Deep Learning System for Staging Liver Fibrosis by Using Contrast Agent-enhanced CT Images in the Liver. Radiology 2018;289:688–697.
OpenUrl CrossRef PubMed Google Scholar
20.
Ahmed Y, Hussein RS, Basha TA, et al. Detecting liver fibrosis using a machine learning-based approach to the quantification of the heart-induced deformation in tagged MR images. NMR Biomed 2020;33:e4215.
OpenUrl Google Scholar
21.↵
Wei R, Wang J, Wang X, et al. Clinical prediction of HBV and HCV related hepatic fibrosis using machine learning. EBioMedicine 2018;35:124–132.
OpenUrl Google Scholar
22.↵
Meyer A, Zverinski D, Pfahringer B, et al. Machine learning for real-time prediction of complications in critical care: a retrospective study. Lancet Respir Med 2018;6:905–914.
OpenUrl Google Scholar
23.↵
Raghunath S, Ulloa Cerna AE, Jing L, et al. Prediction of mortality from 12-lead electrocardiogram voltage data using a deep neural network. Nat Med 2020;26:886–891.
OpenUrl PubMed Google Scholar
24.↵
Huh Y, Cho YJ, Nam GE. Recent Epidemiology and Risk Factors of Nonalcoholic Fatty Liver Disease. J Obes Metab Syndr 2022;31:17–27.
OpenUrl Google Scholar
25.↵
Golabi P, Paik J, Reddy R, et al. Prevalence and long-term outcomes of non-alcoholic fatty liver disease among elderly individuals from the United States. BMC Gastroenterol 2019;19:56.
OpenUrl CrossRef Google Scholar
26.↵
Vernon G, Baranova A, Younossi ZM. Systematic review: the epidemiology and natural history of non-alcoholic fatty liver disease and non-alcoholic steatohepatitis in adults. Aliment Pharmacol Ther 2011;34:274–85.
OpenUrl CrossRef PubMed Google Scholar
27.
Ekstedt M, Hagström H, Nasr P, et al. Fibrosis stage is the strongest predictor for disease-specific mortality in NAFLD after up to 33 years of follow-up. Hepatology 2015;61:1547–54.
OpenUrl CrossRef PubMed Google Scholar
28.↵
Powell EE, Wong VW, Rinella M. Non-alcoholic fatty liver disease. Lancet 2021;397:2212–2224.
OpenUrl CrossRef PubMed Google Scholar
29.↵
Golabi P, Otgonsuren M, de Avila L, et al. Components of metabolic syndrome increase the risk of mortality in nonalcoholic fatty liver disease (NAFLD). Medicine (Baltimore) 2018;97:e0214.
OpenUrl PubMed Google Scholar
30.↵
Adams LA, Anstee QM, Tilg H, et al. Non-alcoholic fatty liver disease and its relationship with cardiovascular disease and other extrahepatic diseases. Gut 2017;66:1138–1153.
OpenUrl Abstract/FREE Full Text Google Scholar
31.↵
Tilg H, Moschen AR, Roden M. NAFLD and diabetes mellitus. Nat Rev Gastroenterol Hepatol 2017;14:32–42.
OpenUrl CrossRef PubMed Google Scholar
32.↵
Eslam M, Sanyal AJ, George J. MAFLD: A Consensus-Driven Proposed Nomenclature for Metabolic Associated Fatty Liver Disease. Gastroenterology 2020;158:1999–2014.e1.
OpenUrl CrossRef PubMed Google Scholar
33.↵
Dunn W, Xu R, Wingard DL, et al. Suspected nonalcoholic fatty liver disease and mortality risk in a population-based cohort study. Am J Gastroenterol 2008;103:2263–71.
OpenUrl CrossRef PubMed Google Scholar
34.
Younossi ZM, Otgonsuren M, Venkatesan C, et al. In patients with non-alcoholic fatty liver disease, metabolically abnormal individuals are at a higher risk for mortality while metabolically normal individuals are not. Metabolism 2013;62:352–60.
OpenUrl CrossRef PubMed Google Scholar
35.↵
Björkström K, Franzén S, Eliasson B, et al. Risk Factors for Severe Liver Disease in Patients With Type 2 Diabetes. Clin Gastroenterol Hepatol 2019;17:2769–2775.e4.
OpenUrl Google Scholar
36.↵
Paik JM, Deshpande R, Golabi P, et al. The impact of modifiable risk factors on the long-term outcomes of non-alcoholic fatty liver disease. Aliment Pharmacol Ther 2020;51:291–304.
OpenUrl CrossRef Google Scholar

Comments

medRxiv aims to provide a venue for anyone to comment on a medRxiv preprint. Comments are moderated for offensive or irrelevant content (this can take ~24 h). Please avoid duplicate submissions and read our Comment Policy before commenting. The content of a comment is not endorsed by medRxiv.

Community Reviews

medRxiv aims to inform readers about online discussion of this preprint occurring elsewhere. The content at the links below is not endorsed by either medRxiv or the preprint's authors.

Community reviews for this article:

There are no community reviews for this paper.

Automated Evaluations

Certain services provide automated analysis of preprints. Analyses invited by the authors are displayed at the top of this tab. Those done independently of authors are shown underneath . None of these analyses is endorsed by medRxiv.

Automated Evaluations:

There are no automated evaluations for this paper.

[1] 1.↵
Sheka AC, Adeyi O, Thompson J, et al. Nonalcoholic Steatohepatitis: A Review. Jama 2020;323:1175–1183.
OpenUrl CrossRef PubMed Google Scholar

[2] 2.
Younossi ZM, Koenig AB, Abdelatif D, et al. Global epidemiology of nonalcoholic fatty liver disease-Meta-analytic assessment of prevalence, incidence, and outcomes. Hepatology 2016;64:73–84.
OpenUrl CrossRef PubMed Google Scholar

[3] 3.↵
Younossi ZM, Blissett D, Blissett R, et al. The economic and clinical burden of nonalcoholic fatty liver disease in the United States and Europe. Hepatology 2016;64:1577–1586.
OpenUrl CrossRef PubMed Google Scholar

[4] 4.↵
Younossi Z, Stepanova M, Ong JP, et al. Nonalcoholic Steatohepatitis Is the Fastest Growing Cause of Hepatocellular Carcinoma in Liver Transplant Candidates. Clin Gastroenterol Hepatol 2019;17:748–755.e3.
OpenUrl CrossRef PubMed Google Scholar

[5] 5.↵
Younossi ZM. Non-alcoholic fatty liver disease - A global public health perspective. J Hepatol 2019;70:531–544.
OpenUrl CrossRef PubMed Google Scholar

[6] 6.↵
Kim D, Vazquez-Montesino LM, Escober JA, et al. Low Thyroid Function in Nonalcoholic Fatty Liver Disease Is an Independent Predictor of All-Cause and Cardiovascular Mortality. Am J Gastroenterol 2020;115:1496–1504.
OpenUrl Google Scholar

[7] 7.↵
Hagström H, Nasr P, Ekstedt M, et al. Fibrosis stage but not NASH predicts mortality and time to development of severe liver disease in biopsy-proven NAFLD. J Hepatol 2017;67:1265–1273.
OpenUrl CrossRef PubMed Google Scholar

[8] 8.↵
Taylor RS, Taylor RJ, Bayliss S, et al. Association Between Fibrosis Stage and Outcomes of Patients With Nonalcoholic Fatty Liver Disease: A Systematic Review and Meta-Analysis. Gastroenterology 2020;158:1611–1625.e12.
OpenUrl CrossRef PubMed Google Scholar

[9] 9.↵
Angulo P, Hui JM, Marchesini G, et al. The NAFLD fibrosis score: a noninvasive system that identifies liver fibrosis in patients with NAFLD. Hepatology 2007;45:846–54.
OpenUrl CrossRef PubMed Web of Science Google Scholar

[10] 10.↵
Shah AG, Lydecker A, Murray K, et al. Comparison of noninvasive markers of fibrosis in patients with nonalcoholic fatty liver disease. Clin Gastroenterol Hepatol 2009;7:1104–12.
OpenUrl CrossRef PubMed Google Scholar

[11] 11.↵
Wai CT, Greenson JK, Fontana RJ, et al. A simple noninvasive index can predict both significant fibrosis and cirrhosis in patients with chronic hepatitis C. Hepatology 2003;38:518–26.
OpenUrl CrossRef PubMed Web of Science Google Scholar

[12] 12.↵
Younes R, Caviglia GP, Govaere O, et al. Long-term outcomes and predictive ability of non-invasive scoring systems in patients with non-alcoholic fatty liver disease. J Hepatol 2021;75:786–794.
OpenUrl Google Scholar

[13] 13.↵
Hagström H, Nasr P, Ekstedt M, et al. Accuracy of Noninvasive Scoring Systems in Assessing Risk of Death and Liver-Related Endpoints in Patients With Nonalcoholic Fatty Liver Disease. Clin Gastroenterol Hepatol 2019;17:1148–1156.e4.
OpenUrl Google Scholar

[14] 14.↵
Liu CH, Ampuero J, Pavlides M, et al. Simple non-invasive scoring systems and histological scores in predicting mortality in patients with non-alcoholic fatty liver disease: A systematic review and meta-analysis. J Gastroenterol Hepatol 2021;36:1754–1768.
OpenUrl Google Scholar

[15] 15.
Ahn JC, Connell A, Simonetto DA, et al. Application of Artificial Intelligence for the Diagnosis and Treatment of Liver Diseases. Hepatology 2021;73:2546–2563.
OpenUrl Google Scholar

[16] 16.↵
Schwalbe N, Wahl B. Artificial intelligence and the future of global health. Lancet 2020;395:1579–1586.
OpenUrl CrossRef PubMed Google Scholar

[17] 17.↵
Spann A, Yasodhara A, Kang J, et al. Applying Machine Learning in Liver Disease and Transplantation: A Comprehensive Review. Hepatology 2020;71:1093–1105.
OpenUrl CrossRef PubMed Google Scholar

[18] 18.↵
Chang D, Truong E, Mena EA, et al. Machine learning models are superior to noninvasive tests in identifying clinically significant stages of NAFLD and NAFLD-related cirrhosis. Hepatology 2022.
Google Scholar

[19] 19.
Choi KJ, Jang JK, Lee SS, et al. Development and Validation of a Deep Learning System for Staging Liver Fibrosis by Using Contrast Agent-enhanced CT Images in the Liver. Radiology 2018;289:688–697.
OpenUrl CrossRef PubMed Google Scholar

[20] 20.
Ahmed Y, Hussein RS, Basha TA, et al. Detecting liver fibrosis using a machine learning-based approach to the quantification of the heart-induced deformation in tagged MR images. NMR Biomed 2020;33:e4215.
OpenUrl Google Scholar

[21] 21.↵
Wei R, Wang J, Wang X, et al. Clinical prediction of HBV and HCV related hepatic fibrosis using machine learning. EBioMedicine 2018;35:124–132.
OpenUrl Google Scholar

[22] 22.↵
Meyer A, Zverinski D, Pfahringer B, et al. Machine learning for real-time prediction of complications in critical care: a retrospective study. Lancet Respir Med 2018;6:905–914.
OpenUrl Google Scholar

[23] 23.↵
Raghunath S, Ulloa Cerna AE, Jing L, et al. Prediction of mortality from 12-lead electrocardiogram voltage data using a deep neural network. Nat Med 2020;26:886–891.
OpenUrl PubMed Google Scholar

[24] 24.↵
Huh Y, Cho YJ, Nam GE. Recent Epidemiology and Risk Factors of Nonalcoholic Fatty Liver Disease. J Obes Metab Syndr 2022;31:17–27.
OpenUrl Google Scholar

[25] 25.↵
Golabi P, Paik J, Reddy R, et al. Prevalence and long-term outcomes of non-alcoholic fatty liver disease among elderly individuals from the United States. BMC Gastroenterol 2019;19:56.
OpenUrl CrossRef Google Scholar

[26] 26.↵
Vernon G, Baranova A, Younossi ZM. Systematic review: the epidemiology and natural history of non-alcoholic fatty liver disease and non-alcoholic steatohepatitis in adults. Aliment Pharmacol Ther 2011;34:274–85.
OpenUrl CrossRef PubMed Google Scholar

[27] 27.
Ekstedt M, Hagström H, Nasr P, et al. Fibrosis stage is the strongest predictor for disease-specific mortality in NAFLD after up to 33 years of follow-up. Hepatology 2015;61:1547–54.
OpenUrl CrossRef PubMed Google Scholar

[28] 28.↵
Powell EE, Wong VW, Rinella M. Non-alcoholic fatty liver disease. Lancet 2021;397:2212–2224.
OpenUrl CrossRef PubMed Google Scholar

[29] 29.↵
Golabi P, Otgonsuren M, de Avila L, et al. Components of metabolic syndrome increase the risk of mortality in nonalcoholic fatty liver disease (NAFLD). Medicine (Baltimore) 2018;97:e0214.
OpenUrl PubMed Google Scholar

[30] 30.↵
Adams LA, Anstee QM, Tilg H, et al. Non-alcoholic fatty liver disease and its relationship with cardiovascular disease and other extrahepatic diseases. Gut 2017;66:1138–1153.
OpenUrl Abstract/FREE Full Text Google Scholar

[31] 31.↵
Tilg H, Moschen AR, Roden M. NAFLD and diabetes mellitus. Nat Rev Gastroenterol Hepatol 2017;14:32–42.
OpenUrl CrossRef PubMed Google Scholar

[32] 32.↵
Eslam M, Sanyal AJ, George J. MAFLD: A Consensus-Driven Proposed Nomenclature for Metabolic Associated Fatty Liver Disease. Gastroenterology 2020;158:1999–2014.e1.
OpenUrl CrossRef PubMed Google Scholar

[33] 33.↵
Dunn W, Xu R, Wingard DL, et al. Suspected nonalcoholic fatty liver disease and mortality risk in a population-based cohort study. Am J Gastroenterol 2008;103:2263–71.
OpenUrl CrossRef PubMed Google Scholar

[34] 34.
Younossi ZM, Otgonsuren M, Venkatesan C, et al. In patients with non-alcoholic fatty liver disease, metabolically abnormal individuals are at a higher risk for mortality while metabolically normal individuals are not. Metabolism 2013;62:352–60.
OpenUrl CrossRef PubMed Google Scholar

[35] 35.↵
Björkström K, Franzén S, Eliasson B, et al. Risk Factors for Severe Liver Disease in Patients With Type 2 Diabetes. Clin Gastroenterol Hepatol 2019;17:2769–2775.e4.
OpenUrl Google Scholar

[36] 36.↵
Paik JM, Deshpande R, Golabi P, et al. The impact of modifiable risk factors on the long-term outcomes of non-alcoholic fatty liver disease. Aliment Pharmacol Ther 2020;51:291–304.
OpenUrl CrossRef Google Scholar

Machine learning-based mortality prediction models for non-alcoholic fatty liver disease in the general United States population

Abstract

Introduction