Artificial Intelligence for the Diagnosis and Treatment of Diabetes Kidney Disease: a systematic review
=======================================================================================================

* Shams Mohammad Abrar

## Abstract

**Background** Diabetic nephropathy (DN) is a serious microvascular complication that affects 40% of diabetes patients. In the last decade, artificial intelligence (AI) has been widely used in both structured and unstructured clinical data to improve the treatment of patients/potential patients with DN.

**Methods** This systematic review aims to cover all applications of AI in the clinical use of DN or related topics. Studies were searched in four open-access databases (Pubmed, IEEE Xplore, DBLP Computer Science Bibliography, and ACM digital library). Finally, the author manually searched the reference list of included studies in the study for additional relevant articles.

**Results** Finally, a total of 24 original peers reviewed articles were included in this study. Through a manual data extraction, the summary of key information such as applied AI algorithm, main outcomes, performance evaluation etc. was taken. Then the included studies underwent a quality assessment criterion, assessing the reproducibility, generalizability etc. Most of the included studies revealed that the AI frameworks outperformed conventional statistical methods. A summary of the limitations, such as lack of data availability or external validation of the framework, in the included studies, was also included.

**Conclusion** The rapid advancement of the AI framework and the exponential data generation in healthcare can be utilized and applied in clinical practices. The aid of AI can be instrumental in the treatment of DN.

Keywords
*   Diabetic kidney disease
*   diabetes
*   artificial intelligence
*   nephropathy
*   systematic review
*   machine learning
*   deep learning

## 1. Introduction

Diabetes Mellitus (DM), a devastating incurable health complication, is caused due to the incapability of beta cells (β cells) in the pancreas to produce adequate effective insulin for the body to utilize - leading to chronic high levels of blood glucose (1). Globally diabetes is considered a major public threat. According to the International Diabetes Federation (IDF), every eleventh person on the planet (463 million, 95% confidence interval: 361-601 million) has diabetes, and half of these patients are undiagnosed due to the disease’s complex pathogenesis. Furthermore, diabetes accounts for ten per cent of global health expenditure (USD 760 billion) (2). Several factors have been linked to the onset of DM in patients. Inactive lifestyles, obesity, a lack of awareness, and other factors have all been linked to the recent global increase in diabetes (3).

DM, on the other hand, results in a variety of macro and microvascular complications as a result of hyperglycemia or hypoglycemia due to the lack of blood glucose (BG) control. Diabetic Nephropathy (DN)/ diabetic kidney disease (DKD) is one of the microvascular complications, which is the most common cause of end-stage renal disease in the world (4-7). DN is caused due to increased urine albumin excretion and/or impairment. It is characterized by gradual loss of kidney function from hyperfiltration at the early stage (8). The principal risk factors involving DKD are hyperglycemia, hypertension and genetic predisposition. To diagnose DN early on, it is recommended to screen microalbuminuria five years after diagnosis for type 1 diabetes (T1D) and yearly screening for type 2 diabetes (T2D) (9). However, the amount of DM patients being affected by kidney disease is rising rapidly. Prevalence of microalbuminuria is seen in about one-third of DM patients after fifteen years of disease duration, and almost half of them develop DN. Often DN occurs in patients of a certain age group or demographics. It is found that DKD is mostly common among elderly patients, very unlikely to develop among patients with less than one decade of diabetes duration (10). Meanwhile, the risk of developing DKD is higher among patients from developing countries compared to developed countries (26). The cost of bearing dialysis and other casualties is very high. As DN is more prevalent in developing countries, it almost becomes impossible for the patient’s family to bear the expenses. Meanwhile, the mortality rate of DN patients is very high, and early diagnosis or adequate treatment is necessary. Around 10% of the death in T2D is due to kidney failure (11).

Despite growing concerns and preventative measures of DKD, the rate of chronic kidney disease (CKD) among DM patients remains unchanged as of 20 years ago. Even though DN might be controlled by the maintenance of blood glucose, blood pressure, blood cholesterol and widespread use of renin-angiotensin-aldosterone system inhibitors (9), it only works at the early stages. On the other hand, the progression of DKD of individual patients shows a large variation due to the complex heterogeneous nature of the disease (41). At later stages of DN, when CKD is well developed, it is harder to control (12). It is found that early diagnosis of microalbuminuria among DM patients and monitoring will postpone or prevent overt nephropathy (13,14). As a result of risk verification or early diagnosis of DN a large sum of the socioeconomic burden suppressed on the patients and their families can be avoided - where the further progression of DN could be stopped (12).

With the help of machine learning (ML), a branch of artificial intelligence (AI) diagnosis, prognosis and clinical management of DKD patients without direct human intervention is possible. These algorithms are capable of extracting clinically relevant information from medical data with great accuracy (8). Without being explicitly programmed, machine learning algorithms can produce non-linear correlations and patterns from raw data (15). Deep learning (DL), a branch of machine learning that is aided by increased computational capabilities, entails training a large number of neural networks with many layers on large data sets. As a result, deep learning algorithms are useful for image classification in medical datasets (11). This systematic review covers the application of AI in DKD, which includes the early detection of DN to the management of DN through the application of AI.

Several systematic reviews have examined the use of AI in diabetes. Zhu et al. investigated the use of DL in DM, searching multiple databases and obtaining 40 studies (16). A total of 107 peer-reviewed research articles were included in a study by Jyotismita Chaki et al. on the use of AI in the detection and self-management of diabetic patients (17). Ioannis Kavakoitis et al. discussed the use of machine learning and data mining in diabetes care (18). These studies, on the other hand, cover the broader application of AI in DM patients. They are not focused on any specific complications. Meanwhile, Katrine et al. conducted a systematic review on the use of DL algorithms in the screening for diabetic retinopathy (DR), evaluating the diagnostic performance of DL algorithms. Diabetic retinopathy is the most common diabetic microvascular complication. The study included a total of 20 articles (19). Miguel Tejedor et al. discussed the use of reinforcement learning, a type of machine learning in which the agent learns in an interactive environment through trial and error and feedback, in diabetic glucose control (20). A systematic review with meta-analysis was performed by Md Mohaimenul et al. about the detection of DR from retinal fundus photographs, where a total of 20 studies were included (21). However, there has yet to be a systematic review of AI’s application in DN. This research aims to fill that void. This systematic review includes studies that use AI algorithms to address the early detection of DN, the prognosis of DN, DN management, and other related topics.

## 2. Overview of Big Data analytics and AI

Researchers have been craving, since the mid-twentieth century, for a support system that can make clinical decisions while processing growing quantities of clinical data (22). As the medical records are often high dimensional, heterogeneous and sparse they are often not utilized properly in clinical practices (9). Applied biology is advancing to the era of ‘big data’ due to the remarkable advances in biotechnology. The principal reason behind the establishment of big data in medicine is high throughput sequencing due to the advancement of biotechnology (23,24). Myriads of large-scale and real-world data are generated from the likes of hospital records, patients’ medical records, biomedical research etc. However, to utilize the data generated requires proper management and analysis (25). The proper application of big data in healthcare is executed by extracting patterns and trends thus creating models from the data. Consequently, big data analytics will yield more throughput insightful diagnoses, prognosis and most clinically effective treatments in a cheaper method, and it will improve healthcare marginally (26). An abundance of clinical data and information regarding diabetes patients is already available and it is increasing at an exponential rate (27).

Artificial Intelligence (AI) is a branch of computer science that aims to mimic the cognitive function of human beings. There are two main branches of AI: virtual and physical. The physical branch covers the aspects of robotics, which can assist in surgery in clinical practice. On the other hand, the virtual branch of AI includes the likes of ML. This informatics branch of AI is capable of assisting physicians in clinical diagnosis, treatment decisions etc. (28). As a result of the ‘big data’ era’s emergence, AI brings forth permanent change in the healthcare system. In addition, AI applies to both structured and unstructured data. ML and Natural Language Processing (NLP) is applied in structured and unstructured data respectively (29). The raw data available from EHR usually consisted of structured data. Thus, ML algorithms are mostly used as predictive performance models in different studies. ML is classified into three different categories: supervised learning, unsupervised learning and reinforcement learning. In both supervised and unsupervised learning, the system infers a function from labelled and unlabeled data respectively. ML workflow often involves data harmonization, preparation learning, model fitting and evaluation (30). Figure 1 demonstrates the learning process of both supervised and unsupervised learning (31). The theoretical construct of ML models gives them an edge over conventional statistical models. As they are efficient at analyzing large complex datasets, while identifying hidden associations ML is proved to be superior. Thus, over the recent decade, ML has evolved dramatically, and it is commonly used in clinical practices (16).

![Figure 1:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2021/10/14/2021.10.10.21264813/F1.medium.gif)

[Figure 1:](http://medrxiv.org/content/early/2021/10/14/2021.10.10.21264813/F1)

Figure 1: Supervised and unsupervised machine learning.
**a**, General workflow of supervised machine-learning approaches. First, training and test datasets are collected. Next, part of the training set is used to build the prediction model, and the other part to tune and validate the model (circular arrow). After the machine-learning model is finalized (crossed-out circular arrow), the established model is used to generate predictions on the test dataset and the model’s performance is estimated by comparing the predicted outcomes with the observed outcomes for the test dataset. **b**, Unsupervised machine learning includes clustering, anomaly detection and dimensionality reduction. Clustering algorithms group data points with similar measurements into clusters. Anomaly detection identifies outliers in the dataset. Dimensionality reduction reduces the number of random variables used to describe the data; for example, by representing an image with thousands of parameters as a smaller vector of summary features. The resulting summary vector preserves the important information in the raw data; for example, summary vectors from similar images will bear more resemblance than those obtained from irrelevant images

Note: Descriptive phrase that serves as title and description. Reprinted from “Artificial Intelligence in Healthcare” by Ognjanovic I, Year 2020, Stud Health Technol Inform, *Volume*(issue), doi: 10.3233/SHTI200677. PMID: 32990674 (7).

On the other hand, DL is a subfield of ML. The field of DL has advanced massively in the ability of machines to understand and data including the likes of images, language and speech. In contrast to ML, DL is a form of representation learning composed of numerous layers - gaining the ability to learn highly complex functions (33). DL algorithms such as Convolutional Neural Network (CNN) is designed to process data that exhibits natural spatial invariances. Therefore, DL models have achieved the accuracy of physicians in identifying cardiovascular risk from fundus images (34). Compared to the traditional Artificial Neural Networks (ANN), which are limited to three layers and trained to obtain supervised representation for specific tasks only (35), DL layers of features are not designed by anyone, but they are learned from data using a general-purpose learning procedure. The inputs of deep neural networks are processed in a non-linear manner to activate the nodes in hidden layers to learn the ‘deep structures and representations. Next, the final representation is utilized in a supervised layer to fine-tune the network using backpropagation algorithms toward representations, which are optimized for an end-to-end task (36).

## 2. Methods

A qualitative systematic review of the application of AI for the diagnosis, prognosis and management of DN was conducted. This systematic review was conducted by following the reporting checklist of Preferred Reporting Items for Systematic Reviews and Meta-Analysis (PRISMA 2020) (37). Consequently, comprehensive literature was undertaken with extensive efforts to identify the articles. This review was conducted from 2020 to 2021 (considering all articles published from 1970 - 2021), and the databases searched include PubMed, DBLP Computer Science Bibliography, IEEE Xplore and ACM digital library. The four databases used are interfaces that do not require institutional subscriptions unlike other databases such as Ovid, Scopus and Web of Science. The purpose of using open-access databases is to enhance the reproducibility of the search results (16). The last search performed was on 12th July 2021.

### 2.1 Search Strategies

The steps of searches were performed using related keywords combinations based on mesh terms. For this systematic review, related keywords of diabetes and nephropathy and AI were combined using Boolean operators (AND/OR). The search strategy is given in Table-1.

View this table:
[Table-1:](http://medrxiv.org/content/early/2021/10/14/2021.10.10.21264813/T1)

Table-1: 
The databases and the search strategy employed

At the outset, the duplicate articles from different sources were excluded from the initial collection of the obtained articles. The records were screened a total of three times by the author, to avoid any biases in the systematic review. Additional potentially eligible articles were searched manually by the author by screening the reference list of included articles.

### 2.2 Inclusion and Exclusion Criteria

The most relevant and potential articles from the initial screening were kept for full-text review. To be included in the review, the articles had to fulfil certain criteria. The study must be in English and peer-reviewed. In addition, application of AI algorithms with performance evaluation is necessary. On the other hand, the study must be applicable for Diabetic patients with potential kidney disease or persisting kidney disease. Any study excluding the application of AI in diabetes and kidney disease were not included. If the AI algorithm only applies to DM but is not related to nephropathy is excluded - vice versa is also applicable. In addition, other types of study such as letters to editors, posters and short communications were excluded. As well, the included articles had described thoroughly the algorithms used and the data processing of datasets. Traditional methods used only in DN are excluded. The author, himself, had thoroughly examined the selected articles by screening the article titles and abstracts to identify the eligible articles. If the article is not relevant to the topic at all then it is excluded at the initial phase which is conducted by screening the title and abstract.

### 2.3 Data extraction

From the final collection of the studies, key information was extracted to assess the application of AI algorithms. The data extraction was divided into 10 different categories. The categories and their information are given below:

1.  Study information: the first author’s name, year, country.

2.  Cases: summary of the specific application in different cases i.e., detection of DN, management of DN etc. If applicable the type of diabetes of the cohort was mentioned as in T1D or T2D.

3.  Data sources: The name and description of the dataset’s source is obtained and mentioned. It can be an EHR dataset from a regional hospital to national surveys.

4.  Research type: The type of study conducted, whether it was an observational study or experimental study, is mentioned in this category.

5.  Baseline characteristics: The clinical variables used to form the ML models were mentioned here. For DL algorithms the type of image taken along with basic characteristics is mentioned.

6.  Cohort size/sample size: The population included in the cohort studies is mentioned. On the other hand, the sample size for other studies is mentioned here.

7.  AI algorithm: In this category, all of the ML or DL algorithms used are mentioned.

8.  Development process: The size of the training and testing dataset is mentioned here.

9.  Performance assessment: The evaluation metrics of AI algorithms are included here. Some of the performance assessment techniques include accuracy test, sensitivity, specificity, Cohen’s k, the area under the curve (AUC) etc.

10. Limitation: The limitations of the included studies, mostly mentioned by the authors themselves, are included in this category.

### 2.4. Quality Assessment

Studies were assessed from the following categories: limits in current non-machine learning tools, feature engineer methods employed in the AI algorithm before training the dataset, the hyperparameters used in the study, valid methods to overcome overfitting and the external validation of the data. The quality assessment table of the studies consisted of binary answers of yes or no.

## 3. Results

After obtaining the results from the four databases, the duplicate articles were excluded. Then a manual inspection by the author was conducted to evaluate the remaining articles based on inclusion criteria. From the four databases a total of 694 (IEEE = 91, PubMed = 202, ACM Digital Library = 393, DBLP = 8) studies were obtained, and 11 of them were marked as ineligible by endnote. In addition, a total of 24 duplicate studies were found. The remaining 659 studies were manually screened by the author, and they were evaluated based on the inclusion and exclusion criteria. Consequently, a total of 146 studies were retrieved for eligibility assessment via full-text inspection - all of these studies either included diabetes patients and/or kidney patients and/or AI algorithms. In the end, a total of 23 articles met the inclusion criteria. The references cited in those 23 studies were screened by the author himself. Among the 831 references listed only one of the studies met the complete inclusion criteria. The details of the selected articles are presented in Table-3.

View this table:
[Table-2:](http://medrxiv.org/content/early/2021/10/14/2021.10.10.21264813/T2)

Table-2: 
PRISMA flow diagram for updated systematic reviews.

View this table:
[Table-3:](http://medrxiv.org/content/early/2021/10/14/2021.10.10.21264813/T3)

Table-3: 
The summary of findings in the included articles

The scheme of this study is summarized into four different categories based on the AI framework in use. The categories include 1) predictive AI model for the early detection of DN (40, 44, 45, 46, 48, 51, 54, 55, 59, 61), 2) AI Framework for the diagnosis of DN (42, 50, 57), 3) predictive model for the progression of DKD (39, 41, 49, 58, 62), 4) the management of DN for existing patients OR early management indication to DM patients without DKD with the help of AI (43, 47, 52, 53, 56, 60). The studies included in this systematic review are from various parts of the world. Seven of the included studies are from the USA (29%), six of them are from China (25%). In addition, three studies are from Japan (12.5%) and Korea (12.5%). Although the number of observational studies included was greater, the comprehensive review consisted of both observational study and experimental study.

A total of 8 studies (39, 40, 44, 45, 48, 49, 53, 59) included a cohort population/ sample size of 1000+ patients/samples to develop the AI model. Meanwhile, 7 studies (41, 50, 51, 52, 54, 55, 57) included a cohort size/ sample size over 10,000 patients/sample size. In addition, a large sum of 14 studies (39, 40, 42, 44, 48, 49, 51, 52, 53, 55, 56, 57, 59, 62) used ML to build the predictive model. On the other hand, 5 studies (47, 50, 54, 58, 60) used DL algorithms as the AI method in their study. Finally, 5 other studies used both ML and DL as separate baselines or in an integrated system (41, 43, 45, 46, 61). It is worth noting that a large number of the studies used EHR datasets from a regional hospital/clinic. A total of nine studies (42, 43, 46, 51, 52, 54, 58, 59, 61) used an EHR dataset from a regional hospital.

Different AI algorithms such as RF (Random Forest), LR (Logistic Regression), SVM (Support Vector Machine), feed-forward network, ANN (Artificial Neural Network), KNN (k-nearest neighbours),), MLP (Multilayer Perceptron), RNN (Recurrent Neural Network), Naive Bayes (NB), Adaboost, DT (Decision Tree), Ridge LR, Apriori, DNN (Deep Neural Network), GBM (Gradient Boosting Machine), convolutional autoencoder, One rule, Simple Logistic, Sequential Minimal Optimizer, LeNet, DetectionNet, Lesion aware DL system, PLSR (Partial Least Square Regression), Backpropagation Artificial Neural Network (BP-ANN). Above all, RF is the most used model for predictive models being used 7 times.

As most of the included studies are predictive models, they require a certain criterion to be met. The data should be pre-processed with the necessary imputation, dimensionality reduction, transformation etc. The training model used in the dataset is used to find the patterns from the feature vectors with the output vectors with the help of the AI algorithm(s). A testing dataset is also necessary from the same source as the training that is unseen to the algorithm. Therefore, the accuracy of the AI algorithm can be assessed. However, a lot of the studies did not mention the exact quantity of the development process of the dataset i.e the training and testing sample size (40 45, 46, 49, 50, 53, 54, 56, 58, 61). As a result, if the performance of the AI model is overfitting or not cannot be determined.

In the quality assessment (Table 4) the shortcomings OR strengths of the studies were assessed. To ensure the included study adds something new to clinical practices and fulfils a void that cannot be assessed by conventional statistical methods, the “limits in current non-AI approach” column is present in Table-4. If the AI model even outperformed the conventional statistical method the cell is filled with a ‘yes’. Except for two studies (59, 45), all 22 other included studies either outperformed conventional methods or accomplished the result which is not attainable through non-AI frameworks. This proves that AI algorithms are superior to conventional methods, and they might be suitable in clinical practices to aid physicians. The feature generation of the AI algorithm(s) before undergoing training is also mentioned. Feature engineering is particularly important as they are essential to select and transform raw data from clinical variables to essential features. As well as, handling missing data or imbalanced data is accounted for in the studies. From the included studies, a total of studies 19 conducted feature engineering in the raw data to make the algorithm more feasible. Four studies (39, 49, 58, 60) did not use feature engineering. Also, before the training of AI algorithms, some studies chose to tune hyperparameters to yield better outcomes. Among 24 studies only four (50, 54, 55, 58) studies did not include hyperparameters in the study. Hyperparameter tuning is necessary for ML models as the optimal combination reduces the predefined lost function as low as possible, which leads to a better performance of the model.

View this table:
[Table 4:](http://medrxiv.org/content/early/2021/10/14/2021.10.10.21264813/T4)

Table 4: 
Quality Assessment of the studies:

The model’s higher predictive accuracy makes it more reliable for clinical use. In addition to the need for high accuracy in the model, validation of the model in different settings outside of the research’s framework is critical. The vast majority of models have not been externally validated. External validation is required to determine the model’s reproducibility and generalizability to unique patients. Aside from that, it is critical to ensure that the model is tested on a multiethnic cohort before implementing the AI framework globally. Only seven (40, 43, 47, 52, 53, 57, 58) of the 24 included articles had the predictive AI model externally validated (29 percent).

## 4. Discussions

This is the first systematic review that reviews current applications of AI in DKD. The heterogeneity in the types of AI algorithms used are shown in the results - SVM, DT, RF, ANN, Naive Bayes etc. The myriads of different feature vectors used in the predictive performances were observed. Studies in this review were performed in different parts of the globe. Some of the studies included multiethnic cohorts. In most of these studies, it is observed that there persist limitations in conventional statistical models. However, ML and DL models proved to be far superior in many instances. AI methods applied in different aspects of DN had met the substantial needs, which conventional methods lack, in the proposed clinical applications. The only aspect of shortcoming in AI algorithms is the generalizability, which can be met with the growing number of EHR datasets in clinics.

It should be pointed out that most selected papers are published within the last four years, which indicates the application of AI in DN is an emerging technology. Despite being not so perfect, AI still prevails a large opportunity to improve the current uses. The predictive models can be improved a lot if expert knowledge is integrated with the data-driven approach. The physicians with adequate knowledge in DN can craft safety constraints as a guide during the training process (16). The performance of AI frameworks in clinical practices of DKD can adequately be assessed from prospective studies (63). For future work, prospective studies should be conducted in studies focused on category four. Therefore, the study would be able to both assess the performance of the AI algorithm along the acceptance and satisfaction of patients with automated approaches.

For the generalizability of the study, the best performing AI algorithm in internal validation should be tested on multiple different ethnic cohorts before adopting its use in clinical practices. On the other hand, further AI devices can be developed that can help the patients of DN to maintain the risk of ESRD at a low rate. However, caution must be adopted before adopting AI algorithms. Such as, whether AI-enabled care is safe and effective enough from the results of robust studies. As well as, the machine-generated results need to be confirmed by a physician or not etc (38).

The review has a few limitations. At the outset, the review only searched open-access databases. Thus, ignoring registers and webpages and other databases such as Scopus and Embase etc. However, this study aimed to only include articles published in peer-reviewed journals to avoid any existing biases. On the other hand, to ensure the reproducibility of the review the manuscript only included open-access databases.

In this study, a systematic effort was made by the author to identify every possible peer-reviewed article available and review AI application in DN. DN is one of the most dangerous complications caused by DM. However, with the proper caution and early detection of DN, a lot of the harmful outcomes can be avoided. The advent of computational biology, with the vast amount of EHR datasets generated every day, enables the further in-depth application of AI in the diagnosis and treatment of DN. Despite its recent emergence in healthcare, they have already outperformed conventional statistical methods. It is high time that physicians adopt AI-based algorithms in clinical approaches of DN.

## Data Availability

All data produced in the present work are contained in the manuscript

## Funding

This research received no specific grant from any funding agency in the public, commercial, or not-for-profit sectors

*   Received October 10, 2021.
*   Revision received October 10, 2021.
*   Accepted October 14, 2021.


*   © 2021, Posted by Cold Spring Harbor Laboratory

This pre-print is available under a Creative Commons License (Attribution-NonCommercial-NoDerivs 4.0 International), CC BY-NC-ND 4.0, as described at [http://creativecommons.org/licenses/by-nc-nd/4.0/](http://creativecommons.org/licenses/by-nc-nd/4.0/)

## References

1.  1.Alberti KG, Zimmet PZ. Definition, diagnosis and classification of diabetes mellitus and its complications. Part 1: diagnosis and classification of diabetes mellitus provisional report of a WHO consultation. Diabet Med. 1998;15(7):539–53.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1002/(SICI)1096-9136(199807)15:7<539::AID-DIA668>3.0.CO;2-S&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=9686693&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F10%2F14%2F2021.10.10.21264813.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000074777700003&link_type=ISI) 

2.  2.Jensen PB, Jensen LJ, Brunak S. Mining electronic health records: towards better research applications and clinical care. Nat Rev Genet. 2012;13(6):395–405.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/nrg3208&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=22549152&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F10%2F14%2F2021.10.10.21264813.atom) 

3.  3.Shih KC, Kwong ASK, Wang JHL, Wong JKW, Ko WWK, Lai JSM, et al. Diabetic retinopathy screening during the coronavirus disease 2019 pandemic. EYE. 2020;34(7):1246–7.
    
    
4.  4.Samsu N. Diabetic nephropathy: Challenges in pathogenesis, diagnosis, and treatment. Biomed Res Int. 2021;2021:1497449.
    
    
5.  5.USRDS 2015 Annual Data Report. Chapter 1: Incidence, Prevalence, Patient Characteristics, and Treatment Modalities. [https://www.usrds.org/2015/view](https://www.usrds.org/2015/view).
    
    
6.  6.Narres M, Claessen H, Droste S, Kvitkina T, Koch M, Kuss O, et al. The incidence of end-stage renal disease in the diabetic (compared to the non-diabetic) population: A systematic review. PLoS One. 2016;11(1):e0147329.
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=26812415&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F10%2F14%2F2021.10.10.21264813.atom) 

7.  7.Pippias M, Kramer A, Noordzij M, Afentakis N, Alonso de la Torre R, Ambühl PM, et al. The European Renal Association - European Dialysis and transplant association Registry annual report 2014: A summary. Clin Kidney J. 2017;10(2):154–69.
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=28584624&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F10%2F14%2F2021.10.10.21264813.atom) 

8.  8.Orasanu G, Plutzky J. The pathologic continuum of diabetic vascular disease. J Am Coll Cardiol. 2009;53(5 Suppl):S35–42.
    
    [FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6MzoiUERGIjtzOjExOiJqb3VybmFsQ29kZSI7czo0OiJhY2NqIjtzOjU6InJlc2lkIjtzOjE5OiI1My81X1N1cHBsZW1lbnQvUzM1IjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjEvMTAvMTQvMjAyMS4xMC4xMC4yMTI2NDgxMy5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 

9.  9.Gross JL, de Azevedo MJ, Silveiro SP, Canani LH, Caramori ML, Zelmanovitz T. Diabetic nephropathy: diagnosis, prevention, and treatment. Diabetes Care. 2005;28(1):164–76.
    
    [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NzoiZGlhY2FyZSI7czo1OiJyZXNpZCI7czo4OiIyOC8xLzE2NCI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDIxLzEwLzE0LzIwMjEuMTAuMTAuMjEyNjQ4MTMuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 

10. 10.Gheith O, Othman N, Nampoory N, Halimb M, Al-Otaibi T. Diabetic kidney disease: difference in the prevalence and risk factors worldwide. J Egypt Soc Nephrol Transplant. 2016;16(3):65.
    
    
11. 11.Krizhevsky A, Sutskever I, Hinton G. E. In Curran Associates, Nevada; 2012.
    
    
12. 12.Kim SS, Kim JH, Kim IJ. Current challenges in diabetic nephropathy: Early diagnosis and ways to improve outcomes. Endocrinol Metab (Seoul). 2016;31(2):245–53.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.3803/EnM.2016.31.2.245&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=27246284&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F10%2F14%2F2021.10.10.21264813.atom) 

13. 13.Verhave JC, Gansevoort RT, Hillege HL, Bakker SJL, De Zeeuw D, de Jong PE, et al. An elevated urinary albumin excretion predicts de novo development of renal function impairment in the general population. Kidney Int Suppl. 2004;66(92):S18–21.
    
    
14. 14.Atkins RC, Briganti EM, Lewis JB, Hunsicker LG, Braden G, Champion de Crespigny PJ, et al. Proteinuria reduction and progression to renal failure in patients with type 2 diabetes mellitus and overt nephropathy. Am J Kidney Dis. 2005;45(2):281–7.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1053/j.ajkd.2004.10.019&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=15685505&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F10%2F14%2F2021.10.10.21264813.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000226913000006&link_type=ISI) 

15. 15. M. C. Pattern Recognition and Machine Learning. 1st ed. New York, NY: Springer; 2006.
    
    
16. 16.Zhu T, Li K, Herrero P, Georgiou P. Deep learning for diabetes: A systematic review. IEEE J Biomed Health Inform. 2021;25(7):2744–57.
    
    
17. 17.Chaki J, Thillai Ganesh S, Cidham SK, Ananda Theertan S. Machine learning and artificial intelligence based Diabetes Mellitus detection and self-management: A systematic review. J King Saud Univ - Comput Inf Sci [Internet]. 2020; Available from: [http://dx.doi.org/10.1016/j.jksuci.2020.06.013](http://dx.doi.org/10.1016/j.jksuci.2020.06.013)
    
    
18. 18.Kavakiotis I, Tsave O, Salifoglou A, Maglaveras N, Vlahavas I, Chouvarda I. Machine learning and data mining methods in diabetes research. Comput Struct Biotechnol J. 2017;15:104–16.
    
    
19. 19.Nielsen KB, Lautrup ML, Andersen JKH, Savarimuthu TR, Grauslund J. Deep learning-based algorithms in screening of diabetic retinopathy: A systematic review of diagnostic performance. Ophthalmol Retina. 2019;3(4):294–304.
    
    
20. 20.Tejedor M, Woldaregay AZ, Godtliebsen F. Reinforcement learning application in diabetes blood glucose control: A systematic review. Artif Intell Med. 2020;104(101836):101836.
    
    
21. 21.Islam M, Poly TN, Yang HC, Atique S,  Li Y-CJ. Deep learning for accurate diagnosis of glaucomatous optic neuropathy using digital fundus image: A meta-analysis. Stud Health Technol Inform. 2020;270:153–7.
    
    
22. 22.Bellemo V, Lim G, Rim TH, Tan GSW, Cheung CY, Sadda S, et al. Artificial intelligence screening for diabetic retinopathy: The real-world emerging application. Curr Diab Rep. 2019;19(9):72.
    
    
23. 23.Marx V. Biology: The big challenges of big data: Biology. Nature. 2013;498(7453):255–60.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/498255a&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=23765498&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F10%2F14%2F2021.10.10.21264813.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000320283400054&link_type=ISI) 

24. 24.Mattmann CA. A vision for data science. Nature. 2013;493(7433):473–5.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/493473a&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=23344342&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F10%2F14%2F2021.10.10.21264813.atom) 

25. 25.Dash S, Shakyawar SK, Sharma M, Kaushik S. Big data in healthcare: management, analysis and future prospects. J Big Data [Internet]. 2019;6(1). Available from: [http://dx.doi.org/10.1186/s40537-019-0217-0](http://dx.doi.org/10.1186/s40537-019-0217-0)
    
    
26. 26.Raghupathi W, Raghupathi V. Big data analytics in healthcare: promise and potential. Health Inf Sci Syst. 2014;2(1):3.
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F10%2F14%2F2021.10.10.21264813.atom) 

27. 27.Cichosz SL, Johansen MD, Hejlesen O. Toward big data analytics: Review of predictive models in the management of diabetes and its complications: Review of predictive models in management of diabetes and its complications. J Diabetes Sci Technol. 2015;10(1):27–34.
    
    
28. 28.Hamet P, Tremblay J. Artificial intelligence in medicine. Metabolism. 2017;69S:S36–40.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.metabol.2017.01.011&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=28126242&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F10%2F14%2F2021.10.10.21264813.atom) 

29. 29.Jiang F, Jiang Y, Zhi H, Dong Y, Li H, Ma S, et al. Artificial intelligence in healthcare: past, present and future. Stroke Vasc Neurol. 2017;2(4):230–43.
    
    [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6Njoic3ZuYm1qIjtzOjU6InJlc2lkIjtzOjc6IjIvNC8yMzAiO3M6NDoiYXRvbSI7czo1MDoiL21lZHJ4aXYvZWFybHkvMjAyMS8xMC8xNC8yMDIxLjEwLjEwLjIxMjY0ODEzLmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 

30. 30.Jordan MI, Mitchell TM. Machine learning: Trends, perspectives, and prospects. Science. 2015;349(6245):255–60.
    
    [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6Mzoic2NpIjtzOjU6InJlc2lkIjtzOjEyOiIzNDkvNjI0NS8yNTUiO3M6NDoiYXRvbSI7czo1MDoiL21lZHJ4aXYvZWFybHkvMjAyMS8xMC8xNC8yMDIxLjEwLjEwLjIxMjY0ODEzLmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 

31. 31.Ognjanovic I. Artificial intelligence in healthcare. Stud Health Technol Inform. 2020;274:189–205.
    
    
32. 32.Senanayake S, White N, Graves N, Healy H, Baboolal K, Kularatna S. Machine learning in predicting graft failure following kidney transplantation: A systematic review of published predictive models. Int J Med Inform. 2019;130(103957):103957.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.ijmedinf.2019.103957&link_type=DOI) 

33. 33.Pham Q, Gamble A, Hearn J, Cafazzo JA. The need for ethnoracial equity in Artificial Intelligence for Diabetes Management: Review and recommendations. J Med Internet Res. 2021;23(2):e22320.
    
    
34. 34.Poplin R, Varadarajan AV, Blumer K, Liu Y, McConnell MV, Corrado GS, et al. Prediction of cardiovascular risk factors from retinal fundus photographs via deep learning. Nat Biomed Eng. 2018;2(3):158–64.
    
    
35. 35.Rumelhart DE, Hinton GE, Williams RJ. Learning representations by back-propagating errors. Nature. 1986;323(6088):533–6.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/323533a0&link_type=DOI) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=A1986E327500055&link_type=ISI) 

36. 36.Miotto R, Wang F, Wang S, Jiang X, Dudley JT. Deep learning for healthcare: review, opportunities and challenges. Brief Bioinform. 2018;19(6):1236–46.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/bib/bbx044&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=28481991&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F10%2F14%2F2021.10.10.21264813.atom) 

37. 37.Page MJ, McKenzie JE, Bossuyt PM, Boutron I, Hoffmann TC, Mulrow CD, et al. The PRISMA 2020 statement: an updated guideline for reporting systematic reviews. Syst Rev. 2021;10(1):89.
    
    
38. 38.Fogel AL, Kvedar JC. Artificial intelligence powers digital medicine. NPJ Digit Med. 2018;1(1):5.
    
    
39. 39.Chan L, Nadkarni GN, Fleming F, McCullough JR, Connolly P, Mosoyan G, et al. Derivation and validation of a machine learning risk score using biomarker and electronic patient data to predict progression of diabetic kidney disease. Diabetologia. 2021;64(7):1504–15.
    
    
40. 40.Ahn H-S, Kim JH, Jeong H, Yu J, Yeom J, Song SH, et al. Differential urinary proteome analysis for predicting prognosis in type 2 diabetes patients with and without renal dysfunction. Int J Mol Sci. 2020;21(12):4236.
    
    
41. 41.Belur Nagaraj S, Pena MJ, Ju W, Heerspink HL, The BEAt DKD Consortium. Machine learning–based early prediction of end stage renal disease in patients with diabetic kidney disease using clinical trials data. Diabetes Obes Metab. 2020;22(12):2479–86.
    
    
42. 42.Kitamura S, Takahashi K, Sang Y, Fukushima K, Tsuji K, Wada J. Deep learning could diagnose diabetic nephropathy with renal pathological immunofluorescent images. Diagnostics (Basel). 2020;10(7):466.
    
    
43. 43.Chen J, Tang H, Huang H, Lv L, Wang Y, Liu X, et al. Development and validation of new glomerular filtration rate predicting models for Chinese patients with type 2 diabetes. J Transl Med. 2015;13(1):317.
    
    
44. 44.Bao X, Xie S, Zhang K, Song K, Yang Y. Machine learning based information extraction for diabetic nephropathy in clinical text documents. In: 2019 6th International Conference on Systems and Informatics (ICSAI). IEEE; 2019.
    
    
45. 45.Hayashi Y. Detection of lower albuminuria levels and early development of diabetic kidney disease using an artificial intelligence-based rule extraction approach. Diagnostics (Basel). 2019;9(4):133.
    
    
46. 46.Cho BH, Yu H, Kim K-W, Kim TH, Kim IY, Kim SI. Application of irregular and unbalanced data to predict diabetic nephropathy using visualization and feature selection methods. Artif Intell Med. 2008;42(1):37–53.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.artmed.2007.09.005&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=17997291&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F10%2F14%2F2021.10.10.21264813.atom) 

47. 47.Ginley B, Lutnick B, Jen K-Y, Fogo AB, Jain S, Rosenberg A, et al. Computational segmentation and classification of diabetic glomerulosclerosis. J Am Soc Nephrol. 2019;30(10):1953–67.
    
    [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6ODoiam5lcGhyb2wiO3M6NToicmVzaWQiO3M6MTA6IjMwLzEwLzE5NTMiO3M6NDoiYXRvbSI7czo1MDoiL21lZHJ4aXYvZWFybHkvMjAyMS8xMC8xNC8yMDIxLjEwLjEwLjIxMjY0ODEzLmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 

48. 48.Huang J, Huth C, Covic M, Troll M, Adam J, Zukunft S, et al. Machine learning approaches reveal metabolic signatures of incident chronic kidney disease in individuals with prediabetes and type 2 diabetes. Diabetes. 2020;69(12):2756–65.
    
    [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6ODoiZGlhYmV0ZXMiO3M6NToicmVzaWQiO3M6MTA6IjY5LzEyLzI3NTYiO3M6NDoiYXRvbSI7czo1MDoiL21lZHJ4aXYvZWFybHkvMjAyMS8xMC8xNC8yMDIxLjEwLjEwLjIxMjY0ODEzLmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 

49. 49.Kim Y-M, Kathuria P, Delen D. Machine learning to compare frequent medical problems of African American and Caucasian diabetic kidney patients. Healthc Inform Res. 2017;23(4):241.
    
    
50. 50.Kim JE, Park SJ, Kim YC, Min S-I, Ha J, Kim YS, et al. Deep learning-based quantification of visceral fat volumes predicts post-transplant diabetes mellitus in kidney transplant recipients. Front Med (Lausanne). 2021;8:632097.
    
    
51. 51.Song X, Waitman LR, Yu AS, Robbins DC, Hu Y, Liu M. Longitudinal risk prediction of chronic kidney disease in diabetic patients using a temporal-enhanced gradient boosting machine: Retrospective cohort study. JMIR Med Inform. 2020;8(1):e15510.
    
    
52. 52.Song X, Waitman LR, Hu Y, Yu ASL, Robbins DC, Liu M. Robust clinical marker identification for diabetic kidney disease with ensemble feature selection. J Am Med Inform Assoc. 2019;26(3):242–53.
    
    
53. 53.Heinzel A, Kammer M, Mayer G, Reindl-Schwaighofer R, Hu K, Perco P, et al. Validation of plasma biomarker candidates for the prediction of eGFR decline in patients with type 2 diabetes. Diabetes Care. 2018;dc180532.
    
    
54. 54.Makino M, Yoshimoto R, Ono M, Itoko T, Katsuki T, Koseki A, et al. Artificial intelligence predicts the progression of diabetic kidney disease using big data machine learning. Sci Rep. 2019;9(1):11862.
    
    
55. 55.Rodriguez-Romero V, Bergstrom RF, Decker BS, Lahu G, Vakilynejad M, Bies RR. Prediction of nephropathy in type 2 diabetes: An analysis of the ACCORD trial applying machine learning techniques. Clin Transl Sci. 2019;12(5):519–28.
    
    
56. 56.Lin S, Guo Y. Value analysis of using urinary microalbumin in artificial intelligence medical institutions to detect early renal damage in diabetes. J Healthc Eng. 2021;2021:6678454.
    
    
57. 57.Zhang K, Liu X, Xu J, Yuan J, Cai W, Chen T, et al. Deep-learning models for the detection and incidence prediction of chronic kidney disease and type 2 diabetes from retinal fundus images. Nat Biomed Eng. 2021;5(6):533–45.
    
    
58. 58.Zhao L, Ren H, Zhang J, Cao Y, Wang Y, Meng D, et al. Diabetic retinopathy, classified using the Lesion-aware Deep Learning System, predicts diabetic end-stage renal disease in Chinese patients. Endocr Pract. 2020;26(4):429–43.
    
    
59. 59.Xu H, Kong Y, Tan S. Predictive modeling of diabetic kidney disease using random forest algorithm along with features selection. In: Proceedings of the 2020 International Symposium on Artificial Intelligence in Medical Sciences. New York, NY, USA: ACM; 2020.
    
    
60. 60.Parab J, Sequeira M, Lanjewar M, Pinto C, Naik G. Backpropagation neural network-based machine learning model for prediction of blood urea and glucose in CKD patients. IEEE J Transl Eng Health Med. 2021;9:4900608.
    
    
61. 61.Zhang J, Kowsari K, Boukhechba M, Harrison J, Lobo J, Barnes L. Sparse longitudinal representations of electronic health record data for the early detection of chronic kidney disease in diabetic patients. In: 2020 IEEE International Conference on Bioinformatics and Biomedicine (BIBM). IEEE; 2020.
    
    
62. 62.Chauhan K, Nadkarni GN, Fleming F, McCullough J, He CJ, Quackenbush J, et al. Initial validation of a machine learning-derived prognostic test (KidneyIntelX) integrating biomarkers and electronic health record data to predict longitudinal kidney outcomes. Kidney360. 2020;1(8):731–9.
    
    [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6OToia2lkbmV5MzYwIjtzOjU6InJlc2lkIjtzOjc6IjEvOC83MzEiO3M6NDoiYXRvbSI7czo1MDoiL21lZHJ4aXYvZWFybHkvMjAyMS8xMC8xNC8yMDIxLjEwLjEwLjIxMjY0ODEzLmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 

63. 63.Abràmoff MD, Lou Y, Erginay A, Clarida W, Amelon R, Folk JC, et al. Improved automated detection of Diabetic Retinopathy on a publicly available dataset through integration of deep learning. Invest Ophthalmol Vis Sci. 2016;57(13):5200–6.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1167/iovs.16-19964&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=27701631&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F10%2F14%2F2021.10.10.21264813.atom)