Abstract
Type 2 Diabetes Mellitus(T2DM) is a debilitating condition with a number of complications including those of the oral cavity which can further deteriorate patient’s general and oral health related quality of life (OHRQoL). Machine Learning (ML) can help assign an individual’s propensity to develop poor OHRQoL, given a set of variables, and at the same time identify the most important features contributing to this outcome. Previously inferential statistical methods have attempted to explain this, albeit with limited success. The aim of this cross sectional study is to determine the impact on OHRQoL in T2DM patients, and identify features most likely to be associated with this outcome and to compare ML and DL analytical methods with inferential statistics. Twelve-hundred T2DM patients were subjected to OHRQoL and demographic data questionnaires and WHO Oral Health Assessment form. K-means Clustering was performed to label individuals as having or not having an impact on OHRQoL. Class imbalance was addressed by undersampling of the majority class using informed subset selection. Further, using the collected data as input features we developed ML algorithms (Naive Bayes(NB), Random Forest(RF), Logistic Regression(LR), Kernel Support Vector Machine(SVM) and Artificial Neural Network(ANN)), to accurately classify individuals with or with-out poor oral health related quality of life (OHRQoL) and utilized SHapley Additive exPlanations (SHAP) analysis for feature importance. The best performing model was SVM (AUC=0.983; Sensitivity=1) for classifying the patients into into poor OHRQoL. SHAP values were highest for Age, Prosthetic Need, Tobacco use and years since onset of diabetes. Features closely related to diabetes, that is, periodontal pockets and loss of attachment were not identified as relevant by inferential statistics, but were deemed as important features associated with poor OHRQoL by SHAP analysis.
Competing Interest Statement
The authors have declared no competing interest.
Funding Statement
This study did not receive any funding
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
IRB of Krishnadevaraya College of Dental Sciences gave ethical approval for this work
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.
Yes
Footnotes
22D1628{at}iitb.ac.in, kshitij.jadhav{at}iitb.ac.in, https://www.iitb.ac.in/
iyemurali{at}gmail.com, https://www.kcdsh.org/
profmeenajain{at}gmail.com, https://manavrachna.edu.in/mrdc
Based on certain reviewer comments, justifications and certain limitations have been mentioned in the discussion.
Data Availability
All data produced in the present study are available upon reasonable request to the authors, after relevant permissions from parent institution where the study was conducted.