Abstract
Background Early onset of type 2 diabetes and cardiovascular disease are common complications for women diagnosed with gestational diabetes. About half of the women with gestational diabetes develop postpartum prediabetes within 10 years of the index pregnancy. These women also have double the risk of developing cardiovascular disease than women without a history of gestational diabetes. Currently, there is no accurate way of knowing which women with gestational diabetes are likely to develop postpartum prediabetes. This study aims to predict the risk of postpartum prediabetes in women diagnosed with gestational diabetes.
Methods We build a sparse logistic regression-based machine learning model to learn key variables significant for the prediction of postpartum prediabetes, from antenatal data with maternal anthropometric and biochemical variables as well as neonatal characteristics of 607 UK women diagnosed with gestational diabetes. We evaluate the performance of the proposed model in addition to other more advanced machine learning methods using established metrics such as the area under the receiver operating characteristic curve and specificity for pre-determined values of sensitivity. We use K-L divergence and information graphs to evaluate and compare different thresholds of classification for targeted screening options in resource-constrained settings. We also perform a decision curve analysis to study the net standardized benefit of our model compared to the universal screening approach.
Results Strikingly, our sparse logistic regression approach selects only two variables as relevant but gives an area under the receiver operating characteristic curve of 0.72, outperforming all other methods. It can identify postpartum prediabetes in women with gestational diabetes using the Rule-in test with 92% specificity at an optimal probability threshold of 0.381 and using the Rule-out test with 92% sensitivity at an optimal probability threshold of 0.140.
Conclusion We propose a simple logistic regression model, which needs only the antenatal fasting glucose at OGTT and HbA1c soon after the diagnosis of GDM, to predict, with remarkable accuracy, the probability of postpartum prediabetes in women with gestational diabetes. We envision this to be a practical solution, which coupled with a targeted follow-up of high-risk women, could yield better cardiometabolic outcomes in women with a history of GDM.
Competing Interest Statement
The authors have declared no competing interest.
Funding Statement
PS and YW are partly funded by Medical Research Council, UK (MR/R020981/1). DP is funded by Warwick-Novo Nordisk international Doctoral Training Program and Ph.D. of NP is funded by Chancellors International Scholarship, University of Warwick. RS is supported by his institution, funded by the Department of Atomic Energy, Government of India.
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
This study was carried out as an audit and quality improvement project using an anonymous dataset. The local clinical governance team approved the audit and formal ethical approval was not required.
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.
Yes
Abbreviations
- GDM
- Gestational Diabetes Mellitus
- T2DM
- Type 2 Diabetes Mellitus
- CVD
- Cardiovascular Disease
- OGTT
- Oral Glucose Tolerance Test
- LR
- Logistic Regression
- ROC
- Receiver Operating Characteristic
- DCA
- Decision Curve Analysis
The Chan Zuckerberg Initiative, Cold Spring Harbor Laboratory, the Sergey Brin Family Foundation, California Institute of Technology, Centre National de la Recherche Scientifique, Fred Hutchinson Cancer Center, Imperial College London, Massachusetts Institute of Technology, Stanford University, University of Washington, and Vrije Universiteit Amsterdam.