Abstract
Importance Prognostic predictions of prelabor rupture of membranes lack proper sample sizes and external validation.
Objective To develop, validate, and deploy statistical and/or machine learning prediction models using medical histories for prelabor rupture of membranes and the time of delivery.
Design A retrospective cohort design within 2-year period (2015 to 2016) of a single-payer, government-owned health insurance database covering 75.8% individuals in a country
Setting Nationwide healthcare providers (n=22,024) at primary, secondary, and tertiary levels
Participants 12-to-55-year-old women that visit healthcare providers using the insurance from ∼1% random sample of insurance holders stratified by healthcare provider and category of family: (1) never visit; (2) visit only primary care; and (3) visit all levels of care
Predictors Medical histories of diagnosis and procedure (International Classification of Disease version 10) before the latest visit of outcome within the database period
Main Outcomes and Measures Prelabor rupture of membranes prognostication (area under curve, with sensitivity, specificity, and likelihood ratio), the time of delivery estimation (root mean square error), and inference time (minutes), with 95% confidence interval
Results We selected 219,272 women aged 33 ± 12 years. The best prognostication achieved area under curve 0.73 (0.72 to 0.75), sensitivity 0.494 (0.489 to 0.500), specificity 0.816 (0.814 to 0.818), and likelihood ratio being positive 2.68 (2.63 to 2.75) and negative 0.62 (0.61 to 0.63). This outperformed models from previous studies according to area under curve of an external validation set, including one using a biomarker (area under curve 0.641; sensitivity 0.419; sensitivity 0.863; positive likelihood ratio 3.06; negative likelihood ratio 0.67; n=1177). Meanwhile, the best estimation achieved ± 2.2 and 2.6 weeks respectively for predicted events and non-events. Our web application only took 5.14 minutes (5.11 to 5.18) per prediction.
Conclusions and Relevance Prelabor rupture of membranes and the time of delivery were predicted by medical histories; but, an impact study is required before clinical application.
Question Can we use medical histories of diagnosis and procedure in electronic health records to predict prelabor rupture of membranes and the time of delivery before the day in nationwide insured women?
Findings In this prognostic study applying retrospective cohort paradigm, a significant predictive performance was achieved and validated. The area under receiver operating characteristics curve was 0.73 with the estimation errors of ± 2.2 and 2.6 weeks for the time of delivery.
Meaning Preliminary prediction can be conducted in a wide population of insured women to predict prelabor rupture of membranes and estimate the time of delivery.
Competing Interest Statement
The authors have declared no competing interest.
Funding Statement
This study was funded by the Ministry of Science and Technology (MOST) in Taiwan (grant number MOST109-2221-E-038-018 and MOST110-2628-E-038-001) and the Higher Education Sprout Project from the Ministry of Education (MOE) in Taiwan (grant number DP2-110-21121-01-A-13) to Emily Chia-Yu Su. These funding bodies had no role in the design of the study and collection, analysis, and interpretation of data and in writing the manuscript.
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
The dataset was opened publicly by request. The BPJS Kesehatan had already approved our request (dataset request approval no.: 5064/I.2/0421). The dataset had been already deidentified before going public; thus, the ethical clearance to Institutional Review Board of Taipei Medical University was waived.
All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.
Yes
Footnotes
Email addresses: herdiantrisufriyana{at}unusa.ac.id (HS); yuwei.wu{at}tmu.edu.tw (YWW); and emilysu{at}tmu.edu.tw (ECYS)
This version focuses on clinical audience for applying the prediction model. All computational/methodological aspects are out of scope of this paper and described elsewhere as protocol papers. These are intended for more general implementation of the proposed protocols, fully or partially implemented in our projects and collaborations. Yet, methods in this paper are sufficiently detailed for replicating this study while also referring to the protocol papers.
Data Availability
The data that support the findings of this study are available from the social security administrator for health or badan penyelenggara jaminan sosial (BPJS) kesehatan in Indonesia, but restrictions apply to the availability of these data, which were used under license for the current study (dataset request approval number: 5064/I.2/0421), and so are not publicly available. Data are however available from the authors upon reasonable request and with permission of the BPJS Kesehatan. To get this permission, one need to request an access from the BPJS Kesehatan for their sample dataset published in August 2019. Up to this date, there are three sample datasets they published in February 2019, August 2019, and December 2020. For the first and second versions, a request is applied via https://e-ppid.bpjs-kesehatan.go.id/, while the third is applied via https://data.bpjs-kesehatan.go.id. The R Markdown, R Script, and others are available in https://github.com/herdiantrisufriyana/prom. To pre-process the raw data into the input dataset of this study, follow the codes of the R Markdown in https://github.com/herdiantrisufriyana/medhist/tree/main/preprocessing.
https://github.com/herdiantrisufriyana/prom
https://e-ppid.bpjs-kesehatan.go.id/
https://data.bpjs-kesehatan.go.id
https://github.com/herdiantrisufriyana/medhist/preprocessing