Abstract
Efforts to fully exploit the rich potential of Bayesian Networks (BNs) have hitherto not seen a practical approach for development of domain-specific models using large-scale public statistics which have the potential to reduce the time required to develop probability tables and train the model. As a result, the duration of projects seeking to develop health BNs tend to be measured in years due to their reliance on obtaining ethics approval and collecting, normalising, and discretising collections of patient EHRs. This work addresses this challenge by investigating a new approach to developing health BNs that combines expert elicitation with knowledge from literature and national health statistics. The approach presented here is evaluated through the development of a BN for pregnancy complications and outcomes using national health statistics for all births in England and Wales during 2021. The result is a BN that when validated using vignettes against other common types of predictive models including multivariable logistic regression and nomograms produces comparable predictions. The BN using our approach and large-scale public statistics was also developed in a project with a duration measured in months rather than years. The unique contributions of this paper are a new efficient approach to BN development and a working BN capable of reasoning over a broad range of pregnancy-related conditions and outcomes.
Competing Interest Statement
The authors have declared no competing interest.
Funding Statement
No funding was received in relation to this work
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.
Yes
Footnotes
bridget.daley{at}lwh.nhs.uk, sam.saidi{at}sydney.edu.au, e.kyrimi{at}qmul.ac.uk, kuda.dube{at}kcl.ac.uk, crina.grosan{at}kcl.ac.uk, m.neil{at}qmul.ac.uk, louise.rose{at}kcl.ac.uk, n.fenton{at}qmul.ac.uk
ACM Reference Format
Scott McLachlan, Bridget J. Daley, Sam Saidi, Evangelia Kyrimi, Kudakwashe Dube, Crina Grosan, Martin Neil, Louise Rose, Norman E. Fenton. 2018. Approach and Method for Bayesian Network Modelling: A Case Study in Pregnancy Outcomes for England and Wales, 10 pages.
Data Availability
All data used is publicly available from national/health department sources. All links to these sources are provided in the manuscript