Abstract
Background Chest pain is among the most common presenting complaints in the emergency department (ED). Swift and accurate risk stratification of chest pain patients in the ED may improve patient outcomes and reduce unnecessary costs. Traditional logistic regression with stepwise variable selection has been used to build risk prediction models for ED chest pain patients. In this study, we aimed to investigate if machine learning dimensionality reduction methods can achieve superior performance than the stepwise approach in deriving risk stratification models.
Methods A retrospective analysis was conducted on the data of patients >20 years old who presented to the ED of Singapore General Hospital with chest pain between September 2010 and July 2015. Variables used included demographics, medical history, laboratory findings, heart rate variability (HRV), and HRnV parameters calculated from five to six-minute electrocardiograms (ECGs). The primary outcome was 30-day major adverse cardiac events (MACE), which included death, acute myocardial infarction, and revascularization. Candidate variables identified using univariable analysis were then used to generate the stepwise logistic regression model and eight machine learning dimensionality reduction prediction models. A separate set of models was derived by excluding troponin. Receiver operating characteristic (ROC) and calibration analysis was used to compare model performance.
Results 795 patients were included in the analysis, of which 247 (31%) met the primary outcome of 30-day MACE. Patients with MACE were older and more likely to be male. All eight dimensionality reduction methods marginally but non-significantly outperformed stepwise variable selection; The multidimensional scaling algorithm performed the best with an area under the curve (AUC) of 0.901. All HRnV-based models generated in this study outperformed several existing clinical scores in ROC analysis.
Conclusions HRnV-based models using stepwise logistic regression performed better than existing chest pain scores for predicting MACE, with only marginal improvements using machine learning dimensionality reduction. Moreover, traditional stepwise approach benefits from model transparency and interpretability; in comparison, machine learning dimensionality reduction models are black boxes, making them difficult to explain in clinical practice.
Competing Interest Statement
NL and MEHO hold patents related to using heart rate variability and artificial intelligence for medical monitoring. NL, ZXK, DG, and MEHO are currently advisers to TIIM SG. The other authors report no conflicts.
Funding Statement
This work was supported by the Duke-NUS Signature Research Programme funded by the Ministry of Health, Singapore. The funder of the study had no role in study design, data collection, data analysis, data interpretation, or writing of the report.
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
The ethical approval was obtained from the Centralized Institutional Review Board (CIRB, Ref: 2014/584/C) of SingHealth, in which patient consent was waived.
All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.
Yes
Data Availability
The datasets used and/or analyzed during the current study are available from the corresponding author on reasonable request.
List of abbreviations
- ACS
- acute coronary syndrome
- AMI
- acute myocardial infarction
- AUC
- area under the curve
- ApEn
- approximate entropy
- CI
- confidence intervals
- CABG
- coronary artery bypass graft
- COVID-19
- coronavirus disease 2019
- DFA
- detrended fluctuation analysis
- ECG
- electrocardiogram
- EHR
- electronic health records
- ED
- emergency department
- GRP
- Gaussian random projection
- GRACE
- global registry of acute coronary events
- HRnV
- heart rate n-variability
- HRV
- heart rate variability
- HF
- high frequency
- HEART
- History, ECG, Age, Risk factors, and initial Troponin
- IQR
- interquartile range
- KPCA
- kernel principal component analysis
- LSA
- latent semantic analysis
- LLE
- locally linear embedding
- LF
- low frequency
- MACE
- major adverse cardiac events
- Mean NN
- average of R-R intervals
- MDS
- multidimensional scaling
- NPV
- negative predictive value
- NN50
- the number of times that the absolute difference between 2 successive R-R intervals exceeds 50 ms
- NN50n
- the number of times that the absolute difference between 2 successive RRnI/RRnIm sequences exceeds 50×n ms
- PACS
- patient acuity category scale PCI, percutaneous coronary intervention
- pNN50
- NN50 divided by the total number of R-R intervals
- pNN50n
- NN50n divided by the total number of RRnI/RRnIm sequences PPV, positive predictive value
- PCA
- principal component analysis
- ROC
- receiver operating characteristic
- RMSSD
- square root of the mean squared differences between R-R intervals
- RRI
- R-R interval
- SampEn
- sample entropy
- SD
- standard deviation
- SDNN
- standard deviation of R-R intervals
- SRP
- sparse random projection
- STEMI
- ST-elevation myocardial infarction
- TIMI
- thrombolysis in myocardial infarction
- VLF
- very low frequency