High Throughput Deep Learning Detection of Mitral Regurgitation
===============================================================

* Amey Vrudhula
* Grant Duffy
* Milos Vukadinovic
* David Liang
* Susan Cheng
* David Ouyang

## Abstract

**Background** Diagnosis of mitral regurgitation (MR) requires careful evaluation of echocardiography with Doppler imaging. This study presents the development and validation of a fully automated deep learning pipeline for identifying apical-4-chamber view videos with color Doppler and detection of clinically significant (moderate or severe) mitral regurgitation from transthoracic echocardiography studies.

**Methods** A total of 58,614 studies (2,587,538 videos) from Cedars-Sinai Medical Center (CSMC) were used to develop and test an automated pipeline to identify apical-4-chamber view videos with color Doppler across the mitral valve and then assess mitral valve regurgitation severity. The model was tested on an internal test set of 1,800 studies (80,833 videos) from CSMC and externally evaluated in a geographically distinct cohort of 915 studies (46,890 videos) from Stanford Healthcare (SHC).

**Results** In the held-out CSMC test set, the view classifier demonstrated an AUC of 0.998 (0.998 - 0.999) and correctly identified 3,452 of 3,539 MR color Doppler videos (sensitivity of 0.975 (0.968-0.982) and specificity of 0.999 (0.999-0.999) compared with manually curated videos). In the external test cohort from SHC, the view classifier correctly identified 1,051 of 1,055 MR color Doppler videos (sensitivity of 0.996 (0.990 – 1.000) and specificity of 0.999 (0.999 – 0.999) compared with manually curated videos). For evaluating clinically significant MR, in the CSMC test cohort, moderate-or-severe MR was detected with AUC of 0.916 (0.899 - 0.932) and severe MR was detected with an AUC of 0.934 (0.913 - 0.953). In the SHC test cohort, the model detected moderate-or-severe MR with an AUC of 0.951 (0.924 - 0.973) and severe MR with an AUC of 0.969 (0.946 - 0.987).

**Conclusions** In this study, we developed and validated an automated pipeline for identifying clinically significant MR from transthoracic echocardiography studies. Such an approach has potential for automated screening of MR and precision evaluation for surveillance.

## Introduction

Mitral regurgitation (MR) is one of the most common forms of valve disease, affecting more than 4 million Americans.1,2,3 Often progressing insidiously and frequently underrecognized3, both primary MR as well as secondary MR can be initially asymptomatic but lead to worsening heart failure and mortality.1,4–6,7,8 There has been an increased focus on early MR diagnosis given advances in surgical and transcatheter treatment options1,9,10. Echocardiography with color Doppler is the most common method of initial evaluation of MR, with a holistic assessment combining left atrial size, effective regurgitant orifice area, regurgitant fraction, regurgitant volume, as well as other key clinical factors to accurately assess disease severity.11,12 Despite ultrasound technology becoming more widely available, accurate assessment of MR still requires experienced expert image acquisition and evaluation.

Recent advances in machine learning offer opportunities to automate time-consuming steps in the interpretation of medical imaging. Artificial intelligence (AI) has the ability to precisely phenotype subtle cardiac physiology as well as identify imaging features of disease severity not recognized by clinicians.13–16 Deep learning has been applied to echocardiography to improve the precision of common measurements, such as left ventricular ejection fraction13 and wall thickness15,17, as well as streamlining assessment of aortic stenosis18, hypertrophic cardiomyopathy (HCM)15, and cardiac amyloidosis (CA).15,19 With increased ultrasound availability, AI guidance has been developed for both image acquisition and interpretation13,20. With the increasing prevalence of MR in an aging population with co-morbid heart failure, AI could aid in MR screening and surveillance.21–25

In this study, we developed and evaluated a deep learning pipeline’s performance in automating identification of MR from standard transthoracic echocardiogram studies. We hypothesized that a deep learning approach can identify color Doppler apical-4-chamber videos and assess MR severity with high-throughput automation, and this automated pipeline was evaluated in two geographically distinct cohorts (**Figure 1**). Combined with other echocardiography AI algorithms, such an approach can be used for serial surveillance and screening of mitral regurgitation.

![Figure 1:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2024/02/12/2024.02.08.24302547/F1.medium.gif)

[Figure 1:](http://medrxiv.org/content/early/2024/02/12/2024.02.08.24302547/F1)

Figure 1: Computer Vision Based Mitral Regurgitation (MR) Detection:
An automated deep learning pipeline was trained to detect and stratify mitral regurgitation severity using large-scale data consisting of apical-4-Chamber (A4C) echocardiogram videos with Color Doppler across the mitral valve (CSMC). The automated pipeline showed strong and consistent performance in test sets at CSMC and SHC. These results show that deep learning can accurately detect clinically significant MR using single-view TTE videos with Doppler information. Deep learning-based MR detection tools could serve as a part of point-of-care ultrasound screening as part of clinic visits or in resource limited settings where imaging may be obtained by individuals with minimal training.

## Methods

### Study Population and Data Source

#### Cedars-Sinai Medical Center (CSMC) Cohort

A total of 58,614 transthoracic echocardiogram studies from 38,461 patients receiving care at Cedars-Sinai Medical Center (CSMC) between October 11, 2011 and June 04, 2022 were used to train and evaluate the deep learning models. A total of 2,587,538 videos (an average of 44 videos per study after excluding still images) were initially sourced from Digital Imaging and Communications in Medicine (DICOM) files and underwent de-identification, view classification, and pre-processing into AVI videos. 354,117 videos were classified as apical-4-chamber videos using an automated view classifier, and then manually curated to identify 34,714 videos with color Doppler across the mitral valve.26

Following view selection, the CSMC cohort included 34,714 videos across 30,453 unique echocardiogram studies from 22,661 patients, and a subset enriched for moderate and severe MR were used for training. A total of 20,604 videos from 18,133 studies in the dataset were split on a patient level into train (80%), validation (10%), and test (10%) cohorts to train a deep neural network for MR severity classification (**Figure 2**). MR severity for each study was determined based on the clinical echocardiogram report determined in a high volume echocardiography lab in accordance with ASE guidelines.27 When MR was characterized as an intermediate category (“trace to mild” or “mild to moderate” or “moderate to severe”), videos were placed in the more severe categories. Both primary and secondary MR were included. Studies with concomitant mitral stenosis, other prosthetic valves, and heart failure were also included in both training and validation datasets.

![Figure 2:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2024/02/12/2024.02.08.24302547/F2.medium.gif)

[Figure 2:](http://medrxiv.org/content/early/2024/02/12/2024.02.08.24302547/F2)

Figure 2: CSMC and Stanford Dataset Isolation.
34,714 Color Doppler A4C videos were isolated from a larger set of videos from CSMC. A view classifier was trained and used to isolate A4C Mitral Doppler videos from 915 studies containing 1,055 suitable videos from Stanford Healthcare. The MR classification model was then benchmarked on an internal test set from CSMC and an external test set from Stanford Healthcare.

#### Stanford Healthcare (SHC) Cohort

The pipeline was evaluated on 915 studies (containing a total of 46,890 videos) from SHC’s high-volume academic echocardiography lab. The automated view classification pipeline was compared with manual curation of videos within those studies to evaluate specificity. All videos identified by the view classifier were used for downstream MR severity model validation. Model output was compared with MR severity determined by expert cardiologists from the clinical reports. This study was approved by the Institutional Review Boards at Cedars-Sinai Medical Center and Stanford Healthcare. The need for informed consent was waived as the study involved secondary analysis of existing data.

### AI Model Training

Deep learning models were trained using the PyTorch Lightning deep learning framework. When patients had multiple echocardiogram videos and studies, each video was considered an independent example during training, with care not to have patient overlap across training, validation, and test cohorts. Video-based convolutional neural networks (R2+1D) were used for view classification and MR severity assessment.28 This model architecture was previously used for other echocardiography tasks and shown to be effective.17 The models were initialized with random weights and trained using a binary cross entropy loss function for up to 100 epochs, using an ADAM optimizer, an initial learning rate of 1e-2, and a batch size of 24 on two NVIDIA RTX 3090 GPUs. Early stopping was performed based on the validation loss.

The view classifier was trained using the 34,714 manually curated videos with color Doppler across the mitral valve as cases and 49,263 other apical-4-chamber videos as controls. Controls were a combination of videos that did not have color Doppler or had color Doppler window not focused on the mitral valve (videos focused on the tricuspid valve, intra-atrial septum, or ventricular septum). The MR severity model was trained on 6,206 videos without MR, 6,128 videos with mild MR, 6,174 videos with moderate MR and 2,042 videos with severe MR. This process is summarized in **Figure 2**.

### Statistical Analysis

Model performance was evaluated using area under the receiver operating characteristic curve (AUROC) and confusion matrices. F1-score, recall (sensitivity), positive predictive value (PPV), and negative predictive value (NPV) for both greater than moderate MR and severe MR. During external validation, the view classifier and MR classifier were evaluated serially as an automated pipeline. Statistical analysis was performed in Python (version 3.8.0) and R (version 4.2.2). Confidence intervals were computed via bootstrapping with 10,000 samples. Reporting of study results are consistent with guidelines put forth by CONSORT-AI.29,30

### Model Explainability

The key imaging features identified by the MR severity model were evaluated using saliency mapping generated using the Integrated Gradients method.31 This method generated a heatmap for every frame of the video, summarized as a final 2-dimensional heatmap generated by using the maximum value along the temporal axis for each pixel location in the video. Pixels brighter in intensity and closer to yellow were more salient to model predictions, while those darker in color were less important to the model’s final prediction. When assessing videos with no MR, heatmaps were obtained by taking the maximum of saliency maps for the moderate and severe class output neurons for each pixel location (**Figure 4**).

## Results

### Study Population

A total of 58,614 studies from 38,461 patients were used to train the deep learning pipeline. From 2,587,538 initial videos, a total of 354,117 videos were identified as apical-4-chamber and subsequently manually curated to identify videos that had color Doppler across the mitral valve. The manually curated color Doppler videos were used to train a view-classification model and linked with clinician reports to train the MR severity model. Patient characteristics are presented in **Tables 1 & 2** and are representative of the general CSMC patient population that received echocardiograms. The data was split on patient level for training and validation and had similar patient age, ejection fraction, left atrial volume index and proportions of male sex, coronary artery disease, and atrial fibrillation.

View this table:
[Table 1 -](http://medrxiv.org/content/early/2024/02/12/2024.02.08.24302547/T1)

Table 1 - Clinical and demographic characteristics represented in the training, validation, and internal test data sets for the 83,977 apical-4-chamber videos used to train, validate, and test the mitral doppler A4C view classifier.
Values outside and inside parentheses represent number and percent, respectively, for categorical variables and mean and standard deviation for continuous variables.

View this table:
[Table 2 -](http://medrxiv.org/content/early/2024/02/12/2024.02.08.24302547/T2)

Table 2 - Clinical and demographic characteristics of the 20,604 apical-4-chamber videos used to train, validate, and test the MR model.
Values outside and inside parentheses represent number and percent, respectively, for categorical variables and mean and standard deviation for continuous variables.

### View Classifier Performance Across Two Institutions

On a test set of 3,109 studies (132,767 videos) from CSMC not seen during model training, the view classifier identified 3,452 videos (97.5% of manually identified cases). This corresponds to an of AUC of 0.998 (0.998 – 0.999), and at the Youden Index, with a sensitivity of 0.975 (0.968 - 0.982) and specificity of 0.999 (0.999-0.999). To evaluate generalization of the view classification model at a geographically distinct site, we evaluated its performance on 915 studies from SHC. The view classifier isolated 1,091 videos from a total of 46,890 videos, while manual review identified 1,055 videos with color Doppler across the mitral valve. The view classifier correctly identified 1,051 (99.6%) of manually curated videos, with 4 videos not found by the AI pipeline and 40 false positives. This corresponds to a sensitivity of 0.996 (0.990 – 1.000) and specificity of 0.999 (0.999 – 0.999).

### Mitral Regurgitation Severity Performance Across Two Institutions

The MR severity model showed strong performance in distinguishing MR severity and identifying clinically significant mitral regurgitation (**Figure 3**). In the internal CSMC test set not used during model training, the model demonstrated an AUC of 0.916 (0.899 - 0.932) in detecting ≥ moderate MR and an AUC of 0.934 (0.913 - 0.953) for severe MR. The AI model had an NPV of 0.954 (0.940 - 0.967) for severe MR and an NPV of 0.863 (0.835 - 0.890) for ≥ moderate MR. Further information on MR model performance is presented in **Table 3**. The MR severity model performance was similar across institutions. In the SHC cohort, the model identified severe MR with an AUC of 0.969 (0.946 - 0.987) and ≥ moderate MR with an AUC of 0.951 (0.924 - 0.973). In this cohort, the model had an NPV of 0.977 (0.962 – 0.990) for severe MR and an NPV of 0.986 (0.974 – 0.995) for ≥ moderate MR.

![Figure 3:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2024/02/12/2024.02.08.24302547/F3.medium.gif)

[Figure 3:](http://medrxiv.org/content/early/2024/02/12/2024.02.08.24302547/F3)

Figure 3: Model Performance Across Severity and Institution -
**A.** Receiver operating characteristic (ROC) curves for detection of Severe or ≥ Moderate MR at CSMC and Stanford. “≥ Moderate” included moderate, moderate to severe, and severe MR. **3B and 3C:** MR Classification on test set videos from CSMC and Stanford, respectively. Confusion matrix colormap values were scaled based on the proportion of actual disease cases in each class that were predicted in each possible disease category. This was done to allow for relative comparison of model performance across disease classes (None, Mild, Moderate, and Severe) given class imbalance.

View this table:
[Table 3 -](http://medrxiv.org/content/early/2024/02/12/2024.02.08.24302547/T3)

Table 3 - Model performance across institutions
- AUC, PPV, NPV, Recall and F1-score for MR on an internal test set from CSMC and an external validation set at Stanford. 95% confidence intervals were obtained by bootstrapping 10,000 samples. “≥ moderate” includes moderate and severe MR.

### Model Interpretation

Notably, saliency maps for our model demonstrate that the model focuses on the clinically relevant imaging features of mitral regurgitation. Saliency maps from Integrated Gradients were used to identify regions of interest in each video contributing the most to detection of MR severity (**Figure 4**).31 These interpretability techniques demonstrated localization of the activation signal in the color Doppler window and primarily highlighting the regurgitant jet, indicating that the model used appropriate, physiologic features of the mitral regurgitation to make predictions. Frame-by-frame saliency visualizations are shown in **Supplemental Videos S1-S4**.

![Figure 4:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2024/02/12/2024.02.08.24302547/F4.medium.gif)

[Figure 4:](http://medrxiv.org/content/early/2024/02/12/2024.02.08.24302547/F4)

Figure 4: Saliency Map Visualization for MR classification models.
Echocardiogram videos with severe MR from CSMC (top left) and Stanford (bottom left) are shown on the left, while videos with no MR from CSMC (top right) and Stanford (bottom right) are shown on the right. Saliency maps were computed using the Integrated Gradients method. A final 2-dimensional heatmap was generated by using the maximum value along the temporal axis for each pixel location in the video. Pixels brighter in color and closer to yellow were more salient to model predictions, while those darker in color were less important to the model’s final prediction. Severe MR was assessed by using the activation function for severe disease to generate a heatmap. When assessing controls, heatmaps were generated by stacking heatmaps for severe and moderate classes and taking the maximum between the two at each pixel location.

## Discussion

We developed and validated an automated pipeline for assessing for mitral regurgitation in echocardiography. From a full transthoracic echocardiogram study, the algorithm automatically screens for appropriate A4C videos with color Doppler on the mitral valve and then assesses MR severity. For both severe MR as well as ≥ moderate MR, the model demonstrated strong performance (> 0.916 AUC and > 0.863 NPV). This automated workflow worked in unselected external validation studies without preselection or exclusion of other concomitant comorbidities. Given these characteristics, our deep learning model could aid in the preliminary assessment of MR, facilitate review of institutional databases, or expand access for screening in low-resource settings.

Our algorithm learns features of mitral regurgitation that generalize across variability in imaging practices in two geographically distinct sites. Many prior echocardiography AI models primarily focused on black-and-white standard 2D B-mode images, while our study focuses on the AI assessment of color Doppler videos and utilized a video-based model for the incorporation of rich temporal information, both crucial for accurate MR assessment. Incorporation of Doppler information greatly expands the opportunities for AI in echo, particularly in valve disease. In expert clinical interpretation, a variety of metrics beyond just color Doppler and views are synthesized together to come up with a holistic assessment of MR severity. Intriguingly, our AI algorithm generally results in concordant interpretations with the comprehensive clinical approach while relying only on the A4C view, suggesting there is significant overlapping information as well as dependence on the A4C view in standard clinical practice.

While promising, the present work carries limitations. Echocardiographic assessment of MR depends on appropriate images being obtained, with different views potentially maximizing the visualized regurgitant jet. This algorithm would not overcome incomplete input information and insufficient images that would result underestimation of MR. Future work could focus on automatically quantifying parameters like valve leaflet thickness, effective regurgitant orifice area, regurgitant volume and fraction.

The present work builds upon prior work in the space of echocardiography and AI. Several recent works have reported strides in computer vision and echocardiography, including automated view classification32,33, phenotyping of left ventricular hypertrophy15, assessment of LV systolic function13, aortic stenosis risk stratification, and detection of complex congenital heart defects.34 Prior work in machine learning applied to MR has primarily focused on structured data and non-deep learning approaches. The combination of our algorithm with previously published works using AI to guide novices in acquiring imaging could potentially increase access to screening of MR.20,35

In summary, we introduce a model to screen for and stratify mitral regurgitation severity from transthoracic echocardiogram videos. To do so, we provide a workflow for isolating mitral valve color Doppler videos and automation of MR severity assessment. The models were evaluated to have good performance in internal and external test cohorts. The use of such a model, with a high AUC, NPV, and generalizability across sites, can open the door for screening of mitral valve disease in the primary care setting or in low-resource environments.

## Data Availability

The dataset of videos used in this study is not publicly available due to its potentially identifiable nature.

## Disclosures

A.V., G.D., M.V., and D.L. report no disclosures. S.C. reports consulting fees from UCB and Viz.ai and research grants NIH R01-HL131532 and NIH R01-HL142983. D.O. reports support from NIH NHLBI, NIH grant R00-HL157421 and AstraZeneca Alexion, as well as consulting from EchoIQ, Ultromics, Pfizer, InVision, Korean Society of Echo, and Japanese Society of Echo.

## Code Statement

Our code and model weights are available at [https://github.com/echonet/MR](https://github.com/echonet/MR)

## Acknowledgements

A.V. is a research fellow supported by the Sarnoff Cardiovascular Research Award. S.C. acknowledges support from the Erika J Glazer Family Foundation.

*   Received February 8, 2024.
*   Revision received February 8, 2024.
*   Accepted February 11, 2024.


*   © 2024, Posted by Cold Spring Harbor Laboratory

The copyright holder for this pre-print is the author. All rights reserved. The material may not be redistributed, re-used or adapted without the author's permission.

## References

1.  1.Enriquez-Sarano M, Akins CW, Vahanian A. Mitral regurgitation. Lancet. 2009;373:1382– 1394.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/S0140-6736(09)60692-9&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=19356795&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F02%2F12%2F2024.02.08.24302547.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000265300100036&link_type=ISI) 

2.  2.Harb SC, Griffin BP. Mitral Valve Disease: a Comprehensive Review. Curr Cardiol Rep. 2017;19:73.
    
    
3.  3.Dziadzko V, Clavel M-A, Dziadzko M, Medina-Inojosa JR, Michelena H, Maalouf J, Nkomo V, Thapa P, Enriquez-Sarano M. Outcome and undertreatment of mitral regurgitation: a community cohort study. Lancet. 2018;391:960–969.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/S0140-6736(18)30473-2&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F02%2F12%2F2024.02.08.24302547.atom) 

4.  4.Otto CM, Verrier ED. Mitral regurgitation--what is best for my patient? N. Engl. J. Med. 2011;364:1462–1463.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1056/NEJMe1102013&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=21463152&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F02%2F12%2F2024.02.08.24302547.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000289467200012&link_type=ISI) 

5.  5.Del Forno B, De Bonis M, Agricola E, Melillo F, Schiavi D, Castiglioni A, Montorfano M, Alfieri O. Mitral valve regurgitation: a disease with a wide spectrum of therapeutic options. Nat Rev Cardiol. 2020;17:807–827.
    
    
6.  6.Ocher R, May M, Labin J, Shah J, Horwich T, Watson KE, Yang EH, Calfon Press, Marcella A. Mitral Regurgitation in Female Patients: Sex Differences and Disparities. Catheter Cardiovasc Interv. 2023;2:101032.
    
    
7.  7.Simpson TF, Kumar K, Samhan A, Khan O, Khan K, Strehler K, Fishbein S, Wagner L, Sotelo M, Chadderdon S, Golwala H, Zahr F. Clinical Predictors of Mortality in Patients with Moderate to Severe Mitral Regurgitation. Am J Med. 2022;135:380–385.e3.
    
    
8.  8.Messika-Zeitoun D, Candolfi P, Vahanian A, Chan V, Burwash IG, Philippon J-F, Toussaint J-M, Verta P, Feldman TE, Iung B, Glineur D, Mesana T, Enriquez-Sarano M. Dismal Outcomes and High Societal Burden of Mitral Valve Regurgitation in France in the Recent Era: A Nationwide Perspective. J Am Heart Assoc. 2020;9:e016086.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1161/JAHA.120.016086&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F02%2F12%2F2024.02.08.24302547.atom) 

9.  9.Tribouilloy CM, Enriquez-Sarano M, Schaff HV, Orszulak TA, Bailey KR, Tajik AJ, Frye RL. Impact of preoperative symptoms on survival after surgical correction of organic mitral regurgitation: rationale for optimizing surgical indications. Circulation. 1999;99:400–405.
    
    [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6MTQ6ImNpcmN1bGF0aW9uYWhhIjtzOjU6InJlc2lkIjtzOjg6Ijk5LzMvNDAwIjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjQvMDIvMTIvMjAyNC4wMi4wOC4yNDMwMjU0Ny5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 

10. 10.David TE, Ivanov J, Armstrong S, Rakowski H. Late outcomes of mitral valve repair for floppy valves: Implications for asymptomatic patients. J Thorac Cardiovasc Surg. 2003;125:1143–1152.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1067/mtc.2003.406&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=12771888&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F02%2F12%2F2024.02.08.24302547.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000183266600023&link_type=ISI) 

11. 11.Fadel BM, Bakarman H, Dahdouh Z, Di Salvo G, Mohty D. Spectral Doppler interrogation of mitral regurgitation-spot diagnosis. Echocardiography. 2015;32:1179–1183.
    
    
12. 12.Hagendorff A, Knebel F, Helfen A, Stöbe S, Haghi D, Ruf T, Lavall D, Knierim J, Altiok E, Brandt R, Merke N, Ewen S. Echocardiographic assessment of mitral regurgitation: discussion of practical and methodologic aspects of severity quantification to improve diagnostic conclusiveness. Clin Res Cardiol. 2021;110:1704–1733.
    
    
13. 13.Ouyang D, He B, Ghorbani A, Yuan N, Ebinger J, Langlotz CP, Heidenreich PA, Harrington RA, Liang DH, Ashley EA, Zou JY. Video-based AI for beat-to-beat assessment of cardiac function. Nature. 2020;580:252–256.
    
    
14. 14.Elias P, Poterucha TJ, Rajaram V, Moller LM, Rodriguez V, Bhave S, Hahn RT, Tison G, Abreau SA, Barrios J, Torres JN, Hughes JW, Perez MV, Finer J, Kodali S, Khalique O, Hamid N, Schwartz A, Homma S, Kumaraiah D, Cohen DJ, Maurer MS, Einstein AJ, Nazif T, Leon MB, Perotte AJ. Deep Learning Electrocardiographic Analysis for Detection of Left-Sided Valvular Heart Disease. J Am Coll Cardiol. 2022;80:613–626.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.jacc.2022.05.029&link_type=DOI) 

15. 15.Duffy G, Cheng PP, Yuan N, He B, Kwan AC, Shun-Shin MJ, Alexander KM, Ebinger J, Lungren MP, Rader F, Liang DH, Schnittger I, Ashley EA, Zou JY, Patel J, Witteles R, Cheng S, Ouyang D. High-Throughput Precision Phenotyping of Left Ventricular Hypertrophy With Cardiovascular Deep Learning. JAMA Cardiol. 2022;7:386–395.
    
    
16. 16.He B, Kwan AC, Cho JH, Yuan N, Pollick C, Shiota T, Ebinger J, Bello NA, Wei J, Josan K, Duffy G, Jujjavarapu M, Siegel R, Cheng S, Zou JY, Ouyang D. Blinded, randomized trial of sonographer versus AI cardiac function assessment. Nature. 2023;616:520–524.
    
    
17. 17.Soto JT, Weston Hughes J, Sanchez PA, Perez M, Ouyang D, Ashley EA. Multimodal deep learning enhances diagnostic precision in left ventricular hypertrophy. Eur Heart J Digit Health. 2022;3:380–389.
    
    
18. 18.Holste G, Oikonomou EK, Mortazavi BJ, Coppi A, Faridi KF, Miller EJ, Forrest JK, McNamara RL, Ohno-Machado L, Yuan N, Gupta A, Ouyang D, Krumholz HM, Wang Z, Khera R. Severe aortic stenosis detection by deep learning applied to echocardiography. Eur Heart J [Internet]. 2023; Available from: doi:10.1093/eurheartj/ehad456
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/eurheartj/ehad456&link_type=DOI) 

19. 19.Vrudhula A, Stern L, Cheng PC, Ricchiuto P, Daluwatte C, Witteles R, Patel J, Ouyang D. Impact of case and control selection on training AI screening of cardiac amyloidosis [Internet]. bioRxiv. 2023;Available from: doi:10.1101/2023.03.30.23287941
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1101/2023.03.30.23287941&link_type=DOI) 

20. 20.Narang A, Bae R, Hong H, Thomas Y, Surette S, Cadieu C, Chaudhry A, Martin RP, McCarthy PM, Rubenson DS, Goldstein S, Little SH, Lang RM, Weissman NJ, Thomas JD. Utility of a Deep-Learning Algorithm to Guide Novices to Acquire Echocardiograms for Limited Diagnostic Use. JAMA Cardiol. 2021;6:624–632.
    
    
21. 21.Nkomo VT, Gardin JM, Skelton TN, Gottdiener JS, Scott CG, Enriquez-Sarano M. Burden of valvular heart diseases: a population-based study. Lancet. 2006;368:1005–1011.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/S0140-6736(06)69208-8&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=16980116&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F02%2F12%2F2024.02.08.24302547.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000240698700029&link_type=ISI) 

22. 22.Martin A-C, Bories M-C, Tence N, Baudinaud P, Pechmajou L, Puscas T, Marijon E, Achouh P, Karam N. Epidemiology, Pathophysiology, and Management of Native Atrioventricular Valve Regurgitation in Heart Failure Patients. Front Cardiovasc Med. 2021;8:713658.
    
    
23. 23.Avierinos J-F, Tribouilloy C, Grigioni F, Suri R, Barbieri A, Michelena HI, Ionico T, Rusinaru D, Ansaldi S, Habib G, Szymanski C, Giorgi R, Mahoney DW, Enriquez-Sarano M, Mitral regurgitation International DAtabase (MIDA) Investigators. Impact of ageing on presentation and outcome of mitral regurgitation due to flail leaflet: a multicentre international study. Eur Heart J. 2013;34:2600–2609.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/eurheartj/eht250&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=23853072&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F02%2F12%2F2024.02.08.24302547.atom) 

24. 24.Grave C, Tribouilloy C, Tuppin P, Weill A, Gabet A, Juillière Y, Cinaud A, Olié V. Fourteen-Year Temporal Trends in Patients Hospitalized for Mitral Regurgitation: The Increasing Burden of Mitral Valve Prolapse in Men. J Clin Med Res [Internet]. 2022;11. Available from: doi:10.3390/jcm11123289
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.3390/jcm11123289&link_type=DOI) 

25. 25.Mitchell E, Walker R. Global ageing: successes, challenges and opportunities. Br J Hosp Med. 2020;81:1–9.
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F02%2F12%2F2024.02.08.24302547.atom) 

26. 26.Zhang J, Gajjala S, Agrawal P, Tison GH, Hallock LA, Beussink-Nelson L, Lassen MH, Fan E, Aras MA, Jordan C, Fleischmann KE, Melisko M, Qasim A, Shah SJ, Bajcsy R, Deo RC. Fully automated echocardiogram interpretation in clinical practice. Circulation. 2018;138:1623–1635.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1161/CIRCULATIONAHA.118.034338&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F02%2F12%2F2024.02.08.24302547.atom) 

27. 27.Zoghbi WA, Adams D, Bonow RO, Enriquez-Sarano M, Foster E, Grayburn PA, Hahn RT, Han Y, Hung J, Lang RM, Little SH, Shah DJ, Shernan S, Thavendiranathan P, Thomas JD, Weissman NJ. Recommendations for noninvasive evaluation of native valvular regurgitation. J Am Soc Echocardiogr. 2017;30:303–371.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.echo.2017.01.007&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F02%2F12%2F2024.02.08.24302547.atom) 

28. 28.Tran D, Wang H, Torresani L, Ray J, LeCun Y, Paluri M. A Closer Look at Spatiotemporal Convolutions for Action Recognition [Internet]. arXiv [cs.CV]. 2017 [cited 2023 Oct 19];Available from: [http://arxiv.org/abs/1711.11248](http://arxiv.org/abs/1711.11248)
    
    
29. 29.Selvaraju RR, Cogswell M, Das A, Vedantam R, Parikh D, Batra D. Grad-CAM: Visual explanations from deep networks via gradient-based localization. Int J Comput Vis. 2020;128:336–359.
    
    
30. 30.Saito T, Rehmsmeier M. The precision-recall plot is more informative than the ROC plot when evaluating binary classifiers on imbalanced datasets. PLoS One. 2015;10:e0118432.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1371/journal.pone.0118432&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25738806&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F02%2F12%2F2024.02.08.24302547.atom) 

31. 31.Sundararajan M, Taly A, Yan Q. Axiomatic attribution for deep networks [Internet]. arXiv [cs.LG]. 2017 [cited 2023 Nov 4];Available from: [http://arxiv.org/abs/1703.01365](http://arxiv.org/abs/1703.01365)
    
    
32. 32.Steffner K, Christensen M, Gill G, Bowdish M, Rhee J, Kumaresan A, He B, Zou J, Ouyang D. Deep learning for transesophageal echocardiography view classification [Internet]. bioRxiv. 2023; Available from: [https://www.medrxiv.org/content/10.1101/2023.06.11.23290759.abstract](https://www.medrxiv.org/content/10.1101/2023.06.11.23290759.abstract)
    
    
33. 33.Madani A, Arnaout R, Mofrad M, Arnaout R. Fast and accurate view classification of echocardiograms using deep learning. NPJ Digit Med [Internet]. 2018;1. Available from: doi:10.1038/s41746-017-0013-1
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41746-017-0013-1&link_type=DOI) 

34. 34.Arnaout R, Curran L, Zhao Y, Levine JC, Chinn E, Moon-Grady AJ. An ensemble of neural networks provides expert-level prenatal detection of complex congenital heart disease. Nat Med. 2021;27:882–891.
    
    
35. 35.Chiu I-M, Lin C-HR, Yau F-FF, Cheng F-J, Pan H-Y, Lin X-H, Cheng C-Y. Use of a Deep-Learning Algorithm to Guide Novices in Performing Focused Assessment With Sonography in Trauma. JAMA Netw Open. 2023;6:e235102.