Application of Principal Component Analysis to Heterogenous Fontan Registry Data Identifies Independent Contributing Factors to Decline ======================================================================================================================================= * Margaret R. Ferrari * Michal Schäfer * Kendall S. Hunter * Michael V. Di Maria ## Abstract Single ventricle heart disease is a severe and life-threatening illness, and improvements in clinical outcomes of those with Fontan circulation have not yet yielded acceptable survival over the past two decades. Patients are at risk of developing a diverse variety of Fontan-associated comorbidities that ultimately requires heart transplant. Our observational cohort study goal was to determine if principal component analysis (PCA) applied to data collected from a substantial Fontan cohort can predict functional decline (N=140). Heterogeneous data broadly consisting of measures of cardiac and vascular function, exercise (VO2max), lymphatic biomarkers, and blood biomarkers were collected over 11 years at a single site; in that time, 16 events occurred that are considered here in a composite outcome measure. After standardization and PCA, principal components (PCs) representing >5% of total variance were thematically labeled based on their constituents and tested for association with the composite outcome. Our main findings suggest that the 6th PC (PC6), representing 7.1% percent of the total variance in the set, is greatly influenced by blood serum biomarkers and superior vena cava flow, is a superior measure of proportional hazard compared to EF, and displayed the greatest accuracy for classifying Fontan patients as determined by AUC. In bivariate hazard analysis, we found that models combining systolic function (EF or PC5) and lymphatic dysfunction (PC6) were most predictive, with the former having the greatest AIC, and the latter having the highest c-statistic. Our findings support our hypothesis that a multifactorial model must be considered to improve prognosis in the Fontan population. ## Introduction Patients born with single ventricle heart disease (SVD), a severe and rare congenital heart defect (CHD), are subjected to three palliative surgeries that culminate in the Fontan circulation [1]. While staged palliation addresses the primary concerns of obstructed systemic blood flow and cyanosis in a condition like hypoplastic left heart syndrome, a range of Fontan associated comorbidities, including lymphatic, liver, and cardiac damage, are often apparent in adolescence [2]. Because morbidity and mortality after Fontan surgery remain unacceptably high [2], new approaches to predict patient decline are sorely needed. The goal of this work is to develop a new prognostic model for patients with SVD and explore machine learning methods as a tool of risk stratification in the Fontan population. Principal component analysis (PCA) is a data reduction technique that is often applied to large data sets in research [3–7], although it has not yet been applied to the Fontan population outside of waveform analysis [8, 9]. Examples where PCA has been utilized to find novel associations in large datasets that would have not have been amenable to more conventional statistical approaches include: Scientists have used PCA to identify patterns of inflammatory and adhesion molecules that contribute to muscle weakness acquired in the intensive care unit [4]. A similar study in a population of adults used PCA to identify inflammatory markers that precede major adverse cardiovascular events (MACE) following heart attack and found that the PC influenced by interleukin-6 and interleukin-8 was a better predictor of MACE at one year than univariate cytokine measures [6]. Additionally, PCA has recognized lymphocyte-monocyte-neutrophil indices that contribute to disease severity in several cohorts of COVID-19 patients [5]. The aforementioned studies support our hypothesis that a PCA approach may be necessary to understand and predict outcomes in heterogenous disease states like the Fontan population, where a very large number of potential predictor variables exist that stem from anatomic, surgical, imaging, laboratory, functional testing domains. We have previously applied PCA to characterize non-pulsatile cavopulmonary flow waveforms [8, 9]; here we expand that approach to include heterogenous biomarkers. Another benefit of using a PCA approach in clinical data analysis is inclusion of correlative/colinear parameters, of which many statistical outcomes models prohibit [5]. Our primary objective in this study was to assess our previously defined novel waveform measures and other clinical parameters in a heterogenous PCA approach, all in support of the overall hypothesis that machine-learning extracted PCs will delineate patients with Fontan-associated comorbidities and reveal parameters that indicate circulatory failure in patients with a Fontan circulation. More specifically, a machine-learning extracted PC, which will consist of a pattern of abnormalities in multiple of cardiac and non-cardiac test results, previously unrecognized as an important preditor of outcomes, will be associated with a composite outcome of Fontan failure. ## Methods One-hundred and forty SVD patients that underwent cardiac MRI (cMRI) at Children’s Hospital Colorado between July 2011 and August 2022 were included in this retrospective cohort study, permitted by the Colorado Multiple Institutional Review Board as a portion of Fontan at Altitude Registry for Outcomes (FAROUT). All patients cared for in the Fontan Multidisciplinary Clinic at the Children’s Hospital Colorado have undergone surveillance testing for end-organ damage and Fontan-associated comorbidities by way of a clinical practice guideline since 2016 and were included. The FAROUT registry was queried and abstracted data was used as a foundation for a study database, in addition to our single site venous flow patterns [10]. Variables collected are shown in Table I. For the purposes of survival analysis, study subject status was evaluated as of December 1, 2022, and a composite outcome was defined as the development of plastic bronchitis (PB, n=1), protein-losing enteropathy (PLE, n=2), referral to transplant (RTT, n=9), received a transplant (n=4), or death (n=0) from the time of cMRI to time to follow up. View this table: [Table I.](http://medrxiv.org/content/early/2024/07/12/2024.07.11.24310309/T1) Table I. Measures used in heterogenous PCA, including novel single vessel waveform data, hemodynamic, global cardiovascular, blood, kidney, liver, and respiratory biomarkers, and their mean or median and corresponding standard deviation or interquartile range. ### cMRI Acquisition Phase images and corresponding magnitude images of the superior vena cava (SVC), inferior vena cava (IVC), and left pulmonary artery (LPA) were obtained using a PC-MRI, ECG gated sequence as previously described [11, 12] by applying a 1.5 or 3.0 Magnetom Avanto (Siemens Medical Solutions, Erlangen, Germany) or Ingenia (Philips Medical System, Best, Netherlands) Tesla magnet using a phased-array body surface coil. A free breathing PC-MRI sequence was used under the following conditions: time to repetition, 14-28 milliseconds/25-40 cardiac phases; time to echo, 2.2-3.5 milliseconds; matrix, 160 x 256; flip angle, 25 degrees; 100% k space sampling; cross-sectional pixel resolution, 0.82 x 0.82 mm2 and 1.56 x 1.56 mm2; slice thickness, 5 millimeters. Heart rate dependent, PC-MRI acquisition varied 2-3 minutes for each vessel. Aliasing was accommodated for using the following velocity-encoding values: SVC and IVC, 75-100 cm/second; LPA, 50-100 cm/second. The AAo, SVC, and IVC images were acquired in axial cine stack and the LPA in vertical long axis, all orthogonal to flow. ### Flow Profile Analysis Flow profile characteristics were assessed as described previously with slight modifications [9]. Flow waveforms were acquired by precise parallel segmentation of 2D phase-contrast image series in Circle CVi42 (Calgary, Canada). Flow data was captured for each patient at the SVC, IVC, and LPA and imported into MATLAB (Natick, MA). Each waveform was normalized by dividing each flow point by patient BSA to minimize size effect on the raw data. Data was interpolated using cubic spline interpolation to 40 points, guaranteeing size-matched array lengths for further analyses. Three data matrices were created containing single-site flow data and the size of each data matrix varied and is as follows: SVC (124 x 40), IVC (132 x 40), LPA (125 x 40). ### Clinical Biomarkers Global cardiovascular and ventricular indicators (EF, CO, CI, EDVi, ESVi, SVi) have long been established as the benchmark for Fontan patient status [8, 13–15] and therefore were included in the analysis for validation purposes. VVCR, mean catheterization pressures (mPAP and SVC mean pressure), VO2max, BNPmax and O2 saturations were also included in the outcomes analysis, all of which have been independently linked to Fontan circulation health and outcomes [16–19]. We have previously determined that biomarkers indicative of lymphatic function and PLE, specifically aspartate aminotransferase, alkaline phosphatase, cystatin-c and creatinine were associated with caval flow patterns [9], strongly indicating that these parameters may identify, or be a predictive of, which Fontan patients will experience circulatory failure. Similarly, biomarkers such as albumin, total protein, blood urea nitrogen and platelet count were included due to previous reports relating these to Fontan patient cyanosis and pulmonary blood flow [20, 21]. ### Principal Component Analysis PCA requires that the input data matrix has a value assigned to each position, and therefore after the exported registry was read into MATLAB, patients with more than five measures missing were removed from analysis. This was done in an effort to maintain a missing data rate of less than 5% [22], and the resulting clinical parameters and demographic information can be found in Table I. Patients that had missing values less than or equal to five were replaced with the column median, as a value is required for each position in the input matrix. Columns were normalized by subtracting the column mean from each sample and dividing by the corresponding column standard deviation, and the resulting matrix was the input for PCA. Scree plots were created to determine PCs representing greater than 5% of total variance. Interpretation of which clinical parameters had the greatest influence on each PC were graphically determined by visualizing PC eigenvectors. ### Statistics Statistical analyses were performed in GraphPad Prism and began by determining the univariate Cox hazard ratio (HR) for PCs 1-10 and EF, a measure of systolic function that serves as a benchmark diagnostic for patients with a Fontan circulation [23]. Akaike’s information criterion (AIC) and the c-statistic were also gathered, and measures with the greatest c-statistic were used to create receiver operating characteristic (ROC) curves and determine the area under the curve (AUC) and Youden’s index, defined as (sensitivity(x) – specificity(x)) −1. The optimum sensitivity, specificity and clinical threshold for grouping was found and used to defined groups for Kaplan-Meier survival analysis. The Mantel-Cox log-rank test was used to determine if a significant difference existed between Kaplan-Meier curves. Univariate parameters with the greatest c-statistics were used, up to two parameters at a time, for multivariate (bivariate) regression analysis and AIC was used to compare univariate and multivariate predictive models. ## Results ### Principal Component Analysis The size of the original data matrix imported to MATLAB was 140 x 31 and was reduced to 115 x 31 after removal of patients missing greater than five measures. The remaining matrix had a 4.15% rate of missing data, and therefore fell within the 5% acceptable rate [22]. Columns of the matrix included scores for single site SVC, IVC, and LPA flow patterns, each representing a patient’s contribution to that PC’s waveform pattern, EF, EDVi, ESVi, SVi, CO, CI, BSA, BNP max, AST, VVCR, lowest SaO2, SVC mean pressure, mPAP, albumin, platelets, alkaline phosphatase, total protein, creatinine, BUN, and cystatin-C (Table I). Following PCA, a scree plot was used to identify the percent each PC contributed to the overall variance in the original data set and can be seen in Figure 1. The first PC accounted for about 17.5% of the original data matrix variance, followed by approximately 11.5% for PC2, 8.5% for PC3, and subsequently decreased as the PC number increased (Figure 1). Each of the first 7 PCs accounted for more than 5% of the total variance, and together explained 70.3% of that variance; PCs up to PC10 (3.6% of total variance, 77.7% cumulative variance) were considered for survival analysis. ![Figure 1.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2024/07/12/2024.07.11.24310309/F1.medium.gif) [Figure 1.](http://medrxiv.org/content/early/2024/07/12/2024.07.11.24310309/F1) Figure 1. The scree pot displays each PC (x axis) and the percent variance it represents in the original data set (y axis). Interpretation of the first two PCs was aided by the biplot displayed in Figure 2, where the blue lines represent the eigenvectors, or PC coefficients, and the length and direction represents that parameter’s influence on each PC. For example, further distance from the origin on the x-axis means greater contribution to PC1, therefore EDVi, ESVi, EF, and VVCR contributed greatest to PC1 (Figure 2). PC2 variance is explained by deviance from zero along the y-axis, and major influencing parameters include CO, SVi, LPA PC1 and IVC PC1 (Figure 2). The red data points represent the scores, or how each patient sample contributes to the PCs. ![Figure 2.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2024/07/12/2024.07.11.24310309/F2.medium.gif) [Figure 2.](http://medrxiv.org/content/early/2024/07/12/2024.07.11.24310309/F2) Figure 2. Principal component biplot that displays the scores returned from PCA, or each sample, in the original dimension and are represented by the red data points. The eigenvectors, or the coefficients, for each clinical parameter are displayed as the blue lines and the direction and length represent the influence each parameter has on PC1 (x axis) and PC2 (y axis). The clinical implications of each PC were examined using bar graphs of the PC eigenvectors, where the clinical parameters (x axis) and their relative contributions to each PC (y axis) are displayed in Figure 3. The first PC was highly influenced by cardiac parameters, including EF, EDVi, ESVi, CI, VVCR, AST, SVC PC1 Scores, and IVC PC1 scores, which primarily describe cardiac function and the downstream effects in the Fontan circulation. The fourth PC was highly influenced by IVC and LPA waveform patterns in addition to cavopulmonary pressures and cystatin-C. PC5 was influenced by cardiac parameters representative of systolic function, such as EF, SVi, CI, and VVCR, and waveform patterns IVC PC2, SVC PC1 scores, and BUN. Albumin, alkaline phosphatase, total protein, BUN, BNP max, and SVC waveforms scores influenced PC 6. ![Figure 3.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2024/07/12/2024.07.11.24310309/F3.medium.gif) [Figure 3.](http://medrxiv.org/content/early/2024/07/12/2024.07.11.24310309/F3) Figure 3. Bar graphs for PC1 and PCs 4-6 display each clinical parameter considered in PCA, or the column headers, and the amount (y axis, 0 up to 1) that parameter influences each PC. ### Survival Analysis Univariate Cox proportional hazard ratio was determined for each PC and can be found in Table II. The single best predictor of which patient is at a greater hazard is PC6 (AIC=109), followed by the standard measure used for prediction in this population, EF (AIC=111) (Table II). PC1 and PC5 also performed well, with AICs of 113 and 115. The hazard ratio for EF and each PC is displayed in a forest plot in Figure 3, and the bars represent the 95% confidence intervals. If a parameter’s confidence interval crossed one, it was not statistically significant (Figure 4). ![Figure 4.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2024/07/12/2024.07.11.24310309/F4.medium.gif) [Figure 4.](http://medrxiv.org/content/early/2024/07/12/2024.07.11.24310309/F4) Figure 4. The forest plot displays the hazard ratio for each PC and EF and the corresponding 95% confidence intervals. Bars that cross 1, or the null hypothesis, represents no difference in hazards between patients that experienced an event versus those that did not. View this table: [Table II.](http://medrxiv.org/content/early/2024/07/12/2024.07.11.24310309/T2) Table II. Univariate cox regression hazard ratio for each PC and EF, the gold standard for systolic function and patient decline in the Fontan population, and the accompanying 95% confidence intervals, their p-value, c-statistic and AIC. Parameters with the greatest c-statistics were tested as classifiers of patients with Fontan decline using ROC curves, and PC6 returned the greatest AUC at 0.767, followed by PC5 (AUC=0.740), and EF (AUC=0.696) (Figure 5). The accompanying optimum sensitivity and specificity, determined using the greatest distance from the null hypothesis line or Youden’s index, was also found and shows that, while EF is highly specific (0.771) and therefore able to designate patients with Fontan failure correctly (low EF is almost always accompanied by SVD circulatory failure), its sensitivity is lacking at 0.643 (Figure 5). Sensitivity determines a classifier’s ability to label patients without Fontan decline correctly, and all PCs had the same, if not superior, sensitivity compared to EF (Figure 5). PC6 had the greatest sensitivity (0.786) with a reasonably balanced specificity at 0.686, suggesting it can rule healthy Fontan patients out as having circulatory failure, and PC1 had both optimum sensitivity and specificity at 0.714 (Figure 4). PC6 also displayed the greatest maximum effective biomarker, represented by Youden’s index of 0.471 (Figure 5). ![Figure 5.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2024/07/12/2024.07.11.24310309/F5.medium.gif) [Figure 5.](http://medrxiv.org/content/early/2024/07/12/2024.07.11.24310309/F5) Figure 5. ROC curves for each PC that returned the greatest c-statistics and EF and the accompanying AUC, p-value, optimum sensitivity and specificity and the corresponding clinical cut off. Grouping patients based on Youden’s index allows for Kaplan-Meier curve generation, displayed in Figure 6. Though EF was determined to have statistically significant differences in survival using the Mantel Cox log rank test (p=0.0006), PC1 and PC5 performed better with p values of 0.0003 and 0.0005 (Figure 6). PC6 also had highly significant differences in survival (p=0.0008), though it was not as significant as EF, and PC4 also displayed a significant difference in survival (p=0.07) (Figure 6). ![Figure 6.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2024/07/12/2024.07.11.24310309/F6.medium.gif) [Figure 6.](http://medrxiv.org/content/early/2024/07/12/2024.07.11.24310309/F6) Figure 6. Kaplan-Meier curves for each PC that returned the greatest c-statistics and EF, and the p value returned from the log-rank Mantel Cox p-value. Groups were created using the Youden’s index defined cut-off values. The inputs for multivariate Cox hazard regression analysis were EF, ESVi, and PC1 and PCs 4-6 and were chosen based on their univariate c-statistics. The models developed and each covariate’s HR estimate, 95% CI, p-value, c-statistic, and AIC are listed in Table III. The greatest AIC, and therefore predictive model, was model B (0.807, AIC=97) and included EF and PC6. However, model F, consisting of PC5 and PC6, has a greater c-statistic (0.845, AIC=103) and therefore is more probable to randomly identify a patient that experienced an event has a greater risk score than a patient that did not experience an event. View this table: [Table III.](http://medrxiv.org/content/early/2024/07/12/2024.07.11.24310309/T3) Table III. Multivariate models were developed, up to two measures at a time, for each PC and ystolic measures EF and ESVi, the corresponding HR estimates, 95% CIs, p-values, c-statistic and AIC (used for model selection). ## Discussion In this study, we explored univariate and multivariate associations between composite outcomes in a relatively large, single-center Fontan cohort, evaluating standard and ML-derived parameters both singly as well as through heterogeneous feature reduction (PCA). Through use of PCA, we determined PCs that appear to relate to specific features of Fontan decline, and that these features were significant univariate and multivariate predictors of a composite event. PC6 was identified as a highly significant measure of hazard in our cohort and was the greatest univariate predictor of outcomes with an AIC of 109. The parameters that contributed the greatest to this PC have also been associated with development of lymphatic dysfunction, specifically PLE, in the Fontan population and include albumin, alkaline phosphatase, total protein, BUN and BNP max [23–25]. Cavopulmonary flow patterns were also found to influence PC6 and have previously been suspected as contributors to PLE [23]. PC6 was also the most accurate classifier of patients with Fontan decline from those without, which supports our understanding of PLE development and poorer prognosis in the Fontan population. It is also worth noting that only two patients (of 16) experienced a composite outcome of PLE. Additionally, with a sensitivity of 78.6% and specificity of 68.6%, this PC may be clinically useful in categorizing (or risk stratifying) patients. For example, after undergoing surveillance testing for end-organ damage and Fontan-related comorbidities, a patient could have their CMR-derived flow waveforms and other biomarkers examined here projected into a known feature set (heterogeneous PCs), after which these latter PCs would be used to classify the patient as either at-risk or not, based on the present analysis. Additional work, in terms of data collection and validation, as well as potentially longitudinal studies targeting causality, must be performed to identify the role parameters represented by PC6 play in the development of PLE, though this study provides a promising foundation for further research. Survival was significantly predicted by all PCs identified as having a significant hazard ratio (1, 4, 5, 6), though two performed superior to EF, the standard measure of health and cardiac function in this population. PC1 performed better at prediction than EF alone and represented components of general cardiac function, including EF, EDVi, ESVi, VVCR, CI, and cavopulmonary flow measures. Our findings suggest that PCA is a supported method for inclusion of colinear parameters, and inclusion of such measures does in fact improve prediction of outcomes better than a single measure of ventricular function or multivariate testing. PC5 also improved prediction of outcomes compared to EF and was influenced by EF, SVi, CI, BSA, VVCR, BUN, SVC and LPA waveform patterns, most of which are affected by systolic ventricular function. However, little improvement was noted in a multivariate model that included EF and PC1 (AIC=112) and EF and PC5 (AIC=110) compared to the individual parameter’s AIC scores [EF (111), PC1 (115), and PC5 (109)] which may be due to redundant information (i.e. EF, and thus, systolic function, is now accounted for twice in the model). These results suggest that a PCA approach can improve outcomes prediction in the Fontan population and continue to support the hypothesis that machine-learning extracted PCs will clearly delineate SVD patients with Fontan-associated comorbidities to those without, in addition to our previously published hypothesis that a multivariable approach, in this case PCA, improves prediction of this heterogenous patient population with multiple organ systems in various stages of failure [9]. The best predictive model explored in this study, determined by AIC, is the multivariate Cox regression model B (Table III) including covariates EF and PC6. If PC6 is in fact linked to PLE, our findings suggest a combination of systolic ventricular function and measures indicative of lymphatic dysfunction may be an avenue for improved prognostication. This model, however, did not have the greatest c-statistic, which suggests it may not be suitable for ranking patients according to risk. Model F, including covariates PC5 and PC6, had the greatest c-statistic and is most suitable in determining which patients are at a higher risk. The limitations of this study, as previously described [8], include those inherent to PCA. Linear data reduction does not consider non-linear reduction methods and, as the name suggests, compresses the original data for usability and is accompanied by a loss of, ideally insignificant, original data variance. Additionally, several patients were removed from analysis because PCA requires that the input matrix has no missing data. The cohort may contain a selection bias, because not all of our Fontan patients received a routine CMR examination in the past. Finally, machine learning methods thrive on large datasets, and while the final set used for PCA (N=115) is large for a pediatric population, clearly multicenter studies or learning networks that pool such data will offer even greater insights into disease progression. Despite limitations, this work has established that a heterogenous approach to PCA is beneficial to outcomes prediction in Fontan patients, and that our novel single site venous waveform patterns contribute to PCs predictive of decline. ### Conclusion The goal of this study was to determine if a heterogenous PCA approach applied to the Fontan cohort can predict functional decline in this population. Our main findings suggest that PC6, which represented roughly 7% of the overall variance and is greatly influenced by blood serum biomarkers and SVC flow, is a superior measure of proportional hazard in this population compared to EF. We also found that PC6 displayed the greatest accuracy for classifying Fontan patients, as determined by AUC, and we identified two PCs that indeed predicted survival in this population better than EF. Our findings support our suspicions that a multifactorial model must be considered to improve prognosis in the Fontan population. ## Data Availability The data that support the findings of this study are available from the corresponding author, KSH, upon reasonable request ## Sources of Funding This research was supported by the Jayden DeLuca Foundation, NIH CTSA Grant UL1 TR002535, and the American Heart Association Children’s Heart Foundation Predoctoral Congenital Heart Defect Research Award 20PRE35260057 to MRF. ## Disclosures All of the authors have nothing to disclose. ## Acknowledgements This research was supported by the Jayden DeLuca Foundation, NIH CTSA Grant UL1 TR002535, and the American Heart Association Children’s Heart Foundation Predoctoral Congenital Heart Defect Research Award 20PRE35260057 to MRF. The authors would like to additionally thank Dr Dunbar Ivy for his support of the dissertation work that led to this manuscript. ## Abbreviations SVD : single ventricle disease PLE : protein losing enteropathy PB : plastic bronchitis FALD : Fontan-associated liver disease PC MRI : phase contrast magnetic resonance imaging VVCR : ventricular vascular coupling ration VO2 : rate of oxygen consumption AAo : ascending aorta SVC : superior vena cava IVC : inferior vena cava LPA : left pulmonary artery cMRI : cardiac MRI TCPC : total cavopulmonary connection PCA : principal component analysis PCs : principal components EF : ejection fraction EDVi : end diastolic volume index ESVi : end systolic volume index CI : cardiac index BNP : B-type natriuretic peptide GGT : gamma-glutamyl transferase AST : aspartate aminotransferase SaO2 : arterial oxygen saturation FEV1 : forced expiratory volume in one second mSVCP : mean SVC pressure mPAP : mean pulmonary artery pressure HLHS : hypoplastic left heart syndrome TA : tricuspid atresia DORV : double outlet right ventricle DILV : double inlet left ventricle HRHS : hypoplastic right heart syndrome TCPC : total cavopulmonary circuit FVC : forced vital capacity RV : residual volume TLC : total lung capacity Alk phos : alkaline phosphatase BUN : blood urea nitrogen * Received July 11, 2024. * Revision received July 11, 2024. * Accepted July 12, 2024. * © 2024, Posted by Cold Spring Harbor Laboratory This pre-print is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), CC BY-NC 4.0, as described at [http://creativecommons.org/licenses/by-nc/4.0/](http://creativecommons.org/licenses/by-nc/4.0/) ## References 1. 1.Kritzmire, S.M. and A.E. Cossu, Hypoplastic Left Heart Syndrome, in StatPearls. 2020: Treasure Island (FL). 2. 2.Rychik, J., Forty years of the Fontan operation: a failed strategy. Semin Thorac Cardiovasc Surg Pediatr Card Surg Annu, 2010. 13(1): p. 96–100. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1053/j.pcsu.2010.02.006&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=20307870&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F12%2F2024.07.11.24310309.atom) 3. 3.Lee, Y.K., E.R. Lee, and B.U. Park, PRINCIPAL COMPONENT ANALYSIS IN VERY HIGH-DIMENSIONAL SPACES. Statistica Sinica, 2012. 22(3): p. 933–956. 4. 4.Witteveen, E., et al., Increased Early Systemic Inflammation in ICU-Acquired Weakness; A Prospective Observational Cohort Study. Crit Care Med, 2017. 45(6): p. 972–979. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1097/CCM.0000000000002408&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F12%2F2024.07.11.24310309.atom) 5. 5.Qi, Y., et al., Lymphocyte-monocyte-neutrophil index: a predictor of severity of coronavirus disease 2019 patients produced by sparse principal component analysis. Virol J, 2021. 18(1): p. 115. 6. 6.Kristono, G.A., et al., An IL-6-IL-8 score derived from principal component analysis is predictive of adverse outcome in acute myocardial infarction. Cytokine X, 2020. 2(4): p. 100037. 7. 7.Federolf, P.A., K.A. Boyer, and T.P. Andriacchi, Application of principal component analysis in clinical gait research: identification of systematic differences between healthy and medial knee-osteoarthritic gait. J Biomech, 2013. 46(13): p. 2173–8. 8. 8.Schafer, M., et al., Flow profile characteristics in Fontan circulation are associated with the single ventricle dilation and function: principal component analysis study. Am J Physiol Heart Circ Physiol, 2020. 318(5): p. H1032–H1040. 9. 9.Margaret R. Ferrari, M.S., Kendall S. Hunter, Michael V. Di Maria, Coupled waveform patterns in the arterial and venous fontan circulation are related to parameters of pulmonary, lymphatic and cardiac function. International Journal of Cardiology Congenital Heart Disease, 2023. 11(100429). 10. 10.Ferrari, M.R., et al., Central Venous Waveform Patterns in the Fontan Circulation Independently Contribute to the Prediction of Composite Survival. Pediatric Cardiology, 2023. 11. 11.Schäfer, M., et al., Characterization of CMR-derived haemodynamic data in children with pulmonary arterial hypertension. European Heart Journal - Cardiovascular Imaging, 2016. 18(4): p. 424–431. 12. 12.Schafer, M., et al., Main pulmonary arterial wall shear stress correlates with invasive hemodynamics and stiffness in pulmonary hypertension. Pulm Circ, 2016. 6(1): p. 37–45. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1086/685024&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=27076906&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F12%2F2024.07.11.24310309.atom) 13. 13.Ghelani, S.J., et al., Longitudinal changes in ventricular size and function are associated with death and transplantation late after the Fontan operation. J Cardiovasc Magn Reson, 2022. 24(1): p. 56. 14. 14.Rathod, R.H., et al., Cardiac magnetic resonance parameters predict transplantation-free survival in patients with fontan circulation. Circ Cardiovasc Imaging, 2014. 7(3): p. 502–9. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6ODoiY2lyY2N2aW0iO3M6NToicmVzaWQiO3M6NzoiNy8zLzUwMiI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDI0LzA3LzEyLzIwMjQuMDcuMTEuMjQzMTAzMDkuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 15. 15.Ishizaki, U., et al., Global strain and dyssynchrony of the single ventricle predict adverse cardiac events after the Fontan procedure: Analysis using feature-tracking cine magnetic resonance imaging. J Cardiol, 2019. 73(2): p. 163–170. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.jjcc.2018.07.005&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F12%2F2024.07.11.24310309.atom) 16. 16.Saiki, H., et al., Ventricular-Arterial Function and Coupling in the Adult Fontan Circulation. J Am Heart Assoc, 2016. 5(9). 17. 17.Diller, G.P., et al., Exercise intolerance in adult congenital heart disease: comparative severity, correlates, and prognostic implication. Circulation, 2005. 112(6): p. 828–35. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6MTQ6ImNpcmN1bGF0aW9uYWhhIjtzOjU6InJlc2lkIjtzOjk6IjExMi82LzgyOCI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDI0LzA3LzEyLzIwMjQuMDcuMTEuMjQzMTAzMDkuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 18. 18.Mori, M., et al., Catheter-measured Hemodynamics of Adult Fontan Circulation: Associations with Adverse Event and End-organ Dysfunctions. Congenit Heart Dis, 2016. 11(6): p. 589–597. [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F12%2F2024.07.11.24310309.atom) 19. 19.Loomba, R.S., et al., Timing of Fontan Completion in Children with Functionally Univentricular Hearts and Isomerism: The Impact of Age, Weight, and Pre-Fontan Arterial Oxygen Saturation. Pediatr Cardiol, 2019. 40(4): p. 753–761. 20. 20.Tomkiewicz-Pajak, L., et al., Iron deficiency and hematological changes in adult patients after Fontan operation. J Cardiol, 2014. 64(5): p. 384–9. 21. 21.Cheng, A.L., et al., Elevated Low-Shear Blood Viscosity is Associated with Decreased Pulmonary Blood Flow in Children with Univentricular Heart Defects. Pediatr Cardiol, 2016. 37(4): p. 789–801. 22. 22.Schafer, J.L., Multiple imputation: a primer. Stat Methods Med Res, 1999. 8(1): p. 3–15. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1177/096228029900800102&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=10347857&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F12%2F2024.07.11.24310309.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000083699900002&link_type=ISI) 23. 23.Rychik, J., et al., Evaluation and Management of the Child and Adult With Fontan Circulation: A Scientific Statement From the American Heart Association. Circulation, 2019: p. CIR0000000000000696. 24. 24.Papadopoulou-Legbelou, K., M. Kavga, and M. Fotoulaki, Protein-losing enteropathy after Fontan operation: enteric capsule findings and management with atrial pacing. Hippokratia, 2017. 21(4): p. 208. 25. 25.Chin, A.J., et al., Serum alkaline phosphatase reflects post-Fontan hemodynamics in children. Pediatr Cardiol, 2009. 30(2): p. 138–45. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/s00246-008-9292-2&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=18685798&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F12%2F2024.07.11.24310309.atom)