Protein identification for stroke progression via Mendelian Randomization in Million Veteran Program and UK Biobank =================================================================================================================== * Andrew Elmore * Nimish Adhikari * April E Hartley * Hugo Javier Aparicio * Dan C. Posner * Gibran Hemani * Kate Tilling * Tom R Gaunt * Peter Wilson * JP Casas * John Michael Gaziano * George Davey Smith * Lavinia Paternoster * Kelly Cho * Gina M Peloso ## Abstract **Background** Individuals who have experienced a stroke, or transient ischemic attack, face a heightened risk of future cardiovascular events. Identification of genetic and molecular risk factors for subsequent cardiovascular outcomes may identify effective therapeutic targets to improve prognosis after an incident stroke. **Methods** We performed genome-wide association studies (GWAS) for subsequent major adverse cardiovascular events (MACE) (Ncases=51,929, Ncntrl=39,980) and subsequent arterial ischemic stroke (AIS) Ncases=45,120, Ncntrl=46,789) after first incident stroke within the Million Veteran Program and UK Biobank. We then used genetic variants associated with proteins (pQTLs) to determine the effect of 1,463 plasma protein abundances on subsequent MACE using Mendelian randomization (MR). **Results** Two variants were significantly associated with subsequent cardiovascular events: rs76472767 (OR=0.75, 95% CI = 0.64-0.85, p= 3.69x10-08) with subsequent AIS and rs13294166 (OR=1.52, 95% CI = 1.37-1.67, p=3.77x10-08) with subsequent MACE. Using MR, we identified 2 proteins with an effect on subsequent MACE after a stroke: *CCL27* (effect OR= 0.77, 95% CI = 0.66-0.88, adj. p=0.05), and *TNFRSF14* (effect OR=1.42, 95% CI = 1.24-1.60, adj. p=0.006). These proteins are not associated with incident AIS and are implicated to have a role in inflammation. **Conclusions** We found evidence that two proteins with little effect on incident stroke appear to influence subsequent MACE after incident AIS. These associations suggest that inflammation is a contributing factor to subsequent MACE outcomes after incident AIS and highlights potential novel targets. Keywords * Progression * Prognosis * Stroke * AIS * MACE * Mendelian Randomization * GWAS ## Introduction Stroke remains a significant public health concern worldwide. With its potential to cause profound disabilities and mortality, it necessitates continued research efforts to unravel its multifaceted aetiology, identify modifiable risk factors, and develop effective therapeutic interventions. Arterial ischemic stroke (AIS) accounts for approximately 85% of all stroke cases and arises from occlusion of cerebral blood vessels, leading to inadequate perfusion and a subsequent ischemic cascade(1). Through the study of incident stroke events, modifiable factors such as hypertension, diabetes, dyslipidaemia, atrial fibrillation, obesity, and lifestyle behaviours have been identified, which may offer promising targets for prevention (2). Whether targeting the same factors offer avenues for effective treatment after the incident event is unclear. Genome wide association studies (GWAS) are usually performed on disease status for incident events but expanding them to subsequent events could provide us with novel biological insights about stroke progression, which may be more relevant for drug identification opportunities(3). GWAS of stroke incidence have previously observed 32 loci associated with stroke and stroke subtypes(5) with a recent study adding 5 more novel loci for stroke incidence(5). GWAS of disease progression can provide genetic risk factors that may be independent of the incident event. Since GWAS of disease progression include only individuals with incident disease, this can lead to the statistical problem of collider bias (or index-event bias), where shared confounders between incident and subsequent events can uncover spurious associations and biased estimates of effects, even amongst genetic risk factors(3) Mendelian randomization (MR) is an established statistical method that uses genetic variants to assess putative causal relationships between genetically proxied protein abundance on incident AIS and subsequent AIS and MACE(6). The main advantage MR has over traditional observational epidemiological methods is that MR can imply causality between an exposure and an outcome because it is less liable to common epidemiological biases, such as confounding and reverse causality. For biases that MR does not account for, sensitivity analyses can assess whether results are robust. One such method is colocalization, which is used to identify if a genetic variant is shared by two traits and is a necessary condition for causality(7). In this study, we perform GWAS of subsequent AIS and MACE after incident AIS in the Million Veteran Program and UK Biobank stratified by ancestry and meta-analysed across ancestries. We then use our subsequent events GWAS to perform MR for plasma protein abundances using pQTLs from UK Biobank Pharma Proteomics Project. Our genetic study aims to mimic a stroke prevention trial where recruitment into the trial is based on having a primary stroke event. ## Methods ### Genome Wide Association Studies #### Phenotype Definitions Incident stroke was defined as any diagnosis of AIS or transient ischaemic attack (TIA) using hospital linked data. People who experienced their initial stroke more than one year prior to recruitment were excluded from our stroke phenotype. Specific International Classification of Disease (ICD) codes used for both MVP and UKB can be found in the supplementary information. Subsequent AIS/TIA was defined as any secondary diagnosis of AIS/TIA at least 90 days after the incident diagnosis, to avoid recoding of the primary event, and would be considered events after the acute phase of an incident AIS/TIA. Individuals who did not survive at least thirty days after their incident stroke diagnosis were excluded from analyses of subsequent outcomes, to emulate a target clinical trial. Subsequent MACE was defined as any subsequent stroke, myocardial infarction (MI), or death due to atherosclerotic cardiovascular disease (ASCVD), with the first event that happens after 90 days used to construct the MACE phenotype. Vascular disease events occurring before or after the initial stroke were excluded, but events greater than 90 days post stroke were included. #### UK Biobank (UKB) UKB is a prospective cohort study with over 500,000 participants aged 40-69 (average 56.5) years when recruited in 2006-2010 and 54% of participants are women(8). Information on the genotype imputation, quality control and GWAS is available in the Supplementary Information. #### Million Veteran Program (MVP) MVP is a continually growing cohort of over 850,000 participants by 2021(9), 8% women, with an average age of 61.9 years(10). Information on the genotyping, imputation, quality control and GWAS is available in the Supplementary Information. #### Collider Bias Sensitivity Analysis and Correction To perform a correction for collider bias for subsequent stroke we used Slope-Hunter(11), a method that uses a mix of thresholding and mixed model clustering to quantify the bias and present a corrected estimate of the progression effect of a subsequent stroke. Slope-Hunter assumes that SNPs can be divided into clusters based on their causal relationship with incident and subsequent events and uses SNPs associated with the incident event only to provide an estimate of the bias correction factor for the study, hence is more robust to the correlation between incident and subsequent events. However, when investigating specific SNPs and their associated regions, collider bias correction may only be necessary if there is an association of the variant with incident AIS to begin with. For that reason, we have compared Slope-Hunter adjusted results with non-Slope-Hunter adjusted results, as well as compared the results with the associated region in the incident GWAS. Each Slope-Hunter calculation was performed for each specific ancestry as the collider bias may behave differently in each subset of data. We used the Slope-Hunter method with a default p-value threshold of 0.001 to correct the summary statistics for further analyses. We used the 1000genomes reference panel for clumping matched by ancestry group, with an r2 threshold of 0.1. #### Expected vs. Observed Replication To determine whether the GWAS results of subsequent stroke are different from incident stroke, we used the approach described in Okbay et. al(12), that determines replication performance by accounting for differential power. Here we used the approach to determine the extent to which incidence stroke GWAS hits are replicated in subsequent stroke, compared against the power-adjusted expected replication rate. ### Multi-Ancestry Comparison and Meta-analysis Two meta-analyses were performed. First, European only meta-analysis was performed across UKB and MVP. Secondly a meta-analysis of all individuals, including each ancestry from MVP (European, African and Hispanic) and Europeans from UKB was conducted using a fixed-effects model. We completed this meta-analysis for both original and Slope-hunter adjusted results, and compared the results. Both meta-analyses and heterogeneity score calculations were performed using the software METAL(13). The Cochran’s Q-Statistic was used to test for heterogeneity between ancestries(13). We set our genome-wide significance threshold to 5x10-08. We ran tissue expression analysis on subsequent stroke states using Functional Mapping and Annotation of GWAS (FUMA)(14), including MAGMA(15) Tissue Expression Analysis to investigate if there were any significant correlations between the subsequent GWAS and tissue expression. ### Mendelian Randomization against Protein Abundance Using the meta-analysed GWAS results and existing available protein quantitative trait loci (pQTL) data sets, we performed MR for each outcome with a panel of 1,463 plasma proteins as potential causal risk factors. Measuring proteins at population scale could help discover novel clinical biomarkers and improve fine-mapping of causal genes linked to complex diseases(16). To account for multiple testing, p-values were adjusted using false discovery rate (FDR), and are subsequently reported as adjusted p. pQTLs were extracted from pQTL studies from UK Biobank Pharma Proteomics Project (UKB-PPP) (54,306 participants, 1,463 proteins, Olink platform) (16). To ensure the robustness of the instruments (pQTLs), we attempted to replicate the MR results using 3 independent pQTL datasets; Atherosclerosis Risk in the Community (ARIC, European and African ancestry) (9,084 participants, 4,657 proteins, SOMAScan platform)(17), deCODE (35,559 participants, 4,907 proteins, SOMAScan platform) (18), and INTERVAL (3,301 participants, 3,622 proteins, SOMAScan platform) (19) MR for subsequent stroke MACE and AIS were performed on the multi-ancestry meta-analysis. MR for incident stroke AIS was also performed on the largest known stroke GWAS published(4). MR analyses were performed using the ‘TwoSampleMR’ R package(20).The pQTL data sets were meta-analysed across ancestries (16,18,19). We used a two-sample MR framework to estimate the putative causal effect of genetically proxied protein abundance to incident and subsequent stroke. MR estimates were generated using the Wald ratio method for instruments consisting of single SNPs, which included all of our instruments. MR relies on three assumptions for identifying a putative causal effect(21), the genetic instrument should: 1) associate with the exposure (relevance), 2) have no shared causal factors with the outcome (independence), and 3) solely influence the outcome through the impact of the risk factor of primary concern (exclusion restriction). The relevance assumption was tested by generating the F-statistic for each instrument, where an F-statistic > 10 is evidence against weak instrument bias(22). The exclusion restriction assumption is difficult to assess with single SNP instruments, as is common for molecular traits. Therefore, we additionally performed colocalization. Finally, to explore if there was any evidence of heterogeneity of effects between genetic ancestries, for SNPs used in MR, we compared the associations with the outcomes across ancestries. #### Colocalization Colocalization is a phenomenon whereby genetic factors at a particular locus are shared between two or more traits. The package *coloc* was used to assess whether two association signals are consistent with a shared causal variant(7). We assessed the posterior probabilities of if the analysed SNPs share the same causal variant (known as H4)(7), where H4 ≥ 80% indicates strong evidence, and 80% > H4 ≥ 60% indicates moderate evidence of colocalization. #### Collider Bias Analysis and Correction We tested whether SNPs used in the MR were associated with the published stroke incidence to determine the potential for collider bias and ran MR against both the uncorrected meta-analysed GWAS as well as the Slope-Hunter adjusted meta-analysed GWAS, as explained previously. ### Compare Results Against Known Druggable Targets Once significant SNPs from GWAS results and proteins from MR results were identified, we cross-referenced these with known existing SNPs, as well as existing literature around stroke onset and progression. Finally, we compared the results from the pQTL MR against known druggable targets from Open Targets(23). ## Results ### Genome Wide Association Studies After exclusions based on ancestry and relatedness, 93,422 individuals who had an incident stroke across the UKB and MVP were analysed (86,237 for MVP, 7,185 for UKB), among which 51,929 had subsequent MACE and 45,120 has subsequent AIS. Stroke cases were older, more commonly male, with a higher proportion of smokers and individuals with hypertension, type 2 diabetes, anti-hypertensive use, and lipid-lowering medication use than individuals who had never experienced an AIS (**Table S1**). There were no genome-wide significant associations in the multi-ancestry meta-analysis for subsequent AIS or MACE events (**Figure S1 and S2**), but we did observe 2 genome-wide significant (p<5x10-08) genetic variants in specific ancestry analyses: rs76472767 near gene *RNF220* on chromosome 1 in the AFR GWAS for subsequent MACE (slope-hunter corrected p = 3.69x10-08) and rs13294166 near gene *LINC01492* on chromosome 9 in the AFR GWAS for subsequent AIS (uncorrected p=3.77x10-08) (**Figure 2, Table S2**). For these two associations, we compared the results before and after Slope-Hunter correction as well as with the results in the incident AIS. We observed that none of the significantly associated variants were associated with incident AIS, and therefore the Slope-Hunter correction for collider bias may not have been necessary and the uncorrected results may be considered unbiased (**Figure 2**). However, the Slope-Hunter correction for collider bias may lead to slight differences in the results. ![Figure 1:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2024/02/01/2024.01.31.24302111/F1.medium.gif) [Figure 1:](http://medrxiv.org/content/early/2024/02/01/2024.01.31.24302111/F1) Figure 1: Flowchart of the methodological processes for analysing stroke data. MVP: Million Veterans Program; UKB: United Kingdom Biobank; AIS Acute Ischemic Stroke; MACE: Major Acute Cardiovascular Events; GWAS: Genome Wide Association Study; SNP: Single Nucleotide Polymorphism; pQTL: Protein Quantitative Trait Loci; MR: Mendelian Randomization ![Figure 2:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2024/02/01/2024.01.31.24302111/F2.medium.gif) [Figure 2:](http://medrxiv.org/content/early/2024/02/01/2024.01.31.24302111/F2) Figure 2: Forest Plots of Initial AIS and corrected and uncorrected subsequent events for the two genome wide significant SNPs. Generalized linear mixed effect regression models were used to test for associations between minor alleles and Subsequent MACE in stroke patients, adjusted for the first 10 genetic principal components. AIS: Acute Ischemic Stroke; MACE: Major Acute Cardiovascular Events; SH corrected: Result corrected by Slope-Hunter; OR: Odds ratio; SNP: Single Nucleotide Polymorphism We tested for tissue enrichment of subsequent stroke GWAS signals using expression data with MAGMA in FUMA. However, no tissues were expressed above any statistically significant threshold (**Figure S4**). #### Expected vs. Observed Replication We sought to determine if the genetic factors for incident stroke were also relevant for subsequent stroke. Of the 91 SNPs previously reported to associate with incident stroke, we observed that 77 replicated in our incidence GWAS at p < 0.05 (91 expected given the power difference). By contrast, our subsequent MACE GWAS replicated only 33 (compared to 82 expected, pdiff = 3x10-35), suggesting there is overlapping, but also distinct genetic aetiology of incident stroke and subsequent MACE. This pattern was consistent when using collider bias corrected results (**Table S3**). ### Mendelian Randomization against pQTL Data The subsequent AIS MR results were similar to subsequent MACE, but due to lower sample size, had larger p-values and wider confidence intervals, therefore, we focus our MR study on the subsequent MACE results (full results in **Table S4**). We observed 6 genes for incident stroke and 2 genes for subsequent stroke that have a significant MR result (adj. p<0.05) and supporting colocalization evidence (PP H4>60%) (**Table 1**). For all 6 genes, all MR results are based on single instrumental variant since only one cis pQTL was available and the Wald ratio was used. View this table: [Table 1:](http://medrxiv.org/content/early/2024/02/01/2024.01.31.24302111/T1) Table 1: MR and colocalization results from UKB-PPP pQTL dataset against both incident and subsequent MACE. MR: Mendelian randomization; MACE: Major Acute Cardiovascular Events; pQTL: Protein Quantitative Trait Loci; MR: Mendelian randomization; UKB-PPP: United Kingdom Biobank Pharma Proteomics Project; coloc h4: Posterior probability that the analysed SNPs in the region share one common causal variant. #### Incident pQTL Results We identified 6 proteins (CST6, FGF5, FURIN, GRK5, MMP12, SCARA5) with evidence for a putative causal effect on incident AIS (adj. p<0.05). However, none of these showed evidence for a putative causal effect on subsequent MACE (**Table S4**). All except SCARA5 showed very strong evidence for colocalization, while SCARA5 showed moderate evidence of colocalization (**Figure S5**). #### Subsequent MACE pQTL Results Two proteins (CCL27 and TNFRSF14) showed evidence for a putative causal effect on subsequent stroke (adj. p<0.05, **Table 1**). Neither of these proteins showed a putative causal effect on incident stroke. Genetically predicted higher levels of CCL27 showed evidence of a protective effect against subsequent stroke (OR=0.77, 95% CI = 0.66, 0.88). In contrast, higher predicted TNFRSF14 levels increased risk of subsequent stroke (OR=1.419, 95% CI =1.24, 1.60). After collider bias correction using Slope-Hunter, the MR results for CCL27 and TNFRSF14 were not significantly affected; CCL27: 0.77 (95% CI = 0.65, 0.89), TNFRSF14: 1.419 (95% CI = 1.24, 1.60). (**Table S4**). However, as these specific variants were not associated with incident disease (**Table 1**) the potential for collider bias was minimal. Both proteins implicated have a role in inflammation(24,25). There was evidence for colocalization of CCL27 and TNFRSF14 protein and subsequent MACE (**Table 1, Table S5, Figure S6 and S7**). The assignment of the remaining probability mostly to H1 (representing association only with protein trait and not stroke outcome) suggests this analysis has limited power, rather than suggesting there are independent effects that don’t colocalise (H3). #### Verification of pQTL instruments using other datasets To verify that the protein instruments identified in UKB-PPP were valid, we replicated the MR results using 3 other independent pQTL data sets (ARIC, deCODE and INTERVAL). For 5 of the 9 significant MR results, MR using independent pQTL data sets showed consistent putative causal effects (**Table S6**). Due to the differing power of the pQTL data sets, only pQTLs were filtered for having F-statistic value above 10 (**Table S7**). #### Multi-Ancestry Comparison of MR Results and Meta-analysis There is little evidence that the 3 proteins reported as having putative causal effects on subsequent stroke have different putative causal effects across the three ancestries tested. However, this is primarily due to the very wide confidence intervals (and small sample sizes) within Hispanic and African subgroups (**Figure 3**). ![Figure 3:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2024/02/01/2024.01.31.24302111/F3.medium.gif) [Figure 3:](http://medrxiv.org/content/early/2024/02/01/2024.01.31.24302111/F3) Figure 3: breakdown of significant MR results in subsequent stroke states by ancestry, as well as a comparison to incident stroke (Cochran Q p>=0.1). MR: Mendelian randomization; MACE: Major Acute Cardiovascular Events; AIS: Acute Ischemic Stroke; MR: Mendelian randomization; OR: Odds Ratio #### Comparing MR Results Against Potential Druggable Targets Of the 720 genes related to “Ischemic Stroke” in Open Targets(23), 103 had instrumentable protein products (p<1x10-11) in UKB-PPP(16). Five of these met an FDR-adjusted significance threshold in MR for incident stroke (ANGPT1, FGF5, FURIN, MMP12, TFPI), but none in the MR of subsequent stroke. Of the 5 targets with evidence of a causal effect on incidence, 4 were previously identified as putatively causal for incident stroke in MR studies(26,27), while FURIN is novel. ANGPT1, TFPI, and MMP12 have evidence of a causal effect on incident AIS, while FURIN and FGF5 have existing genetic associations. Using the Therapeutic Target Database(28) to find existing therapeutic drugs for these genes, we determined that TFPI and ANGPT1 have phase 3 clinical trials associated with them, MMP12 has a phase 1 trial, and FURIN has pre-clinical trial. None of the drugs which started clinical trials were designed for stroke (**Table S8**). For markers associated with subsequent MACE, currently, there are no clinical trials of drugs targeting TNFRSF14, CCL27. ## Discussion We observed several loci associated with subsequent stroke events that have a role in inflammation. There exists a link between inflammation and stroke. While the immune response starts locally, inflammatory mediators propagate, which leads to a systemic inflammatory response, followed by immunosuppression(29). Changes in TNF and IL6 levels have been observed in patients at the onset of stroke(30). This response may be due to a state of immunodepression that occurs post-stroke, as there are increased risks of poststroke infections(29). There is increasing evidence that greater inflammation is associated with AIS progression. It is unclear whether inflammation is transitory, related to the severity of the ischemia, and the ischemia-inflammation association post stroke is not well characterized(31). The discovery of two proteins having a predicted causal effect on subsequent MACE after stroke suggests that inflammation is a contributing factor to subsequent MACE outcomes after incident stroke AIS(24,25). TNFRSF14 (also known as HVEM) signals via TRAF2/3 pathway, role in immune cell survival. TNFRSF14 is a receptor for 4 ligands: TNFSF14 (LIGHT), LTA, BTLA and CD160. First two are TNF cytokines, 2nd two are Ig-related membrane proteins. HVEM has been shown to contribute to plaque destabilization and rupture(32). The LIGHT protein is known to have prognostic predictive value for composite cardiovascular events(33). The TNF-alpha family also has been suggested as being a risk factor to stroke(34). CD160 has been shown to be a potential indicator of the progression of atherosclerosis(35). Plasma measures of three of these four ligands were available in the UKB proteomics data, but (despite having strong instruments available F>500) neither had a causal effect on subsequent MACE (LTA p=0.634; CD160 p=0.184, TNFSF14 p=0.303). CCL27 is a cytokine involved in maintaining immune homeostasis in barrier tissues(36). A third protein, IL19, showed slighty weaker evidence of a causal effect on subsequent MACE (OR=0.878, 95% CI = 0.81, 0.94, adj. p=0.053, coloc h4=69%) and also no effect in incident AIS (OR=0.963, 95% CI = 0.92, 1.00, adj. p=0.496). IL19 is an anti-inflammatory marker(37), and diminishes cerebral infarction and neurological deficits following cerebral ischemia in mice, potentially through the elevated expression of genes related to pro-inflammatory cytokines(38). As increased IL19 levels appear to be an anti-inflammatory marker, and increased TNFRSF14 levels show as a correlative effect as a known inflammation marker, this leads to the notion of inflammation as a contributor to subsequent MACE outcomes. As CCL27 is used in maintaining immune homeostasis in barrier tissues, is more difficult to ascertain what a negative effect size could infer without further investigation. We observed genetic variants that appear exclusively associated with subsequent MACE and AIS after an incident AIS. This might imply novel biological insights into the disease progression of stroke. We observed that all 6 proteins that show a putatively causal effect on incident AIS do not appear to affect subsequent strokes. All individuals in this study were diagnosed, treated, and likely given blood pressure medications, statins, or both, which could mask an effect on subsequent stroke risk. Of the 5 targets identified by MR from the drug target list in Open Targets (ANGPT1, FGF5, FURIN, MMP12, TFPI), none are associated with subsequent MACE. This suggests that these proteins may be important therapeutic targets to reduce risk for incident AIS, but not for subsequent MACE. Existing targets in Open Targets is in part populated by correlations in genetic association, thus why they initially became candidate targets. We postulate that genetic variants and genes for incident stroke are not good targets for drug discovery of subsequent stroke events. We note that while Cochran’s Q-Statistic for heterogeneity did not show evidence for different effects between ancestries (**Table S9)**, the difference in sample sizes between ancestries remain large. Results from European ancestry were overall much stronger due to higher power. More data around individuals of non-European ancestries is necessary to investigate this further. We had several limitations to our study. First, MACE is defined as a combination of MI, AIS/TIA and ASCVD death. For incident MACE in MVP, we have observed that MI accounts for a larger proportion of the MACE phenotype compared to AIS/TIA(39). However, our subsequent cohort has a smaller proportion of MI events than expected in the MACE phenotype **(Table S10**.) This is likely due to the selection on incident AIS. Secondly, despite analyzing relatively large datasets for disease progression, our results have limited statistical power due to sample sizes. Thirdly, as cis-only pQTL data sets are normally instrumented by a single SNP, the Wald ratio is the only available means of estimation for MR, which restricts the type of sensitivity analyses we can perform. Colocalization reduces the risk of confounding by linkage disequilibrium, as it requires the presence of a common causal variant responsible for both traits but can’t exclude potential horizontal pleiotropy, and other sources of pleiotropy could not be tested in this study. Finally, a lack of sufficient sample sizes and access to data in ancestries other than European, African, or Hispanic mean that we cannot ascertain whether these results are generalisable across all ancestries, or if there are genetic differences by ancestry. All individuals that were diagnosed with stroke will have likely been put on common preventative medication for subsequent stroke, this treatment may be altering the progression GWAS results, however no data is available to compare against individuals diagnosed with stroke but have not been treated. ## Conclusion We observed two novel SNPs associated with subsequent stroke events that warrant further replication. We also performed MR to identify putative causal proteins for risk of subsequent MACE in stroke patients. We observed putatively causal evidence for two novel proteins (CCL27 and TNFRS14) associated with subsequent MACE risk in pQTL, suggesting that inflammation is a contributing factor to subsequent MACE outcomes after incident stroke AIS. ## Data Availability Data is available upon request ## Sources of Funding This research is based on data from the Million Veteran Program, Office of Research and Development, Veterans Health Administration, and was supported by Veterans Affairs Merit Awards BX004821 and CX001025. This publication does not represent the views of the Department of Veteran Affairs or the United States Government. Please see supplementary information for MVP Core Acknowledgements. This study was also supported by the NIHR Biomedical Research Centre at the University Hospitals Bristol and Weston NHS Foundation Trust and the University of Bristol. This publication is the work of the authors who will serve as guarantors for the contents of this paper. The views expressed in this publication are those of the author(s) and not necessarily those of the NHS, the National Institute for Health Research. AE, LP, TRG, GDS, GH and AEH receive support from the UK Medical Research Council Integrative Epidemiology Unit at the University of Bristol (MC\_UU\_00011/4, MC\_UU\_00011/1, MC\_UU_00032/01, MC_UU_00032/03). HA is supported by American Academy of Neurology Career Development Award and National Institutes of Health R01NS017950. ## Disclosures The following authors have nothing to declare: AE, NA, HJA, DCP, GH, KT, PWFW, JPC, JMG, GDS, LP, KC, GMP TRG receives funding from Biogen and GSK for unrelated work. AEH started working for Novo Nordisk after contributing to this manuscript. ## Acknowledgments We acknowledge the VA Million Veteran Program (MVP) participants. This study was supported by the National Institute for Health and Care Research Bristol Biomedical Research Centre. The views expressed are those of the author(s) and not necessarily those of the NIHR or the Department of Health and Social Care. The UK Biobank data was obtained under the UK Biobank resource application 81499. We would like to thank the participants of MVP and UKB studies. ## Footnotes * * Andrew Elmore and Nimish Adhikari are joint first authors. Lavinia Paternoster, Kelly Cho, and Gina M Peloso jointly supervised the research. ## Non-standard Abbreviations and Acronyms GWAS : Genome Wide Association Study MR : Mendelian randomization MACE : Major Adverse Cardiovascular Events AIS : Arterial Ischemic Stroke TIA : Transient Ischaemic Attack SNP : Single Nucleotide Polymorphism pQTL : Protein Quantitative Trait Loci UKB : United Kingdom Biobank MVP : Million Veteran Program * Received January 31, 2024. * Revision received January 31, 2024. * Accepted February 1, 2024. * © 2024, Posted by Cold Spring Harbor Laboratory This pre-print is available under a Creative Commons License (Attribution 4.0 International), CC BY 4.0, as described at [http://creativecommons.org/licenses/by/4.0/](http://creativecommons.org/licenses/by/4.0/) ## References 1. 1.Benjamin EJ, Virani SS, Callaway CW, Chamberlain AM, Chang AR, Cheng S, et al. Heart Disease and Stroke Statistics—2018 Update: A Report From the American Heart Association. Circulation [Internet]. 2018 Mar 20 [cited 2023 Jul 31];137(12). Available from: [https://www.ahajournals.org/doi/10.1161/CIR.0000000000000558](https://www.ahajournals.org/doi/10.1161/CIR.0000000000000558) 2. 2.O’Donnell MJ, Chin SL, Rangarajan S, Xavier D, Liu L, Zhang H, et al. Global and regional effects of potentially modifiable risk factors associated with acute stroke in 32 countries (INTERSTROKE): a case-control study. The Lancet. 2016 Aug;388(10046):761–75. 3. 3.1. Barsh GS Paternoster L, Tilling K, Davey Smith G. Genetic epidemiology and Mendelian randomization for informing disease therapeutics: Conceptual and methodological challenges. Barsh GS, editor. PLoS Genet. 2017 Oct 5;13(10):e1006944. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=doi:10.1371/journal.pgen.1006944&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=28981501&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F02%2F01%2F2024.01.31.24302111.atom) 4. 4.Malik R, Chauhan G, Traylor M, Sargurupremraj M, Okada Y, Mishra A, et al. Multiancestry genome-wide association study of 520,000 subjects identifies 32 loci associated with stroke and stroke subtypes. Nat Genet. 2018 Apr;50(4):524–37. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41588-018-0058-3&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=29531354&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F02%2F01%2F2024.01.31.24302111.atom) 5. 5.Surakka I, Wu KH, Hornsby W, Wolford BN, Shen F, Zhou W, et al. Multi-ancestry meta-analysis identifies 5 novel loci for ischemic stroke and reveals heterogeneity of effects between sexes and ancestries. Cell Genom. 2023 Aug 9;3(8):100345. 6. 6.Davey Smith G, Ebrahim S. ‘Mendelian randomization’: can genetic epidemiology contribute to understanding environmental determinants of disease?*. International Journal of Epidemiology. 2003 Feb;32(1):1–22. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/ije/dyg070&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=12689998&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F02%2F01%2F2024.01.31.24302111.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000182341300001&link_type=ISI) 7. 7.1. Williams SM Giambartolomei C, Vukcevic D, Schadt EE, Franke L, Hingorani AD, Wallace C, et al. Bayesian Test for Colocalisation between Pairs of Genetic Association Studies Using Summary Statistics. Williams SM, editor. PLoS Genet. 2014 May 15;10(5):e1004383. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1371/journal.pgen.1004383&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=24830394&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F02%2F01%2F2024.01.31.24302111.atom) 8. 8.Sudlow C, Gallacher J, Allen N, Beral V, Burton P, Danesh J, et al. UK Biobank: An Open Access Resource for Identifying the Causes of a Wide Range of Complex Diseases of Middle and Old Age. PLOS Medicine. 2015 Mar 31;12(3):e1001779. 9. 9.Nguyen XMT, Whitbourne SB, Li Y, Quaden RM, Song RJ, Nguyen HNA, et al. Data Resource Profile: Self-reported data in the Million Veteran Program: survey development and insights from the first 850736 participants. International Journal of Epidemiology. 2023 Feb 1;52(1):e1–17. 10. 10.Gaziano JM, Concato J, Brophy M, Fiore L, Pyarajan S, Breeling J, et al. Million Veteran Program: A mega-biobank to study genetic influences on health and disease. Journal of Clinical Epidemiology. 2016 Feb 1;70:214–23. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.jclinepi.2015.09.016&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=26441289&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F02%2F01%2F2024.01.31.24302111.atom) 11. 11.Mahmoud O, Dudbridge F, Davey Smith G, Munafo M, Tilling K. A robust method for collider bias correction in conditional genome-wide association studies. Nat Commun. 2022 Feb 2;13(1):619. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41467-022-28119-9&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=35110547&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F02%2F01%2F2024.01.31.24302111.atom) 12. 12.Okbay A, Beauchamp JP, Fontana MA, Lee JJ, Pers TH, Rietveld CA, et al. Genome-wide association study identifies 74 loci associated with educational attainment. Nature. 2016 May 26;533(7604):539–42. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/nature17671&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=27225129&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F02%2F01%2F2024.01.31.24302111.atom) 13. 13.Willer CJ, Li Y, Abecasis GR. METAL: fast and efficient meta-analysis of genomewide association scans. Bioinformatics. 2010 Sep 1;26(17):2190–1. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/bioinformatics/btq340&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=20616382&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F02%2F01%2F2024.01.31.24302111.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000281738900017&link_type=ISI) 14. 14.Watanabe K, Taskesen E, Van Bochoven A, Posthuma D. Functional mapping and annotation of genetic associations with FUMA. Nat Commun. 2017 Nov 28;8(1):1826. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41467-017-01261-5&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F02%2F01%2F2024.01.31.24302111.atom) 15. 15.De Leeuw CA, Mooij JM, Heskes T, Posthuma D. MAGMA: Generalized Gene-Set Analysis of GWAS Data. Tang H, editor. PLoS Comput Biol. 2015 Apr 17;11(4):e1004219. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1371/journal.pcbi.1004219&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25885710&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F02%2F01%2F2024.01.31.24302111.atom) 16. 16.Sun BB, Chiou J, Traylor M, Benner C, Hsu YH, Richardson TG, et al. Genetic regulation of the human plasma proteome in 54,306 UK Biobank participants [Internet]. Genetics; 2022 Jun [cited 2023 Jul 27]. Available from: [http://biorxiv.org/lookup/doi/10.1101/2022.06.17.496443](http://biorxiv.org/lookup/doi/10.1101/2022.06.17.496443) 17. 17.Zhang J, Dutta D, Köttgen A, Tin A, Schlosser P, Grams ME, et al. Plasma proteome analyses in individuals of European and African ancestry identify cis-pQTLs and models for proteome-wide association studies. Nat Genet. 2022 May;54(5):593–602. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41588-022-01051-w&link_type=DOI) 18. 18.Ferkingstad E, Sulem P, Atlason BA, Sveinbjornsson G, Magnusson MI, Styrmisdottir EL, et al. Large-scale integration of the plasma proteome with genetics and disease. Nat Genet. 2021 Dec;53(12):1712–21. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41588-021-00978-w&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F02%2F01%2F2024.01.31.24302111.atom) 19. 19.Sun BB, Maranville JC, Peters JE, Stacey D, Staley JR, Blackshaw J, et al. Genomic atlas of the human plasma proteome. Nature. 2018 Jun;558(7708):73–9. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41586-018-0175-2&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=29875488&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F02%2F01%2F2024.01.31.24302111.atom) 20. 20.Hemani G, Zheng J, Elsworth B, Wade KH, Haberland V, Baird D, et al. The MR-Base platform supports systematic causal inference across the human phenome. eLife. 2018 May 30;7:e34408. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.7554/eLife.34408&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=29846171&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F02%2F01%2F2024.01.31.24302111.atom) 21. 21.Sanderson E, Glymour MM, Holmes MV, Kang H, Morrison J, Munafò MR, et al. Mendelian randomization. Nat Rev Methods Primers. 2022 Feb 10;2(1):1–21. 22. 22.Lawlor DA, Harbord RM, Sterne JAC, Timpson N, Davey Smith G. Mendelian randomization: Using genes as instruments for making causal inferences in epidemiology. Statist Med. 2008 Apr 15;27(8):1133–63. 23. 23.Koscielny G, An P, Carvalho-Silva D, Cham JA, Fumis L, Gasparyan R, et al. Open Targets: a platform for therapeutic target identification and validation. Nucleic Acids Res. 2017 Jan 4;45(D1):D985–94. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/nar/gkw1055&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=27899665&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F02%2F01%2F2024.01.31.24302111.atom) 24. 24.Zhang Q, Zhu L, Wang G, Zhao Y, Xiong N, Bao H, et al. Ionizing radiation promotes CCL27 secretion from keratinocytes through the cross talk between TNF-α and ROS: Zhang et al. J Biochem Mol Toxicol. 2017 Mar;31(3):e21868. 25. 25.Šedý JR, Bjordahl RL, Bekiaris V, Macauley MG, Ware BC, Norris PS, et al. CD160 Activation by Herpesvirus Entry Mediator Augments Inflammatory Cytokine Production and Cytolytic Function by NK Cells. The Journal of Immunology. 2013 Jul 15;191(2):828–36. 26. 26.Chen L, Peters JE, Prins B, Persyn E, Traylor M, Surendran P, et al. Systematic Mendelian randomization using the human plasma proteome to discover potential therapeutic targets for stroke. Nat Commun. 2022 Oct 17;13(1):6143. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41467-022-33675-1&link_type=DOI) 27. 27.Benjamins JW, Yeung MW, Vegte YJ van de, Said MA, Linden T van der, Ties D, et al. Genomic insights in ascending aortic size and distensibility. eBioMedicine [Internet]. 2022 Jan 1 [cited 2023 Oct 19];75. Available from: [https://www.thelancet.com/journals/ebiom/article/PIIS2352-3964(21)00577-6/fulltext](https://www.thelancet.com/journals/ebiom/article/PIIS2352-3964(21)00577-6/fulltext) 28. 28.Zhou Y, Zhang Y, Zhao D, Yu X, Shen X, Zhou Y, et al. Therapeutic Target Database: Describing target druggability information. Nucleic Acids Research. 2023 Sep 15;gkad751. 29. 29.Anrather J, Iadecola C. Inflammation and Stroke: An Overview. Neurotherapeutics. 2016 Oct;13(4):661–70. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/s13311-016-0483-x&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=27730544&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F02%2F01%2F2024.01.31.24302111.atom) 30. 30.Tuttolomondo A, Di Raimondo D, Di Sciacca R, Pinto A, Licata G. Inflammatory Cytokines in Acute Ischemic Stroke. CPD. 2008 Nov 1;14(33):3574–89. 31. 31.Wang Q, Tang X, Yenari M. The inflammatory response in stroke. Journal of Neuroimmunology. 2007 Mar;184(1–2):53–68. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.jneuroim.2006.11.014&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=17188755&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F02%2F01%2F2024.01.31.24302111.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000245727400007&link_type=ISI) 32. 32.Lee WH, Kim SH, Lee Y, Lee BB, Kwon B, Song H, et al. Tumor Necrosis Factor Receptor Superfamily 14 Is Involved in Atherogenesis by Inducing Proinflammatory Cytokines and Matrix Metalloproteinases. Arteriosclerosis, Thrombosis, and Vascular Biology. 2001 Dec;21(12):2004–10. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NzoiYXR2YmFoYSI7czo1OiJyZXNpZCI7czoxMDoiMjEvMTIvMjAwNCI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDI0LzAyLzAxLzIwMjQuMDEuMzEuMjQzMDIxMTEuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 33. 33.Hsu CY, Tseng WK, Wu YW, Lin TH, Yeh HI, Chang KC, et al. Circulating TNFSF14 (Tumor Necrosis Factor Superfamily 14) Predicts Clinical Outcome in Patients With Stable Coronary Artery Disease. ATVB. 2019 Jun;39(6):1240–52. 34. 34.Bokhari FA, Shakoori TA, Butt A, Ghafoor F. TNF-ALPHA: A RISK FACTOR FOR ISCHEMIC STROKE. J Ayub Med Coll Abbottabad. 35. 35.Piotrowska M, Spodzieja M, Kuncewicz K, Rodziewicz-Motowidło S, Orlikowska M. CD160 protein as a new therapeutic target in a battle against autoimmune, infectious and lifestyle diseases. Analysis of the structure, interactions and functions. European Journal of Medicinal Chemistry. 2021 Nov 15;224:113694. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.ejmech.2021.113694&link_type=DOI) 36. 36.Davila ML, Xu M, Huang C, Gaddes ER, Winter L, Cantorna MT, et al. CCL27 is a crucial regulator of immune homeostasis of the skin and mucosal tissues. iScience. 2022 Jun;25(6):104426. 37. 37.Jain S, Gabunia K, Kelemen SE, Panetti TS, Autieri MV. The Anti-Inflammatory Cytokine Interleukin 19 Is Expressed By and Angiogenic for Human Endothelial Cells. ATVB. 2011 Jan;31(1):167–75. 38. 38.Zhu H, Hu S, Li Y, Sun Y, Xiong X, Hu X, et al. Interleukins and Ischemic Stroke. Front Immunol. 2022 Jan 31;13:828447. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.3389/fimmu.2022.828447&link_type=DOI) 39. 39.Vassy JL, Posner DC, Ho YL, Gagnon DR, Galloway A, Tanukonda V, et al. Cardiovascular Disease Risk Assessment Using Traditional Risk Factors and Polygenic Risk Scores in the Million Veteran Program. JAMA Cardiology. 2023 Jun 1;8(6):564–74.