RT Journal Article SR Electronic T1 Characterization of a COPD-Associated NPNT Functional Splicing Genetic Variant in Human Lung Tissue via Long-Read Sequencing JF medRxiv FD Cold Spring Harbor Laboratory Press SP 2020.10.20.20203927 DO 10.1101/2020.10.20.20203927 A1 Saferali, Aabida A1 Xu, Zhonghui A1 Sheynkman, Gloria M. A1 Hersh, Craig P. A1 Cho, Michael H. A1 Silverman, Edwin K. A1 Laederach, Alain A1 Vollmers, Christopher A1 Castaldi, Peter J. YR 2020 UL http://medrxiv.org/content/early/2020/11/03/2020.10.20.20203927.abstract AB Chronic obstructive pulmonary disease (COPD) is a leading cause of death worldwide. Genome-wide association studies (GWAS) have identified over 80 loci that are associated with COPD and emphysema, however for most of these loci the causal variant and gene are unknown. Here, we utilize lung splice quantitative trait loci (sQTL) data from the Genotype-Tissue Expression project (GTEx) and short read sequencing data from the Lung Tissue Research Consortium (LTRC) to characterize a locus in nephronectin (NPNT) associated with COPD case-control status and lung function. We found that the rs34712979 variant is associated with alternative splice junction use in NPNT, specifically for the junction connecting the 2nd and 4th exons (chr4:105898001-105927336) (p=4.02×10−38). This association colocalized with GWAS data for COPD and lung spirometry measures with a posterior probability of 94%, indicating that the same causal genetic variants in NPNT underlie the associations with COPD risk, spirometric measures of lung function, and splicing. Investigation of NPNT short read sequencing revealed that rs34712979 creates a cryptic splice acceptor site which results in the inclusion of a 3 nucleotide exon extension, coding for a serine residue near the N-terminus of the protein. Using Oxford Nanopore Technologies (ONT) long read sequencing we identified 13 NPNT isoforms, 6 of which are predicted to be protein coding. Two of these are full length isoforms which differ only in the 3 nucleotide exon extension whose occurrence differs by genotype. Overall, our data indicate that rs34712979 modulates COPD risk and lung function by creating a novel splice acceptor which results in the inclusion of a 3 nucelotide sequence coding for a serine in the nephronectin protein sequence. Our findings implicate NPNT splicing in contributing to COPD risk, and identify a novel serine insertion in the nephronectin protein that warrants further study.Competing Interest StatementP. Castaldi has received personal fees and grant support from GlaxoSmithKline and Novartis. C.Hersh has received grants from NHLBI, Bayer, Boehringer-Ingelheim, Novartis and Vertex. M. Cho has received grant support from GSK and Bayer, and speaking or consulting fees from AstraZeneca and Illumina. E. Silverman has received grant support from GSK and Bayer. A. Laederach reports grants from the NIH NHLBI and consultant fees from Ribometrix. C. Vollmers has filed patent applications on aspects of the R2C2 method.Funding StatementThis work was funded by R01 HL124233, R01 HL147326, R01 HL111527, U01 HL089897, U01 HL089856, R01HL125583, R01HL130512, T32HL007427. Research reported in this publication was supported by the NHLBI and FDA Center for Tobacco Products (CTP). The content is solely the responsibility of the authors and does not necessarily represent the official views of the NIH or the Food and Drug Administration. The Genotype-Tissue Expression (GTEx) Project was supported by the Common Fund of the Office of the Director of the National Institutes of Health, and by NCI, NHGRI, NHLBI, NIDA, NIMH, and NINDS. The data used for the analyses described in this manuscript were obtained from: the GTEx Portal between January and August of 2020 and via the GTEx Terra Workspace during the same time interval. Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:The LTRC study was approved by the Institutional Review Boards at Partners Healthcare (Protocol #2018P000186). All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.YesAll data will be publicly available through dbGaP (accession number phs001662.v1.p1)