PT - JOURNAL ARTICLE AU - Degenhardt, Frauke AU - Mayr, Gabriele AU - Wendorff, Mareike AU - Boucher, Gabrielle AU - Ellinghaus, Eva AU - Ellinghaus, David AU - ElAbd, Hesham AU - Rosati, Elisa AU - Hübenthal, Matthias AU - Juzenas, Simonas AU - Abedian, Shifteh AU - Vahedi, Homayon AU - BK, Thelma AU - Yang, Suk-Kyun AU - Ye, Byong Duk AU - Cheon, Jae Hee AU - Datta, Lisa Wu AU - Daryani, Naser Ebrahim AU - Ellul, Pierre AU - Esaki, Motohiro AU - Fuyuno, Yuta AU - McGovern, Dermot PB AU - Haritunians, Talin AU - Hong, Myhunghee AU - Juyal, Garima AU - Jung, Eun Suk AU - Kubo, Michiaki AU - Kugathasan, Subra AU - Lenz, Tobias L. AU - Leslie, Stephen AU - Malekzadeh, Reza AU - Midha, Vandana AU - Motyer, Allan AU - Ng, Siew C AU - Okou, David T AU - Raychaudhuri, Soumya AU - Schembri, John AU - Schreiber, Stefan AU - Song, Kyuyoung AU - Sood, Ajit AU - Takahashi, Atsushi AU - Torres, Esther A AU - Umeno, Junji AU - Alizadeh, Behrooz Z. AU - Weersma, Rinse K AU - Wong, Sunny H AU - Yamazaki, Keiko AU - Karlsen, Tom H AU - Rioux, John D AU - Brant, Steven R AU - for the MAAIS Recruitment Center AU - Franke, Andre AU - for the International IBD Genetics Consortium TI - Trans-ethnic analysis of the human leukocyte antigen region for ulcerative colitis reveals shared but also ethnicity-specific disease associations AID - 10.1101/2020.07.29.20162552 DP - 2020 Jan 01 TA - medRxiv PG - 2020.07.29.20162552 4099 - http://medrxiv.org/content/early/2020/07/30/2020.07.29.20162552.short 4100 - http://medrxiv.org/content/early/2020/07/30/2020.07.29.20162552.full AB - Inflammatory bowel disease (IBD) is a chronic inflammatory disease of the gut. Genetic association studies have identified the highly variable human leukocyte antigen (HLA) region as the strongest susceptibility locus for IBD, and specifically DRB1*01:03 as a determining factor for ulcerative colitis (UC). However, for most of the association signal such a delineation could not be made due to tight structures of linkage disequilibrium within the HLA. The aim of this study was therefore to further characterize the HLA signal using a trans-ethnic approach. We performed a comprehensive fine mapping of single HLA alleles in UC in a cohort of 9,272 individuals with African American, East Asian, Puerto Rican, Indian and Iranian descent and 40,691 previously analyzed Caucasians, additionally analyzing whole HLA haplotypes. We computationally characterized the binding of associated HLA alleles to human self-peptides and analysed the physico-chemical properties of the HLA proteins and predicted self-peptidomes. Highlighting alleles of the HLA-DRB1*15 group and their correlated HLA-DQ-DR haplotypes, we identified consistent associations across different ethnicities but also identified population-specific signals. We observed that DRB1*01:03 is mostly present in individuals of Western European descent and hardly present in non-Caucasian individuals. We found peptides predicted to bind to risk HLA alleles to be rich in positively charged amino acids such. We conclude that the HLA plays an important role for UC susceptibility across different ethnicities. This research further implicates specific features of peptides that are predicted to bind risk and protective HLA proteins.Competing Interest StatementThe authors have declared no competing interest.Funding StatementThis project received infrastructure support from the DFG Excellence Cluster No. 306 "Inflammation at Interfaces". M.W. and H.E. are supported by the German Research Foundation (DFG) through the Research Training Group 1743, "Genes, Environment and Inflammation". E.E. received funding from the European Union Seventh Framework Program (FP7-PEOPLE-2013-COFUND; grant agreement No. 609020 (Scientia Fellows)). S.A. is supported by joint funding from the University Medical Center Groningen, Groningen, The Netherlands, and Institute for Digestive System Disease, Tehran University of Medical Sciences, Tehran, Iran. Funding for the Multicenter African American IBD Study (MAAIS) samples, for the GENESIS samples, and for the African Americans recruited by Cedars Sinai was provided by the U.S.A. National Institutes of Health (NIH) grants DK062431 (S.R.B.), DK 087694 (S.K.), and DK062413 (D.P.B.M), respectively. This work was supported by a grant from the BioBank Japan Project and, in part, by a Grant-in-Aid for Scientific Research (B) (26293180) funded by the Ministry of Education, Culture, Sports, Science, and Technology, Japan. This research was supported by a Mid-career Researcher Program grant through the National Research Foundation of Korea to K.S. (2017R1A2A1A05001119), funded by the Ministry of Science, Information & Communication Technology and Future Planning, and a grant of the Korea Health Technology R&D Project through the Korea Health Industry Development Institute (KHIDI), funded by the Ministry of Health & Welfare (grant number: HI18C0094), Republic of Korea. Funding for the Indian samples was provided by the Centre of Excellence in Genome Sciences and Predictive Medicine (Grant # BT/01/COE/07/UDSC/2008) from the Department of Biotechnology, Government of India). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:The recruitment of study subjects was approved by the ethics committees or institutional review boards of all individual participating centers or countries. Written informed consent was obtained from all study participants.All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.YesThe ImmunoChip data used in this study are proprietary to the IIBGDC genetics consortium and may be requested from the consortium. Any data produced within this study, may be requested from the corresponding authors upon reasonable request including association statistics of imputed and genotyped SNVs.(A)Alpha chain of an HLA protein(B)Beta chain of an HLA proteinAAAfrican American population of this studyAFRAfrican American population of the 1000 Genomes/HapMap population see also https://www.internationalgenome.org/category/population/)AMRAdmixed American population of the 1000 Genomes/HapMap population (see also https://www.internationalgenome.org/category/population/)AFAllele FrequencyCEUUtah Residents (CEPH) with Northern and Western European Ancestry of the 1000 Genomes/HapMap population (see also https://www.internationalgenome.org/category/population/)CIConfidence IntervalEASEast Asian population of the 1000 Genomes/HapMap population (see also https://www.internationalgenome.org/category/population/)EURCaucasian population of this population or (mentioned within the context of the 1000Genomes/HapMap population European data of the latter; see also https://www.internationalgenome.org/category/population/)F1, F3Atchley Factors 1 and 3, that contain information on 54 amino acid propertiesHLAHuman Leukocyte AntigenHLA-AHuman Leukocyte Antigen gene locus AHLA-BHuman Leukocyte Antigen gene locus BHLA-CHuman Leukocyte Antigen gene locus CHLA-DRAHuman Leukocyte Antigen gene locus DRAHLA-DRB1Human Leukocyte Antigen gene locus DRB1HLA-DRB3Human Leukocyte Antigen gene locus DRB3HLA-DRB4Human Leukocyte Antigen gene locus DRB4HLA-DRB5Human Leukocyte Antigen gene locus DRB5HLA-DQA1Human Leukocyte Antigen gene locus DQA1HLA-DQB1Human Leukocyte Antigen gene locus DQB1HLA-DPA1Human Leukocyte Antigen gene locus DPA1HLA-DPB1Human Leukozyten Antigen gene locus DPB1INDIndian populationIRNIranian populationJPNJapanese populationKORKorean populationMAFMinor Allele FrequencyMLEMaximum Likelihood EstimatorMLTMaltese populationPRIPuerto Rican populationP1-P9Pockets 1 to 9 of the HLA protein within the HLA peptide binding siteQCQuality ControlSASSouth Asian population of the 1000 Genomes/HapMap population (see also https://www.internationalgenome.org/category/population/)SNPSingle Nucleotide Polymorphism (MAF >= 1%)SNVSingle Nucleotide Variation (MAF < 1%)xHLAextended HLA regionYRIYoruba in Ibadan, Nigeria population of the 1000 Genomes/HapMap population (see also https://www.internationalgenome.org/category/population/)