Abstract
Conventional HLA imputation methods drop their performance for infrequent alleles, which reduces reliability of trans-ethnic MHC fine-mapping due to inter-ethnic heterogeneity in allele frequency spectra. We developed DEEP*HLA, a deep learning method for imputing HLA genotypes. Through validation using the Japanese and European HLA reference panels (n = 1,118 and 5,112), DEEP*HLA achieved the highest accuracies in both datasets (0.987 and 0.976) especially for low-frequency and rare alleles. DEEP*HLA was less dependent of distance-dependent linkage disequilibrium decay of the target alleles and might capture the complicated region-wide information. We applied DEEP*HLA to type 1 diabetes GWAS data of BioBank Japan (n = 62,387) and UK Biobank (n = 356,855), and successfully disentangled independently associated class I and II HLA variants with shared risk between diverse populations (the top signal at HLA-DRβ1 amino acid position 71; P = 6.2 ×10−119). Our study illustrates a value of deep learning in genotype imputation and trans-ethnic MHC fine-mapping.
Competing Interest Statement
The authors have declared no competing interest.
Funding Statement
This study was supported by the Japan Society for the Promotion of Science (JSPS) KAKENHI.
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
This study was approved by the ethical committee of Osaka University Graduate School of Medicine.
All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.
Yes
Data Availability
The Japanese HLA data have been deposited at the National Bioscience Database Center (NBDC) Human Database (research ID: hum0114). Independent HLA genotype data of Japanese population is available in the Japanese Genotype-phenotype archive (JGA; accession ID: JGAS00000000018). T1DGC HLA reference panel can be download at a NIDDK central repository with a request (https://repository.niddk.nih.gov/studies/t1dgc-special/). GWAS data of the BBJ are available at the NBDC Human Database (research ID: hum0014). UKBB GWAS data is available upon request (https://www.ukbiobank.ac.uk/).