ABSTRACT
Acute myeloid leukemia (AML) is the most common type of acute leukemia in adults. Its risk factors include rare and highly penetrant somatic mutations. Genome-wide association studies (GWAS) have also identified four common inherited variants associated with AML risk, but these findings have not yet been confirmed in many independent datasets. Here, we performed a replication study with 567 AML cases from the Leucegene cohort and 1,865 controls from the population-based cohort CARTaGENE (CaG). Because genotypes were generated using different technologies in the two datasets (e.g. low- vs. high-coverage whole-genome sequencing), we applied stringent quality-control filters to minimize type I errors. We showed using data reduction methods (e.g. principal component analysis [PCA] and uniform manifold approximation and projection [UMAP]) that our approach successfully integrated the Leucegene and CaG genetic data. We replicated the association between cytogenetically normal (CN)-AML and rs3916765, a variant located near HLA-DQA2 (odds ratio [95% confidence interval] = 1.88 [1.21-2.93], P- value=0.005). The effect size of this association was stronger when we restricted the analyses to AML patients with NPM1 mutations (odds ratios >2.35). We found HLA- DOB to be the most significantly upregulated gene in Leucegene participants with the CN-AML protective A-allele at rs3916765. We further found that several HLA class II genes are also differentially expressed albeit at lower statistical significance. Our results confirm that a common genetic variant at the HLA locus associates with AML risk, providing new opportunities to improve disease prognosis and treatment.
Competing Interest Statement
The authors have declared no competing interest.
Funding Statement
Leucegene sequencing was supported by the Government of Canada through Genome Canada and the Ministere de Economie, de Innovation et des Exportations du Quebec through Genome Quebec. Leukemic samples, cytogenetic and clinical data of this cohort were provided by the Quebec leukemia cell bank supported by grants from the Cancer Research Network of the Fonds de recherche du Quebec Sante. V.P.L. is supported by the Fondation Charles Bruneau and by the Cole Foundation and holds a Fonds de Recherche en Sante du Quebec clinician scientist award. The sequencing of the CaG dataset was covered by grants from Genome Canada, Genome Quebec, and CHU Ste-Justine (Projet GenoRefQ). G.L. is supported by a Canada Research Chair (tier 1) and holds funds from the Canadian Institutes of Health Research (Projects #426541 and #486808) and the Montreal Heart Institute Foundation. This research was enabled in part by support provided by Calcul Quebec (https://www.calculquebec.ca/en/) and Compute Canada (www.computecanada.ca).
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
The Montreal Heart Institute Ethics Committee gave ethical approval for this work (Project #2017-2247).
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.
Yes
DATA AVAILABILITY
Accessing the CaG WGS data requires the submission of a request application; the procedure is explained at: https://cartagene.qc.ca/en/researchers/access-request.html. The code used to analyze the data and generate the figures is available upon request.