ABSTRACT
Acute Myeloid Leukemia (AML) is a heterogeneous disease that may occur at any age. Although it has been shown that the incidence of AML increases with age and that different genetic alterations are present in younger versus older patients, the current AML classifications do not include age as a factor in classifying patients. In this work, we analyzed 404 RNA-seq samples with AML initial diagnosis to highlight the differences between younger and older patients in favorable and adverse risk using a k-mer-based approach for transcriptomic machine learning models. We predicted the risk of patients with more than 90% accuracy. We also provided lists of genes of interest for AML that were able to distinguish favorable from adverse ELN risks. From these lists, we selected prognostic biomarkers that have an impact on survival. Furthermore, we analyzed the biological context involved in the transcriptome complexity of younger and older patients. We observed differences in tumor profiles as well as in the presence of immune and stromal cell populations with specific features in older patients.
Competing Interest Statement
The authors have declared no competing interest.
Funding Statement
This work has been supported by La Ligue Contre le Cancer and the Agence Nationale de la Recherche (TranSipedia and FullRNA projects).
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
The data analyzed in this study are publicly available in the dbGAP database(https://www.ncbi.nlm.nih.gov/gap/) with the accessions ID phs001657.v1.p1 and phs001657.v2.p1, and the GEO database (https://www.ncbi.nlm.nih.gov/geo/) accessions ID GSE49642, GSE52656, GSE62190, and GSE62852.
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.
Yes
Data Availability
The data generated during the study are available from the corresponding author upon reasonable request.
Data availability
The data analyzed in this study are publicly available in the dbGAP database (https://www.ncbi.nlm.nih.gov/gap/) with the accessions ID phs001657.v1.p1 and phs001657.v2.p1, and the GEO database (https://www.ncbi.nlm.nih.gov/geo/) accessions ID GSE49642, GSE52656, GSE62190, and GSE62852. The data generated during the study are available from the corresponding author upon reasonable request.