PT - JOURNAL ARTICLE AU - Wang, Shiyu AU - Fang, Xixian AU - Wen, Xiang AU - Yang, Congying AU - Yang, Ying AU - Zhang, Tianxiao TI - Prioritization of risk genes for Alzheimer’s disease: an analysis framework using spatial and temporal gene expression data in the human brain based on support vector machine AID - 10.1101/2023.02.06.23285522 DP - 2023 Jan 01 TA - medRxiv PG - 2023.02.06.23285522 4099 - http://medrxiv.org/content/early/2023/02/08/2023.02.06.23285522.short 4100 - http://medrxiv.org/content/early/2023/02/08/2023.02.06.23285522.full AB - Background Alzheimer’s disease (AD) is a complex disorder, and its risk is influenced by multiple genetic and environmental factors. In this study, an AD risk gene prediction framework based on spatial and temporal features of gene expression data (STGE) was proposed.Methods We proposed an AD risk gene prediction framework based on spatial and temporal features of gene expression data. The gene expression data of providers of different tissues and ages were used as model features. Human genes were classified as AD risk or non-risk sets based on information extracted from relevant databases. Support vector machine (SVM) models were constructed to capture the expression patterns of genes believed to contribute to the risk of AD.Results The recursive feature elimination (RFE) method was utilized for feature selection. Data for 64 tissue-age features were obtained before feature selection, and this number was reduced to 19 after RFE was performed. The SVM models were built and evaluated using 19 selected and full features. The area under curve (AUC) values for the SVM model based on 19 selected features (0.740 [0.690–0.790]) and full feature sets (0.730 [0.678–0.769]) were very similar. Fifteen genes predicted to be risk genes for AD with a probability greater than 90% were obtained.Conclusion The newly proposed framework performed comparably to previous prediction methods based on protein-protein interaction (PPI) network properties. A list of 15 candidate genes for AD risk was also generated to provide data support for further studies on the genetic etiology of AD.Competing Interest StatementThe authors have declared no competing interest.Funding StatementThis study was supported by the National Natural Science Foundation of China (NSFC) Young Scientists Fund (31900407). The funding body did not participate in the design, conduct, or writing of the study.Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:https://gtexportal.org/home/; http://www.alzdata.org/; https://www.ebi.ac.uk/gwas/ I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.YesThe data that support the findings of this study are openly available from the databases described in the Methods part. The codes of this study can be found at https://doi.org/10.5281/zenodo.7553711. https://doi.org/10.5281/zenodo.7553711