RT Journal Article SR Electronic T1 Machine Learning for Interpretation of DNA Variants of Maturity-Onset Diabetes of the Young Genes Based on ACMG Criteria JF medRxiv FD Cold Spring Harbor Laboratory Press SP 2020.05.20.20108035 DO 10.1101/2020.05.20.20108035 A1 Liu, Yichuan A1 Qu, Huiqi A1 Wenocur, Adam S. A1 Qu, Jingchun A1 Chang, Xiao A1 Glessner, Joseph A1 Sleiman, Patrick A1 Tian, Lifeng A1 Hakonarson, Hakon YR 2020 UL http://medrxiv.org/content/early/2020/05/23/2020.05.20.20108035.abstract AB Background Maturity-onset diabetes of the young (MODY) is a group of dominantly inherited monogenic diabetes, with HNF4A-MODY, GCK-MODY and HNF1A-MODY being the three most common genes responsible. Molecular diagnosis of MODY is important for precise treatment. While a DNA variant causing MODY can be assessed by the criteria of the American College of Medical Genetics and Genomics (ACMG) guidelines, gene-specific assessment of disease-causing mutations is important to differentiate between the MODY subtypes. As the ACMG criteria were not originally designed for machine learning algorithms, they are not true independent variables.Methods In this study, we applied machine learning models for interpretation of DNA variants in MODY genes defined by the ACMG criteria based on Human Gene Mutation Database (HGMD) and ClinVar.Results The results show highly predictive abilities with accuracy over 95%, suggest that this model could serve as a fast, gene-specific method for physicians or genetic counselors assisting with diagnosis and reporting, especially when confronted by contradictory ACMG criteria. Also, the weight of the ACMG criteria shows gene specificity which advocates for the application of machine learning methods with the ACMG criteria to capture the most relevant information for each disease-related variant.Conclusion Our results highlight the need for different weights of the ACMG criteria in relation with different MODY genes for accurate functional classification. For proof of principle, we applied the ACMG criteria as feature vectors in a machine learning model obtaining precision-based result.Competing Interest StatementThe authors have declared no competing interest.Funding StatementNo external funding was received.Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesAll necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.YesAll data referred to in the manuscript are publicly available from these databases: HGMD 2019 version: http://www.hgmd.cf.ac.uk/ac/index.php ClinVar: https://www.ncbi.nlm.nih.gov/clinvar/ Common SNP 151: https://www.ncbi.nlm.nih.gov/snp/