RT Journal Article SR Electronic T1 Predicting and elucidating the etiology of fatty liver disease using a machine learning-based approach: an IMI DIRECT study JF medRxiv FD Cold Spring Harbor Laboratory Press SP 2020.02.10.20021147 DO 10.1101/2020.02.10.20021147 A1 Atabaki-Pasdar, Naeimeh A1 Ohlsson, Mattias A1 Viñuela, Ana A1 Frau, Francesca A1 Pomares-Millan, Hugo A1 Haid, Mark A1 Jones, Angus G A1 Thomas, E Louise A1 Koivula, Robert W A1 Kurbasic, Azra A1 Mutie, Pascal M A1 Fitipaldi, Hugo A1 Fernandez, Juan A1 Dawed, Adem Y A1 Giordano, Giuseppe N A1 Forgie, Ian M A1 McDonald, Timothy J A1 Rutters, Femke A1 Cederberg, Henna A1 Chabanova, Elizaveta A1 Dale, Matilda A1 De Masi, Federico A1 Thomas, Cecilia Engel A1 Allin, Kristine H A1 Hansen, Tue H A1 Heggie, Alison A1 Hong, Mun-Gwan A1 Elders, Petra JM A1 Kennedy, Gwen A1 Kokkola, Tarja A1 Pedersen, Helle Krogh A1 Mahajan, Anubha A1 McEvoy, Donna A1 Pattou, Francois A1 Raverdy, Violeta A1 Häussler, Ragna S A1 Sharma, Sapna A1 Thomsen, Henrik S A1 Vangipurapu, Jagadish A1 Vestergaard, Henrik A1 ‘t Hart, Leen M A1 Adamski, Jerzy A1 Musholt, Petra B A1 Brage, Soren A1 Brunak, Søren A1 Dermitzakis, Emmanouil A1 Frost, Gary A1 Hansen, Torben A1 Laakso, Markku A1 Pedersen, Oluf A1 Ridderstråle, Martin A1 Ruetten, Hartmut A1 Hattersley, Andrew T A1 Walker, Mark A1 Beulens, Joline WJ A1 Mari, Andrea A1 Schwenk, Jochen M A1 Gupta, Ramneek A1 McCarthy, Mark I A1 Pearson, Ewan R A1 Bell, Jimmy D A1 Pavo, Imre A1 Franks, Paul W YR 2020 UL http://medrxiv.org/content/early/2020/02/11/2020.02.10.20021147.abstract AB Background Non-alcoholic fatty liver disease (NAFLD) is highly prevalent and causes serious health complications in type 2 diabetes (T2D) and beyond. Early diagnosis of NAFLD is important, as this can help prevent irreversible damage to the liver and ultimately hepatocellular carcinomas.Methods and Findings Utilizing the baseline data from the IMI DIRECT participants (n=1514) we sought to expand etiological understanding and develop a diagnostic tool for NAFLD using machine learning. Multi-omic (genetic, transcriptomic, proteomic, and metabolomic) and clinical (liver enzymes and other serological biomarkers, anthropometry, and measures of beta-cell function, insulin sensitivity, and lifestyle) data comprised the key input variables. The models were trained on MRI image-derived liver fat content (<5% or ≥5%). We applied LASSO (least absolute shrinkage and selection operator) to select features from the different layers of omics data and Random Forest analysis to develop the models. The prediction models included clinical and omics variables separately or in combination. A model including all omics and clinical variables yielded a cross-validated receiver operator characteristic area under the curve (ROCAUC) of 0.84 (95% confidence interval (CI)=0.82, 0.86), which compared with a ROCAUC of 0.82 (95% CI=0.81, 0.83) for a model including nine clinically-accessible variables. The IMI DIRECT prediction models out-performed existing non-invasive NAFLD prediction tools.Conclusions We have developed clinically useful liver fat prediction models (see: www.predictliverfat.org) and identified biological features that appear to affect liver fat accumulation.Competing Interest StatementThe authors of this manuscript have the following competing interests: PWF is a consultant for Novo Nordisk, Lilly and Zoe Global Ltd and has received research grants from numerous diabetes drug companies. HR is an employee and shareholder of Sanofi. MIM: The views expressed in this article are those of the author(s) and not necessarily those of the NHS, the NIHR, or the Department of Health. MIM has served on advisory panels for Pfizer, NovoNordisk and Zoe Global, has received honoraria from Merck, Pfizer, Novo Nordisk and Eli Lilly, and research funding from Abbvie, Astra Zeneca, Boehringer Ingelheim, Eli Lilly, Janssen, Merck, NovoNordisk, Pfizer, Roche, Sanofi Aventis, Servier, and Takeda. As of June 2019, MIM is an employee of Genentech, and a holder of Roche stock. AM is a consultant for Lilly and has received research grants from several diabetes drug companies.Funding StatementThe work leading to this publication has received support from the Innovative Medicines Initiative Joint Undertaking under grant agreement n°115317 (DIRECT), resources of which are composed of financial contribution from the European Union's Seventh Framework Programme (FP7/2007-2013) and EFPIA companies’ in kind contribution. This work was supported by a European Research Council award ERC- 2015-CoG - 681742_NASCENT. RK was funded by the Novo Nordisk Foundation (NNF18OC0031650) as part of a postdoctoral fellowship. AGJ is supported by an NIHR Clinician Scientist award (17/0005624). TJM is funded by an NIHR clinical senior lecturer fellowship. S.Bru acknowledges support from the Novo Nordisk Foundation (grants NNF17OC0027594 and NNF14CC0001). ATH is a Wellcome Trust Senior Investigator and is also supported by the NIHR Exeter Clinical Research Facility. JMS acknowledges support from Science for Life Laboratory (Plasma Profiling Facility) and Knut and Alice Wallenberg Foundation (Human Protein Atlas), Erling-Persson Foundation (KTH Centre for Precision Medicine). MIM is supported by the following grants; Wellcome (090532, 098381, 106130, 203141, 212259); NIH (U01-DK105535). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.Author DeclarationsAll relevant ethical guidelines have been followed; any necessary IRB and/or ethics committee approvals have been obtained and details of the IRB/oversight body are included in the manuscript.YesAll necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.YesData cannot be shared publicly because of GDPR restrictions on data privacy.