PT - JOURNAL ARTICLE AU - Gardiner, Laura-Jayne AU - Carrieri, Anna Paola AU - Bingham, Karen AU - Macluskie, Graeme AU - Bunton, David AU - McNeil, Marian AU - Pyzer-Knapp, Edward O. TI - Combining explainable machine learning, demographic and multi-omic data to identify precision medicine strategies for inflammatory bowel disease AID - 10.1101/2021.03.03.21252821 DP - 2021 Jan 01 TA - medRxiv PG - 2021.03.03.21252821 4099 - http://medrxiv.org/content/early/2021/03/05/2021.03.03.21252821.short 4100 - http://medrxiv.org/content/early/2021/03/05/2021.03.03.21252821.full AB - Inflammatory bowel diseases (IBDs), including ulcerative colitis and Crohn’s disease, affect several million individuals worldwide. These diseases are heterogeneous at the clinical, immunological and genetic levels and result from a complex interaction between the host and environmental factors. Investigating drug efficacy in cultured human fresh IBD tissues can improve our understanding of the reasons why certain medications are more or less effective for different patients.We propose an explainable machine learning (ML) approach that combines bioinformatics and domain insight, to informatively integrate multi-modal data to predict inter-patient specific variation in drug response. Using explanation of our models, we interpret the models’ predictions inferring unique combinations of important features associated with human tissue pharmacological responses. The inferred multi-modal features originate from multi-omic data (genomic and transcriptomic), demographic, medicinal and pharmacological data and all are associated with drug efficacy generated by a preclinical human fresh IBD tissue assay.To pharmacologically assess patient variation in response to IBD treatment, we used the reduction in the release of the inflammatory cytokine TNFα from the fresh IBD tissues in the presence or absence of test drugs, as a measure of drug efficacy. The TNF pathway is a common target in current therapies for IBD; we initially explored the effects of a mitogen-activated protein kinase (MAPK) inhibitor on the production of TNFα; however, we later show the approach can be applied to other targets, test drugs or mechanisms of interest. Our best model was able to predict TNFα levels from a combination of integrated demographic, medicinal and genomic features with an error as low as 4.98% on unseen patients. We incorporated transcriptomic data to validate and expand insights from genomic features. Our results showed variations in drug effectiveness between patients that differed in gender, age or condition and linked new genetic polymorphisms in our cohort of IBD patients to variation in response to the anti-inflammatory treatment BIRB796 (Doramapimod).Our approach models drug response in a relevant human tissue model of IBD while also identifying its most predictive features as part of a transparent ML-based precision medicine strategy.Competing Interest StatementThe authors APC, LJG, EPK, MM declare that they have no competing interests. REPROCELL Europe Ltd is a commercial provider of laboratory-based tests for preclinical research. GM, KB and DCB are all paid employees of REPROCELL Europe. These commercial affiliations do not alter adherence of the authors to journal policies on sharing data and materials.Funding StatementThis work was supported by the STFC Hartree Centre's Innovation Return on Research programme, funded by the Department for Business, Energy & Industrial Strategy. The functional pharmacology experiments and whole exome sequencing for this study were part-funded via a project grant from Precision Medicine Scotland Innovation Centre (PMS-IC), provided to REPROCELL Europe Ltd. PMS-IC is funded by the Scottish Funding Council and Scottish Enterprise. The funding organisations did not play an additional role in the study design, data collection and analysis, decision to publish or preparation of the manuscript and only provided financial support in the form of authors' salaries and research materials.Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:The West of Scotland Research Ethics Committee (12/ws/0069) granted approval for this study, and patients provided written consent complying with the declaration of Helsinki.All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.YesThe experimental datasets that were used in this study are available from the ENA Sequence Read archive study PRJEB43220 at https://www.ebi.ac.uk/ena/browser/view/PRJEB43220. Data will be available upon publication.