PT - JOURNAL ARTICLE AU - Machado Reyes, Diego AU - Burch, Myson AU - Parida, Laxmi AU - Bose, Aritra TI - A Multimodal Foundation Model for Discovering Genetic Associations with Brain Imaging Phenotypes AID - 10.1101/2024.11.02.24316653 DP - 2024 Jan 01 TA - medRxiv PG - 2024.11.02.24316653 4099 - http://medrxiv.org/content/early/2024/11/04/2024.11.02.24316653.short 4100 - http://medrxiv.org/content/early/2024/11/04/2024.11.02.24316653.full AB - Due to the intricate etiology of neurological disorders, finding interpretable associations between multi-omics features can be challenging using standard approaches. We propose COMICAL, a contrastive learning approach leveraging multi-omics data to generate associations between genetic markers and brain imaging-derived phenotypes. COMICAL jointly learns omic representations utilizing transformer-based encoders with custom tokenizers. Our modality-agnostic approach uniquely identi-fies many-to-many associations via self-supervised learning schemes and cross-modal attention encoders. COMICAL discovered several significant associations between genetic markers and imaging-derived phenotypes for a variety of neurological disorders in the UK Biobank as well as predicting across diseases and unseen clinical outcomes from the learned representations. Source code of COMICAL along with pre-trained weights, enabling transfer learning is available at https://github.com/IBM/comical.Competing Interest StatementThe authors have declared no competing interest.Funding StatementThis study was funded by IBM.Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:Data analysis was performed under UK Biobank application 50658 using existing publicly available and deidentified data and was IRB exempt.I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.YesData is available from UK Biobank.