Abstract
Deep learning has emerged as a powerful approach in various domains, including biological network analysis. This paper investigates the advancements in computational techniques for inferring gene regulatory networks (GRNs) and introduces MCNET, a state-of-the-art deep learning algorithm. MCNET integrates multi-omics data to infer GRNs and extract biologically significant representations from single-cell RNA sequencing (scRNA-seq) data. By incorporating attention mechanisms and graph convolutional networks, MCNET captures intricate regulatory relationships among genes. Extensive benchmarking on diverse scRNA-seq datasets demonstrates MCNET’s superiority over existing methods in GRN inference, scRNA-seq data visualization, clustering, and simulation. Notably, MCNET accurately predicts gene regulations on cell-type marker genes in the mouse cortex, validated by epigenetic data. The introduction of MCNET paves the way for advanced analysis of scRNA-seq data and provides a powerful tool for inferring GRNs in a multi-omics context. Moreover, this paper addresses the integration of multiomics data in gene regulatory network inference, proposing MCNET as a method that efficiently analyzes and visualizes homogeneous gene regulatory networks derived from diverse omics data. The inference capability of MCNET is evaluated through extensive experiments with simulation data and applied to analyze the biological network of psychiatric disorders using human brain data.
Competing Interest Statement
The authors have declared no competing interest.
Funding Statement
This study was funded by Birla Institute of Technology and Science, Pilani, Hyderabad Campus.
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
We performed MCNET on a human brain dataset that consists of psychiatric diseases and control with 131 samples. The multi-omics data of gene expression, CNV, and DNA methylation used in the preparation of this paper were obtained from the human prefrontal cortex. The human brain data contains 39 samples of schizophrenia, 35 of bipolar disorder, 12 of major depression patients, and 44 of healthy control samples, where each sample has 25,833 of gene expression measurement, 1,028 of CNV, and 24,399 of DNA methylation. We considered the psychiatric disorder data as a group combining the three psychiatric disorder samples, since the psychiatric disorders share many common biological features. The 495 genes were introduced to the String database which was already publicly available at (http://string-db.org). Human Data which was used in this study is available at SMRI Online Genomics Database (https://www.stanleygenomics.org).
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.
Yes
Footnotes
anshtiwari9899{at}gmail.com
strankatwar{at}gmail.com
Ansh Tiwari et al. This is an open-access article distributed under the terms of the Creative Commons Attribution 3.0 United States License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability
All data produced in the present study are available upon reasonable request to the authors.