Network reinforcement driven drug repurposing for COVID-19 by exploiting disease-gene-drug associations

Yonghyun Nam; Jae-Seung Yun; Seung Mi Lee; Ji Won Park; Ziqi Chen; Brian Lee; Anurag Verma; Xia Ning; Li Shen; Dokyoon Kim

doi:10.1101/2020.08.11.20173120

Abstract

Currently, the number of patients with COVID-19 has significantly increased. Thus, there is an urgent need for developing treatments for COVID-19. Drug repurposing, which is the process of reusing already-approved drugs for new medical conditions, can be a good way to solve this problem quickly and broadly. Many clinical trials for COVID-19 patients using treatments for other diseases have already been in place or will be performed at clinical sites in the near future. Additionally, patients with comorbidities such as diabetes mellitus, obesity, liver cirrhosis, kidney diseases, hypertension, and asthma are at higher risk for severe illness from COVID-19. Thus, the relationship of comorbidity disease with COVID-19 may help to find repurposable drugs. To reduce trial and error in finding treatments for COVID-19, we propose building a network-based drug repurposing framework to prioritize repurposable drugs. First, we utilized knowledge of COVID-19 to construct a disease-gene-drug network (DGDr-Net) representing a COVID-19-centric interactome with components for diseases, genes, and drugs. DGDr-Net consisted of 592 diseases, 26,681 human genes and 2,173 drugs, and medical information for 18 common comorbidities. The DGDr-Net recommended candidate repurposable drugs for COVID-19 through network reinforcement driven scoring algorithms. The scoring algorithms determined the priority of recommendations by utilizing graph-based semi-supervised learning. From the predicted scores, we recommended 30 drugs, including dexamethasone, resveratrol, methotrexate, indomethacin, quercetin, etc., as repurposable drugs for COVID-19, and the results were verified with drugs that have been under clinical trials. The list of drugs via a data-driven computational approach could help reduce trial-and-error in finding treatment for COVID-19.

1. Introduction

Coronavirus typically affects the respiratory tract of humans, and it leads to mild severe respiratory tract infections. Since December 2019, the novel coronavirus, well known as severe acute respiratory syndrome coronavirus 2 (SARS-COV-2) or coronavirus disease 2019 (COVID-19), has been declared a global pandemic by the World Health Organization¹. As of June 14^th, there are about 755 million confirmed cases and about 42.3 million deaths worldwide. The most common symptoms of COVID-19 are fever, dry cough, and fatigue. Additional symptoms include nasal congestion, headaches, conjunctivitis, and sore throat. In severe cases, symptoms of dyspnea and chest pain were reported.

Several recent studies analyzed hospitalized patients’ records to investigate diseases comorbid with COVID-19. Patients with these comorbidities such as diabetes mellitus, obesity, liver cirrhosis, kidney diseases, hypertension, and asthma have a higher risk for severe illness from COVID-19^2-7. Among confirmed cases of COVID-19, patients with any of the above comorbidities do worse than those without. Since the coronavirus leads to death in severe cases and is highly contagious between humans, there is an urgent need for developing therapeutic treatments or vaccines. However, there are currently no effective medications⁸.

To overcome the pandemic, many researchers and companies are currently developing new treatments and vaccines for COVID-19. However, this process usually takes at least a decade of work and about $300-600 million of funding. In order to overcome this global pandemic quickly, treatments must be developed immediately, but this can be difficult if we only employ traditional drug discovery methods. One way to hasten our search for effective treatments involves drug repurposing. Drug repurposing (or drug repositioning) is the process of reusing of already-approved drugs for new medical conditions. Many clinical trials for COVID-19 patients using treatments for other diseases are already in place or will be performed in the clinical sites shortly¹⁰. So far, candidate treatments such as Remdesivir (originally designed to treat the Ebola virus), hydroxychloroquine (the now-infamous malaria drug), and lopinavir/ritonavir (an antiretroviral drug targeting HIV) have been used on hospitalized COVID-19 patients. In addition to these clinical trials, many studies have recently been conducted using computational drug repurposing strategies. Zhou et al constructed a drug-human coronavirus network using protein associations between coronavirus and 2,938 drugs and discovered 16 repurposable drugs¹¹. They constructed a protein interactome network using coronavirus and drug-related genes and then predicted candidate drugs using network proximities. Ge et al proposed a data-driven drug repurposing framework that integrates coronavirus-related data and identifies drug candidates from 6,255 drugs¹². Muralidharan et al found candidate drugs capable of molecular docking or drug binding by modeling the protein structure of coronavirus and drugs with structure-based virtual screening identified four commercially available drugs¹³. These previous studies used only protein/gene information or the molecular structure of coronavirus that can be targeted by repurposable drugs. This means that unfortunately clinical information has not been utilized. Because COVID-19 patients with comorbidity disease have a high risk of illness, this unexplored relationship of comorbidity disease may help to find repurposable drugs.

In this study, we propose a network-based drug repurposing framework to predict candidate therapeutic agents for COVID-19. Our framework combines heterogeneous relational data such as disease-gene associations, disease-drug associations, drug-target gene associations, and comorbidity information with the novel coronavirus. Initially, we construct the comprehensive disease-gene-drug association networks by combining the disease-disease networks, gene-gene networks, and drug-drug networks. Then, the network propagation algorithm is applied to predict candidate repurposable drugs for COVID-19.

In more detail, from the associations between diseases, genes, and drugs, we construct a disease-gene-drug network (DGDr-Net) to understand the interactions between diseases, genes, and drugs. Then, the comorbidity information was added to the DGDr-Net to enhance the interactions between COVID-19 and other diseases. The constructed networks have tripartite relationships among diseases, genes, and drugs and these interactions can be represented in the form of layered networks. In establishing a tripartite relationship between a specific disease, gene, and drug, the following three procedures should be considered for drug repurposing: (1) Find a disease that is genetically or clinically similar to COVID-19 (at the disease level); (2) Find the key gene that serves as the bio-marker or some other therapeutic evidence for COVID-19 at the gene level; (3) Find a drug that can be used to treat COVID-19 by combining or finding a drug that is already being used to treat other diseases. Given the nature of tripartite relationships, all three steps need to be done simultaneously rather than independently. Thus, we construct a disease-gene-drug network consisted of three single-layered networks: a disease-disease network (disease layer), a gene-gene network (gene layer), and a drug-drug network (drug layer). Additionally, each single network is connected according to association information. Within the network, drug scores for COVID-19 are calculated by network-based label propagations that are using graph-based semi-supervised learning. Since the treatments of COVID-19 are not yet found, there is no label information associated with COVID-19 in the drug network. In those cases, the semi-supervised approach that can be applied when label information is insufficient is a very suitable method for recommending candidate drugs^{14, 15}. Semi-supervised learning can deal with few labeled data and can perform predictions by propagating the label information to unlabeled nodes along with edges¹⁶. When the COVID-19 in the disease layer is set as a seed node for propagation, then the label information is transmitted through the edges of the disease-gene-drug network. The comorbidity and disease-gene association information in the disease layer are propagated to the gene and drug layer to inform the priority of the repurposable drug of COVID-19. From the resulting scores, the priorities of candidate drugs can be identified. The top-tier candidate drugs are recommended by repurposable drugs for coronavirus. The predicted results can help reduce trial-and-error in finding treatment for COVID-19.

2. Methods

We propose a network-based drug repositioning method that utilizes and integrates biochemical and clinical COVID-19 data such as disease comorbidity, protein associations, and drug-target gene information. The overall framework to find repurposable drugs for treating COVID-19 consists of mainly two parts as shown in Figure 1: (a) Constructing a Disease-Gene-Drug network and (b) Scoring for candidate repurposable drugs. We first construct the Disease-Gene-Drug networks (DGDr-Net) to figure out the interactions between the biological components related to COVID-19. The constructed networks have tripartite relationships among diseases, genes, and drugs and these interactions can be represented in the form of layered networks. Next, drug scores for COVID-19 are calculated by network-based label propagations using graph-based semi-supervised learning^{14, 15, 17}. From the resulting scores, the priorities of candidate drugs can be identified. The top-tier candidate drugs are recommended by repurposable drugs for coronavirus.

Figure 1. Schematic description for the proposed method

a) Disease-Gene-Drug networks with COVID-19, b) scoring results with genes/drugs for targeting COVID-19,

2.1 Disease-Gene-Drug network

We construct the Disease-Gene-Drug Networks (DGDr-Net) to show the COVID-19 centric interactome with components for diseases, genes, and drugs. The interactions among them have a tripartite relationship that can be represented in the form of layered networks. The DGDr-Net is a graph, G = (V,E,S), where V represents the set of nodes, E represents the set of edges, and S represents the set of layers. The proposed networks have three single layers with S = {S_D, S_G, S_Dr}, depending on the type of components (e.g., disease, gene, and drug) as shown in Figure 1(a). In the network, let v_i ∊ V denote a node and e_ij ∊ E to denote the edge that connects two vertices v_i and v_j. The edge weights in the single layer have weighted values and unweighted values between different layers. The edge weights in a single layer w_ij between two nodes v_i and v_j is calculated by Gaussian similarity kernel as follows: where dist(v_i, v_j) is a distance measure for adjacent nodes (v_i ~ v_j). The edge values between different layers can take a value 1 if the associations exist, otherwise 0. Then, the value of the similarity for DGDr-Net is represented by a matrix W = {w_ij}. To easily describe the DGDr-Net with three single layers, the similarity matrix W can be expressed in block-wise as follows:

The block diagonal matrix (W_D, W_G, and W_Dr) represents a similarity for single networks with disease (D), gene (G), and drug (Dr) layers respectively, and the block off-diagonal matrix represents the connections between different layers.

2.2 Scoring algorithms for repurposing

With the DGDr-Net, a scoring algorithm is applied to find repurposable drugs. Predicted resulting scores can be obtained simultaneously from each node of the disease layer, gene layer, and drug layer. In the disease layer, it is possible to obtain quantitatively how many associations between target and other diseases are. Disease scores for COVID-19 reflect both shared gene information with other diseases and comorbidity information. In the gene layer, disease-disease associations of the disease layer and protein interaction of the gene layer are propagated through the edges, and the resulting score indicates the priority of candidate target genes for treating COVID-19. In the drug layer, label information from COVID-19 in the disease layer is propagated through the edges of DGDr-Net. Resulting drug scores can identify the priority of candidate drugs for drug repurposing. Graph-based semi-supervised learning (SSL) is employed for scoring algorithms, as only one disease node (COVID-19) is given. SSL can perform predictions by propagating the label information to the nodes along with the edges even if there is only one labeled node¹⁸.

The formulation of the scoring algorithm using SSL is as follows. Consider a DGDr-Net, G = (V, E, S), with node V(= V^D ∪ V^G ∪ V^Dr) corresponding to the n(= n_D + n_G + n_Dr) vertices. Let y = (y₁, …, y_n)^T denote the label set of nodes, denote the set of resulting scores, and W be a similarity matrix of DGDr-Net. Unlike the general classification problem where the target variable has a binary label (‘+1’ or ‘-1’), the problem setting of scoring in a semi-supervised approach has a unary label (‘+1’). More specifically, a disease node of COVID-19 19 is set to a unary label y_COVID ∊ {+1}, and the other nodes set to zero (y/y_C0VID ∊ {0}). There are two assumptions: a loss function (predicted scores in unlabeled nodes should be close to the given label of y_i in labeled nodes) and smoothness condition (predicted scores in adjacent unlabeled nodes should not be too different). These assumptions are reflected in the value of f by minimizing the following quadratic function with the graph Laplacian matrix L:

The graph Laplacian L is defined as L = D − W, where D = diag(d_i), and d_i = Σ_j w_ij. The user-specified parameter μ trades off loss and smoothness (to reduce complexity, μ is set to 1/║L║₁).

Then, the solution of this problem becomes . The resulting score f is rearranged in each layer:

To recommend the repurposable drugs, n_Dr drugs are sorted in descending order according to the resulting score in drug layers. The rankings in drug lists can be analyzed in two ways: (1) to suggest possible candidate repurposable drugs and/or (2) to reduce the trial-and-error nature of current and future COVID-19 clinical trials.

3. Results of Network Construction

3.1 Data sources

We collected a list of components and relational data from public databases. Table 1 describes the data sources used to construct DGDr-Net. For a single disease network, 5,819 diseases and 31,991 disease-gene associations were obtained from the comparative toxicogenomics database (CTD)¹⁹. To screen diseases strongly associated with COVID-19 and to construct a more accurate COVID-19 centric disease network, community detection was applied²⁰. 592 diseases were selected for disease nodes. To construct a gene network, we use the protein-protein interactions (PPIs) from the Search Tool for the Retrieval of Interacting Genes/Proteins (STRING) databases. The STRING DB collects and integrates every possible publicly available PPIs from gene neighborhood, gene fusion, gene co-occurrence, gene co-expression, experiments/biochemistry, and annotated pathway data²¹. To avoid false positive information, we select 26,681 genes (proteins) and 841,068 interactions with the high confidence level (≥ 0.7). For a drug network, 9,540 drugs (compounds) and 525,207 drug-target gene associations were obtained from DrugBank and CTD databases^{19, 22}.

View this table:

Table 1. List of data sources to construct DGDr-Net

COVID-19 is associated with 17 genes for biomarker or therapeutic evidence (e.g., CCL2, TNF, IL10, CXCL8, IL6, IL1B, AGT, IL2, CXCL10, CCL3, TMPRSS2, IL7, IL2RA, CSF3, TMPRSS4, ACE2, and BSG) as shown in Figure 2a. Also, 194 drugs in our network are currently in clinical trials for COVID-19²³. A more detailed description of data and how to construct networks is explained below.

Figure 2. Snapshot of the disease network focused on COVID-19 and 18 comorbidity diseases

a) Disease-gene network, b) Reported comorbidity relationships with COVID-19, c) combined disease network

3.2 Disease network construction

The disease network is a sub-graph G = (V_D, W_D) when S = {S_D} in the multi-layered DGDr-Net. A set of nodes V_D represent a disease and W_D represents the similarity between diseases. To obtain the COVID-19 centric disease network, we construct and integrate two types of networks; (a) networks with disease-gene associations and (b) networks with comorbidity relations. In the framework for drug repurposing using disease interactome, the former network allows us to use genetic information for other related diseases while the latter network allows us to utilize clinical data. First, we constructed networks with disease-gene associations using 5,819 diseases and 31,991 disease-gene associations. Each disease is represented as 26,681-dimensional gene vectors with binary attributes. If a disease is associated with a gene, its respective element in the disease vectors has a value (‘1’), otherwise (‘0’). (Note that attribute value 0 in disease vector does not mean that disease has no relationship with a certain gene. It could occur that the association has not been found yet). With 5,819 disease vectors, the edge weights for networks with disease-gene associations, W_D1, were calculated by Eq(1) using the cosine distance dist(v_i, v_j) = 1−(v_i · v_i)/(║ v_i║ · ║ v_j║). Next, we constructed a network with comorbidity relations involving the 18 aforementioned diseases highly comorbid with COVID-19. The 18 aforementioned comorbid diseases were curated by recent researches^2-7. The edge weights for comorbidity network, W_D2, has binary values, each of which stands for reported comorbidity relation (‘1’) or not (‘0’). Figure 2(b) depicts a comorbidity network with 18 comorbidity diseases and COVID-19. To screen tightly associated diseases with COVID-19 and construct more eligible disease networks, community detection was applied²⁰. Two modules include COVID-19 and 18 comorbidity diseases. Finally, 592 diseases in two modules were used in the disease network. With two types of networks, network integration was performed. The combined similarity W_D is represented as a linear combination of both networks with coefficient β₁ and β₂. To find the coefficients β for the linear combinations, grid search was performed in the range (0.1, 1) based on β₁. While adjusting the coefficients, DGDr-Net was applied a scoring algorithm to aim for recommending clinical trial drugs for COVID-19, and average precision was used as a metric. As a result, we choose β₁ = 0.3 for networks with disease-gene associations and β₂ = 0.7 for networks with comorbidity relation. Final disease networks are shown in Figure 2(c).

3.3 Gene network construction

Gene network is a sub-graph, G = (V_G, W_G), when S = {S_G} in DGDr-Net. The protein-protein interaction (PPIs) is employed for gene networks from the STRING database²¹. To avoid false positive information, we select 26,681 genes (proteins) and 841,068 interactions with a high confidence level (≥ 0.7). In the gene network, the edge stands for protein interaction, in which the edge weight indicates the presence or absence (‘1’ or ‘0’ respectively).

3.4 Drug network construction

Drug network is a sub-graph, G = (V_Dr, W_Dr), when S = {S_Dr} in multi-layered DGDr-Net. 9,540 drugs (compounds) and 525,207 drug-target gene associations were obtained from DrugBank and CTD databases^{19, 22}. Each drug is composed of 26,681-dimensional binary attributes, each of which stands for existing target gene associations (‘1’) or not (‘0’). The similarity matrices for the drug network W_Dr were calculated by cosine distance and transformed by Gaussian similarity kernel by Eq(1).

3.5 Connections between different single layers

Recall similarity matrix W for DGDr-Net in Equation (2). The block off-diagonal matrix represents the connections between different layers. From the relational data in Table 1, the connection weights between different single layers have binary values of 1 if there are associations of two nodes in different layers, 0 otherwise.

4. Scoring results for repurposable drugs and clinical implication

4.1 Experimental Settings

We performed a scoring algorithm to predict candidate repurposable drugs for COVID-19 using tripartite relationships of disease, gene, and drug in DGDr-Net. It can align drugs through the connections between nodes in the network with label propagations. Currently, no therapeutic drugs for coronavirus have been developed, which means that there are no ground truths that can be used as labeled data in machine learning perspective. SSL can deal with few labeled data and perform prediction by propagating the label information to the nodes in three layers along with edges¹⁸. To find repurposable drugs, we performed the scoring algorithms with a label for COVID-19 only in the disease layer. Also, all of the drug information related to COVID-19 were assumed to be unknown which means the edge connections between COVID-19 and all of the drugs were not included in the DGDr-Net. Out of 2,173 highlighted drugs, 194 drugs are currently in clinical trials for COVID-19²³. To verify the performance of our scoring results, we assume these 194 drugs as ground truths. Only one node (COVID-19) in the disease layer was set as “1” for the labeled set, and the remaining 590 diseases, 26,681 genes and 2,173 drugs were set as “0” for the unlabeled set. After scoring, we tested how many drugs in clinical trials were highly ranked.

4.2 Implication and Validation of DGDr-Net

The scoring algorithm with DGDr-Net provides score values for all nodes (disease, gene, drugs) in the three layers. Disease scores represent which diseases are strongly associated with COVID-19, and gene scores represent the priorities of target genes for biomarkers. Also, drug scores represent the priorities of candidate drugs for COVID-19. Figure 3 shows the list of top 30 ranked diseases, genes, and drugs when COVID-19 is given. The colored nodes represent COVID-19 related diseases, genes, and drugs, and they are sorted in descending order according to the resulting scores in each layer. Blue nodes indicate diseases, green nodes indicate the genes, and red nodes indicate the drugs that are recommended for repositioning. Also, we show the tripartite connections among diseases, genes, and drugs such as the disease-gene associations and drug-disease associations.

Figure 3. Associations between top 30 ranked diseases, genes, and drugs

Figure 3(a) represents the associations between 30 high ranked diseases (blue nodes) and genes (green nodes) focused on COVID-19 (black node). The blue-colored circles indicate diseases with comorbidity information, and the blue outlined circles indicate diseases connected to COVID-19 from sharing gene information. 18 diseases with comorbidity information are at the top of the ranking. Among them, 10 diseases (asthma, chronic obstructive pulmonary disease (COPD), HIV infections, cardiovascular diseases, cerebrovascular disorders, hypertension, hepatic cirrhosis, obesity, Alzheimer’s disease, and coronary artery disease) have strong associations with COVID-19 because they not only have common genes with COVID-19 but also have additional comorbidity information factored in to the analysis. 8 diseases (kidney diseases, morbid obesity, common variable immunodeficiency, obstructive sleep apnea, chronic hepatitis B, chronic hepatitis C, chronic hepatitis, and diabetes mellitus) were connected with COVID-19 primarily via the comorbidity information. Among the comorbid diseases reported as an underlying medical condition, there was generally a positive correlation between the number of genes shared with COVID-19 and the given score. The remaining 12 diseases are linked by sharing gene hypothesis, and diseases with high edge weights have a high score. The score results in the gene layer is largely interpreted in two ways: (1) the priorities of the strongly associated genes with COVID-19 and (2) a list of genes that can be candidate target genes for treatments. The priority of already known genes with COVID-19 is the result of the label propagation by reflecting both the disease interactions and protein interactions.

Figure 3(b) represents the associations between diseases and candidate drugs. The red-colored circles in the drug lists denotes currently in clinical trial drugs and red outlined circles are currently not yet. As mentioned above, we have not yet developed a COVID-19 therapeutic agent. Thus, we recommend all 30 highlighted drugs as candidate repositioning drugs. The rankings in drug lists can be analyzed in two ways: (1) The results can suggest the possible candidate repurposable drugs from the rank, (2) To reduce the trial-and-error of clinical trials, the list of drugs can be compared with current clinical trial drugs for COVID-19.

4.3 Clinical utility of the proposed drugs

Figure 4 depicts the scoring results of top candidate drugs for COVID-19. On the graph, the solid gray line stands for the value of scores f of the candidate drugs in descending orders. Out of 2,173 drugs, we assume that these drugs should be repurposable drugs for COVID-19. The red circles on the line correspond to the current clinical trial drugs, and the black one is not. Steroids such as dexamethasone, prednisolone, and hydrocortisone were recommended as top candidates. Among them, dexamethasone, an anti-inflammatory drug, had the highest scores. Dexamethasone is a very cheap steroid that reduces inflammation by mimicking anti-inflammatory hormones produced by the body. COVID-19 treatment guidelines recommend using dexamethasone 6 mg per day for up to 10 days for hospitalized patients with COVID-19^{24, 25}. On the score curve, the inserted drugs between clinical-trial drugs are regarded as strong candidates even though they are not used in clinical trials.

Figure 4. Score curve of top 30 repurposable drugs for COVID-19

Several proposed drugs need to be considered for clinical application. First, dexamethasone, prednisolone, and hydrocortisone are steroid analogues, which affect immune and inflammatory functions. There have been reports suggesting that steroids may be effective in the control of systemic inflammation or ‘cytokine storm’ in severe COVID-19 cases²⁵, although there are several on-going trials on the effectiveness of steroid treatment¹⁰. The current study also supports the possibility of steroid therapy in patients with COVID-19. Until now, there has been controversy regarding the use of NSAIDs in COVID-19 patients. Although several literatures warned the use of NSAIDs, others disagreed²⁶. With acetaminophen being included as one of the highlighted drugs in our study, we wish to bring attention to the role NSAIDs may play in helping an individual with COVID-19. Given the body’s inflammatory response to the virus, researchers have been studying the effects of some immune-modulating drugs including methotrexate, cyclosporin^27,28, although there is a paucity of information on other immune-modulating medications or cytotoxic drugs including azathioprine, doxorubicin, valproic acid, and Arsenic Trioxide.

Based on the current study, further studies are needed to evaluate the possibility of immune-modulating drugs in the context of COVID-19. Several proposed drugs also need attention such as ACE inhibitors (enalapril), lipid lowering medications (simvastatin), hormonal medication (estradiol, progesterone) and antidiabetic drugs (rosiglitazone, troglitazone). These medications may be more effective in specific populations with specific comorbidities such as kidney disease, diabetes, or coronary/cardiovascular disease. We may also have to evaluate the efficacy of these medications in specific populations. However, of course, before any of these potential treatments are given to help patients suffering from COVID-19, rigorous clinical trials are required.

5. Conclusions

In this study, we developed a network-based drug repurposing framework for recommending repurposable drugs by utilizing and combining COVID-19 related data such as disease comorbidity, protein associations, and drug-target gene information. We first constructed a form of layered disease-gene-drug networks (DGDr-Net) to figure out the interactions among the diseases, genes, and drugs related to COVID-19. With the network, we apply semi-supervised scoring algorithms to identify candidate repurposable drugs. However, the information we have about coronavirus associated drugs is not yet sufficient because we do not yet have a cure for COVID-19. Therefore, in the scoring procedures, similar to the Cold Start Problem in recommendation systems, the candidate drugs were identified without any inferences between COVID-19 and drugs.

We provided the priorities of candidate repurposable drugs and candidate target genes based on predicted scores. Of the top 30 drugs, 17 drugs were currently in clinical trials for the treatment of COVID-19. Steroids such as dexamethasone, prednisolone, and hydrocortisone are recommended as top candidates. Among them, dexamethasone, an anti-inflammatory drug, had the highest scores.

A unique strength of our experimental approach is the inclusion of comorbidity factors to our network. This allowed us to more accurately identify repurposable drugs while affording a bird’s-eye view of the intricate associations between COVID-19 and other related diseases, drugs, and genes. One particular limitation of our study is the relatively small sample size of the few databases we utilized; however, this concern is quickly alleviated as the robust yet flexible nature of a network-based approach allows us to very easily supplement and correct our current model. As we receive the newest information regarding the novel coronavirus, we can easily update the candidate drug/gene components of our networks, performing a set of updated calculations and generating an updated gene and drug candidate list almost instantly. With this in mind, we hope our approach may help clinicians and scientists make the hard decisions regarding which drugs or gene targets to test first in this global race for a cure.

References

[1].↵
Zhu N, Zhang D, Wang W, Li X, Yang B, Song J, Zhao X, Huang B, Shi W, Lu R: China Novel Coronavirus Investigating and Research Team. A novel coronavirus from patients with pneumonia in China, 2019. N Engl J Med 2020, 382:727–33.
OpenUrl CrossRef PubMed Google Scholar
[2].↵
Guan W-j, Liang W-h, Zhao Y, Liang H-r, Chen Z-s, Li Y-m, Liu X-q, Chen R-c, Tang C-l, Wang T: Comorbidity and its impact on 1590 patients with Covid-19 in China: A Nationwide Analysis. European Respiratory Journal 2020, 55.
Google Scholar
[3].
Richardson S, Hirsch JS, Narasimhan M, Crawford JM, McGinn T, Davidson KW, Barnaby DP, Becker LB, Chelico JD, Cohen SL: Presenting characteristics, comorbidities, and outcomes among 5700 patients hospitalized with COVID-19 in the New York City area. Jama 2020.
Google Scholar
[4].
Li B, Yang J, Zhao F, Zhi L, Wang X, Liu L, Bi Z, Zhao Y: Prevalence and impact of cardiovascular metabolic diseases on COVID-19 in China. Clinical Research in Cardiology 2020, 109:531–8.
OpenUrl Google Scholar
[5].
Wang B, Li R, Lu Z, Huang Y: Does comorbidity increase the risk of patients with COVID-19: evidence from meta-analysis. Aging (Albany NY) 2020, 12:6049.
OpenUrl Google Scholar
[6].
Cheng Y, Luo R, Wang K, Zhang M, Wang Z, Dong L, Li J, Yao Y, Ge S, Xu G: Kidney disease is associated with in-hospital death of patients with COVID-19. Kidney international 2020.
Google Scholar
[7].↵
Kuo C-L, Pilling LC, Atkins JL, Masoli JA, Delgado J, Kuchel GA, Melzer D: APOE e4 genotype predicts severe COVID-19 in the UK Biobank community cohort. medRxiv 2020.
Google Scholar
[8].↵
Sanders JM, Monogue ML, Jodlowski TZ, Cutrell JB: Pharmacologic treatments for coronavirus disease 2019 (COVID-19): a review. Jama 2020, 323:1824–36.
OpenUrl PubMed Google Scholar
[9].
Hurle M, Yang L, Xie Q, Rajpal D, Sanseau P, Agarwal P: Computational drug repositioning: from data to therapeutics. Clinical Pharmacology & Therapeutics 2013, 93:335–41.
OpenUrl Google Scholar
[10].↵
Lythgoe MP, Middleton P: Ongoing clinical trials for the management of the COVID-19 pandemic. Trends in Pharmacological Sciences 2020.
Google Scholar
[11].↵
Zhou Y, Hou Y, Shen J, Huang Y, Martin W, Cheng F: Network-based drug repurposing for novel coronavirus 2019-nCoV/SARS-CoV-2. Cell discovery 2020, 6:1–18.
OpenUrl Google Scholar
[12].↵
Ge Y, Tian T, Huang S, Wan F, Li J, Li S, Yang H, Hong L, Wu N, Yuan E: A data-driven drug repositioning framework discovered a potential therapeutic agent targeting COVID-19. bioRxiv 2020.
Google Scholar
[13].↵
Muralidharan N, Sakthivel R, Velmurugan D, Gromiha MM: Computational studies of drug repurposing and synergism of lopinavir, oseltamivir and ritonavir binding with SARS-CoV-2 Protease against COVID-19. Journal of Biomolecular Structure and Dynamics 2020:1-6.
Google Scholar
[14].↵
Nam Y, Kim M, Chang H-S, Shin H: Drug repurposing with network reinforcement. BMC bioinformatics 2019, 20:383.
OpenUrl Google Scholar
[15].↵
Nam Y, Jhee JH, Cho J, Lee J-H, Shin H: Disease gene identification based on generic and disease-specific genome networks. Bioinformatics 2019, 35:1923–30.
OpenUrl Google Scholar
[16].↵
Zhu XJ: Semi-supervised learning literature survey. University of Wisconsin-Madison Department of Computer Sciences, 2005.
Google Scholar
[17].↵
Zhu X, Goldberg AB: Introduction to semi-supervised learning. Synthesis lectures on artificial intelligence and machine learning 2009, 3:1–130.
OpenUrl Google Scholar
[18].↵
Chapelle O, Scholkopf B, Zien A: Semi-supervised learning (chapelle, o. et al., eds.; 2006)[book reviews]. IEEE Transactions on Neural Networks 2009, 20:542-.
OpenUrl Google Scholar
[19].↵
Davis AP, Grondin CJ, Johnson RJ, Sciaky D, McMorran R, Wiegers J, Wiegers TC, Mattingly CJ: The comparative toxicogenomics database: update 2019. Nucleic acids research 2019, 47:D948-D54.
OpenUrl CrossRef Google Scholar
[20].↵
Blondel VD, Guillaume J-L, Lambiotte R, Lefebvre E: Fast unfolding of communities in large networks. Journal of statistical mechanics: theory and experiment 2008, 2008:P10008.
OpenUrl CrossRef Google Scholar
[21].↵
Szklarczyk D, Morris JH, Cook H, Kuhn M, Wyder S, Simonovic M, Santos A, Doncheva NT, Roth A, Bork P: The STRING database in 2017: quality-controlled protein–protein association networks, made broadly accessible. Nucleic acids research 2016:gkw937.
Google Scholar
[22].↵
Wishart DS, Feunang YD, Guo AC, Lo EJ, Marcu A, Grant JR, Sajed T, Johnson D, Li C, Sayeeda Z: DrugBank 5.0: a major update to the DrugBank database for 2018. Nucleic acids research 2018, 46:D1074-D82.
OpenUrl CrossRef PubMed Google Scholar
[23].↵
DrugBank: COVID-19 information. 2020.
Google Scholar
[24].↵
Johnson RM, Vinetz JM: Dexamethasone in the management of covid-19. British Medical Journal Publishing Group, 2020.
Google Scholar
[25].↵
Ledford H: Coronavirus breakthrough: dexamethasone is first drug shown to save lives. Nature 2020, 582:469-.
OpenUrl Google Scholar
[26].↵
Russell B, Moss C, Rigg A, Van Hemelrijck M: COVID-19 and treatment with NSAIDs and corticosteroids: should we be limiting their use in the clinical setting? ecancermedicalscience 2020, 14.
Google Scholar
[27].↵
Frohman EM, Cruz RA, Longmuir R, Steinman L, Zamvil SS, Villemarette-Pittman NR, Frohman TC, Parsons MS: Part II. High-dose methotrexate with leucovorin rescue for severe COVID-19: An immune stabilization strategy for SARS-CoV-2 induced ‘PANIC’attack. Journal of the Neurological Sciences 2020:116935.
Google Scholar
[28].↵
Glowacka P, Rudnicka L, Warszawik-Hendzel O, Sikora M, Goldust M, Gajda P, Stochmal A, Blicharz L, Rakowska A, Olszewska M: The Antiviral Properties of Cyclosporine. Focus on Coronavirus, Hepatitis C Virus, Influenza Virus, and Human Immunodeficiency Virus Infections. Biology 2020, 9:192.
OpenUrl Google Scholar

Posted August 14, 2020.

Download PDF

Author Declarations

Data/Code

Citation Tools

Get QR code

Tweet Widget

Subject Area

Pharmacology and Therapeutics

Reviews and Context

Comment

TRIP Peer Reviews

Community Reviews

Automated Services

Blogs/Media

Author Videos

Subject Areas

All Articles

Addiction Medicine (418)
Allergy and Immunology (740)
Anesthesia (217)
Cardiovascular Medicine (3171)
Dentistry and Oral Medicine (355)
Dermatology (268)
Emergency Medicine (469)
Endocrinology (including Diabetes Mellitus and Metabolic Disease) (1126)
Epidemiology (13143)
Forensic Medicine (17)
Gastroenterology (878)
Genetic and Genomic Medicine (4971)
Geriatric Medicine (458)
Health Economics (761)
Health Informatics (3130)
Health Policy (1114)
Health Systems and Quality Improvement (1156)
Hematology (418)
HIV/AIDS (987)
Infectious Diseases (except HIV/AIDS) (14443)
Intensive Care and Critical Care Medicine (894)
Medical Education (462)
Medical Ethics (121)
Nephrology (511)
Neurology (4720)
Nursing (250)
Nutrition (699)
Obstetrics and Gynecology (857)
Occupational and Environmental Health (772)
Oncology (2432)
Ophthalmology (691)
Orthopedics (272)
Otolaryngology (335)
Pain Medicine (315)
Palliative Medicine (88)
Pathology (523)
Pediatrics (1263)
Pharmacology and Therapeutics (535)
Primary Care Research (536)
Psychiatry and Clinical Psychology (4057)
Public and Global Health (7290)
Radiology and Imaging (1631)
Rehabilitation Medicine and Physical Therapy (973)
Respiratory Medicine (953)
Rheumatology (468)
Sexual and Reproductive Health (486)
Sports Medicine (409)
Surgery (527)
Toxicology (66)
Transplantation (225)
Urology (196)

Comments

medRxiv aims to provide a venue for anyone to comment on a medRxiv preprint. Comments are moderated for offensive or irrelevant content (this can take ~24 h). Please avoid duplicate submissions and read our Comment Policy before commenting. The content of a comment is not endorsed by medRxiv.

medRxiv aims to inform readers about online discussion of this preprint occurring elsewhere. The content at the links below is not endorsed by either medRxiv or the preprint's authors.

Community reviews for this article:

There are no community reviews for this paper.

Automated Evaluations

Certain services provide automated analysis of preprints. Analyses invited by the authors are displayed at the top of this tab. Those done independently of authors are shown underneath . None of these analyses is endorsed by medRxiv.

Automated Evaluations:

There are no automated evaluations for this paper.

[1] [1].↵
Zhu N, Zhang D, Wang W, Li X, Yang B, Song J, Zhao X, Huang B, Shi W, Lu R: China Novel Coronavirus Investigating and Research Team. A novel coronavirus from patients with pneumonia in China, 2019. N Engl J Med 2020, 382:727–33.
OpenUrl CrossRef PubMed Google Scholar

[2] [2].↵
Guan W-j, Liang W-h, Zhao Y, Liang H-r, Chen Z-s, Li Y-m, Liu X-q, Chen R-c, Tang C-l, Wang T: Comorbidity and its impact on 1590 patients with Covid-19 in China: A Nationwide Analysis. European Respiratory Journal 2020, 55.
Google Scholar

[3] [3].
Richardson S, Hirsch JS, Narasimhan M, Crawford JM, McGinn T, Davidson KW, Barnaby DP, Becker LB, Chelico JD, Cohen SL: Presenting characteristics, comorbidities, and outcomes among 5700 patients hospitalized with COVID-19 in the New York City area. Jama 2020.
Google Scholar

[4] [4].
Li B, Yang J, Zhao F, Zhi L, Wang X, Liu L, Bi Z, Zhao Y: Prevalence and impact of cardiovascular metabolic diseases on COVID-19 in China. Clinical Research in Cardiology 2020, 109:531–8.
OpenUrl Google Scholar

[5] [5].
Wang B, Li R, Lu Z, Huang Y: Does comorbidity increase the risk of patients with COVID-19: evidence from meta-analysis. Aging (Albany NY) 2020, 12:6049.
OpenUrl Google Scholar

[6] [6].
Cheng Y, Luo R, Wang K, Zhang M, Wang Z, Dong L, Li J, Yao Y, Ge S, Xu G: Kidney disease is associated with in-hospital death of patients with COVID-19. Kidney international 2020.
Google Scholar

[7] [7].↵
Kuo C-L, Pilling LC, Atkins JL, Masoli JA, Delgado J, Kuchel GA, Melzer D: APOE e4 genotype predicts severe COVID-19 in the UK Biobank community cohort. medRxiv 2020.
Google Scholar

[8] [8].↵
Sanders JM, Monogue ML, Jodlowski TZ, Cutrell JB: Pharmacologic treatments for coronavirus disease 2019 (COVID-19): a review. Jama 2020, 323:1824–36.
OpenUrl PubMed Google Scholar

[9] [9].
Hurle M, Yang L, Xie Q, Rajpal D, Sanseau P, Agarwal P: Computational drug repositioning: from data to therapeutics. Clinical Pharmacology & Therapeutics 2013, 93:335–41.
OpenUrl Google Scholar

[10] [10].↵
Lythgoe MP, Middleton P: Ongoing clinical trials for the management of the COVID-19 pandemic. Trends in Pharmacological Sciences 2020.
Google Scholar

[11] [11].↵
Zhou Y, Hou Y, Shen J, Huang Y, Martin W, Cheng F: Network-based drug repurposing for novel coronavirus 2019-nCoV/SARS-CoV-2. Cell discovery 2020, 6:1–18.
OpenUrl Google Scholar

[12] [12].↵
Ge Y, Tian T, Huang S, Wan F, Li J, Li S, Yang H, Hong L, Wu N, Yuan E: A data-driven drug repositioning framework discovered a potential therapeutic agent targeting COVID-19. bioRxiv 2020.
Google Scholar

[13] [13].↵
Muralidharan N, Sakthivel R, Velmurugan D, Gromiha MM: Computational studies of drug repurposing and synergism of lopinavir, oseltamivir and ritonavir binding with SARS-CoV-2 Protease against COVID-19. Journal of Biomolecular Structure and Dynamics 2020:1-6.
Google Scholar

[14] [14].↵
Nam Y, Kim M, Chang H-S, Shin H: Drug repurposing with network reinforcement. BMC bioinformatics 2019, 20:383.
OpenUrl Google Scholar

[15] [15].↵
Nam Y, Jhee JH, Cho J, Lee J-H, Shin H: Disease gene identification based on generic and disease-specific genome networks. Bioinformatics 2019, 35:1923–30.
OpenUrl Google Scholar

[16] [16].↵
Zhu XJ: Semi-supervised learning literature survey. University of Wisconsin-Madison Department of Computer Sciences, 2005.
Google Scholar

[17] [17].↵
Zhu X, Goldberg AB: Introduction to semi-supervised learning. Synthesis lectures on artificial intelligence and machine learning 2009, 3:1–130.
OpenUrl Google Scholar

[18] [18].↵
Chapelle O, Scholkopf B, Zien A: Semi-supervised learning (chapelle, o. et al., eds.; 2006)[book reviews]. IEEE Transactions on Neural Networks 2009, 20:542-.
OpenUrl Google Scholar

[19] [19].↵
Davis AP, Grondin CJ, Johnson RJ, Sciaky D, McMorran R, Wiegers J, Wiegers TC, Mattingly CJ: The comparative toxicogenomics database: update 2019. Nucleic acids research 2019, 47:D948-D54.
OpenUrl CrossRef Google Scholar

[20] [20].↵
Blondel VD, Guillaume J-L, Lambiotte R, Lefebvre E: Fast unfolding of communities in large networks. Journal of statistical mechanics: theory and experiment 2008, 2008:P10008.
OpenUrl CrossRef Google Scholar

[21] [21].↵
Szklarczyk D, Morris JH, Cook H, Kuhn M, Wyder S, Simonovic M, Santos A, Doncheva NT, Roth A, Bork P: The STRING database in 2017: quality-controlled protein–protein association networks, made broadly accessible. Nucleic acids research 2016:gkw937.
Google Scholar

[22] [22].↵
Wishart DS, Feunang YD, Guo AC, Lo EJ, Marcu A, Grant JR, Sajed T, Johnson D, Li C, Sayeeda Z: DrugBank 5.0: a major update to the DrugBank database for 2018. Nucleic acids research 2018, 46:D1074-D82.
OpenUrl CrossRef PubMed Google Scholar

[23] [23].↵
DrugBank: COVID-19 information. 2020.
Google Scholar

[24] [24].↵
Johnson RM, Vinetz JM: Dexamethasone in the management of covid-19. British Medical Journal Publishing Group, 2020.
Google Scholar

[25] [25].↵
Ledford H: Coronavirus breakthrough: dexamethasone is first drug shown to save lives. Nature 2020, 582:469-.
OpenUrl Google Scholar

[26] [26].↵
Russell B, Moss C, Rigg A, Van Hemelrijck M: COVID-19 and treatment with NSAIDs and corticosteroids: should we be limiting their use in the clinical setting? ecancermedicalscience 2020, 14.
Google Scholar

[27] [27].↵
Frohman EM, Cruz RA, Longmuir R, Steinman L, Zamvil SS, Villemarette-Pittman NR, Frohman TC, Parsons MS: Part II. High-dose methotrexate with leucovorin rescue for severe COVID-19: An immune stabilization strategy for SARS-CoV-2 induced ‘PANIC’attack. Journal of the Neurological Sciences 2020:116935.
Google Scholar

[28] [28].↵
Glowacka P, Rudnicka L, Warszawik-Hendzel O, Sikora M, Goldust M, Gajda P, Stochmal A, Blicharz L, Rakowska A, Olszewska M: The Antiviral Properties of Cyclosporine. Focus on Coronavirus, Hepatitis C Virus, Influenza Virus, and Human Immunodeficiency Virus Infections. Biology 2020, 9:192.
OpenUrl Google Scholar

Network reinforcement driven drug repurposing for COVID-19 by exploiting disease-gene-drug associations

Abstract

1. Introduction