Abstract
Background Plasma cell mastitis (PCM) is a nonbacterial breast inflammation with severe and intense clinical manifestation yet treatment methods for PCM are still rather limited. Although the mechanism of PCM remains unclear, mounting evidences suggest that the dysregulation of immune system is closely associated with the pathogenesis of PCM. Drug combinations or combination therapy could exert improved efficacy and reduced toxicity through hitting multiple discrete cellular targets.
Methods We have developed a knowledge graph architecture towards immunotherapy and systematic immunity that consists of herbal drug-target interactions with a novel scoring system to select drug combinations based on target-hitting rates and phenotype relativeness. To this end, we employed this knowledge graph to identify an herbal drug combination for PCM and we subsequently evaluated the efficacy of the herbal drug combination in clinical trial
Results Our clinical data suggests that the herbal drug combination could significantly reduce the serum level of various inflammatory cytokines, downregulate serum IgA and IgG level, reduce the recurrence rate and reverse the clinical symptoms of PCM patients with improvements of general health status.
Conclusions In summary, we reported that an herbal drug combination identified by knowledge graph can alleviate the clinical symptoms of plasma cell mastitis patients. We demonstrated that the herbal drug combination holds great promise as an effective remedy for PCM, acting through the regulation of immunoinflammatory pathways and improvement of systematic immune level. In particular, the herbal drug combination could significantly reduce the recurrence rate of PCM, a major obstacle for PCM treatment. Our data suggests that the herbal drug combination is expected to feature prominently in future PCM treatment.
Funding C. Liu’s lab was supported by grants from the Public Health Science and Technology Project of Shenyang (Grant: 22-321-32-18); Y. Yang’s laboratory was supported by the National Natural Science Foundation of China (Grant: 81874301), the Fundamental Research Funds for Central University (Grant: DUT22YG122) and the Key Research project of ‘be Recruited and be in Command’ in Liaoning Province (Personal Target Discovery for Metabolic Diseases).
Clinical trial number ClinicalTrials.gov: NCT05530226
Introduction
Plasma cell mastitis (PCM) represents as a serious inflammatory condition of breast that occurs in young and middle-aged females at non-pregnant and non-lactating period1. The main histopathological characteristics of PCM is the infiltration of plasma cells and lymphocytes in breast tissue2. Interestingly, PCM shares similarity with breast cancer in the perspective of macroscopical or microscopical characteristics3. On the other hand, mounting evidences suggest that the dysregulation of immune system is closely associated with the pathogenesis of PCM. Recently, the incidence rate of PCM is quickly rising yet the treatment methods for PCM are still rather limited. In clinical practice, surgical resection and hormone therapy remains as two major treatments for PCM. Unfortunately, neither surgical resection nor hormone therapy could prevent recurrence of PCM. Moreover, the serious side effects for hormone therapy are still problematic4. Therefore, the discovery of effective PCM treatment or therapeutics with minimal side effects is clearly warranted.
Traditional Chinese Medicine (TCM) has a rather long history for the prevention and treatment of complex diseases in eastern Asia5-7. Moreover, for decades, TCM has been often used as alternative or complementary medicine in the west. Indeed, Chinese herbal compounds has been successfully applied in the treatment of plasma cell mastitis (PCM) in conjunction with western medicine8. In clinical practice, TCM refers to herbal entity prescription or formulae (also called as ‘Fangji’) which may exhibit coordinating or synergistic effects through the combination of multiple herb drugs9. However, the design of formulae in TCM is solely based on the principle of ‘syndrome differentiation’ according to the medicinal properties of herbal entities. Moreover, the molecular mechanisms for the ‘formulae’ in TCM remains rather elusive.
Knowledge graph has emerged as an advanced technology in the field of artificial intelligence which is able to connect entities in a graph based on their existing intricate relationships10. In particular, knowledge graph can enable the rational design and identification of combination therapies for a specific disease or phenotypes11. Recently, we developed and constructed a knowledge graph for the discovery of herbal drug combination towards immunotherapy and systematic immunity. Subsequently, we identified a synergistic combination of herbal drugs for PCM via a scoring system based on target hitting rates and phenotype relativeness. To verify our concept of design, we conducted a clinical trial experiment for the drug combination of herbal compounds mentioned above (ClinicalTrials.gov Registration: NCT05530226). Strikingly, our clinical results demonstrated that the herbal drug combination identified by knowledge graph can markedly suppress various inflammatory cytokines in serum, restore clinical symptoms and reduce the recurrence rates of PCM patients with improved global health status.
Results and Discussion
In previous study, we collected and compiled 240 targets for immunotherapy and systematic immunity from literature data12. Recently, we collected 345 entities of herbal drugs documented in TCM books and herbal drugs announced by National Administration of Traditional Chinese Medicine through advanced text-mining techniques. The existing intricate relationships between the herbal drugs and immunotherapy targets were also extracted and compiled via advanced text-mining techniques and manual curation for the construction of knowledge graph. We defined an ontology list consisting of 13 ontology terms describing the relations (edges) between herbal drug entities and the immunotherapy targets based on manual curation of literature data (Supplemental Materials, Edge_ontology_terms). Moreover, we collected the attributes of the medicinal properties for each herbal compound from Pharmacopoeia of China13. Totally, we compiled and integrated 64 attributes of the medicinal properties for herbal drug entities into the knowledge graph (Supplemental Materials, attributes of the medicinal properties). These medicinal properties are useful throughout the design of herbal drug combination. Finally, we built the knowledge graph via Neo4j and Py2Neo tools which consists of 895 nodes and 2197 edges (Figure 1 and Figure supplement 1), which can be visited online (http://www.ikgg.org/).
Subsequently, we employed a scoring system (or so-called recommendation system) to asses and predict synergistic herbal drug combination from the knowledge graph. Of note, the scoring function is able to identify those herbal drug combinations that are most related with specific phenotypes as well as herbal drug combinations that are able to hit most discrete cellular targets, yet still following the principle of ‘syndrome differentiation’ as described in Pharmacopoeia of China (Materials and Methods). To this end, we used this scoring function to select herbal drug combinations consisting of eight herbal entities. We chose to identify drug combinations with eight entities because ‘formulae’ consisting of eight drugs are regarded as ‘essence combination’ in TCM community. In short, we employed a combination generator that is able to randomly generate drug combinations with eight herbal drug entities for ten rounds, each of which consists of 1,000 random drug combinations. All the generated drug combinations from the ten rounds were further ranked and evaluated. Noteworthy, the scoring results of the ten rounds presented as normal distributions (Figure 2). The top twenty combinations from each round ranked by the scoring function were further curated and inspected by experts in TCM. Remarkably, we identified a specific drug combination which was ranked among top twenty choices in all ten rounds of calculation. The combination consists of eight herbal drug entities including ‘Fructus forsythiae’, ‘Herba violae’, ‘Uniflower swisscentaury root’, ‘Danshen’, ‘Astragalus’, ‘Taraxacum’, ‘Liquorice’ and ‘Honeysuckle’ (see Figure 2).
Next, we extracted the subgraph for the herbal drug combination mentioned above and created a network diagram for the drug combination via Cytoscape tools14 (Figure 3). In total, the eight herbal drug entities in the combination regulate 46 cellular targets related to immunotherapy and systematic immunity such as HIF-1, iNOS, IL-17, IL-6, IL-1β, mTOR, NLPR3, PD-L1, STAT3, TGF-β, TLR2 and TLR4 etc. (Figure 3). Noteworthy, the medicinal properties of the eight drug entities could be classified into three major categories of ‘Heat-clearing and detoxicating’, ‘Qi-tonifying’ and ‘Blood-activating menstruation regulating’. Moreover, we conducted pathway analysis for the herbal drug combination for plasma cell mastitis. Interestingly, we revealed that the herbal drug combination may modulate a few pathways related to systematic immunity including ‘Toll-like receptors cascades’, ‘MAP kinase activation’, Adaptive immune system’, ‘Growth hormone receptor signaling’, ‘Cytokine signaling in immune system’ and ‘Innate immune system’ via reactome knowledgebase15 (Figure 4 and Supplemental Materials). We believe all these may account for the therapeutic profiles of the herbal drug combination towards PCM.
Subsequently, we want to evaluate the efficacy of herbal drug combination in clinical trial for PCM patients (Clinicaltrials.gov number: NCT05530226). The ‘Jun-Chen-Zuo-Shi’ principle was examined for the herbal drug combination (‘Formulae’) by TCM experts and the dosage for each drug entity from the drug combination was adjusted by TCM experts. The clinical trial is a unrandomized, open label single arm study investigating the efficacy and safety of the herbal drug combination. To reveal the therapeutic effects of TCM drug combination, we selected patients who were treated with western medicine in the real world as comparison. Therefore, the two groups of patients were divided into TCM treatment group (experimental group) and western medicine treatment group (control group). All patients were diagnosed with PCM by biopsy of breast tissue before recruited into the clinical trial. Patients in the CG group (the control group) were orally treated with methylprednisolone whereas the patients in the EG group (the experimental or treatment group) were orally treated with herbal drug combination (Materials and Methods). Of note, methylprednisolone is a standard corticosteroid for the treatment of inflammatory conditions in clinical practice16 and therefore methylprednisolone is used in the CG group.
Efficacy was assessed every 2 cycles and the results were summarized after six months of treatment. The baseline characteristics are shown in Table S1. Strikingly, our results demonstrated that a few inflammatory cytokines in the serum including IL-2, IL-4, IL-6, IFN-γ, IL-1β and TNF-α were significantly downregulated in PCM patients after treatment of herbal drug combination as compared to the CG group treated with methylprednisolone (Figure 5). We chose to measure these cytokines in the experiments because they are often regarded as serum cytokine markers during the pathogenic development of PCM17. In addition, we found that serum level of IgA and IgG level were markedly suppressed in the treatment group of herbal drug combination as compared to the control group (Figure 6). Of note, both IgA and IgG have been found to be crucial diagnostic serum markers for PCM patients18. Moreover, IgA is regarded as a major component of mucosal immunity which is closely related to the pathogenesis of PCM19,20. Therefore, our data suggests that the herbal drug combination may enable the regulation of mucosal immunity and consequently downregulate IgA and IgG serum level. Furthermore, we conducted the standard Quality of Life questionnaire studies for PCM patients in the clinical experiment. Our results implicated that symptom score, pain score and global health status of PCM patients are significantly improved after treatment of the herbal drug combination as compared to the control group (Figure 7). Noteworthy, our results demonstrated that the recurrence rate of PCM patients in the treatment group were reduced to 3.75% as compared to the recurrence rate of 12.5% in the control group (Table 1). Moreover, the incidence rate of adverse events of PCM patients in the treatment group were reduced to 6.25% as compared to the recurrence rate of 11.25% in the control group (Table 1). In addition, we observed that the clinical symptoms of PCM patients in the EG group such as swelling, abscess and fistula were reversed (Figure 8, Table_S2 in Supplemental File) after treatment of herbal drug combinations. The clinical symptom score in the EG group is ∼4.68 as compared to the clinical symptom score of ∼5.98 in the CG group (Table 2). These results suggest that the herbal drug combination may achieve better efficacy for the treatment of PCM as compared to methylprednisolone.
With the increasing amount of biomedical data, the traditional drug discovery campaign has been revolutionized with the aid of artificial intelligence techniques to accelerate the process and reduce the cost21. In recent years, knowledge graph, a technique that can provide structured relations among entities and unstructured semantic relations associated with entities, has been introduced into the domain of drug discovery11. Although the pathogenesis of PCM remains largely unclear, there have been numerous reports implicating that the overactivation of immunoinflammatory pathways play an important role in the development of PCM22. The major advantage of using Traditional Chinese Medicine is that herbal drug combination can hit multiple discrete targets related to immunoinflammatory pathways with improved efficacy and reduced toxicity. Herein, for the first time, we showcase an example that identifies an herbal drug combination via knowledge graph towards PCM. In contrast to using the conventional principle of ‘syndrome differentiation’, our knowledge graph consisting of intricate relations between herbal drug entities and immunotherapy targets coupled with scoring functions are able to automatically identify novel herbal drug combinations which can hit most discrete targets, making this strategy unique in the TCM community. Although we acknowledge that the inclusion of chemical ingredients from the herbal drugs may impact the outcome of our analysis and design, unfortunately, the inclusion of chemical ingredients in the knowledge graph is rather technically difficult due to the limited and incomplete datasets for the herbal drugs in the field of TCM. Nevertheless, our strategy captures the prominent feature of design for drug combinations towards a complex disease such as PCM. In the future, we plan to include multiple types of omics data such as genomic, transcriptomic, proteomic, metagenomic or metabolomics data into the knowledge graph to reveal novel targets and enable novel drug discovery.
In the present study, our results revealed that the herbal drug combination identified by knowledge graph could suppress a few key immunoinflammatory cytokines, enhance the systematic immune levels and significantly reduce the recurrence rates of PCM patients. Of note, recurrence has become one major obstacle after surgical resection for PCM treatment in clinical practice. On the other hand, hormone therapy may increase the risk of side effects for PCM patients. Therefore, our approach of herbal drug combination may provide a new avenue for PCM treatment with less recurrence rate and reduced incidence rate of adverse events.
Conclusion
In summary, we report the identification and clinical assessment of an herbal drug combination towards Plasma cell mastitis (PCM). We demonstrated that the herbal drug combination holds great promise as an effective remedy for PCM, acting through the regulation of immunoinflammatory pathways and improvement of systematic immune level. In particular, the herbal drug combination could significantly reduce the recurrence rate of PCM, a major obstacle for PCM treatment. Our data suggests that the herbal drug combination is expected to feature prominently in the future PCM treatment. Moreover, these promising results underscore the potential of knowledge graph to identify drug combinations or other novel therapeutics across multiple types of human disorders.
Data Availability
All data refereed to in the manuscript are available to the public.
Conflict of interest statement
Ling Han is an employee of China Resources Sanjiu Medical & Pharmaceutical; Manji Wang is an employee of Shanghai BeautMed Corporation.
Author Contributions
Q.Y., and G.C. performed data mining and data curation and constructed the knowledge graph; Q.Y., Y.L. and Y.Y. designed the scoring system for the knowledge graph; Q.Y., and Y.Y. participated in the design and styling of the web interface; D.Z. helped to collect and analyze the medicinal properties of the herbal drug entities; L.H. and N.N. participated in the design of herbal drug combination product; M.W. and G.C. conducted analysis of clinical data and clinical pictures of the PCM patients; H.Y. and N.N. coordinated the clinical trial experiments and registration; C. L. and Y.Y. provided funding for the project; Y.Y. and C.L. initiated, coordinated the whole project; Y.Y. wrote the manuscript.
Declaration of Ethnics
The protocol was approved by the Institutional Review Board (IRB) of the China Medical University (approval number: 2021PS024T). This study was registered with ClinicalTrials.gov: NCT05530226. All patients provided written informed consent.
Materials and Methods
Construction of knowledge graph towards immunotherapy
We employed data mining techniques to collect and compile 240 targets of immunotherapy and systematic immunity from PubMed database. Next, we collected and compiled 345 herbal drug entities officially released by the National Health Commission of China and National Administration of Traditional Chinese Medicine. The intricate relations between the herbal drug entities and the immunotherapy targets were extracted from the PubMed database. These intricate relations were subjected to further manual curation. We used thirteen ontology terms (see Supplemental Materials) to describe the intricate relations (edges) in the knowledge graph. Moreover, 64 attributes of the medicinal properties for the herbal drug entities were collected and compiled from Pharmacopoeia of China. Finally, we built the knowledge graph via Neo4j and Py2Neo tools which consists of 895 nodes and 2197 edges.
Scoring system of the knowledge graph
To this end, we developed a scoring system to asses and predict synergistic drug combination of herbal drug entities (number of drugs, n) as below, Herein, f(x) represents as the penalty function. f(x) value will be set to 0 if the medicinal properties of the drug combination fall into the contraindication rules in Pharmacopoeia of China. Otherwise, f(x) value will be set to 1. g(x) is the target diversity function as above and pi is calculated as pi = w/t; t refers to the total number of targets within the knowledge graph, w refers to the total number of overlapping targets that the drug combination may hit. Hence, the target diversity function can be used as a measure to assess the diversity of the targets that the drug combination may hit. In another word, if each drug entity in the combination hits distinct targets, the g(x) value will be set to 1. The last term of the scoring system is used as a measure to assess the relativeness of each drug entity in the combination and calculated as follows, In brief, h1 represents the target hitting rates of each drug entity in the combination and was calculated as follows, h1 = ni/t; ni is the number of hitting targets for each drug entity in the combination; Again, t refers to the total number of targets constituting the knowledge graph for the disease; Noteworthy, the concept of hitting rates towards discrete targets has been used in the scoring function for the selection of synergistic drug combinations23. h2 represents the phenotype relativeness of each drug entity in the combination and h2 = c2 * 1/x, where x is the number of drug entities in the combination and c2 is the parameter; Namely, if the drug entity is related to the phenotype of the disease (co-occurrence with the disease phenotype in the literature), then c2 value is set to 1 otherwise c2 value is set to 0; h3 represents the literature relativeness or confidence of each drug entity in the combination and calculated as follows, in which l is the number of studies/publications that validated the association of drug entity with the specific disease (herein in the knowledge graph refers to cancer immunotherapy), j and k refer to if the relations of the drug entity with the disease have been validated in cell lines or patient (or animal) tissues, respectively. Namely, if the drug entity was validated in cancer cell lines or patient tissues, the j or k value will be set to 1, respectively. Otherwise, the j or k value will be set to 0; c3 is the parameter and set to 1 here. Therefore, herein, a high score of h3 implicates that the drug combination is more relevant to cancer immunotherapy with high confidence of literature relativeness. Collectively, our scoring system can be used to select those drug combinations that are most relevant with disease phenotypes and those drug combinations that are able to hit most discrete targets related to immunotherapy.
Design of the clinical trial
In brief, 160 female patients diagnosed as plasma cell mastitis (PCM) in Shengjing Hospital Affiliated to China Medical University were recruited in the clinical trial between January 2021 to February 2022. Patients were randomly 1:1 divided into experimental group (EG) and control group (CG). Noteworthy, in order to demonstrate the therapeutic effect of TCM drug combination, we selected patients who were treated with western medicine in the real world during the same period. Therefore, the two groups of patients were divided into TCM treatment group (experimental group) and western medicine treatment group (control group). There was no significant difference in baseline data such as age, body mass index, clinical classification, marriage and child-bearing history between the two groups (Supplemental File, Table_S1). Patients in the CG group were orally treated with methylprednisolone tablets, 20mg/ day once a day. The patients in the EG group were orally treated with 20g/bag of herbal drug combination twice a day, once in the morning and once in the evening for 2 months. The herbal drug combination was prepared as granules in the following formulae: Taraxacum 15g, Fructus forsythiae 15g, Honeysuckle 10g, Uniflower swisscentaury root 8g, Herba violae 20g, Danshen 10g, Astragalus 20g, Liquorice 8g. The herbal drug combination in the form of granules was provided and prepared by Shengjing Hospital Affiliated to China Medical University.
Clinical trial protocol
The clinical trial for the herbal drug combination was registered at ClinicalTrials.gov and entitled as “A Single Arm Study of Traditional Chinese Medicine for Plasma Cell Mastitis” with registration code of NCT05530226. The detailed clinical trial protocol has been provided a separate document in the Supplemental Files named as ‘PCM_Clinical_Protocol’.
Measurement of serum inflammatory cytokines by ELSIA assay
Venous blood of the CG and EG groups were collected in sterile non-anticoagulant test tube before and after treatment. The immune transmission turbidimetry was used according to the procedure of CRP kit and automatic biochemical analyzer was used to detect the level of CRP. The levels of serum cytokines were measured by ELISA (Elabscience) according to the manufacturer’s instructions.
Measurement of serum immunoglobulin level
The venous blood of PCM patients in the two groups were collected in sterile non-anticoagulant tube before and after treatment. The serum IgG and IgA were measured by rate scattering turbidimetry using Array 360 System automatic specific protein analyzer (Beckman Company, USA).
Assessment of clinical symptoms of PCM patients
The clinical symptoms were evaluated by attending physician with board certification in pathology. The patients were scored before and after treatment according to the standard rating scale for PCM (see Supplemental Materials).
Statistics
All data were evaluated as mean ± SEM. Statistical analysis of the quantitative multiple group comparisons was performed using the one-way analysis of variance (ANOVA) followed by Tukey’s test; whereas pairwise comparisons were performed using the t test by GraphPad Prism 8 (Graph Pad Software, La Jolla, CA, USA). Results were considered to be statistically significant with p<0.05.
Acknowledgments
Y. Yang’s laboratory was supported by the National Natural Science Foundation of China (Grant: 81874301), the Fundamental Research Funds for Central University (Grant: DUT22YG122) and the Key Research project of ‘be Recruited and be in Command’ in Liaoning Province (Personal Target Discovery for Metabolic Diseases); C. Liu’s lab was supported by grants from the National Natural Science Foundation of China (No. 81572609), China Medical University Major Construction Project (No. 2017ZDZX05) and Liaoning Colleges Innovative Talent Support Program (Cancer Stem Cell Origin and Biological Behavior).