2.1. Identification of DEGs
Based on the differentially expressed genes analysis associated with the gene sets enrichment variation analysis strategy, a total of 80 differentially expressed genes (DEGs) involved in the 44 MSigDB C2 curated gene sets were identified (
Figure 3A,B), and the top 20 DEGs’ expression heatmap is shown in
Figure 3C. Of which, 38 genes involved in the Singh NFE2L2 targets gene sets, Chang dominant negative gene sets immortalized by HPV31 and Lin silenced gene sets by tumor microenvironment were up-regulated (
Tables S1 and S2), including IF16 (interferon-inducible protein 6), IRF9 (interferon regulatory factor 9), IFIT1 (IFN-induced protein 1 with tetratricopeptide repeats), ISG15 (Interferon-stimulated gene 15), BST2 (bone marrow stromal cell antigen 2), OAS3 (2′-5′-oligoadenylate synthetase 3), OAS1 (2′-5′-oligoadenylate synthetase 1), DDX60 (DEAD box polypeptide 60), CYP1A1 (cytochrome P450 1A1), CEACAM6 (carcinoembryonic antigen-related cell adhesion molecule 6), keratin genes KRT81, and so on.
Among the differentially expressed genes associated with enrichment gene sets, 42 genes involved in the 41 gene sets were down-regulated (
Tables S1 and S2), such as the genes involved in the Iizuka (
Table S1) liver cancer progression pathway, including PPIF (peptidylprolyl isomerase F), TMED2 (transmembrane trafficking protein 2 with emp24 domain), SAFB (scaffold attachment factor B), SQLE (squalene epoxidase), PICALM (phosphatidylinositol binding clathrin assembly protein), STIP1 (stress-induced phosphoprotein 1), CYB561 (cytochromes b561), CCT2 (chaperonin 2β with TCP1 domain); the genes sets involved Thum systolic heart failure pathway, including CCNG2 (cyclin G2), TMED2 (transmembrane emp24 domain trafficking protein 2), FH (fumarate hydratase), TAF9B (ATA-box binding protein associated factor 9b), CCT2 (chaperonin-containing t-complex polypeptide 1 beta), transmembrane receptor NOTCH2, PICALM (subfamily A (MS4A), and CCNL2 (cyclin L2); and also Reactome DNA strand elongation, Reactome regulated proteolysis of P75NTR, and other gene sets were downregulated, with logFC form −0.89~−0.27.
In order to obtain a biological interpretation of those genes in the GO and KEGG pathway functional groups, GO and KEGG enrichment analysis were performed with the clueGO plug [
23] in Cystoscape [
24]. Results indicated that those genes that were up-regulated were rich in terms of type I interferon signaling response to virus, defense to other organisms, regulation of viral genome replication, and 2′-5′-oligoadenylate synthetase activity, and those activated may be because of the up-regulation of IRF9, IFI6, IFI27, ISG15, IFIT1, OAS1, and OAS3 (
Figure 4A). Further investigation with the KEGG pathway enrichment analysis showed those up-regulated genes could cause the activation of the IFN-induced pathway, type II interferon signaling pathway, and regulation of protein ISGylation by the ISG15 deconjugating enzyme USP18 pathway (
Figure 4B). The genes that were down-regulated were rich in terms of protein kinase complex, transcription factor TFTC complex-1, the SAGA-complex, and cargo loading into vesicle (
Figure 4A). Further investigation with KEGG pathway enrichment analysis showed those genes could negatively affect the transport of fringe-modified NOTCH to the plasma membrane pathway (
Figure 4B).
2.2. Similar Small Molecule Detection
Detection of the similar small molecule with the Comparative Toxicogenomics Database (CTD) (
http://ctdbase.org/) [
16] and connectivity map (CMAP2) (
https://portals.broadinstitute.org/cmap/) [
17,
18] database provides a better understanding the molecular mechanism of bufadienolide-like chemicals, and its potential value as a novel agent for cancer therapy. Based on the results with detecting the CTD Database, valproic acid, cyclosporine, and estradiol had the most similar target with bufadienolide-like chemicals (
Figure 5). Valproic acid, a histone deacetylase inhibitor, which once was widely used as an antiepileptic, has recently also shown anti-cancer activity in an vitro/vivo model [
25]. Estradiol is a sex hormone with anticancer activity, and is also widely used for the treatment of breast cancer, especially for postmenopausal women [
26,
27,
28].
Based on the results from the CMAP2 database (
https://portals.broadinstitute.org/cmap/) [
17,
18], V03AF, G03GB, C05AX, and C05CX were the top matching drugs with bufadienolide-like chemicals (
Table 1). V03AF, a type of detoxifying agent for antineoplastic treatment, had an opposing effect on the expression of bufadienolide-like chemicals. This result provided evidence for bufadienolide-like chemicals’ potential value as a novel agent for cancer therapy. G03GB, one type of sex hormone and a modulator of the genial system, had the most similar expression profile with bufadienolide-like chemicals. This means the bufadienolide-like chemicals also use estradiol, epimestrol and cyclofenil in breast cancer. C05AX and C05CX are two types of vasoprotectives agents, indicating that bufadienolide-like chemicals also have a potential use as vasoprotectives-like drugs.
From the evidence from detecting the similar small molecules with the CTD database and CMAP2 database, it was indicated that bufadienolide-like chemicals were one kind of steroid with the same physiological activity as estradiol and G03GB (ATC code), with potential value for use in cancer, especially breast cancer.
2.3. The Tissue Specific Co-Expression Network and Breast Cancer Associated Subnetwork Regulated by Bufadienolide-Like Chemicals
It is clear that most of the genes exert their function by collaborating with other genes in the network, which represent rigid molecular machines, cellular structures, or dynamic signaling pathways [
29]. Here, a breast tissue specific co-expression network with DEGs was generated with the TCSBN database (
http://inetmodels.com/) [
19] through the NetworkAnalyst web server (
https://www.networkanalyst.ca/) [
18]. Results indicated that the co-expression networks consisted of 743 nodes and 876 edges (
Figure 6 and
Table 2).
Furthermore, a functional enrichment analysis with KEGG pathways revealed that the co-expression networks with DEGs were enriched in pathways related to tight junction, PPAR signaling pathway, mTOR signaling pathway, influenza A, tuberculosis, N-Glycan biosynthesis, terpenoid backbone biosynthesis, Notch signaling pathway, regulation of cyclin-dependent protein kinase activity, and steroid biosynthesis (
Table 2). The GO BP term enrichment analysis showed those genes mostly involved in the establishment or maintenance of cell polarity, triglyceride metabolic process, protein targeting to membrane, defense response to virus, tuberculosis, post-translational protein modification, coenzyme biosynthetic process, gamete generation, transcription, DNA-dependent, positive regulation of translation, endoplasmic reticulum unfolded protein response, regulation of cyclin-dependent protein kinase activity, steroid biosynthetic process, regulation of the transcription of DNA-dependent, intra-Golgi vesicle-mediated transport term, and other rigid molecular machines in the biological process.
Based on the novel differentially regulated sub-networks detection tool, PhenomeScape [
20], which could combine the fold changes of genes into the knowledge of networks and disease phenotypes, a series of differentially regulated sub-networks associated with phenotypes were identified with the random walk algorithm. In this research, seven phenotypes related to breast cancer were selected as the seed phenotypes (Table 5); subsequently, a total of 19 differentially regulated sub-networks enriched in the breast cancer phenotype related subnetwork were identified (
Table 3). The sub-networks distributed by bufadienolide-like chemicals included RNA splicing (
p-value = 2.00 × 10
−3), apoptotic process (
p-value = 2.00 × 10
−3), extracellular matrix organization (
p-value = 1.00 × 10
−3), canonical Wnt signaling pathway (
p-value = 2.20 × 10
−2), synaptic transmission (
p-value = 1.40 × 10
−2), negative regulation of the JAK-STAT cascade (
p-value = 4.20 × 10
−2), adherens junction organization (
p-value = 3.80 × 10
−2), BMP signaling pathway (
p-value = 4.10 × 10
−2), negative regulation of cell migration (
p-value = 1.30 × 10
−2), and activation of signaling protein activity involved in the unfolded protein response (
p-value = 1.90 × 10
−2) (
Figure 7).
The subnetwork A (
Figure 7A), related to the RNA splicing function, was the first identified dysregulation subnetwork. It showed the genes involved in the mRNA splicing spliceosome were down-regulated, including the serine- and arginine- rich splicing factor members, SRSF4, SRSF5, and SRSF6, and peroxisome proliferator activated receptor gamma coactivator (PPARGC1A). The apoptotic process (
Figure 7B) could have been dysregulated by bufadienolide-like chemicals, and this dysregulation was performed with the increased expression of SYT11, PARK2, PYHIN1, APC, RNF40, SERPINB3, TIAM2, ITSN1, SH3GL2, CASP1, GATA4, ITSN2, and PDE4DIP. Several cancer signaling pathways, including the Wnt signaling pathway, the JAK-STAT signaling pathway, and the BMP signaling pathway, also could had been dysregulated by bufadienolide-like chemicals (
Figure 7D,F,H). This suggests that bufadienolide-like chemicals could increase the apoptotic process through a series of pathways or regulation networks. The subnetwork C (
Figure 7C) was mostly related to the extracellular matrix organization being upregulated, including the genes, TIMP4, MMP3, SPARC, DPT, and ACAN. Also in this subnetwork, those genes that referred to the regulation of cell migration were downregulated, including the genes, TNFAIP6, DCN, SPARC, THBS1, and CCL8. This means the increase of the extracellular matrix may have hindered the migration of the tumor. Also, negative synaptic transmission, adherens junction organization, and regulation of cell migration was found in subnetwork E, G, and I (
Figure 7E,G,I). Several metabolic processes were also discovered, including the drug metabolic process, xenobiotic metabolic process, oligosaccharide metabolic process, etc. All other PhenomeScape networks can be found in
Supplementary Figure S1.
Although, there is no evidence to prove the bufadienolide-like chemicals having obvious toxicity with the CEBS database (
https://manticore.niehs.nih.gov/cebssearch/) [
30]. In this research, in order to identify the potential cardiotoxicity of bufadienolide-like chemicals, 11 cardiotoxicity relation phenotypes (Table 6), including arrhythmia (HP:0011675), atrial fibrillation (HP:0005110), atrial flutter (HP:0004749), and other phenotypes, were chosen as seed phenotypes of cardiotoxicity with the aim of searching for the potential dysregulation subnetworks with cardiotoxicity. Results indicated six subnetwork related to membrane depolarization during the action potential (
p-value = 3.70 × 10
−3,
Figure 8A), retinoic acid receptor binding (
p-value = 2.00 × 10
−3,
Figure 8B), GABA receptor binding ((
p-value = 3.00 × 10
−3,
Figure 8C), positive regulation of nuclear division (
p-value = 5.00 × 10
−3,
Figure 8D), negative regulation of viral genome replication (
p-value = 3.00 × 10
−3,
Figure 8E), and negative regulation of viral life cycle (
p-value = 1.00 × 10
−3), which were identified as potential cardiotoxicity subnetworks disturbed by bufadienolide-like chemicals (
Table 4 and
Figure 8). The subnetwork related to membrane depolarization may be the key potential cardiotoxic target of bufadienolide-like chemicals. These were also be observed by several widely used anticancer drugs with cardiotoxicity. For example, Adriamycin, Gleevec, and Herceptin were observed with a membrane depolarization appearance during clinical research [
31,
32].
Hub genes, mostly the highly connected nodes in the network, were identified by node degree and the MCC (Maximal clique centrality) algorithm with the Cytoscape plugin, cytoHubba [
33]. Based on the threshold of the degree (degree > 5) and the MCC algorithm, 10 genes with MCC scores ranging from 126 to 953 were identified as hub genes (
Figure 9A,B). Ten hub genes, including three 2′-5′-oligoadenylate synthetase genes, OAS1, OAS2, and OAS3; five interferon-induced genes, ISG15, IFIT1, IFI6, IFI44, and IFIL44L; and two other genes, including the kelch-like family member 35 (KLHL35) and Golgi Membrane Protein 1 (GOLM1) were identified. These were selected as the hub genes. Further investigation with TCGA [
21] and the Kaplan-Meier databases [
22] indicated that the 10 hub genes except KLHL35 were increased both in the treatment with bufadienolide-like chemicals and the TCGA breast cancer sample (
Figure 9C). Six hub genes, including IFIT1, ISG15, IFI6, GOLM5, KLHL35, and OAS2, were associated with the total survival probability in breast cancer patients (
Figure 9D). Further analysis of the correlation between the hub genes and the total survival time in breast cancer indicated that the high expression of GOLM5, KLHL35, and OAS2 was associated with a better survival probability.