Graph Neural Network-Based Modeling with Subcategory Exploration for Drug Repositioning

Lu, Rong; Liang, Yong; Lin, Jiatai; Chen, Yuqiang

doi:10.3390/electronics13193835

Open AccessArticle

Graph Neural Network-Based Modeling with Subcategory Exploration for Drug Repositioning

¹

Faculty of Innovation Engineering, Macau University of Science and Technology, Macau 999078, China

²

School of Artificial Intellgence, Dongguan Polytechnic, Dongguan 523808, China

³

Peng Cheng Laboratory, Shenzhen 518118, China

⁴

School of Computer Science and Engineering, South China University of Technology, Guangzhou 510641, China

^*

Author to whom correspondence should be addressed.

Electronics 2024, 13(19), 3835; https://doi.org/10.3390/electronics13193835 (registering DOI)

Submission received: 14 September 2024 / Revised: 23 September 2024 / Accepted: 25 September 2024 / Published: 28 September 2024

(This article belongs to the Special Issue Network Security Management in Heterogeneous Networks)

Download

Browse Figures

Versions Notes

Abstract

:

Drug repositioning is a cost-effective approach to identifying new indications for existing drugs by predicting their associations with new diseases or symptoms. Recently, deep learning-based models have become the mainstream for drug repositioning. Existing methods typically regard the drug-repositioning task as a binary classification problem to find the new drug–disease associations. However, drug–disease associations may encompass some potential subcategories that can be used to enhance the classification performance. In this paper, we propose a prototype-based subcategory exploration (PSCE) model to guide the model learned with the information of a potential subcategory for drug repositioning. To achieve this, we first propose a prototype-based feature-enhancement mechanism (PFEM) that uses clustering centroids as the attention to enhance the drug–disease features by introducing subcategory information to improve the association prediction. Second, we introduce the drug–disease dual-task classification head (D3TC) of the model, which consists of a traditional binary classification head and a subcategory-classification head to learn with subcategory exploration. It leverages finer-grained pseudo-labels of subcategories to introduce additional knowledge for precise drug–disease association classification. In this study, we conducted experiments on four public datasets to compare the proposed PSCE with existing state-of-the-art approaches and our PSCE achieved a better performance than the existing ones. Finally, the effectiveness of the PFEM and D3TC was demonstrated using ablation studies.

Keywords:

drug repositioning; prototype; subcategory exploration; graph neural network

1. Introduction

Drug development is crucial for the treatment of diseases [1,2]. Traditional drug development is divided into three stages: the discovery stage, preclinical stage, and clinical stage. Developing a new drug typically requires 10–20 years and costs billions of dollars, which poses considerable challenges. To address these issues, drug repositioning offers an alternative approach by identifying new therapeutic uses for existing approved drugs. This strategy significantly reduces the drug development time and lowers the costs [3,4,5]. Consequently, drug repositioning is widely applied by research-based pharmaceutical companies in their drug-discovery efforts.

Drug-repositioning algorithms can be typically categorized into feature-based, matrix-factorization-based, and network-based methods to predict the associations between drugs and diseases [6,7]. (1) Feature-based methods involve analyzing the chemical and biological properties of drugs, as well as the phenotypic characteristics of diseases, using data-driven machine learning models to predict potential connections between drugs and diseases [8]. (2) Matrix-factorization-based methods decompose the interaction matrix between drugs and diseases into feature vectors through mathematical techniques to compute their similarity, thereby predicting new indications for drugs. This approach can handle large-scale datasets, flexibly integrate more prior information, identify potential connections between drugs and diseases, and aid in the rapid discovery of new therapeutic approaches [9,10]. (3) Network-based drug-repositioning methods aim to use the internal association matrix (e.g., drug–drug or disease–disease matrix) to predict the external associations between drugs and diseases, which can be regarded as a binary classification task for each drug–disease association [11].

With the development of neural networks, network-based algorithms have gradually become the mainstream for drug-repositioning tasks. As a typical approach, Xuan et al. developed a drug-repositioning approach based on convolutional neural networks (CNNs) and bidirectional long short-term memory (BiLSTM) networks, with the BiLSTM module using an attention mechanism to learn path representations of drug–disease pairs by balancing contributions from different paths [12]. Graph convolutional networks (GCNs) are also widely used in this task because the connection nature of association matrices can be transformed into graphs to capture the features of drug–drug or disease–disease associations. For example, Wang et al. utilized bipartite graph convolution operations to model macroscopic and microscopic information exchange between drugs and diseases through protein nodes, thus effectively leveraging interaction relationships to predict potential diseases that drugs may treat [13]. Yu et al. further introduced a hierarchical-attention-based graph convolutional network for drug repositioning by utilizing relationships at different graph convolution layers to enhance the predictive accuracy [14].

Since the drug-repositioning model learns from a small-scale internal association matrix, it struggles to acquire sufficient knowledge for effective drug repositioning [15]. However, the aforementioned network-based methods hardly rely on the introduced information for model training. Meanwhile, we observe that there might be diversity in the associations between each drug and disease, and it is possible to further explore subcategories of these associations to introduce more information for model learning. Therefore, the main challenge of this work is how to uncover this potential diversity or subcategory knowledge to improve the classification performance for drug repositioning.

In this paper, we propose a prototype-based subcategory exploration (PSCE) model to introduce the potential knowledge of subcategories for model training for drug repositioning. First, we propose a prototype-based feature-enhancement mechanism (PFEM) that employs the K-means method [16,17] to obtain the clustering subcategories for each sample, and the clustering centroids are regarded as the class-relevant prototypes [18,19,20]. In the proposed PFEM, prototypes are used to attach attention to original graph features to obtain the enhanced features. Second, we introduce a drug–disease dual-task classification head (D3TC) of the model, which consists of a traditional binary classification head and a subcategory-classification head to learn with subcategory exploration. It leverages finer-grained pseudo-labels of subcategories to introduce additional knowledge for precise drug–disease association classification. We conducted experiments on four public datasets to compare with several existing drug-repositioning methods. In the experiment, the PSCE achieved a state-of-the-art performance. Finally, we conducted ablation studies to demonstrate the effectiveness of the proposed PFEM and D3TC. The contributions of this paper are summarized as follows:

This paper presents a prototype-based feature-enhancement mechanism (PFEM) by making full use of the potential knowledge of subcategories for model training, based on which the classification performance in the drug-repositioning task can be significantly improved.
For the proposed PFEM, we propose a drug–disease dual-task classification head (D3TC) of the model for subcategory exploration to learn the potential feature representation of subcategories by building additional constraints to improve the performance of the drug–disease association predictions.
Experimental comparisons showed that the PSCE could achieve state-of-the-art performance with respect to the best existing drug-repositioning methods on four datasets.

2. Materials and Methods

As shown in Figure 1, in this section, we systematically introduce the PSCE method proposed for the drug-repositioning task. We first introduce the datasets we used, and then we show the overall framework of our model and provide detailed introductions to the two main modules of our model: the PFEM and D3TC modules. Finally, we present the implementation details of our approach.

2.1. Datasets

We used four datasets to demonstrate the effectiveness and evaluate the performance of our method: Gdataset [21], Cdataset [22], Ldataset [14], and LRSSL [23]. These datasets are widely used in the drug-repositioning task. Among them, the Gdataset includes 1933 confirmed drug–disease associations, including 593 drugs from the DrugBank database and 313 diseases from the OMIM database.The Cdataset contains 663 drugs, 409 diseases, and 2352 drug–disease interaction pairs. The Ldataset was compiled from the CTD dataset, which includes 18,416 associations between 269 drugs and 598 diseases. The last dataset, namely, LRSSL, contains 3051 validated drug–disease associations involving 763 drugs and 681 diseases. The specific statistical information of these datasets is shown in Table 1.

In our method, by observing the relationship between the disease and drug features in the feature space, we propose a novel feature that combines clustering features to calculate the similarity between drugs and diseases. To better interpret the features, we also propose a method that divides the binary classification task into more subtasks through unsupervised clustering so that the model can better distinguish hard samples.

2.2. Overview

We used the drug–disease association matrix, drug–drug similarity matrix, and disease–disease similarity matrix to construct a graph network structure and obtain potential drug–disease relationships. The drug–disease association matrix X represents the known associations between drugs and diseases and is a binary

p * q

matrix, where p and q represent the numbers of drug and disease types, respectively. Each element

X_{i j}

in X indicates the association between drug

R_{i}

and disease

D_{j}

, where if there is an association,

X_{i j} = 1

, and otherwise,

X_{i j} = 0

:

X_{i j} = \{\begin{matrix} 1 & If R_{i} is associated with D_{j} \\ 0 & otherwise \end{matrix}

(1)

The drug–drug similarity matrix R represents the similarity between drugs and is a

p * p

matrix, where p is the number of drug types. Each element

R_{i j}

in R represents the degree of similarity between the i-th drug and the j-th drug, which is specifically defined as

R_{i j} = \{\begin{matrix} k & If R_{i} is associated with R_{j}, 0 < k < 1 \\ 0 & otherwise \end{matrix}

(2)

Similarly, the disease–disease similarity matrix D represents the similarity between diseases and is defined as

D_{i j} = \{\begin{matrix} k & If D_{i} is associated with D_{j}, 0 < k < 1 \\ 0 & otherwise \end{matrix}

(3)

The purpose of the drug-repositioning task is to predict unknown potential associations between drugs and diseases by studying the similarity between drugs, the similarity between diseases, and the known associations between drugs and diseases.

2.3. Model Architecture

Existing methods have already shown the effectiveness of a GNN in constructing associations between drugs and diseases. Our method takes the drug similarity matrix R and the disease similarity matrix D as the input of the network to construct the corresponding graph structures

G_{R}

and

G_{D}

, respectively, according to the element adjacency relationship. The obtained graph structures are then fed into the graph neural network for preliminary feature extraction, which results in drug features

F_{R} = G (R, G_{R})

and disease features

F_{D} = G (D, G_{D})

. To better represent the features of similar drugs/diseases, we used a clustering feature-enhancement method (PFEM) to strengthen the expression ability of the features, thus obtaining the enhanced drug features

\hat{F_{R}} = f (F_{R})

and disease features

\hat{F_{D}} = f (F_{D})

. We obtained the drug–disease similarity features by unfolding the obtained drug features and disease features in the form of a tensor product, which was then used for the predictions:

F_{R_D} = [\begin{matrix} \hat{F_{R}^{1}} ⨁ \hat{F_{D}^{1}}, & \hat{F_{R}^{1}} ⨁ \hat{F_{D}^{2}}, & \dots, & \hat{F_{R}^{1}} ⨁ \hat{F_{D}^{q}} \\ \hat{F_{R}^{2}} ⨁ \hat{F_{D}^{1}}, & \hat{F_{R}^{2}} ⨁ \hat{F_{D}^{2}}, & \dots, & \hat{F_{R}^{2}} ⨁ \hat{F_{D}^{q}} \\ \dots & \dots & \dots & \dots \\ \hat{F_{R}^{p}} ⨁ \hat{F_{D}^{1}}, & \hat{F_{R}^{p}} ⨁ \hat{F_{D}^{2}}, & \dots, & \hat{F_{R}^{p}} ⨁ \hat{F_{D}^{q}} \end{matrix}

(4)

where p is the number of drug types, q is the number of disease types, and ⨁ represents the concatenation operation.

We then used the drug–disease association matrix

M_{R_D}

as the label to supervise the learning of these features. In previous methods, a simple decoder was used to parse the features to achieve classification, but we believe that simple binary classification cannot distinguish some difficult samples, and thus, we propose a new classification head (D3TC) to improve the classification performance and obtain the final prediction probability matrix

Y = D (\hat{F_{R}}, \hat{F_{D}})

.

2.4. Prototype-Based Feature-Enhancement Mechanism

In order to obtain the underlying associations between drugs and between diseases, previous methods often relied on the k-nearest neighbor graph of the similarity matrix to construct stronger similarity. However, in this paper, we believe that features with closer clustering in the feature space have stronger similarity. To enhance this similarity, and thus, obtain more subtle associations between diseases and drugs, we propose a prototype-based feature-enhancement method (PFEM).

We used the features extracted by the graph neural network as the initial features for enhancement. For the drug features

F_{R} \in R^{p * s}

, where p is the number of drug types and s is the feature dimension, we performed k-means clustering on the features to group the p drugs into k clusters and obtained the feature of each cluster center

{\bar{F}}_{R}^{i} (0 < i < k)

. We then fused each drug’s own feature

F_{R}^{j} (0 < j < p)

with the feature of the cluster center

{\bar{F}}_{R}^{i}

it belonged to to obtain the enhanced features

\hat{F_{R}} = δ (F_{R}, {\bar{F}}_{R}^{i})

. Specifically, we used an attention mechanism to acquire more representative features. Similarly, for the disease features

F_{D} \in R^{q * s}

, we adopted the same method to obtain the enhanced disease features

\hat{F_{D}} = δ (F_{D}, {\bar{F}}_{D}^{i})

.

2.5. Drug–Disease Dual-Task Classification Head

Although we obtained representative features, predicting the potential similarity probability between drugs and diseases is still a challenging task, as there are still some difficult samples. The traditional decoder treats this prediction task as a binary classification problem that results in classification results with high inter-class similarity, which hinders the formation of diverse features. To obtain a better prediction performance, we propose a drug–disease dual-task classification head (D3TC).

In addition to the binary classification task of predicting whether there is an association between a drug and a disease, we further extended each class into T sub-classes that represent different degrees of relevance and irrelevance (e.g., extremely irrelevant, possibly irrelevant, possibly relevant, extremely relevant). This encourages the model to not only focus on the differences in binary classification but also on the differences in different degrees, thus ultimately obtaining a more subtle feature representation:

\begin{matrix} Y_{p} \to Y_{c} = \{\begin{matrix} (0, . . ., 0, Y_{c}^{T + 1}, . . ., Y_{c}^{2 T}), & Y_{p} = 1 \\ (Y_{c}^{1}, . . ., Y_{c}^{T}, 0, . . ., 0), & Y_{p} = 0 \end{matrix} \end{matrix}

(5)

where

Y_{p} \in {0, 1}

is the binary classification label,

Y_{c} \in {0, 1}^{2 T}

is the one-hot pseudo-label for the molecular sub-classes, and → represents the process of using the original labels to generate a subcategory label.

First, we trained the binary classification model until it converged. Then, we extracted deep features for each sample and obtained the pseudo-labels for the sub-classes through unsupervised clustering. Finally, we jointly trained the network using both the binary classification labels and the sub-class pseudo-labels. To better train the network, we used a weighted binary cross-entropy loss to supervise the binary classification task:

L_{w b c e} = - \sum_{j} w_{j} \sum_{i} y_{j i} l o g (\hat{y_{j i}}) + (1 - y_{j i}) l o g (1 - \hat{y_{j i}})

(6)

At the same time, we introduced focal loss and center loss to learn the knowledge of the pseudo-labels. This allowed us to bring the samples of the same class closer in the feature space and push the samples of different classes farther apart. By introducing focal loss, we reduced the weight of the easy samples and focused more on the difficult samples, which helped to push the different classes apart in the feature space:

L_{F o c a l} = - \sum_{i} [{(1 - \hat{y_{i}})}^{γ} y_{i} l o g \hat{y_{i}} + {(\hat{y_{i}})}^{γ} (1 - y_{i}) l o g (1 - \hat{y_{i}})]

(7)

Center loss was used to minimize the intra-class variability by encouraging the feature vectors of the same class to be close to their corresponding class centers. The center loss was defined as

L_{C e n t e r} = \frac{1}{2} \sum_{i = 1}^{N} ∥ x_{i} - c_{y_{i}} ∥_{2}^{2}

(8)

By combining the loss of the binary classification and the sub-class pseudo-labels, we optimized the classification model:

L_{t o t a l} = L_{w b c e} + λ (L_{F o c a l} + L_{C e n t e r})

(9)

3. Results

In this section, we first give the implementation details of the proposed PSCE in Section 3.1 and describe the evaluation metrics in Section 3.2. Then, we give the results of the local leave-one-out 10-time 10-fold cross-validation in Section 3.3 and ablation study in Section 3.4.

3.1. Implementation Details

During the training process, we divided the training samples and validation samples based on the drug–disease association matrix. For each element in the matrix, we could treat it as a sample. We randomly split these samples into a training set and a validation set at a ratio of 9:1, and adopted a 5-fold cross-validation experiment to obtain the model’s performance.

Our model used the Adam optimizer for optimization, with a learning rate of 0.01. The mini-batch size for the model training was set to 2000, and a 5-fold cross-validation was adopted. Our experiments were conducted on PyTorch 1.13.1 and a workstation equipped with a 24 GB NVIDIA RTX3090 GPU. In the PFEM, the number of clustering centers was set to half the number of samples, and in the D3TC, the number of sub-classes T was set to five. In the loss function, the weights in

L_{w b c e}

were set according to the ratio of the number of positive samples to negative samples in the training set. The value of

λ

was set to 0.005.

3.2. Evaluation Metrics

We used two metrics, namely, the area under the receiver operating characteristic (AUROC) [29] and the area under the precision–recall curve (AUPRC) [30], to evaluate the performance of our model. These two metrics are widely used for evaluating the performance of binary classification models. The AUROC measures the trade-off between the true positive rate (TPR) and the false positive rate (FPR) across different classification thresholds. It represents the probability that a randomly selected positive sample will be ranked higher than a randomly selected negative sample by the classifier. In contrast, the AUPRC evaluates the trade-off between the precision and recall across different classification thresholds. It provides a more comprehensive assessment of the classifier’s performance, especially when dealing with imbalanced datasets where the positive and negative classes are significantly unequal.

3.3. Comparison with Existing Methods

In this section, we present the results of the local leave-one-out 10-time 10-fold cross-validation to compare the proposed PSCE method with six representative methods to examine the robustness and effectiveness of our PSCE for discovering novel drug candidates for new diseases without any treatment information on four datasets, which are mentioned in Section 2.1. The six representative methods were MBiRW [22], BNNR [24], iDrug [25], NIMCGCN [26], DRHGCN [27], and DRWBNCF [28]. In this experiment, we used the AUROC and AUPRC metrics to evaluate the performances of methods.

Table 1 presents the quantization results of our PSCE method compared with six existing methods. In this table, we highlighted the best and second-best performances in red and blue, respectively. The results demonstrate that our method consistently achieved the best performance across the Gdataset, Cdataset, and Ldataset. In the LRSSL dataset experiment, although our method attained the second-best performance for the AUROC metric, it still achieved the best performance for the AUPRC metric. The last column of the table displays the mean performances across the four datasets, where it shows that our method performed well on all datasets and achieved comprehensive optimality.

In Figure 2, we visualized the mean performance of this experiment on four datasets using a bar chart. This figure demonstrates that our PSCE method outperformed the others and achieved a state-of-the-art performance. To intuitively demonstrate the robustness and effectiveness, we visualized the performance of the 10-time 10-fold cross-validation for each time in Figure 3. We can observe that our method, like other methods, demonstrated consistent results across repeated experiments, where the outcomes remained within a certain range and exhibited no significant random fluctuations. This indicates that the quantification results of our method are robust. The performance stability of our PSCE method was evident, where it consistently maintained a high performance. This visual confirmation aligned with the quantitative results presented in our table, which further verified the effectiveness of our method. Additionally, this stability across various datasets underscored the reliability of our approach in different experimental conditions. The robustness of our method ensures that it can be confidently applied in practical scenarios since it maintains accuracy and efficiency. Overall, these observations highlight the strength and dependability of our PSCE method in achieving superior quantification results. Compared with the existing methods, especially NIMCGCN and DRHGCN, which are also GCN-based methods, the proposed PSCE learned additional potential knowledge with subcategory pseudo-labels, and the experimental results demonstrated that our method could indeed achieve better and more robust performance than the existing ones.

3.4. Ablation Study on the Proposed PFEM and D3TC

This work presents a novel drug-repositioning model (PSCE) that incorporates two modules: PFEM and D3TC, which were designed for subcategory exploration. To investigate the effectiveness of these two modules, we conducted the ablation study detailed in this section. In these experiments, we compared the impacts of different combinations of the two modules, with the quantification results reported in Table 2.

In Table 2 and Table 3, we see that when using each module individually, only a comparable performance could be achieved. However, combining both modules yielded the best performance; this even led to significant improvements, such as an increase of about 0.1–0.2 on the Cdataset. This not only indicates that both modules are effective but also that they are complementary. By integrating the two modules, they can leverage each other’s strengths, thus resulting in superior performance. Figure 4 visually illustrates the quantization results described above with a line chart. We can observe that combining the two modules achieved significant and stable improvements over using them individually. This visual representation further validated the effectiveness of our method.

4. Conclusions

In conclusion, our proposed PSCE model represents a significant advancement in the field of drug repositioning by effectively incorporating subcategory information into the prediction process. Through the innovative use of a prototype-based feature-enhancement mechanism (PFEM) and a dual-task classification head (D3TC), we demonstrated that it is possible to achieve more precise and reliable drug–disease association predictions. The PFEM’s clustering centroids and the D3TC’s subcategory exploration enable our model to leverage finer-grained pseudo-labels, thus providing a richer source of information compared with traditional binary classification methods. Experimental results on four public datasets confirmed that our PSCE model outperformed the current state-of-the-art approaches, which underscored the potential of our method to improve the accuracy and efficiency of drug-repositioning tasks. The effectiveness of both PFEM and D3TC was further validated through comprehensive ablation studies, which highlighted the robustness and applicability of our approach.

Author Contributions

Conceptualization, R.L. and Y.L.; methodology, R.L.; software, J.L.; validation, R.L. and J.L.; formal analysis, R.L.; investigation, R.L.; resources, Y.L.; data curation, R.L.; writing—original draft preparation, R.L.; writing—review and editing, J.L.; visualization, J.L.; supervision, Y.L.; project administration, R.L. and Y.C.; funding acquisition, Y.L. All authors read and agreed to the published version of this manuscript.

Funding

This work was supported by the Artificial Intelligence Technology Application Research and Service Center of Dongguan Polytechnic School-Level Fund Project of Dongguan Polytechnic (no. 2023c27), and was also supported by SSL Sci-tech Commissioner Program Project (no. 20234400-01KCJ-G).

Data Availability Statement

The data underlying this article are available in our provided github repository at https://github.com/lu-rong/PSCE_DR/ (accessed on 13 September 2024).

Conflicts of Interest

The authors declare no conflicts of interest.

References

Morgan, S.; Grootendorst, P.; Lexchin, J.; Cunningham, C.; Greyson, D. The cost of drug development: A systematic review. Health Policy 2011, 100, 4–17. [Google Scholar] [CrossRef]
Ingber, D.E. Human organs-on-chips for disease modelling, drug development and personalized medicine. Nat. Rev. Genet. 2022, 23, 467–491. [Google Scholar] [CrossRef]
Jourdan, J.P.; Bureau, R.; Rochais, C.; Dallemagne, P. Drug repositioning: A brief overview. J. Pharm. Pharmacol. 2020, 72, 1145–1151. [Google Scholar] [CrossRef]
Jarada, T.N.; Rokne, J.G.; Alhajj, R. A review of computational drug repositioning: Strategies, approaches, opportunities, challenges, and directions. J. Cheminformatics 2020, 12, 1–23. [Google Scholar] [CrossRef] [PubMed]
Hua, Y.; Dai, X.; Xu, Y.; Xing, G.; Liu, H.; Lu, T.; Chen, Y.; Zhang, Y. Drug repositioning: Progress and challenges in drug discovery for various diseases. Eur. J. Med. Chem. 2022, 234, 114239. [Google Scholar] [CrossRef] [PubMed]
Yu, J.L.; Dai, Q.Q.; Li, G.B. Deep learning in target prediction and drug repositioning: Recent advances and challenges. Drug Discov. Today 2022, 27, 1796–1814. [Google Scholar] [CrossRef] [PubMed]
Dang, Q.; Liang, Y.; Ouyang, D.; Miao, R.; Ling, C.; Liu, X.; Xie, S. Improved Computational Drug-Repositioning by Self-Paced Non-Negative Matrix Tri-Factorization. IEEE/ACM Trans. Comput. Biol. Bioinform. 2022, 20, 1953–1962. [Google Scholar] [CrossRef]
Hou, L.; Samaras, D.; Kurc, T.M.; Gao, Y.; Davis, J.E.; Saltz, J.H. Patch-based convolutional neural network for whole slide tissue image classification. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 27–30 June 2016; pp. 2424–2433. [Google Scholar]
Peska, L.; Buza, K.; Koller, J. Drug-target interaction prediction: A Bayesian ranking approach. Comput. Methods Programs Biomed. 2017, 152, 15–21. [Google Scholar] [CrossRef]
Ceddia, G.; Pinoli, P.; Ceri, S.; Masseroli, M. Matrix factorization-based technique for drug repurposing predictions. IEEE J. Biomed. Health Inform. 2020, 24, 3162–3172. [Google Scholar] [CrossRef]
Zeng, X.; Zhu, S.; Liu, X.; Zhou, Y.; Nussinov, R.; Cheng, F. deepDR: A network-based deep learning approach to in silico drug repositioning. Bioinformatics 2019, 35, 5191–5198. [Google Scholar] [CrossRef]
Xuan, P.; Ye, Y.; Zhang, T.; Zhao, L.; Sun, C. Convolutional neural network and bidirectional long short-term memory-based method for predicting drug–disease associations. Cells 2019, 8, 705. [Google Scholar] [CrossRef]
Wang, Z.; Zhou, M.; Arnold, C. Toward heterogeneous information fusion: Bipartite graph convolutional networks for in silico drug repurposing. Bioinformatics 2020, 36, i525–i533. [Google Scholar] [CrossRef] [PubMed]
Yu, Z.; Huang, F.; Zhao, X.; Xiao, W.; Zhang, W. Predicting drug–disease associations through layer attention graph convolutional network. Briefings Bioinform. 2021, 22, bbaa243. [Google Scholar] [CrossRef] [PubMed]
Chen, H.G.; Zhou, X.H. MNBDR: A Module Network Based Method for Drug Repositioning. Genes 2021, 12, 25. [Google Scholar] [CrossRef] [PubMed]
Ahmed, M.; Seraj, R.; Islam, S.M.S. The k-means algorithm: A comprehensive survey and performance evaluation. Electronics 2020, 9, 1295. [Google Scholar] [CrossRef]
Ikotun, A.M.; Ezugwu, A.E.; Abualigah, L.; Abuhaija, B.; Heming, J. K-means clustering algorithms: A comprehensive review, variants analysis, and advances in the era of big data. Inf. Sci. 2023, 622, 178–210. [Google Scholar] [CrossRef]
Zhou, T.; Wang, W.; Konukoglu, E.; Van Gool, L. Rethinking semantic segmentation: A prototype view. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, New Orleans, LA, USA, 18–24 June 2022; pp. 2582–2593. [Google Scholar]
Zhu, X.; Toisoul, A.; Perez-Rua, J.M.; Zhang, L.; Martinez, B.; Xiang, T. Few-shot action recognition with prototype-centered attentive learning. arXiv 2021, arXiv:2101.08085. [Google Scholar]
Rymarczyk, D.; Struski, Ł.; Górszczak, M.; Lewandowska, K.; Tabor, J.; Zieliński, B. Interpretable image classification with differentiable prototypes assignment. In Proceedings of the European Conference on Computer Vision, Tel Aviv, Israel, 23–27 October 2022; pp. 351–368. [Google Scholar]
Gottlieb, A.; Stein, G.Y.; Ruppin, E.; Sharan, R. PREDICT: A method for inferring novel drug indications with application to personalized medicine. Mol. Syst. Biol. 2011, 7, 496. [Google Scholar] [CrossRef]
Luo, H.; Wang, J.; Li, M.; Luo, J.; Peng, X.; Wu, F.X.; Pan, Y. Drug repositioning based on comprehensive similarity measures and Bi-Random walk algorithm. Bioinformatics 2016, 32, 2664–2671. [Google Scholar] [CrossRef]
Liang, X.; Zhang, P.; Yan, L.; Fu, Y.; Peng, F.; Qu, L.; Shao, M.; Chen, Y.; Chen, Z. LRSSL: Predict and interpret drug–disease associations based on data integration using sparse subspace learning. Bioinformatics 2017, 33, 1187–1196. [Google Scholar] [CrossRef]
Yang, M.; Luo, H.; Li, Y.; Wang, J. Drug repositioning based on bounded nuclear norm regularization. Bioinformatics 2019, 35, i455–i463. [Google Scholar] [CrossRef] [PubMed]
Chen, H.; Cheng, F.; Li, J. iDrug: Integration of drug repositioning and drug-target prediction via cross-network embedding. PLOS Comput. Biol. 2020, 16, 1–20. [Google Scholar] [CrossRef] [PubMed]
Li, J.; Zhang, S.; Liu, T.; Ning, C.; Zhang, Z.; Zhou, W. Neural inductive matrix completion with graph convolutional networks for miRNA-disease association prediction. Bioinformatics 2020, 36, 2538–2546. [Google Scholar] [CrossRef] [PubMed]
Cai, L.; Lu, C.; Xu, J.; Meng, Y.; Wang, P.; Fu, X.; Zeng, X.; Su, Y. Drug repositioning based on the heterogeneous information fusion graph convolutional network. Briefings Bioinform. 2021, 22, bbab319. [Google Scholar] [CrossRef]
Meng, Y.; Lu, C.; Jin, M.; Xu, J.; Zeng, X.; Yang, J. A weighted bilinear neural collaborative filtering approach for drug repositioning. Briefings Bioinform. 2022, 23, bbab581. [Google Scholar] [CrossRef]
Nahm, F.S. Receiver operating characteristic curve: Overview and practical use for clinicians. Korean J. Anesthesiol. 2022, 75, 25–36. [Google Scholar] [CrossRef]
Miao, J.; Zhu, W. Precision–recall curve (PRC) classification trees. Evol. Intell. 2022, 15, 1545–1569. [Google Scholar] [CrossRef]

Figure 1. Illustration of the proposed PSCE pipeline. The middle part of this diagram shows the main process of the entire pipeline. (a) The proposed prototype-based feature enhancement mechanism (PFEM), (b) the feature concatenation and split steps, and (c) the proposed drug–disease dual-task classification head (D3TC).

Figure 2. Bar chart of performance that compares our PSCE and six existing methods. The blue and green bars represent the performances according to the AUROC and AUPRC metrics, respectively.

Figure 3. Visualization of performance generated by our PSCE and existing methods. Top and bottom represent the scatter plot of performances according to AUROC and AUPRC metrics, respectively.

Figure 4. The effects of different combinations of the proposed PFEM and D3TC on four datasets. The top and bottom represent the line charts of performances with AUROC and AUPRC metrics, respectively.

Table 1. Comparison of the proposed PSCE with six respective algorithms for drug repositioning under 10-fold cross-validation on Gdataset, Cdataset, the LRSSL dataset, and the Ldataset. The red and blue markers indicate the best and second-best performances, respectively.

Methods	Metrics	Performance on Datasets (Mean ± Sd)
Methods	Metrics	Gdataset	Cdataset	LRSSL	Ldataset	Avg
MBiRW [22]	AUROC	0.896 ± 0.014	0.920 ± 0.008	0.893 ± 0.015	0.765 ± 0.007	0.868
MBiRW [22]	AUPRC	0.106 ± 0.019	0.161 ± 0.019	0.030 ± 0.004	0.032 ± 0.003	0.082
BNNR [24]	AUROC	0.937 ± 0.010	0.952 ± 0.010	0.922 ± 0.012	0.866 ± 0.004	0.919
BNNR [24]	AUPRC	0.328 ± 0.029	0.431 ± 0.020	0.226 ± 0.021	0.142 ± 0.007	0.282
iDrug [25]	AUROC	0.905 ± 0.019	0.926 ± 0.010	0.900 ± 0.008	0.838 ± 0.005	0.892
iDrug [25]	AUPRC	0.167 ± 0.027	0.250 ± 0.027	0.070 ± 0.009	0.086 ± 0.004	0.143
NIMCGCN [26]	AUROC	0.821 ± 0.011	0.827 ± 0.017	0.777 ± 0.012	0.843 ± 0.001	0.817
NIMCGCN [26]	AUPRC	0.123 ± 0.028	0.174 ± 0.071	0.087 ± 0.010	0.117 ± 0.002	0.125
DRHGCN [27]	AUROC	0.948 ± 0.011	0.964 ± 0.005	0.961 ± 0.006	0.851 ± 0.007	0.931
DRHGCN [27]	AUPRC	0.490 ± 0.041	0.580 ± 0.035	0.384 ± 0.022	0.498 ± 0.012	0.488
DRWBNCF [28]	AUROC	0.923 ± 0.013	0.941 ± 0.011	0.935 ± 0.011	0.824 ± 0.005	0.906
DRWBNCF [28]	AUPRC	0.484 ± 0.027	0.559 ± 0.021	0.349 ± 0.034	0.419 ± 0.006	0.453
PSCE (ours)	AUROC	0.953 ± 0.014	0.964 ± 0.011	0.952 ± 0.016	0.877 ± 0.004	0.936
PSCE (ours)	AUPRC	0.535 ± 0.036	0.582 ± 0.028	0.443 ± 0.032	0.568 ± 0.008	0.532

Table 2. Ablation study using the AUROC metric on the PFEM module and D3TC module. The bold marker indicates the best performance.

Setting		Performance with AUROC Metric on Datasets (Mean ± Sd)
PFEM	D3TC	Gdataset	Cdataset	LRSSL	Ldataset	Avg
√		0.922 ± 0.015	0.945 ± 0.009	0.932 ± 0.014	0.850 ± 0.008	0.912
	√	0.924 ± 0.008	0.946 ± 0.014	0.940 ± 0.011	0.866 ± 0.005	0.919
√	√	0.953 ± 0.014	0.964 ± 0.011	0.952 ± 0.016	0.877 ± 0.004	0.936

Table 3. Ablation study using the AUPRC metric on the PFEM module and D3TC module. The bold marker indicates the best performance.

Setting		Performance with AUPRC Metric on Datasets (Mean ± Sd)
PFEM	D3TC	Gdataset	Cdataset	LRSSL	Ldataset	Avg
√		0.396 ± 0.027	0.458 ± 0.016	0.382 ± 0.018	0.513 ± 0.011	0.437
	√	0.453 ± 0.044	0.488 ± 0.033	0.401 ± 0.025	0.541 ± 0.009	0.470
√	√	0.535 ± 0.036	0.582 ± 0.028	0.443 ± 0.032	0.568 ± 0.008	0.532

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Lu, R.; Liang, Y.; Lin, J.; Chen, Y. Graph Neural Network-Based Modeling with Subcategory Exploration for Drug Repositioning. Electronics 2024, 13, 3835. https://doi.org/10.3390/electronics13193835

AMA Style

Lu R, Liang Y, Lin J, Chen Y. Graph Neural Network-Based Modeling with Subcategory Exploration for Drug Repositioning. Electronics. 2024; 13(19):3835. https://doi.org/10.3390/electronics13193835

Chicago/Turabian Style

Lu, Rong, Yong Liang, Jiatai Lin, and Yuqiang Chen. 2024. "Graph Neural Network-Based Modeling with Subcategory Exploration for Drug Repositioning" Electronics 13, no. 19: 3835. https://doi.org/10.3390/electronics13193835

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Article metric data becomes available approximately 24 hours after publication online.

Article Menu

Graph Neural Network-Based Modeling with Subcategory Exploration for Drug Repositioning

Abstract

1. Introduction

2. Materials and Methods

2.1. Datasets

2.2. Overview

2.3. Model Architecture

2.4. Prototype-Based Feature-Enhancement Mechanism

2.5. Drug–Disease Dual-Task Classification Head

3. Results

3.1. Implementation Details

3.2. Evaluation Metrics

3.3. Comparison with Existing Methods

3.4. Ablation Study on the Proposed PFEM and D3TC

4. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI