GraphEPN: A Deep Learning Framework for B-Cell Epitope Prediction Leveraging Graph Neural Networks

Wang, Feng; Dai, Xiangwei; Shen, Liyan; Chang, Shan

doi:10.3390/app15042159

Open AccessArticle

GraphEPN: A Deep Learning Framework for B-Cell Epitope Prediction Leveraging Graph Neural Networks

¹

School of Computer Science and Artificial Intelligence, Aliyun School of Big Data, Changzhou University, Changzhou 213164, China

²

Department of Information Engineering, Changzhou University Huaide College, Taizhou 214500, China

³

School of Computer Engineering, Suzhou Vocational University, Suzhou 215104, China

⁴

Institute of Bioinformatics and Medical Engineering, School of Electrical and Information Engineering, Jiangsu University of Technology, Changzhou 213001, China

^*

Authors to whom correspondence should be addressed.

Appl. Sci. 2025, 15(4), 2159; https://doi.org/10.3390/app15042159

Submission received: 13 December 2024 / Revised: 4 February 2025 / Accepted: 17 February 2025 / Published: 18 February 2025

Download

Browse Figures

Versions Notes

Abstract

:

Featured Application

GraphEPN can serve as a robust computational tool in scenarios where experimental techniques are limited by scalability or accessibility. By leveraging its unique ability to encode and quantify protein residue features through VQ-VAE and graph transformers, this framework provides new opportunities for identifying potential therapeutic targets and optimizing antigen designs in novel pathogens. Beyond immunology, GraphEPN’s architecture can be adapted for broader applications, such as structural biology studies, protein interaction modeling, and drug discovery pipelines, showcasing its versatility in tackling complex biological prediction tasks.

Abstract

B-cell epitope prediction is crucial for advancing immunology, particularly in vaccine development and antibody-based therapies. Traditional experimental techniques are hindered by high costs, time consumption, and limited scalability, making them unsuitable for large-scale applications. Computational methods provide a promising alternative, enabling high-throughput screening and accurate predictions. However, existing computational approaches often struggle to capture the complexity of protein structures and intricate residue interactions, highlighting the need for more effective models. This study presents GraphEPN, a novel B-cell epitope prediction framework combining a vector quantized variational autoencoder (VQ-VAE) with a graph transformer. The pre-trained VQ-VAE captures both discrete representations of amino acid microenvironments and continuous structural embeddings, providing a comprehensive feature set for downstream tasks. The graph transformer further processes these features to model long-range dependencies and interactions. Experimental results demonstrate that GraphEPN outperforms existing methods across multiple datasets, achieving superior prediction accuracy and robustness. This approach underscores the significant potential for applications in immunodiagnostics and vaccine development, merging advanced deep learning-based representation learning with graph-based modeling.

Keywords:

B-cell epitope prediction; deep learning; graph transformer; VQ-VAE; protein structure

1. Introduction

B-cell epitopes are regions on the surface of antigen molecules that are specifically recognized and bound by antibodies, playing a critical role in immune responses. Accurate prediction of B-cell epitopes (BCEs) is essential for understanding antigen–antibody interactions and holds significant promise for vaccine design, immunodiagnostics, and therapeutic antibody development [1]. However, traditional experimental techniques such as X-ray crystallography and nuclear magnetic resonance (NMR) spectroscopy, which can provide precise three-dimensional structural information, are often time-consuming, expensive, and unsuitable for large-scale applications [2]. Consequently, developing efficient and accurate computational methods for B-cell epitope prediction has become a focal point of research in this field.

Early computational approaches can be broadly categorized into sequence-based and structure-based methods. Sequence-based methods, such as BcePred, rely on physicochemical properties and statistical learning models trained on linear peptide sequences [3]. However, these methods are inherently limited as they fail to account for the three-dimensional structure of proteins, making them insufficient for predicting conformational epitopes. To address this limitation, structure-based methods like ElliPro utilize features such as ellipsoid approximation and protrusion index calculations to identify epitope locations [4]. While these approaches leverage spatial information, they often depend on manually designed heuristics, which restrict their generalizability across diverse antigen structures.

Deep learning has recently driven substantial advancements in biomedical research [5]. Unlike traditional machine learning techniques, deep learning can automatically extract complex, nonlinear features from large datasets, offering distinct advantages for epitope prediction. Early machine learning models, such as XGBoost and logistic regression, were capable of identifying patterns in known antigen–antibody complexes but struggled with capturing the intricate spatial and nonlinear interactions between protein residues [6]. Deep learning methods such as convolutional neural networks (CNNs) and graph neural networks (GNNs) are naturally suited to handle spatial relationships in protein 3D structures, with GNNs in particular effectively capturing spatial dependencies and chemical interactions between residues, thus increasing the accuracy and generalizability of epitope prediction [7,8].

The incorporation of deep learning, particularly graph-based models, into epitope prediction has substantially improved model performance by better representing the structural complexity of proteins [9]. Furthermore, generative models, such as VAEs, have demonstrated significant potential in representation learning. Specifically, the VQ-VAE is particularly effective in capturing the underlying structural features of proteins by learning discrete representations [10]. A similar concept of discretizing protein structures has also been explored in Foldseek, where the 3Di alphabet encodes local structural motifs to enable rapid structural searches [11].

Building upon these advances, we propose a novel B-cell epitope prediction model—Graph-based Epitope Prediction Network (GraphEPN). This model combines the strengths of a VQ-VAE variant and a graph transformer, employing a two-stage training strategy to fully harness their capabilities. In the first stage, the VQ-VAE variant is pre-trained on a large-scale protein graph dataset, focusing on capturing high-quality representations of protein residues, encompassing both discrete and continuous features. This pre-training step enables the model to extract both discrete representations of amino acid microenvironments and continuous structural embeddings from antigen–antibody interaction data, providing a comprehensive feature set for downstream tasks. In the second stage, the graph transformer leverages the pre-trained VQ-VAE by using its fixed encoder and codebook to map protein graph nodes (residues) into continuous and discrete feature representations. These representations, combined with edge features, allow the model to capture long-range interactions and local dependencies among residues [12]. To the best of our knowledge, this is the first time that such a deep learning framework has been applied to B-cell epitope prediction in structural bioinformatics. This model fully integrates both node and edge features, enabling the exploitation of graph-based information inherent in protein structures, thereby improving feature representation for epitope prediction. Experimental results demonstrate that GraphEPN outperforms existing methods across multiple datasets, particularly excelling in predicting epitopes in complex protein structures with superior accuracy and robustness.

2. Materials and Methods

2.1. Datasets

This study utilizes three independent datasets to pre-train the VQ-VAE, train the graph transformer model, and conduct independent evaluations.

SAbDab_1323 Dataset: SAbDab_1323 is derived from SAbDab [13] and comprises 1323 nonredundant protein chains, each containing at least one epitope residue. This dataset is employed to retrain the VQ-VAE model, focusing on learning critical residue representations involved in antibody interactions. The resulting representations provide refined feature vectors crucial for downstream tasks, enhancing the model’s ability to capture essential structural information.

SAbDab_665 Dataset: This dataset, also obtained from SAbDab, contains 665 protein chains annotated with BCEs [14]. BCEs are residues on antigens that interact with antibodies, with the distance between any heavy atom of the epitope and the antibody being less than 4.0 Å. CD-HIT clustering was applied to reduce sequence identity to below 70% to ensure sequence independence between the pretraining and training datasets. Only antigen chains with at least one epitope residue were retained for model training.

Blind_42 Dataset: The Blind_42 dataset consists of 42 nonredundant protein structures specifically designated for independent testing. Sequence identity between this dataset and SAbDab_1323 and SAbDab_665 was reduced to less than 30% via BLAST+ (version 2.14.0) for sequence alignment and homology filtering. The Blind_42 dataset exhibits significant label imbalance, with only 7.08% of the 12,570 samples labeled as positive (epitope residues), whereas negative samples outnumber positive samples by approximately 13.12 times. This dataset evaluates the model’s generalizability to unseen data and its robustness under highly imbalanced conditions.

2.2. Data Preprocessing and Graph Construction

This study extracts node and edge features from protein residues and constructs 3D graph representations of protein structures based on these features.

2.2.1. Node Feature Extraction

The extracted node features include physicochemical properties, backbone torsion angles (PHI and PSI), relative solvent accessible surface area (rASA), and secondary structure types. These features provide local structural information for each residue.

Given that the Cα coordinate of residue

i

is

(x_{i}, y_{i}, z_{i})

, its physicochemical properties are represented as follows:

v_{aa} (i) = [v_{{aa}_{1}} (i), v_{{aa}_{2}} (i), \dots, v_{{aa}_{d}} (i)]

(1)

where

d

represents the dimensions of the physicochemical properties. The secondary structure (SS) types, backbone torsion angles

(Φ_{i}, Ψ_{i})

, and rASA values are extracted via the DSSP tool [15,16]. Specifically, SS types are encoded using a one-hot representation, backbone torsion angles

(Φ_{i}, Ψ_{i})

are derived from atomic coordinates, and rASA is computed by normalizing the absolute solvent accessible surface area against standard reference values for each amino acid. The node feature vector for residue i is then represented as follows:

x_{i} = [v_{aa} (i), {SS}_{i}, Φ_{i}, Ψ_{i}, r A S A_{i}]

(2)

where

{SS}_{i}

represents the one-hot encoding of the secondary structure type of residue

i

,

Φ_{i}

and

Ψ_{i}

represent torsion angles, and

r A S A_{i}

represents the relative solvent-accessible surface area.

2.2.2. Edge Feature Extraction

Residue interactions are modeled as edges between nodes [17]. For each pair of residues

i

and

j

, the Euclidean distance is calculated as follows:

D (i, j) = \sqrt{{(x_{i} - x_{j})}^{2} + {(y_{i} - y_{j})}^{2} + {(z_{i} - z_{j})}^{2}}

(3)

To better capture variations in distances, radial basis functions (RBFs) are used to encode the distances:

RBF (D (i, j)) = \sum_{k = 1}^{K} \exp (- \frac{{(D (i, j) - μ_{k})}^{2}}{2 {σ_{k}}^{2}})

(4)

where

μ_{k}

represents the center of the

k - th

RBF,

σ_{k}

is its standard deviation, and

K

is the total number of RBF functions.

In addition to distance features, the relative orientation and rotation between residues are calculated and represented via quaternions:

Q (i, j) = [q_{w} (i, j), q_{x} (i, j), q_{y} (i, j), q_{z} (i, j)]

(5)

where

q_{w} (i, j)

is the fundamental part, and where

q_{x} (i, j), q_{y} (i, j), q_{z} (i, j)

are the imaginary parts. Finally, the edge feature vector

e_{i, j}

is constructed by concatenating the RBF-encoded distance features and quaternion-based geometric information:

e_{i, j} = [RBF (D (i, j)), Q (i, j)]

(6)

2.2.3. Protein 3D Graph Construction

After extracting node and edge features, we constructed a graph model based on the 3D structure of the protein [18]. Each residue in the protein is represented as a node in the graph, and interactions between residues are represented as edges. Given the set of residues in a protein

{r_{1}, r_{2}, \dots, r_{N}}

, the graph

G = (V, E)

is constructed, where

V

denotes the set of nodes and where

E

denotes the set of edges. An edge

(i, j)

is added between residues

i

and

j

if the Euclidean distance

D (i, j)

between them is less than or equal to a predefined threshold

τ

. The formal representation of the graph is as follows:

G = (V, E), V = {h_{i} ∣ i = 1, 2, \dots, N}, E = {e_{i, j} ∣ D (i, j) \leq τ}

(7)

Here,

N

represents the number of residues in the protein, and

τ

is the distance threshold. Finally, the complete 3D protein graph was constructed via the DGL framework [18]. In DGL, we construct the protein graph by defining nodes based on the residues and adding edges between residues that satisfy the distance threshold condition. Specifically, we calculate the Euclidean distance between all residue pairs and retain only those that meet the threshold criterion. These edges are then stored as adjacency relationships in the graph. Each edge is further assigned a set of features, including distance-based radial basis function (RBF) encodings and quaternion-based orientation features. The final graph representation is implemented in DGL, where node features, edge indices, and edge attributes are defined accordingly.

2.3. Model Architecture

2.3.1. Overview

This study proposes a deep learning model that combines a VQ-VAE with a graph transformer to improve B-cell epitope prediction performance. The model operates in two stages. First, a VQ-VAE model is pre-trained on the SAbDab_1323 dataset to learn universal representations of protein residues. In the downstream task, the fixed encoder and codebook are subsequently applied to the SAbDab_665 dataset to extract continuous and discrete representations. These features, along with edge features, are further processed by a graph transformer to capture complex interactions between residues and predict whether each residue is an epitope (Figure 1a). This two-stage strategy significantly enhances the prediction accuracy and generalizability.

2.3.2. Pre-Trained VQ-VAE Module

To learn discrete latent representations from protein structures, we employ a VQ-VAE. The VQ-VAE model comprises an encoder, a vector quantization layer, and a decoder (Figure 1b) and is trained in an unsupervised manner. The encoder, which is based on a graph convolutional network (GCN), processes node features from the protein graph [19]. The input for each node is

x_{i} \in ℝ^{d}

, and the output is the latent representation

z_{i} \in ℝ^{d_{z}}

. Each GCN layer updates node features by aggregating information from neighboring nodes:

h_{i}^{(l)} = σ (\sum_{(j \in N (i))} W^{(l)} h_{j}^{(l - 1)} + b^{(l)})

(8)

where

h_{i}^{(l)}

is the representation of node i at the all-th layer, initialized as

{h_{i}}^{(0)} = x_{i}

. Here,

N (i)

denotes the set of neighboring nodes of

i

,

W^{(l)}

is a trainable weight matrix, and

b^{(l)}

is a bias term.

The continuous representation

z_{i}

produced by the encoder is passed to the vector quantization layer, which uses a discrete codebook for quantization. Through nearest neighbor search, it is mapped to the closest vector in the codebook

e_{i}

, expressed as follows:

e_{i} = \arg \min_{C_{k}} ‖ z_{i} - C_{k} ‖_{2}

(9)

The quantization loss

L_{v q}

ensures alignment between the continuous and discrete spaces, while a commitment loss

L_{c o m m i t}

promotes stable updates for the codebook:

L_{v q} = \sum_{i} {‖sg [z_{i}] - e_{i}‖}_{2}^{2}, L_{c o m m i t} = \sum_{i} {‖z_{i} - sg [e_{i}]‖}_{2}^{2}

(10)

where

sg [\cdot]

denotes the stop-gradient operation to prevent gradients from being propagated to the encoder.

The decoder, which is based on the GCN architecture, decodes the discrete representation

e_{i}

back to the original node feature

{\hat{x}}_{i}

. The reconstruction loss

L_{r e c o n}

is expressed as follows:

L_{r e c o n} = \sum_{i} {‖{\hat{x}}_{i} - x_{i}‖}_{2}^{2}

(11)

To enhance robustness, a masking mechanism is introduced during training, with a masking loss

L_{m a s k}

.

Finally, the total loss function of the VQ-VAE model is as follows:

L_{V Q - V A E} = L_{r e c o n} + β L_{v q} + α L_{c o m m i t} + λ L_{m a s k}

(12)

where

β

,

α

, and

λ

are hyperparameters used to balance the contributions of each loss term.

2.3.3. Graph Attention-Based Epitope Prediction Module

In the downstream task of epitope prediction, the pre-trained VQ-VAE model is utilized, with its encoder and codebook parameters remaining fixed. On the new dataset, the pre-trained VQ-VAE model encodes and quantizes the protein graph to generate each node’s feature representation

z

and quantized representation

e

. These nodes are concatenated to form the final node feature

h_{i} = [z_{i}; e_{i}]

, which is further processed via a graph transformer model to capture the complex relationships between residues (Figure 1c). For each layer, the model updates each node’s representation by combining the node features

h_{i}

and edge features

e_{i j}

through a multi-head graph attention network (GAT) mechanism [20]:

h_{i}^{'} = h_{i} + \sum_{j \in N (i)} α_{i j} {Wh}_{j}

(13)

where

h_{i}^{'}

is the updated feature of node

i

,

W

is a trainable weight matrix, and

α_{i j}

is the attention coefficient, which is calculated as follows:

α_{i j} = \frac{\exp (LeakyReLU (a^{⊤} [W h_{i} ‖ W h_{j} ‖ e_{i j}]))}{\sum_{k \in N (i)} \exp (LeakyReLU (a^{⊤} [W h_{i} ‖ W h_{k} ‖_{e} e_{i k}]))}

(14)

GAT dynamically assigns attention coefficients to each neighboring residue, allowing the model to focus on the most informative ones. By incorporating edge features, which encode spatial relationships between residues, the model effectively captures structural dependencies, enhancing epitope prediction accuracy.

Using multihead attention mechanisms, the model independently calculates the relationships between nodes in different subspaces. The graph transformer is composed of multiple stacked transformer layers, each consisting of a multi-head GAT mechanism, residual connections, and normalization layers. The output of each layer is combined with its input through residual connections and enhanced by layer normalization to improve training stability. Finally, the model’s output layer uses a linear transformation followed by a sigmoid activation function to produce the prediction probability

{\hat{y}}_{i}

for each node being an epitope:

{\hat{y}}_{i} = σ (W_{o u t} h_{i})

(15)

where

W_{o u t}

is a trainable weight matrix, and where

{\hat{y}}_{i}

is the predicted probability of node

i

being an epitope.

To improve the model’s adaptability to imbalanced data, we introduce binary cross-entropy loss and Dice loss as the loss function. The use of Dice loss enhances the model’s performance by emphasizing the prediction of the minority class, which in this case are the epitope residues. The final loss function is as follows:

L_{t o t a l} = L_{B C E} + L_{D i c e}

(16)

By jointly optimizing these two loss functions, the graph transformer model effectively captures both local and global information in protein structures, achieving superior performance in epitope prediction tasks.

2.4. Model Training and Evaluation

This study adopts a two-stage training strategy. First, the VQ-VAE model is trained in an unsupervised manner on the SAbDab_1323 dataset to learn the discrete and continuous representations of protein residues. The trained VQ-VAE model is subsequently utilized for encoding features from the SAbDab_665 dataset, and the concatenated features are employed for supervised epitope prediction via the graph transformer model.

During the VQ-VAE pretraining phase, the model optimizes the learned representations by minimizing both the reconstruction loss and quantization loss. A masking mechanism is incorporated, where certain node features are randomly masked, compelling the model to recover these features via neighborhood information. The total loss function is controlled by three key parameters: α, β, and λ, which determine the relative contributions of reconstruction loss, vector quantization loss, and masked loss, respectively. Their values are empirically determined through grid search and set as follows: α = 1.0, β = 0.25, and λ = 1.5, ensuring a balance between feature preservation and generalization. The Adam optimizer is employed for training, with weight decay applied to prevent overfitting [21]. The optimizer parameters are set as follows: learning rate = 0.01, weight decay = 0.0001, betas = (0.9, 0.999). The learning rate follows a cosine annealing schedule to improve convergence stability. The batch size is set to 64, balancing computational efficiency and generalization performance.

In the training phase of the graph transformer, joint optimization of multiple loss functions, including binary cross-entropy loss and Dice loss, is performed to mitigate the class imbalance inherent in the dataset [22,23]. The Adam optimizer, with the same parameter settings as in the VQ-VAE training phase, is employed, and a learning rate scheduler dynamically adjusts the learning rate to ensure stable convergence throughout the training process. A ReduceLROnPlateau scheduler is used to adjust the learning rate based on validation loss trends. Given the underrepresentation of epitope residues in the data, the loss function weights are adjusted to enhance the model’s ability to recognize the minority class. The hidden units are set to 128, the batch size is set to 32, and the dropout rate is 0.4 to mitigate overfitting while maintaining expressiveness. To assess the generalization capability of the model, 5-fold cross-validation is employed [24]. In each fold, the dataset is split into a training set, a validation set, and a test set. The model is trained on the training set, and the validation set is used for tuning hyperparameters such as the learning rate, batch size, attention heads, hidden units, and dropout rate. The performance is then evaluated on the test set.

After model training, an evaluation is conducted on the independent blind test dataset, Blind_42. Owing to the class imbalance in the data, multiple evaluation metrics are employed to assess the model’s performance, including the F1-score, Matthews correlation coefficient (MCC), balanced accuracy (BACC), area under the curve (AUC), and area under the precision–recall curve (AUPRC) [25]. The F1-score measures the harmonic mean of precision and recall, making it particularly suitable for imbalanced datasets. MCC provides a robust evaluation metric that considers all four elements of the confusion matrix, ensuring a more reliable assessment even when the class distribution is skewed. BACC is designed to compensate for class imbalance by computing the mean recall of both classes. AUC quantifies the model’s ability to distinguish between positive and negative samples across different decision thresholds, while AUPRC focuses on precision and recall trade-offs, which are particularly crucial for epitope prediction given the typically low prevalence of positive samples. As threshold-independent metrics, AUC and AUPRC serve as the primary evaluation criteria.

3. Results

3.1. Feature Engineering

In the baseline model of this study, amino acid property (AAP) features, while contributing to epitope prediction, were not the sole critical factor. To comprehensively evaluate the impact of other features, we systematically analyzed the effects of spatial features and DSSP features on epitope prediction via the SAbDab_665 dataset and explored how these feature combinations could enhance model performance.

First, we evaluated the impact of spatial features. These spatial features were utilized as edge features in the graph structure, effectively capturing spatial relationships between amino acid residues, including precise distance and directional information [26]. For the test set, the addition of spatial features alone improved the model’s AUC from 0.735 to 0.802 and the AUPRC from 0.312 to 0.389, as shown in Table 1. This substantial performance improvement underscores the critical role of spatial relationships between residues, when represented as edge features, in enhancing the accuracy of epitope prediction.

Next, we examined the contribution of DSSP features. DSSP features provide valuable information regarding protein secondary structure and solvent accessibility [27]. Upon integrating DSSP features into the model, the AUC increased to 0.785, and the AUPRC rose to 0.346. These results emphasize the importance of local structural information in epitope identification.

More importantly, we assessed the model’s performance under the combined influence of spatial and DSSP features. The results demonstrate that the multi-feature fusion model substantially outperformed the single-feature models, with the AUC increasing from the baseline value of 0.735 to 0.829 and the AUPRC improving from 0.312 to 0.433. This outcome validates the hypothesis that the synergistic effect of multidimensional features, particularly the incorporation of spatial information as edge features, can more comprehensively capture the complexity of epitopes, thereby enhancing the model’s predictive ability [28].

Based on these experimental results, our final model integrates a comprehensive combination of AAP, spatial, and DSSP features. We assert that this multidimensional feature engineering approach, especially the encoding of spatial information as edge features in the graph structure, not only captures the complexity of epitopes more holistically but also significantly improves the accuracy and robustness of the predictions. This integrated model provides a robust foundation for subsequent in-depth analyses and rigorous evaluations, offering a powerful tool for advancing the exploration of cutting-edge questions in the field of epitope prediction.

3.2. Internal Validation

This study conducted a systematic evaluation of the performance of the GraphEPN model through rigorous internal validation. The internal validation employed a five-fold cross-validation strategy, where the dataset was partitioned into five subsets, with one subset used for validation and the remaining four subsets used for training in each iteration. This alternating validation approach ensures the robustness of the model’s performance across different data splits.

To comprehensively assess model performance, we used the AUC and the AUPRC as the primary evaluation metrics. Additionally, to evaluate the relative performance of GraphEPN compared with other commonly used machine learning methods, we compared it with elastic net, gradient boosting, XGBoost, and logistic regression models [29,30,31,32]. Additional file: Figures S1 and S2 illustrate the performance of each model across the five-fold cross-validation, whereas Figure 2a,b highlight the AUC and AUPRC results for GraphEPN in each fold.

To ensure the robustness of the evaluation, the average values across the five folds were taken as the final evaluation metrics. The results show that GraphEPN outperforms all the other methods in terms of both the average AUC and average AUPRC, with values of 82.9% and 45.8%, respectively. Compared with the second-best model, XGBoost (with an AUC of 79.2% and a AUPRC of 38.9%), GraphEPN achieves improvements of 3.6% and 6.6% in these two metrics, respectively.

3.3. Comparison with Peer Methods

To evaluate the performance of GraphEPN, we compared it with several mainstream B-cell conformational epitope prediction methods, including SEMA 2.0, BepiPred 3.0, Ellipro, and SEPPA 3.0. These methods are widely used in the field of epitope prediction, each with its strengths, providing valuable benchmarks for comparison. The evaluation metrics included the AUC, AUPRC, F1-score, MCC, and BACC, with the AUC and AUPRC serving as the primary metrics.

SEMA 2.0 is an advanced web platform for predicting conformational B-cell epitopes via large pre-trained protein language models (PLMs). It integrates sequence-based and structure-based prediction, identifies N-glycosylation sites, and compares antigen structures for enhanced immunogenic analysis [33].

BepiPred-3.0 is a sequence-based B-cell epitope prediction tool that uses protein language model embeddings (ESM-2) to improve the prediction accuracy for both linear and conformational epitopes. It also incorporates additional input variables and a refined epitope annotation strategy for enhanced performance [34].

ElliPro is a web tool for predicting antibody epitopes based on the geometric properties of protein structures that uses residue clustering and visualization tools [4].

SEPPA 3.0 is an advanced tool for predicting B-cell epitopes that integrates glycosylation-related features and microenvironmental information to increase the prediction accuracy for glycoprotein antigens [35].

In our comparative analysis, we evaluated the performance of each method using the independent Blind_42 test dataset, calculating the AUC and AUPRC scores. The optimal thresholds were selected based on precision–recall curves, and additional metrics such as the F1-score, MCC, and BACC were computed. Figure S3 presents a comprehensive heatmap of all evaluation metrics for each method, visualizing the comparative performance and highlighting GraphEPN’s consistent superiority across multiple metrics. As shown in Table 2, GraphEPN outperforms the majority of the peer methods across several key evaluation metrics. Specifically, it demonstrates superior performance in AUC (Figure 2c) and AUPRC (Figure 2d), underscoring its exceptional generalization capacity and robustness.

3.4. Case Study

To demonstrate the effectiveness of GraphEPN, we analyzed a representative protein structure, chain A of PDB ID: 6ad8_A, from the independent test set. The performance of GraphEPN was compared against SEMA 2.0, BepiPred 3.0, ElliPro, and SEPPA 3.0.

As shown in Figure 3, GraphEPN accurately predicted most epitope residues and minimized false positives in non-epitope regions. Its predictions closely matched the annotated ground truth, particularly in regions with complex spatial configurations. In comparison, methods like ElliPro, SEPPA 3.0, and BepiPred 3.0 struggled to achieve the same level of precision, often missing residues in structurally intricate areas or producing higher false positive rates.

To further compare the predictions across different methods at the sequence level, we provide a sequence-based visualization in Figure S4. This additional analysis offers a complementary perspective to Figure 3 by allowing for a direct comparison of predicted epitope positions along the sequence. The integration of sequence and structure-based visualizations helps to better illustrate the strengths and limitations of each method.

GraphEPN’s superior performance stems from its ability to combine discrete and continuous feature representations with structural information, enabling it to capture long-range dependencies and local interactions effectively. This advantage is evident in its ability to provide more precise and reliable predictions, even for challenging protein structures.

This case highlights GraphEPN’s robustness and accuracy, showcasing its potential as a valuable tool for epitope prediction, with practical applications in vaccine design and therapeutic antibody development.

3.5. Ablation Study

To assess the impact of the VQ-VAE and different attention mechanisms within the graph transformer architecture, we perform ablation methodological experiments on the SAbDab_665 dataset. The experimental setup consisted of four configurations: (1) full model, which combines the VQ-VAE and GAT modules to capture discrete features effectively and local dependencies; (2) VQ-VAE + multihead self-attention (MSA), which replaces GAT with MSA to examine the effects of different attention mechanisms while preserving feature quantization; (3) direct node features + GAT, which includes VQ-VAE while retaining GAT to focus on local relationships via direct node features; (4) direct node features + MSA, which omits both VQ-VAE and GAT, relying solely on the self-attention mechanism to evaluate the model’s performance. The experimental results are summarized in Table 3. The full model, which combines VQ-VAE and GAT, achieves the highest performance, demonstrating that discrete feature representations and local dependencies captured by GAT are essential for this task. Configurations without VQ-VAE or with MSA replacing GAT exhibited performance degradation, further reinforcing the importance of discrete representations and the GAT module in capturing local interactions.

3.6. Visualization of the Epitope Prediction Results

To conduct a comprehensive analysis of the epitope prediction results from the GraphEPN model, we employed advanced visualization techniques to effectively present the predicted epitope probabilities and integrate them with protein sequences and 3D structures. This integration not only enhances the interpretability of the predictions but also provides clear insights for guiding subsequent experimental investigations. For each protein chain, the predicted epitope probabilities are represented through bar charts (Figure 4b), where the height of each bar corresponds to the prediction score of the respective residue. A default threshold of 0.6 is applied to identify potential epitope regions. Additionally, the prediction scores are embedded within the PDB files by substituting the B-factor field, facilitating direct visualization of the epitope distributions via 3D structure visualization tools (Figure 4a). For each protein chain, two primary output files are generated: one containing the bar chart of epitope scores and the other containing an updated PDB file with the embedded prediction scores.

4. Conclusions

In this study, we present a novel B-cell epitope prediction model, GraphEPN, which integrates the VQ-VAE and graph transformer to capture both the local and global structural features of proteins effectively. By leveraging VQ-VAE for discrete residue representations and employing a graph transformer to model complex residue interactions, GraphEPN achieves superior predictive performance across multiple datasets, significantly improving the accuracy and generalizability of epitope prediction.

The experimental results indicate that GraphEPN excels in various evaluation metrics, confirming its ability to capture essential protein structural information and model intricate residue interactions. Through the incorporation of multiple features and the utilization of graph neural networks and self-attention mechanisms, the model provides a comprehensive understanding of epitope regions and their predictions. Furthermore, the two-stage training strategy enables the model to capture both local structural features and long-range dependencies within the protein structure.

Nevertheless, the current predictive accuracy is still limited by the availability of experimental structural data. Given the scarcity of antigen–antibody complex structures, some of the predicted epitopes may not necessarily be false positives but rather unannotated potential true positives. Additionally, the computational cost associated with integrating VQ-VAE and graph transformers remains a challenge, necessitating further optimization to enhance efficiency. Future work will focus on improving feature representations through contrastive learning and exploring lightweight graph neural networks to reduce computational overhead. Moreover, expanding the dataset by incorporating predicted structures from AlphaFold could improve generalizability, particularly for antigens lacking experimental structural data.

In conclusion, GraphEPN offers a robust and effective tool for B-cell epitope prediction, with significant potential for applications in vaccine development and antibody design.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/app15042159/s1, Figure S1: Performance of AUC for machine learning methods through 5-fold cross-validation; Figure S2: Comparison of model performance with machine learning methods; Figure S3: Metrics heatmap comparing performance across different methods; Figure S4: Sequence-based comparison of epitope predictions for PDB 6ad8_A. The x-axis represents the residue index, while the y-axis displays the predicted epitope positions from different methods. Red stars indicate the true epitope residues.

Author Contributions

Conceptualization, F.W. and S.C.; software, X.D.; investigation, X.D.; resources, L.S.; writing—original draft preparation, X.D.; writing—review and editing, S.C., F.W. and L.S.; supervision, S.C.; project administration, S.C. and F.W.; funding acquisition, F.W. and S.C. All authors have read and agreed to the published version of the manuscript.

Funding

The project was financially supported by the National Natural Science Foundation of China (Grant No. 62373172).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The GraphEPN scripts and the corresponding datasets are available in the release package on GitHub (https://github.com/Graphepn/GraphEPN, accessed on 13 December 2024).

Acknowledgments

We would like to express our sincere gratitude to the three reviewers for their valuable comments and constructive suggestions. We also appreciate the assistant editor and the English editor for their professional support and assistance in refining the manuscript.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Krawczyk, K.; Liu, X.; Baker, T.; Shi, J.; Deane, C.M. Improving B-cell epitope prediction and its application to global antibody-antigen docking. Bioinformatics 2014, 30, 2288–2294. [Google Scholar] [CrossRef]
Rose, P.W.; Prlić, A.; Altunkaya, A.; Bi, C.; Bradley, A.R.; Christie, C.H.; Costanzo, L.D.; Duarte, J.M.; Dutta, S.; Feng, Z. The RCSB protein data bank: Integrative view of protein, gene and 3D structural information. Nucleic Acids Res. 2016, 45, D271–D281. [Google Scholar] [PubMed]
Saha, S.; Raghava, G.P.S. BcePred: Prediction of continuous B-cell epitopes in antigenic sequences using physico-chemical properties. In Proceedings of the Artificial Immune Systems ICARIS 2004, Catania, Sicily, Italy, 13–16 September 2004; pp. 197–204. [Google Scholar]
Ponomarenko, J.; Bui, H.-H.; Li, W.; Fusseder, N.; Bourne, P.E.; Sette, A.; Peters, B. ElliPro: A new structure-based tool for the prediction of antibody epitopes. BMC Bioinform. 2008, 9, 514. [Google Scholar] [CrossRef]
Senior, A.W.; Evans, R.; Jumper, J.; Kirkpatrick, J.; Sifre, L.; Green, T.; Qin, C.; Žídek, A.; Nelson, A.W.; Bridgland, A.; et al. Improved protein structure prediction using potentials from deep learning. Nature 2020, 577, 706–710. [Google Scholar] [CrossRef] [PubMed]
Wang, H.-W.; Pai, T.-W. Machine Learning-Based Methods for Prediction of Linear B-Cell Epitopes. In Immunoinformatics; Methods in Molecular Biology; De, R., Tomar, N., Eds.; Humana Press: New York, NY, USA, 2014; Volume 1184, pp. 217–236. [Google Scholar]
Townshend, R.; Bedi, R.; Suriana, P.; Dror, R. End-to-end learning on 3d protein structure for interface prediction. In Proceedings of the Advances in Neural Information Processing Systems 32 (NeurIPS 2019), Vancouver, BC, Canada, 8–14 December 2019. [Google Scholar]
Jumper, J.; Evans, R.; Pritzel, A.; Green, T.; Figurnov, M.; Ronneberger, O.; Tunyasuvunakool, K.; Bates, R.; Žídek, A.; Potapenko, A. Highly accurate protein structure prediction with AlphaFold. Nature 2021, 596, 583–589. [Google Scholar] [CrossRef]
Van Den Oord, A.; Vinyals, O. Neural discrete representation learning. In Proceedings of the Advances in Neural Information Processing Systems 30 (NIPS 2017), Long Beach, CA, USA, 4–9 December 2017. [Google Scholar]
Ying, C.; Cai, T.; Luo, S.; Zheng, S.; Ke, G.; He, D.; Shen, Y.; Liu, T.-Y. Do transformers really perform badly for graph representation? In Proceedings of the Advances in Neural Information Processing Systems 34 (NeurIPS 2021), Online, 6–14 December 2021; pp. 28877–28888. [Google Scholar]
van Kempen, M.; Kim, S.S.; Tumescheit, C.; Mirdita, M.; Gilchrist, C.L.; Söding, J.; Steinegger, M. Foldseek: Fast and accurate protein structure search. bioRxiv 2022. [Google Scholar] [CrossRef]
Razavi, A.; Van den Oord, A.; Vinyals, O. Generating diverse high-fidelity images with vq-vae-2. In Proceedings of the Advances in Neural Information Processing Systems 32 (NeurIPS 2019), Vancouver, BC, Canada, 8–14 December 2019. [Google Scholar]
Dunbar, J.; Krawczyk, K.; Leem, J.; Baker, T.; Fuchs, A.; Georges, G.; Shi, J.; Deane, C.M. SAbDab: The structural antibody database. Nucleic Acids Res. 2014, 42, D1140–D1146. [Google Scholar] [CrossRef] [PubMed]
Zhang, Y.; Skolnick, J. TM-align: A protein structure alignment algorithm based on the TM-score. Nucleic Acids Res. 2005, 33, 2302–2309. [Google Scholar] [CrossRef] [PubMed]
Kabsch, W.; Sander, C. Dictionary of protein secondary structure: Pattern recognition of hydrogen-bonded and geometrical features. Biopolym. Orig. Res. Biomol. 1983, 22, 2577–2637. [Google Scholar] [CrossRef] [PubMed]
Touw, W.G.; Baakman, C.; Black, J.; Te Beek, T.A.; Krieger, E.; Joosten, R.P.; Vriend, G. A series of PDB-related databanks for everyday needs. Nucleic Acids Res. 2015, 43, D364–D368. [Google Scholar] [CrossRef] [PubMed]
Wu, Z.; Pan, S.; Chen, F.; Long, G.; Zhang, C.; Philip, S.Y. A comprehensive survey on graph neural networks. IEEE Trans. Neural Netw. Learn. Syst. 2020, 32, 4–24. [Google Scholar] [CrossRef]
Zheng, D.; Wang, M.; Gan, Q.; Song, X.; Zhang, Z.; Karypis, G. Scalable graph neural networks with deep graph library. In Proceedings of the 14th ACM International Conference on Web Search and Data Mining, Online, 8–12 March 2021; pp. 1141–1142. [Google Scholar]
Zhang, S.; Tong, H.; Xu, J.; Maciejewski, R. Graph convolutional networks: A comprehensive review. Comput. Soc. Netw. 2019, 6, 11. [Google Scholar] [CrossRef] [PubMed]
Veličković, P.; Cucurull, G.; Casanova, A.; Romero, A.; Lio, P.; Bengio, Y. Graph attention networks. arXiv 2017, arXiv:1710.10903. [Google Scholar] [CrossRef]
Kingma, D.P.; Ba, J.A. A method for stochastic optimization. In Proceedings of the 3rd International Conference on Learning Representations ICLR 2015, San Diego, CA, USA, 7–9 May 2015. [Google Scholar]
Ruby, U.; Yendapalli, V. Binary cross entropy with deep learning technique for image classification. Int. J. Adv. Trends Comput. Sci. Eng 2020, 9, 5393–5396. [Google Scholar]
Li, X.; Sun, X.; Meng, Y.; Liang, J.; Wu, F.; Li, J. Dice loss for data-imbalanced NLP tasks. arXiv 2019, arXiv:1911.02855. [Google Scholar] [CrossRef]
Wong, T.-T. Performance evaluation of classification algorithms by k-fold and leave-one-out cross validation. Pattern Recognit. 2015, 48, 2839–2846. [Google Scholar] [CrossRef]
Saito, T.; Rehmsmeier, M. The precision-recall plot is more informative than the ROC plot when evaluating binary classifiers on imbalanced datasets. PLoS ONE 2015, 10, e0118432. [Google Scholar] [CrossRef]
Baldassarre, F.; Menéndez Hurtado, D.; Elofsson, A.; Azizpour, H. GraphQA: Protein model quality assessment using graph convolutional networks. Bioinformatics 2021, 37, 360–366. [Google Scholar] [CrossRef]
Tien, M.Z.; Meyer, A.G.; Sydykova, D.K.; Spielman, S.J.; Wilke, C.O. Maximum allowed solvent accessibilites of residues in proteins. PLoS ONE 2013, 8, e80635. [Google Scholar] [CrossRef] [PubMed]
Adhikari, B.; Cheng, J. CONFOLD2: Improved contact-driven ab initio protein structure modeling. BMC Bioinform. 2018, 19, 22. [Google Scholar] [CrossRef] [PubMed]
Zou, H.; Hastie, T. Regularization and variable selection via the elastic net. J. R. Stat. Soc. Ser. B Stat. Methodol. 2005, 67, 301–320. [Google Scholar] [CrossRef]
Friedman, J.H. Greedy function approximation: A gradient boosting machine. Ann. Stat. 2001, 29, 1189–1232. [Google Scholar] [CrossRef]
Chen, T.; Guestrin, C. Xgboost: A scalable tree boosting system. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA, 13–17 August 2016; pp. 785–794. [Google Scholar]
Chatterjee, S.; Hadi, A.S. Regression Analysis by Example; John and Wiley and Sons: Hoboken, NJ, USA, 2015. [Google Scholar]
Ivanisenko, N.V.; Shashkova, T.I.; Shevtsov, A.; Sindeeva, M.; Umerenkov, D.; Kardymon, O. SEMA 2.0: Web-platform for B-cell conformational epitopes prediction using artificial intelligence. Nucleic Acids Res. 2024, 52, W533–W539. [Google Scholar] [CrossRef]
Clifford, J.N.; Høie, M.H.; Deleuran, S.; Peters, B.; Nielsen, M.; Marcatili, P. BepiPred-3.0: Improved B-cell epitope prediction using protein language models. Protein Sci. 2022, 31, e4497. [Google Scholar] [CrossRef] [PubMed]
Zhou, C.; Chen, Z.; Zhang, L.; Yan, D.; Mao, T.; Tang, K.; Qiu, T.; Cao, Z. SEPPA 3.0—Enhanced spatial epitope prediction enabling glycoprotein antigens. Nucleic Acids Res. 2019, 47, W388–W394. [Google Scholar] [CrossRef]

Figure 1. Schematic view of the GraphEPN architecture. (a) Overall framework: The 3D structure of a protein (PDB ID: 1OTU_A) is represented as a graph, where nodes correspond to amino acid residues. The VQ-VAE encodes and quantizes node features into discrete embeddings, which are passed to a graph transformer for epitope prediction. (b) VQ-VAE Module: The encoder extracts latent features and maps them to the nearest codebook vectors, generating discrete representations, while the decoder reconstructs original features. (c) Graph Transformer Architecture: The model applies graph attention networks (GAT) to capture residue interactions, followed by residual connections and feed-forward layers.

Figure 2. Performance evaluation of the GraphEPN model. (a) ROC curves of 5-fold cross-validation for GraphEPN. (b) AUPRC curves of 5-fold cross-validation for GraphEPN. (c) Comparison of the ROC curves between the GraphEPN and peer methods. (d) Comparison of AUPRC curves between GraphEPN and peer methods.

Figure 3. Visualization of epitope predictions for a test case (PDB ID: 6ad8_A, chain A) across multiple methods. (a) Reference epitopes. (b–f) Predictions by GraphEPN, BepiPred 3.0, SEPPA 3.0, SEMA 2.0, and ElliPro, respectively. In each model, correctly predicted epitope residues (true positives) are shown in green, residues incorrectly predicted as epitopes (false positives) are shown in red, and residues that should have been predicted as epitopes but were missed (false negatives) are highlighted in yellow. Silver represents non-epitope residues.

Figure 4. Visualization of GraphEPN model predictions for protein 2j88_A. (a) Predicted epitopes are highlighted on the protein’s 3D structure. Residues with high prediction scores are shown as cyan sticks, with secondary structure elements in green and unlabeled regions in silver. (b) Epitope prediction scores along the sequence, where the x-axis corresponds to the sequence positions of amino acids and the y-axis represents the predicted epitope scores. The color gradient indicates the predicted confidence, with yellow representing high-confidence predictions. The blue dashed line marks the prediction threshold.

Table 1. Comparison of feature performance in predicting BCEs on SabDab_665.

Feature	AUC	AUPRC
AAP	0.735	0.312
AAP+Spatial	0.802	0.389
AAP+DSSP	0.785	0.346
AAP+DSSP+Spatial	0.829	0.433

Table 2. Performance of GraphEPN compared with that of available peers.

Feature	AUC	AUPRC	BACC	F1	MCC
Seppa3	0.666	0.120	0.588	0.193	0.1181
Bepipred3	0.711	0.147	0.622	0.220	0.1533
Ellipro	0.580	0.090	0.565	0.156	0.067
Sema2	0.702	0.144	0.627	0.223	0.158
GraphEPN	0.765	0.196	0.657	0.271	0.214

Table 3. Ablation study results on the SAbDab_665 dataset.

Configuration	AUC	AUPRC
VQ-VAE+GAT	0.829	0.433
VQ-VAE+MSA	0.775	0.354
Raw Features+GAT	0.778	0.361
Raw Features+MSA	0.701	0.214

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Wang, F.; Dai, X.; Shen, L.; Chang, S. GraphEPN: A Deep Learning Framework for B-Cell Epitope Prediction Leveraging Graph Neural Networks. Appl. Sci. 2025, 15, 2159. https://doi.org/10.3390/app15042159

AMA Style

Wang F, Dai X, Shen L, Chang S. GraphEPN: A Deep Learning Framework for B-Cell Epitope Prediction Leveraging Graph Neural Networks. Applied Sciences. 2025; 15(4):2159. https://doi.org/10.3390/app15042159

Chicago/Turabian Style

Wang, Feng, Xiangwei Dai, Liyan Shen, and Shan Chang. 2025. "GraphEPN: A Deep Learning Framework for B-Cell Epitope Prediction Leveraging Graph Neural Networks" Applied Sciences 15, no. 4: 2159. https://doi.org/10.3390/app15042159

APA Style

Wang, F., Dai, X., Shen, L., & Chang, S. (2025). GraphEPN: A Deep Learning Framework for B-Cell Epitope Prediction Leveraging Graph Neural Networks. Applied Sciences, 15(4), 2159. https://doi.org/10.3390/app15042159

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

GraphEPN: A Deep Learning Framework for B-Cell Epitope Prediction Leveraging Graph Neural Networks

Abstract

Featured Application

Abstract

1. Introduction

2. Materials and Methods

2.1. Datasets

2.2. Data Preprocessing and Graph Construction

2.2.1. Node Feature Extraction

2.2.2. Edge Feature Extraction

2.2.3. Protein 3D Graph Construction

2.3. Model Architecture

2.3.1. Overview

2.3.2. Pre-Trained VQ-VAE Module

2.3.3. Graph Attention-Based Epitope Prediction Module

2.4. Model Training and Evaluation

3. Results

3.1. Feature Engineering

3.2. Internal Validation

3.3. Comparison with Peer Methods

3.4. Case Study

3.5. Ablation Study

3.6. Visualization of the Epitope Prediction Results

4. Conclusions

Supplementary Materials

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI