Next Article in Journal
Enhancing Steganography through Optimized Quantization Tables
Previous Article in Journal
Caching Method for Information-Centric Ad Hoc Networks Based on Content Popularity and Node Centrality
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Temporal Knowledge Graph Reasoning Based on Entity Relationship Similarity Perception

School of Information and Communication Engineering, Hainan University, Haikou 570228, China
*
Author to whom correspondence should be addressed.
Electronics 2024, 13(12), 2417; https://doi.org/10.3390/electronics13122417
Submission received: 16 May 2024 / Revised: 8 June 2024 / Accepted: 19 June 2024 / Published: 20 June 2024

Abstract

:
Temporal knowledge graphs (TKGs) are used for dynamically modeling facts in the temporal dimension, and are widely used in various fields. However, existing reasoning models often fail to consider the similarity features between entity relationships and static attributes, making it difficult for them to effectively handle these temporal attributes. Therefore, these models have limitations in dealing with previously invisible entities that appear over time and the implicit associations of static attributes between entities. To address this issue, we propose a temporal knowledge graph reasoning model based on Entity Relationship Similarity Perception, known as ERSP. This model employs the similarity measurement method to capture the similarity features of entity relationships and static attributes, and then fuses these features to generate structural representations. Finally, we provide a decoder with entity relationship representation, static attribute representation, and structural representation information to form a quadruple. Experiments conducted on five common benchmark datasets show that ERSP surpasses the majority of TKG reasoning methods.

1. Introduction

Knowledge graphs (KGs) are widely used in multiple fields, including information retrieval, intelligent recommendation, question-answering systems, and natural language processing [1]. However, the challenges of maintaining timeliness and data consistency have made research into knowledge graphs increasingly arduous. Current research primarily focuses on reasoning over static knowledge graphs. However, in real life, information is dynamically changing and data have timeliness, which poses a significant challenge to the dynamic reasoning of knowledge graphs.
A knowledge graph is a knowledge base that stores factual information in a graph structure, where each fact is represented as a triplet ( e s , r , e o ) , where e s (subject entities) and e o (object entities) represent nodes (entities), and r represents the type of edge (relation) connecting e s and e o . Knowledge graphs are generally presented in a static form. However, in real life, the relations between entities change over time. To more accurately describe this dynamism, some researchers have introduced the concept of temporal knowledge graphs (TKGs). A temporal knowledge graph is a type of knowledge graph used to describe dynamic facts. It expands information by combining each triplet ( e s , r , e o ) with a timestamp to form a quadruple ( e s , r , e o , t ) , where t denotes a timestamp, which represents the specific period during which the fact occurred. For example, as shown in Figure 1, (Donald Trump, President, United States, 2017/1~2021/1) represents that Donald Trump served as President of the United States from January 2017 to January 2021. The main research focus of this article is to solve the extrapolation problem in temporal knowledge graph reasoning, which includes predictions about entities and relations. Solving the problem of extrapolation is of great significance for multiple practical applications, such as predicting social relations, disaster management, the induction of event processes, and medical research [2]. More methods of application can be found in [3,4,5].
To accurately predict future events, we need to gain insight into the patterns of development between historical events. At each timestamp, the relations between entities are constructed into a complex structural network through the interaction of various concurrent facts. This network evolves continuously, providing opportunities to deeply explore the correlation of historical events through their mutual influence, thereby improving the accuracy of predicting future events.
For example, consider querying (?, President Elected, United States, 2017–1~2021–1). We can query a series of related historical adjacent facts and similar facts. As shown in Figure 1, among these facts, the possible people involved are Joe Biden, Donald Trump, and so on. Considering both neighboring and similar facts narrows down the predicted outcome, such as in this example, to Barack Obama, Donald Trump, and Joe Biden. In addition, as shown in Figure 2, we can reveal some important clues by observing adjacent and similar facts in history. For example, in January 2017, Trump signed an executive order, issuing a “ban” involving seven countries; in May 2018, he announced withdrawal from the Iran Nuclear Agreement signed by the six parties. In 2019, Kim Jong Un and Donald Trump held their first ever meeting between their sitting leaders. In 2020, Trump announced that the United States officially withdrew from the Paris Agreement. These historical facts not only reflect Trump’s behavior, but also foreshadow his future movements to some extent. By comprehensively considering this historical information, we can reveal the behavioral trends and preferences of entities and relations, providing important clues for predicting future events.
In recent years, some studies have attempted to obtain relevant historical information from different queries through heuristic methods. However, these methods did not consider the similar relation between entity relationships and between static attributes. For example, CyGNet [6] utilizes global historical information to efficiently model repetitive patterns present in temporal knowledge graphs to predict information about entities and relations at future timestamps. TANGO [7] extends the idea of neural ordinary differential equations to multi-relational graph convolutional networks, encoding temporal and structural information as continuous-time dynamic embeddings. TITer [8] temporal path-based reinforcement learning utilizes a relative time-coding function to obtain temporal information and uses time-shaped rewards to guide model learning. CEN [9] uses a KG sequence encoder based on relational graph neural networks and a length-aware convolutional-neural-network-based evolutionary representation decoder to learn evolutionary patterns from historical KG sequences of different lengths. Although these methods integrate and predict entity relationship information, they overlook the similar relations between entity relationships and static attributes, which often have a significant impact on TKG reasoning.
In this work, we propose a representation learning model based on Entity Relationship Similarity Perception, or ERSP, for modeling and reasoning temporal knowledge graph sequences. Specifically, the model captures the evolutionary representations of entities and relations in TKGs at different timestamps through multiple components. The main components are as follows: (1) The entity-aware component, which models the entity information at each timestamp in the TKG. (2) Modeling the relations and their similarity feature information in the TKG at each timestamp using the relationship similarity-aware component. (3) The static attribute similarity-aware component, which constrains the static attributes of entities and mines the similarity features of static attributes.
In summary, the main contributions of this paper are as follows:
  • We propose a novel ERSP, which is a representation learning model based on a TKG. This model integrates the entity-aware component, relationship similarity-aware component, and static attribute similarity-aware component, thus fully utilizing the similarity features between entity relationships and static attributes in historical facts. By modeling these features, our model has significant advantages in handling unseen entities and can solve the problem of hidden associations between static attributes of facts, thereby further enhancing the model’s predictive ability for entity relationships.
  • To our knowledge, this is the first time that similarity features of historical facts and static attribute information have been integrated into TKG reasoning.
  • We conducted extensive experiments on five commonly used TKG datasets and the results showed the excellent performance of ERSP in both entity prediction and relation-prediction tasks.

2. Related Work

In recent years, researchers have divided the integration of time information into knowledge graph reasoning into two main settings: interpolation and extrapolation [2]. In the interpolation setting, the model, given known events and facts within a timestamp range [ t 0 , t T ] , predicts facts that may exist within this time range but have not yet been observed. This means that the model needs to fill in the gaps within this period based on existing historical information. In the extrapolation setting, the model knows the historical events and facts within a timestamp range of [ t 0 , t T ] and predicts possible facts that may occur within the future timestamp range of [ t T + 1 , ] . This requires the model to have an understanding of the evolution patterns of events in historical information in order to infer possible facts that may occur in the future [10].
TKG Reasoning under Interpolation Settings: TTransE [11] extends TransE [12] with temporal consistency information as a constraint. TA-DistMult [13] embeds entities and relations into the complex domain using matrix factorization. TA-TransE learns a time-aware representation of relations through long short-term memory recurrent neural networks and distance minimization. TNTComplEx [14] generates embedded representations of timestamps through the complex factorization of fourth-order tensors. However, these models cannot capture representations of unseen timestamps and are usually not applicable in extrapolation settings.
TKG Reasoning under Extrapolation Settings: RE-NET [15] models historical facts as subgraph sequences using a neighborhood aggregator and a cyclic event encoder. xERTE [16] builds interpretable reasoning graphs using subgraph sampling techniques to integrate entity, relation and temporal information in the reasoning graph. TANGO-Tucker [7] introduces Tucker decomposition to model complex relations of entities, relations, and time. RE-GCN [17] comprehensively models historical information by capturing structural dependencies of historical facts and sequential patterns of neighboring facts. GHT [18] designed two Transformer modules based on the Hawkes process for capturing structural and temporal information, respectively. ReGAT [19] used RNN and GNN to jointly encode temporal and structural event information from historical and concurrent events. However, none of the above methods consider similarity features of entity relationships or similarity features of static attributes, making it difficult to handle these temporal attributes effectively.
Similarity Modeling of Entity Relationships: This involves establishing a similarity model for entity relationships in the knowledge graph to mine potential associations in the knowledge graph. SiGMa [20] uses a greedy matching method based on similarity propagation to model structural information and similarity measures of entity attributes in the relational graph. The HistSim and DisNGram methods [21] achieve the similarity matching of entities through character-level similarity and threshold calculations. Paris [22] measures the degree of matching between entities and relations based on probability estimation. CEAFF [23] uses reinforcement learning models to align entities collectively and utilizes representative features to capture the similarity between entities in heterogeneous knowledge graphs. ClusterEA [24] is an EA model based on GNN that aligns entities between large-scale knowledge graphs through random training and standardized similarity. However, these methods have not been applied in the field of temporal knowledge graph extrapolation, and do not take into account the similarity features of static attributes.
Therefore, the above model cannot handle previously invisible entities that appear over time or the implicit associations of static attributes between facts. In contrast, ERSP adopts a novel approach to modeling the entire sequence of temporal knowledge graphs as a whole, which not only fully considers the similarity features of entity relationships, but also synthesizes the similarity features of static attributes of entities. This comprehensive modeling can greatly improve the accuracy and effectiveness of entity and relation prediction.

3. Preliminaries

In this section, we elaborate on TKGs and TKG reasoning problems, and summarize the relevant symbols used in this work, as shown in Table 1.
Definition 1.
Temporal knowledge graph: A temporal KG is a multi-relation graph that changes over time, with edges marked with timestamps. A TKG can be represented as a quadruple G = ( H , R , M , T ) , where H is the entity set, R is the relation set, M is the fact set, and T is the time set. Each fact M is represented as a quadruple ( E s , r , E o , t ) .
Definition 2.
Temporal knowledge graph reasoning: Temporal knowledge graph inference is the process of prediction using the temporal patterns and dependencies present in the temporal knowledge graph. This process can be divided into two main tasks: entity prediction and relation prediction.
Entity prediction: Entity prediction aims to predict missing entities at a future point in time based on a sequence of historical knowledge graphs. For example, given a certain query form of ( ? , r , E o , t + 1 ) or ( E s , r , ? , t + 1 ) , we can infer missing subject entity E s or object entity E o .
Relation prediction: Relation prediction is the inference of missing relations between entities at a certain point in the future based on the same historical sequence. For example, given a certain query form of ( E s , ? , E o , t + 1 ) , we can infer missing relation r .
Similarity measurement is a key tool for evaluating the similarity between two objects and is widely used in machine learning and data mining. Common similarity measurement methods include the following:
Definition 3.
Similarity measurement: Similarity measurement is a key tool for evaluating the similarity between two objects and is widely used in machine learning and data mining. Common similarity measurement methods include cosine similarity and Euclidean distance, among others.
Cosine similarity: Cosine similarity is used to measure the directional similarity between two vectors, and its formula is
c o s m , n = i = 1 k   m i n i m i n i = i = 1 k   m i · n i i = 1 k   m i 2 i = 1 k   n i 2
where m and n are two vectors; m i and n i represent the i t h element in vectors m and n , respectively; m i and n i represent the modulus of vector m i and vector n i , respectively; and k represents the dimension of a vector, which is the number of elements it has.
Euclidean distance: Euclidean distance is used to measure the straight-line distance between two vectors in multidimensional space, and its formula is
d ( m , n ) = i = 1 k   m i n i 2
where m and n are two vectors; m i and n i are the i t h element in vectors m and n ; and k represents the dimension of a vector, which is the number of elements it has.

4. The Proposed Model: ERSP

4.1. Model Overview

The overall framework of ERSP consists of three key components, including the entity-aware component, the relationship similarity-aware component, and the static attribute similarity-aware component, s, as illustrated in Figure 3. The entity-aware component includes the graph convolutional GCN network structure and the adaptive time gate network structure. The goal of the graph convolutional GCN network structure is to capture the feature dependency relations of entities in the temporal knowledge graph at each timestamp through aggregation. The adaptive time gate network structure is used to obtain the evolutionary representation of entities at each timestamp. The relationship similarity-aware component is composed of mean pooling and relationship-aware gating units. The relationship-aware gating unit is based on the relationship similarity vector and the gated recurrent neural network unit, which can not only obtain similarity features of relations in historical facts, but also further capture the evolutionary representation of relations. The static attribute similarity-aware component integrates the static embedding and evolutionary embedding of entities, integrating static attributes into evolutionary embedding and further obtaining the similar relation of their static attributes.

4.2. Entity-Aware Component

4.2.1. Graph Convolutional GCN Network Structure

To capture the feature dependency relations between concurrent facts, we adopt a graph convolutional GCN network structure [25]. The GCN model is used to capture the associations between entities and relations in a multi-relational graph. Specifically, at the timestamp t of the temporal knowledge graph, the embedding of object entity o in the η layer is achieved by obtaining information from its subject entity layer through a message-passing framework and obtaining the next layer’s embedding. Specifically, this is expressed as
h o , t η + 1 = Φ s , r , o E t 1 k W r η h s , t η + r t + W o η h o , t η
where Φ ( ) represents the ReLU activation function [26], k is a constant, W r η and W o η represent the weight matrices related to the relation r and object entity o , respectively, h s , t η and h o , t η represent entity s and o embedded in the η layer, respectively, and r represents the embedding of relation r in the η layer.

4.2.2. Adaptive Time Gate Network Structure

For entity o , its information may change between different time steps, which may lead to loss when processing entity information. To address this issue and better capture the temporal correlation of entity information between adjacent time steps, we introduce an adaptive time gate network structure (ATGN) that can dynamically adjust the transmission and updating of entity information at different time steps. The adaptive gating mechanism controls how the entity information after graph convolution aggregation is fused with the entity information of the previous time step to generate an updated entity representation of the current time step. Namely,
H t = A d a p t G a t e H t 1 , H t G C N
where H t 1 represents the entity embedding matrix at timestamp t 1 ; H t 1 G C N represents the entity embedding matrix after graph convolution aggregation. The Adapt_Gate structure is composed of Update Gate C t and final state updates O t , which can control the dynamic adjustment of entity information. Specifically, this is expressed as follows:
C t = s i g m W c X t 1 + b r
where C t represents the output of Update Gate; s i g m ( ) represents the sigmoid function; W c represents the weight coefficient matrix of the Update Gate, which dynamically changes entity information by adjusting the weights; b r represents a bias matrix used to adjust the opening degree of the update gate; and X t 1 represents the hidden state of the previous time step, which is the entity embedding matrix.
O t = C t X t + 1 C t X t 1
where O t represents the output of the final state updates, which is the updated entity embedding matrix. represents the element-wise product. X t represents the hidden state, which is the entity embedding matrix after graph convolution aggregation.
Through the adaptive time gate network structure, the model can better handle changes in entity information at different time steps, ensuring that the model can flexibly capture feature dependency relations between entities at different timestamps, further improving the modeling ability of entity time correlation.

4.3. Relationship Similarity-Aware Component

To capture the temporal correlation of relation information between adjacent timestamps, the model adopts a combination of mean pooling and relationship-aware gating units to gradually update the evolutionary representation of relations. The specific mean pooling operation is as follows:
r t = M e a n P o o l i n g H t 1 , M r , t ; r
where H t 1 represents the entity embedding matrix at timestamp t 1 , M r , t represents the entity set connected to the relation r at timestamp t , and r t is the mean pooling calculation of the entity embedding matrix H t 1 from the previous time step and the entity set M r , t .
Subsequently, the relation embedding matrix R t 1 at timestamp t 1 and the R t obtained from the mean pooling operation are used to update R t through the relation-aware gating unit (Relation-GateCell). The relation-aware gating unit is based on the similarity measurement method to extract the similarity features of relationships in order to better capture the evolution of relationships in the temporal knowledge graph. Specifically, we use the following formula to represent the update process of the relation:
R t = R e l a t i o n G a t e C e l l R t 1 , R t
where R t is composed of the set of relation vectors r t , r t R t .
Previous models often overlook similar features of relations, so we adopt the Relationship-GateCell structure, which is a relation-aware gating unit. This structure is based on a relation similarity matrix and the gated recurrent neural network unit (PReLU-GateCell), which not only captures the relation similarity in historical facts, but also further captures the evolutionary representation of relations. The specific calculation of relation similarity is as follows:
h = Φ F C 1 R t
H r = Φ F C 2 h
where F C 1 , F C 2 represents the fully connected layer; Φ represents the ReLU activation function; h represents the calculated representation of the hidden layer; and H r denotes the mapping of the hidden layer representation to the final output, representing the relation similarity matrix. Based on the relation similarity matrix, the updated relation features are as follows:
R t = P R e L U G a t e C e l l R t 1 , H r
The traditional GRU structure may lead to gradient vanishing due to the stacking of historical KG sequences. To address this issue, a gated recurrent neural network unit (PReLU-GateCell) with learnable parameters is adopted, which is obtained by improving the traditional RNN structure. The PReLU-GetCell structure consists of four parts, namely the input gate, forgetting gate, temporary hidden state, and hidden state update. Specifically,
i t = s i g m W i x t , h t 1
where i t represents the output of the input gate; s i g m ( ) is the sigmoid activation function; W i represents the weight matrix; and x t and h t 1 represent the input at the current timestamp t and the hidden state at the previous timestamp, respectively. The input gate i t is responsible for controlling the storage of new information in the hidden state.
f t = s i g m W f x t , h t 1
where f t and W f represent the output of the forget gate and the weight matrix, respectively. The forget gate is responsible for controlling how much information should be forgotten from the previously hidden state in the current time step.
h ~ t = P R e L U W h x t , h t 1
where h ~ t is the output of the temporary hidden state; P R e L U ( ) denotes the activation function; and W h denotes the weight matrix. Unlike the standard ReLU, it allows input values less than zero to pass through instead of directly zeroing them, which can effectively alleviate the problem of gradient vanishing. Specifically,
P R e L U ( x ) = { x , x 0 λ x , x < 0
where x represents the input value and λ is a parameter used to control the slope of the negative value part. The final hidden state z t is obtained through a weighted combination of the input gate, forget gate, and temporary hidden state. Specifically,
z t = f t h t 1 + i t h ~ t
where z t represents the output of the hidden state update and represents the element-wise product. The final hidden state decides on the addition of old and new information.

4.4. Static Attribute Similarity-Aware Component

In temporal knowledge graphs, static attributes describe the fixed characteristics of entities, such as the political system of a country and the type of organization, which remain unchanged over time. Although these attributes remain constant, they may have a profound impact on the behavior and relations of entities. Therefore, studying the similarity characteristics of static attributes is of great significance for gaining a deeper understanding of entity characteristics and relations, as well as predicting future events. This type of research helps to reveal patterns and regularities of entity behavior, supporting the precise analysis of temporal knowledge graphs. By delving deeper into the temporal correlation of static attributes, we can more accurately capture the essential characteristics of entities and events in the temporal knowledge graph, providing strong support and reference for decision-making, predictive analysis, and other fields.
The static graph similarity constraint mechanism (SGC-Sim) is adopted to capture the similarity features of static attributes. This mechanism is based on similarity measurement methods, which can more comprehensively capture the static attribute similarity between entities and improve the modeling ability of the model for entity relationships. Firstly, define the static graph as follows:
h i s = Φ ( r s , j ) , ( i , r s , j ) ) R s 1 k i W r s h i s j
where Φ ( ) represents the ReLU activation function; r s represents the relation in static graphs; R s represents the set of relations under the timestamp; k i is a constant; W r s represents the relation matrix; h i s j represents the randomly initialized input embedding matrix; and h i s represents the output embedding matrix.
Subsequently, the similarity between evolutionary embeddings and static embeddings is measured using the similarity matrix H s . This similarity calculation can ensure consistency between the dynamic evolution and static characteristics of entities, providing a more accurate entity representation for the model.
H s = E s E e T | | E s | | 2 × | | E e | | 2
where H s represents the similarity matrix, E s and E e represent the static embedding matrix and evolutionary embedding matrix, respectively. T represents the transpose, and | | E s | | 2 and | | E e | | 2 represent the L 2 norm of the E s matrix and the E e matrix, respectively.
To obtain information with high similarity values for static attributes, the t o p k method is used for selection. Specifically,
V = t o p k H s , k + 1 , d
where V represents the similarity value of static attributes; H s is the similarity matrix; k + 1 represents selecting the top k + 1 highest ranked values of similarity values; and d represents the dimension, with the default value being one-dimensional. The t o p k method is a method used to select the first k maximum or minimum values from an array or tensor.
Finally, the loss of the static attribute similarity-aware component at timestamp t is defined in the following way:
L s x = i = 0 H 1 m a x { c o s ϕ x V ) , 0 }
where H represents the number of entities in the static graph, and ϕ x represents the angle between evolutionary embeddings and static embeddings of the same entity, which can be adjusted. By constraining this angle, ensure that the dynamic evolution of the entity does not deviate excessively from its static characteristics.
Then, the loss of the static attribute similarity-aware component is
L s = x = 0 n L s x
where n represents the number of historical event steps.

4.5. Decoder and Training

4.5.1. Scoring Function

Research has shown that graph convolutional networks (GCNs) using convolutional scoring functions have significant performance advantages in temporal knowledge graph reasoning tasks [27]. To capture the evolutionary characteristics of entities and relations implied in historical facts, the ConvTransE decoder is used in this study [17]. By modeling entities and relations through a decoder, the probability vectors of entities and relations can be obtained, which are
P s c o r e E = s i g m H t C o n v T r a n s E s t , r t
P s c o r e r = s i g m R t C o n v T r a n s E s t , o t
where s i g m ( ) is the sigmoid function, H t and R t represent the entity embedding matrix and the relation embedding matrix at timestamp t , respectively, and s t , r t , o t represent the embeddings of the subject entity s , relation r , and object entity o in H t and R t , respectively.

4.5.2. Model Learning

The goal of the model is to predict changes in entities and relations over future time periods. This is a multi-label learning task, where each label represents a possible entity or relation. This model is based on given historical facts, assigns probability scores to each entity and relation, and predicts entities and relations by maximizing the score of actual events that occur. Specifically, the loss function for entity prediction tasks is
L e = i = 0 H 1 f e P s c o r e E
where L e represents the loss of entity, H represents the total number of entities, f e represents the entity vector, and P s c o r e E represents the score probability of the entity.
Similarly, the loss function for the relation prediction task is
L r = i = 0 R 1 f r P s c o r e r
where L r represents the loss of the relation, R represents the total number of relations, f r denotes the relation vector, and P s c o r e r represents the score probability of the relation.
These two temporal reasoning tasks are conducted within a multi-task learning framework. Therefore, the final loss is defined as follows:
L = α L e + β L r + L s
where α is the parameter that controls the entity loss and β is the parameter that controls the relation loss. The detailed reasoning process is shown in Algorithm 1.
Algorithm 1: Reasoning algorithm of RESP
Input: Historical graph sequence G = ( H , R , M , T ) , max_epoch
Output: The loss of time reasoning task
1:   H , R , T = Init()
2: for  i = 1 to max_epoch
3:      for  s in H do
4:          Generate structural entity embeddings h o , t η + 1 ▷Equation (3)
5:          Generate the latest entity representation H t ▷Equation (4)//include Equations (5) and (6)
6:          Compute the probability vector P s c o r e E of entity ▷Equation (22)
7:          Compute the loss L e of entity ▷Equation (24)
8:      end
9:      for  r in R do
10:          Generate relation embeddings r t ▷Equation (7)
11:          Generate the relation representation R t ▷Equation (8)//include Equations (9)–(16)
12:              Compute relation similarity matrix H r ▷Equations (9) and (10)
13:              Update relation representation R t ▷Equation (11)//include Equations (12)–(16)
14:          Compute the probability vector P s c o r e r of relation ▷Equation (23)
15:          Compute the loss L r of relation ▷Equation (25)
16:      end
17:      for  t in T do
18:          Generate static attributes embeddings h i s ▷Equation (17)
19:          Compute the similarity matrix H s ▷Equation (18)
20:          Select the high similarity values V ▷Equation (19)
20:          Compute the loss L s of static attributes ▷Equation (21)
21:      end
22:      Compute the loss of time reasoning task L = α L e + β L r + L s
23:      return  L

5. Experiments

5.1. Datasets

We used five different typical TKG datasets to evaluate the performance of the ERSP model on TKG reasoning tasks, including ICEWS14 [13], ICEWS18 [15], ICEWS05-15 [13], YAGO [28], and GDELT [29].
ICEWS14 is a dataset of the Integrated Crisis Warning System (ICEWS) that includes political events that occurred in 2014. For ICEWS18, also from ICEWS, this dataset includes political events that occurred between 1 January 2018 and 31 October 2018. ICEWS05-15 is the long-term dataset of ICEWS, which includes events that occurred between 2005 and 2015, covering a wider time range. The YAGO dataset is extracted from YAGO3 and contains temporal information. The GDELT dataset is a global event database. These datasets contain event data from different fields and periods to evaluate the performance of the ERSP model in different contexts. More datasets can be found in [30].

5.2. Evaluation Metrics

To evaluate the performance of the ERSP model on TKG reasoning tasks, a standard set of temporal knowledge graph metrics is adopted, including mean reciprocal rank (MRR) [31] and Hits@N [31]. These metrics are used to measure the performance of the model in temporal knowledge graph reasoning tasks. The mean reciprocal rank (MRR) evaluates the average of the reciprocals of model rankings, which is the most typical metric for TKG reasoning tasks. Hits@N measures the percentage of correct entities included in the top N rankings of the model. Typically, we use Hits@1, Hits@3, and Hits@10 to report the results. Especially, higher MRR and Hits@N values indicate more accurate results. More evaluation metrics can be found in [32].

5.3. Baselines

We conducted a comparative study of the performance of ERSP with several classical models designed in recent years, involving three different classes of models, including static TKG reasoning models, existing interpolated TKG reasoning models, and extrapolated TKG reasoning models.

5.3.1. Static TKG Reasoning Models

  • DisMult: DisMult [33] learns entity and relation embedding information using bilinear functions.
  • ComplEx: ComplEx [34] introduces complex domain space to deal with asymmetry in complex relations in KG.
  • RotatE: RotatE [35] defines the rotation vector from the head entity to the tail entity as a relational representation.
  • ConvE: ConvE [36] adopts convolutional operations in a CNN to handle header entity embedding and relation embedding.
  • ConvTransE: ConvTransE [37] extends the convolutional neural network (CNN) idea to the TransE model.
  • R-GCN: R-GCN [25] is based on a message-passing GCN framework, processing the structural data of multiple relations in a KG.

5.3.2. Interpolated TKG Reasoning Models

  • HyTE: HyTE [38] embeds a learning time-aware knowledge graph based on a hyperplane and embeds the time information into the entity relation space.
  • TTransE: TTransE [11] integrates temporal information into the embedding vector of entity and relation.
  • TA-DistMult: TA-DistMult [13] adopts a recurrent neural network to learn the time-aware representation of relations.
  • DE-SimplE: DE-SimplE [39] extends the SimplE [40] model based on diachronic embedding (DE).
  • TNTComplEx: TNTComplEx [14] extends the ComplEx model based on fourth-order tensor canonical decomposition.

5.3.3. Extrapolated TKG Reasoning Models

  • CyGNet: CyGNet [6] analyzes historical repetitive facts and predicts future facts through a time-aware replication generation mechanism.
  • RE-NET: RE-NET [15] uses a cyclic event encoder to capture global and local features.
  • TANGO-DistMult: TANGO-DistMult and TANGO-Tucker [7] apply the idea of neural ordinary differential equations to multi-relational graphs, and calculate the final results with the score functions of DistMult and Tucker, respectively.
  • RE-GCN: RE-GCN [17] captures the structural-dependent features and the sequential patterns of facts in the KG utilizing relation-aware GCN and gate-recurrent components, respectively.
  • xERTE: xERTE [16] utilizes the temporal relation attention mechanism to extract the causal features of temporal multi-relational data.
  • GHT: GHT [18] captures temporal evolutionary information and transient structural information in KGs through Transformer.
  • rGalT: rGalT [41] utilizes a self-encoder structure to capture the interaction between historical facts and predicted facts.
  • ReGAT: ReGAT [19] encodes and models historical facts and concurrent facts based on the attention mechanism.
  • PPT: PPT [42] converts the task of temporal knowledge graph completion into a pre-trained language model to capture its semantic information.

5.4. Implementation Details

For all datasets, the entity and relation dimensions are set to 200.
For the ICEWS14, ICEWS05-15, ICEWS18, YAGO and GDELT datasets, set the optimal local historical length m to 6, 2, 6, 2, and 12, respectively. For all datasets, we set the dropout rate of each layer to 0.2 and the number of GCN layers to 2. For the decoder ConvTransE, the number of kernels is set to 50 and the kernel size is set to 2 × 3 for all datasets. In terms of parameter learning, we use the Adam optimizer [43] and set the learning rate to 0.001.

5.5. Results

5.5.1. Results of Entity Prediction

The experimental results of the entity prediction task are shown in Table 2 and Table 3. The performance of ERSP on the five benchmark datasets continuously outperforms all baseline models. Specifically, ERSP outperforms the latest extrapolated baseline PPT because it not only comprehensively considers the similar characteristics of entity relationships, but also integrates the similar characteristics of static attributes of entities. Compared with other models that only consider the characteristics of entity relationships, such as RE-NET, TANGO, GHT, and PPT, ERSP shows stronger performance. RE-NET uses recurrent neural networks to capture temporally adjacent entity relationship characteristics. The performance of the ERSP model is superior to the RE-NET model because it adopts an adaptive time gate network structure, which enables the model to transmit and update entity information at different time steps, and flexibly captures the feature dependency relationships between entities. Based on Transformer’s neural time point process model, GHT uses its attention mechanism and relational continuous time-coding function to learn entity and relational representation. The performance of the ERSP model is superior to the GHT model because it adopts a static graph similarity constraint mechanism to more comprehensively capture the static attribute similarity between entities, thereby enhancing the model’s entity prediction performance.
Compared to the classical static baseline RotatE, which is based on rotation operations to represent entity relationships, the model primarily applies to static knowledge graphs, with relatively limited modeling capabilities for temporal knowledge graphs. Moreover, when the dataset contains a large number of missing entities or relations, the performance of RotatE will decline, according to Figure 4. On the contrary, ERSP demonstrates excellent performance in the temporal knowledge graphs containing a large number of missing entities and relations. It not only captures the similar characteristics of entity relationships, but also models the similar characteristics of static attributes, showing superior performance with each benchmark dataset. Especially on datasets with static graphs, such as the ICEWS dataset, ERSP performs particularly well. Therefore, ERSP shows greater applicability in dealing with temporal knowledge graphs.
According to the results in Table 2 and Table 3, the performance improvement of ERSP is particularly significant for the ICEWS14, ICEWS18, and ICEWS05-15 datasets with a large number of timestamps. This further confirms the effectiveness of the static graph similarity constraint mechanism in modeling similarity features of static attributes. In addition, for ICEWS05-15, GDELT, and other datasets containing a large number of facts, ERSP significantly improves the performance, further demonstrating the effectiveness of considering the similarity features of entity relationships.

5.5.2. Results of Relation Prediction

For relation prediction, considering that some models are not designed specifically for relation prediction, we choose the temporal model suitable for the relation-prediction task. As shown in Table 4, ERSP performs well in the relation-prediction task and outperforms all baseline models. Compared with the classical baseline model RE-GCN, the previous models often overlook the similar characteristics of the relations. Based on the relational similarity vector and gated recurrent neural network unit, ERSP can not only capture the similar characteristics of relations in historical facts, but also further capture the evolutionary representation of relations. In addition, the gated recurrent neural network unit can effectively alleviate the problem of gradient vanishing. Therefore, ERSP is more advantageous in relation-prediction tasks.
When faced with datasets containing more relations, such as the ICE14 dataset and the ICE05-15 dataset, the performance of ERSP improves significantly, further validating the results observed in the entity prediction task. This indicates that ERSP has greater advantages in dealing with data with more complex relational structures.

5.5.3. Comparison of Different Embedding Dimensions

In this work, to examine the influence of embedding dimensions, we conduct experiments on the ERSP model using the ICEWS14 dataset with different dimension settings of n { 100 ,   200 ,   300 ,   400 ,   500 } , while keeping other hyperparameters unchanged. In Figure 5, we find that ERSP maintains excellent performance in both low and high dimensions. As the embedding dimension increases, the initial performance of the model also improves, while the overall performance begins to decline after a critical point. Research has shown that although higher embedding dimensions can improve model performance, excessive dimensions can also bring additional computational costs.

5.5.4. Comparison of Different History Lengths

In this work, we investigate the impact of historical length on TKG inference methods and plot performance results using datasets with a historical length range of 1–10. In Figure 6, the results show that as the length of history gradually increases, the overall performance of the ERSP model increases, clearly indicating the effectiveness of historical information for inference tasks. However, when the historical length is too long, this may lead to redundant information from different timestamps, resulting in unnecessary computational losses during the learning process.

5.5.5. Ablation Study

To gain a deeper understanding of the impact of different model components on the effectiveness of model reasoning, we conducted an ablation study based on the YAGO, GDELT, ICEWS14, ICEWS05-15, and ICEWS18 datasets.
Table 5 shows the ablation results of the different modules in our model. We find that the static graph similarity constraint mechanism (SGSC) has the most significant impact on performance. By removing the static graph similarity constraint mechanism (SGSC), the performance on the four datasets is significantly decreased, emphasizing the importance of extracting similarity features of entity static attributes for prediction. In addition, we notice that when removing the relation-aware gating unit (RGU) and adaptive time gate network structure (ATGN), all datasets decreased slightly compared with removing the static graph similarity constraint mechanism. In fact, the relation-aware gating unit combines the gated recurrent neural network unit and relationship similarity, which makes it easier to capture the evolution of entity relationships in the temporal knowledge graph. The adaptive time gate network structure helps the ERSP model better understand the dynamic changes of entities in the process of temporal evolution, and flexibly captures the temporal correlation and feature dependence of entity information between adjacent time steps.
Therefore, these results further show that capturing more comprehensive similarity features and dynamic changes in entity relationships, as well as similarity features of static attributes, is helpful for prediction.

5.6. Future Directions

We have demonstrated the superiority of ERSP in reasoning and prediction tasks for TKGs. For further work, we can apply this idea to other TKG-based tasks, such as knowledge graph-based Q&A systems, healthcare systems, and intelligent recommendation systems. This can also be extended by combining large graph models (LGM) and transferable learning [45]. Large graph models (LGM) learn general knowledge from a large amount of graph data and combine deep learning methods to achieve more complex graph structures and features. Transferable graph learning can be combined with multimodal data to achieve more comprehensive knowledge representation and processing. Both directions have very broad application scenarios.
For this model, we try to solve the following two challenges from the data and methods: (1) Sparsity of relational data. For the sparse temporal knowledge graph dataset, we will adopt the method of introducing more auxiliary information or external data sources to overcome the problem of insufficient data volume. (2) High model complexity. We will focus on the computational efficiency methods of some optimization models and try to adopt more efficient graph convolution network structures or parameter optimization technologies to reduce the computational complexity.

6. Conclusions

In this paper, we introduce the ERSP model, a reasoning approach for temporal knowledge graphs that focuses on discerning the similarity in relationships between entities. The evolution of entities and relations is learned by comprehensively capturing the similarity features of entity relationships. ERSP also incorporates the similarity features of the captured entity static attributes (such as entity types) into evolutionary representation, utilizing evolutionary representation at the final timestamp for temporal reasoning combined with a scoring function. Experiments on five benchmark datasets show that ERSP is significantly superior and effective in entity prediction and relation prediction.

Author Contributions

Conceptualization, S.F. and C.Z.; methodology, S.F. and C.Z.; software, S.F., C.Z. and Q.L.; validation, C.Z. and X.J.; formal analysis, C.Z., Q.L. and X.J.; resources, X.J.; writing—original draft preparation, S.F. and C.Z.; writing—review and editing, S.F., C.Z., Q.L. and M.H.; project administration, S.F.; funding acquisition, M.H. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the following funding sources: the National Natural Science Foundation of China (Grant No. 62241202), the National Key Research and Development Program (Grant No. 2021ZD0111000), and the Key Research and Development Program of the Ministry of Science and Technology (Grant No. 2021ZD0111002).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data are contained within the article.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Yan, S.; Tian, C.; Zhang, Z.; Xu, G. LollipopE: Bi-centered lollipop embedding for complex logic query on knowledge graph. Neural Netw. 2024, 175, 106277. [Google Scholar] [CrossRef] [PubMed]
  2. Dong, H.; Wang, P.; Xiao, M.; Ning, Z.; Wang, P.; Zhou, Y. Temporal inductive path neural network for temporal knowledge graph reasoning. Artif. Intell. 2024, 329, 104085. [Google Scholar] [CrossRef]
  3. Sun, X.; Cheng, H.; Li, J.; Liu, B.; Guan, J. All in one: Multi-task prompting for graph neural networks. In Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Long Beach, CA, USA, 6–10 August 2023. [Google Scholar]
  4. Sun, X.; Cheng, H.; Liu, B.; Li, J.; Chen, H.; Xu, G.; Yin, H. Self-supervised hypergraph representation learning for sociological analysis. IEEE Trans. Knowl. Data Eng. 2023, 35, 11860–11871. [Google Scholar] [CrossRef]
  5. Sun, X.; Cheng, H.; Dong, H.; Qiao, B.; Qin, S.; Lin, Q. Counter-Empirical Attacking based on Adversarial Reinforcement Learning for Time-Relevant Scoring System. IEEE Trans. Knowl. Data Eng. 2023, 1–12. [Google Scholar] [CrossRef]
  6. Zhu, C.; Chen, M.; Fan, C.; Cheng, G.; Zhang, Y. Learning from history: Modeling temporal knowledge graphs with sequential copy-generation networks. Proc. AAAI Conf. Artif. Intell. 2021, 35, 4732–4740. [Google Scholar] [CrossRef]
  7. Han, Z.; Ding, Z.; Ma, Y.; Gu, Y.; Tresp, V. Learning neural ordinary equations for forecasting future links on temporal knowledge graphs. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, Punta Cana, Dominican Republic, 7–11 November 2021; pp. 8352–8364. [Google Scholar]
  8. Sun, H.; Zhong, J.; Ma, Y.; Han, Z.; He, K. Timetraveler: Reinforcement learning for temporal knowledge graph forecasting. arXiv 2021, arXiv:2109.04101. [Google Scholar]
  9. Li, Z.; Guan, S.; Jin, X.; Peng, W.; Lyu, Y.; Zhu, Y.; Bai, L.; Li, W.; Guo, J.; Cheng, X. Complex evolutional pattern learning for temporal knowledge graph reasoning. arXiv 2022, arXiv:2203.07782. [Google Scholar]
  10. Mei, X.; Yang, L.; Jiang, Z.; Cai, X.; Gao, D.; Han, J.; Pan, S. An Inductive Reasoning Model based on Interpretable Logical Rules over temporal knowledge graph. Neural Netw. 2024, 174, 106219. [Google Scholar] [CrossRef] [PubMed]
  11. Leblay, J.; Chekol, M.W. Deriving validity time in knowledge graph. In Proceedings of the Companion Proceedings of the The Web Conference 2018, Lyon, France, 23–27 April 2018; pp. 1771–1776. [Google Scholar]
  12. Bordes, A.; Usunier, N.; Garcia-Duran, A.; Weston, J.; Yakhnenko, O. Translating embeddings for modeling multi-relational data. In Proceedings of the 26th International Conference on Neural Information Processing Systems, Red Hook, NY, USA, 5–10 December 2013. [Google Scholar]
  13. García-Durán, A.; Dumančić, S.; Niepert, M. Learning sequence encoders for temporal knowledge graph completion. arXiv 2018, arXiv:1809.03202. [Google Scholar]
  14. Lacroix, T.; Obozinski, G.; Usunier, N. Tensor decompositions for temporal knowledge base completion. arXiv 2020, arXiv:2004.04926. [Google Scholar]
  15. Jin, W.; Qu, M.; Jin, X.; Ren, X. Recurrent event network: Autoregressive structure reasoning over temporal knowledge graphs. arXiv 2019, arXiv:1904.05530. [Google Scholar]
  16. Han, Z.; Chen, P.; Ma, Y.; Tresp, V. Explainable subgraph reasoning for forecasting on temporal knowledge graphs. In Proceedings of the International Conference on Learning Representations, Addis Ababa, Ethiopia, 30 April 2020. [Google Scholar]
  17. Li, Z.; Jin, X.; Li, W.; Guan, S.; Guo, J.; Shen, H.; Wang, Y.; Cheng, X. Temporal knowledge graph reasoning based on evolutional representation learning. In Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, Virtual Event, 11–15 July 2021; pp. 408–417. [Google Scholar]
  18. Sun, H.; Geng, S.; Zhong, J.; Hu, H.; He, K. Graph hawkes transformer for extrapolated reasoning on temporal knowledge graphs. In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, Abu Dhabi, United Arab Emirates, 7–11 December 2022. [Google Scholar]
  19. Li, Z.; Feng, S.; Shi, J.; Zhou, Y.; Liao, Y.; Yang, Y.; Li, Y.; Yu, N.; Shao, X. Future Event Prediction Based on Temporal Knowledge Graph Embedding. Comput. Syst. Sci. Eng. 2023, 44, 2411–2423. [Google Scholar] [CrossRef]
  20. Lacoste-Julien, S.; Palla, K.; Davies, A.; Kasneci, G.; Graepel, T.; Ghahramani, Z. Sigma: Simple greedy matching for aligning large knowledge bases. In Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Chicago, IL, USA, 11–13 August 2013. [Google Scholar]
  21. Song, D.; Luo, Y.; Heflin, J. Linking heterogeneous data in the semantic web using scalable and domain-independent candidate selection. IEEE Trans. Knowl. Data Eng. 2016, 29, 143–156. [Google Scholar] [CrossRef]
  22. Suchanek, F.M.; Abiteboul, S.; Senellart, P. Paris: Probabilistic alignment of relations, instances, and schema. arXiv 2011, arXiv:1111.7164. [Google Scholar] [CrossRef]
  23. Zeng, W.; Zhao, X.; Tang, J.; Lin, X.; Groth, P. Reinforcement learning–based collective entity alignment with adaptive features. ACM Trans. Inf. Syst. 2021, 39, 1–31. [Google Scholar] [CrossRef]
  24. Gao, Y.; Liu, X.; Wu, J.; Li, T.; Wang, P.; Chen, L. Clusterea: Scalable entity alignment with stochastic training and normalized mini-batch similarities. In Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, New York, NY, USA, 14–18 August 2022. [Google Scholar]
  25. Sun, X.; Zhang, J.; Wu, X.; Cheng, H.; Xiong, Y.; Li, J. Graph prompt learning: A comprehensive survey and beyond. arXiv 2023, arXiv:2311.16534. [Google Scholar]
  26. Korkmaz, G.; Cadena, J.; Kuhlman, C.J.; Marathe, A.; Vullikanti, A.; Ramakrishnan, N. Combining heterogeneous data sources for civil unrest forecasting. In Proceedings of the 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, Paris, France, 25–28 August 2015; pp. 258–265. [Google Scholar]
  27. Vashishth, S.; Sanyal, S.; Nitin, V.; Talukdar, P. Composition-based multi-relational graph convolutional networks. arXiv 2019, arXiv:1911.03082. [Google Scholar]
  28. Mahdisoltani, F.; Biega, J.; Suchanek, F.M. Yago3: A Knowledge Base from Multilingual Wikipedias; CIDR; HAL CCSD: Montpellier, France, 2013. [Google Scholar]
  29. Leetaru, K.; Schrodt, P.A. Gdelt: Global Data on Events, Location, and Tone, 1979–2012; ISA Annual Convention; Citeseer: Princeton, NJ, USA, 2013; Volume 2, pp. 1–49. [Google Scholar]
  30. Sun, X.; Yin, H.; Liu, B.; Chen, H.; Meng, Q.; Han, W.; Cao, J. Multi-level hyperedge distillation for social linking prediction on sparsely observed networks. In Proceedings of the Web Conference, Ljubljana, Slovenia, 19–23 April 2021. [Google Scholar]
  31. Bollen, J.; Mao, H.; Zeng, X. Twitter mood predicts the stock market. J. Comput. Sci. 2011, 2, 1–8. [Google Scholar] [CrossRef]
  32. Li, Y.; Sun, X.; Chen, H.; Zhang, S.; Yang, Y.; Xu, G. Attention Is Not the Only Choice: Counterfactual Reasoning for Path-Based Explainable Recommendation. arXiv 2024, arXiv:2401.05744. [Google Scholar] [CrossRef]
  33. Trouillon, T.; Welbl, J.; Riedel, S.; Gaussier, É.; Bouchard, G. Complex embeddings for simple link prediction. In Proceedings of the International Conference on Machine Learning, Phoenix, AZ, USA, 12–17 February 2016; pp. 2071–2080. [Google Scholar]
  34. Yang, B.; Yih, W.T.; He, X.; Gao, J.; Deng, L. Embedding entities and relations for learning and reasoning in knowledge bases. arXiv 2014, arXiv:1412.6575. [Google Scholar]
  35. Yang, B.; Yih, W.T.; He, X.; Gao, J.; Deng, L. Rotate: Knowledge graph embedding by relational rotation in complex space. arXiv 2019, arXiv:1902.10197. [Google Scholar]
  36. Dettmers, T.; Minervini, P.; Stenetorp, P.; Riedel, S. Convolutional 2d knowledge graph embeddings. In Proceedings of the AAAI Conference on Artificial Intelligence, New Orleans, LA, USA, 2–7 February 2018; Volume 32. [Google Scholar]
  37. Shang, C.; Tang, Y.; Huang, J.; Bi, J.; He, X.; Zhou, B. End-to-end structure-aware convolutional networks for knowledge base completion. Proc. AAAI Conf. Artif. Intell. 2019, 33, 3060–3067. [Google Scholar] [CrossRef] [PubMed]
  38. Dasgupta, S.S.; Ray, S.N.; Talukdar, P. Hyte: Hyperplane-based temporally aware knowledge graph embedding. In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, 31 October–4 November 2018; pp. 2001–2011. [Google Scholar]
  39. Goel, R.; Kazemi, S.M.; Brubaker, M.; Poupart, P. Diachronic embedding for temporal knowledge graph completion. Proc. AAAI Conf. Artif. Intell. 2020, 34, 3988–3995. [Google Scholar] [CrossRef]
  40. Kazemi, S.M.; Poole, D. Simple embedding for link prediction in knowledge graphs. Adv. Neural Inf. Process. Syst. 2018, 31. [Google Scholar] [CrossRef]
  41. Gao, Y.; Feng, L.; Kan, Z.; Han, Y.; Qiao, L.; Li, D. Modeling Precursors for Temporal Knowledge Graph Reasoning via Auto-encoder Structure. In Proceedings of the 31st International Joint Conference on Artificial Intelligence Main Track, Vienna, Austria, 23–29 July 2022. [Google Scholar]
  42. Xu, W.; Liu, B.; Peng, M.; Jia, X.; Peng, M. Pre-trained language model with prompts for temporal knowledge graph completion. arXiv 2023, arXiv:2305.07912. [Google Scholar]
  43. Kingma, D.P.; Ba, J. Adam: A method for stochastic optimization. arXiv 2014, arXiv:1412.6980. [Google Scholar]
  44. Liu, K.; Zhao, F.; Xu, G.; Wang, X.; Jin, H. Temporal knowledge graph reasoning via time-distributed representation learning. In Proceedings of the 2022 IEEE International Conference on Data Mining (ICDM), Orlando, FL, USA, 28 November–1 December 2022. [Google Scholar]
  45. Sun, X.; Yin, H.; Liu, B.; Meng, Q.; Cao, J.; Zhou, A.; Chen, H. Structure learning via meta-hyperedge for dynamic rumor detection. IEEE Trans. Knowl. Data Eng. 2022, 35, 9128–9139. [Google Scholar] [CrossRef]
Figure 1. An example of a temporal knowledge graph.
Figure 1. An example of a temporal knowledge graph.
Electronics 13 02417 g001
Figure 2. Example of different historical facts associated with a query from GDELT. Different arrows indicate similarity information.
Figure 2. Example of different historical facts associated with a query from GDELT. Different arrows indicate similarity information.
Electronics 13 02417 g002
Figure 3. An illustrative diagram of the proposed ERSP model. The RS component represents the relationship similarity-aware component. The SAS component represents the static attribute similarity-aware component. ATGN represents the adaptive time gate network. SGC-Sim represents the static graph Similarity constraint.
Figure 3. An illustrative diagram of the proposed ERSP model. The RS component represents the relationship similarity-aware component. The SAS component represents the static attribute similarity-aware component. ATGN represents the adaptive time gate network. SGC-Sim represents the static graph Similarity constraint.
Electronics 13 02417 g003
Figure 4. Performance (in percentage) of the entity prediction task with ICESW14 and ICEWS05-15.
Figure 4. Performance (in percentage) of the entity prediction task with ICESW14 and ICEWS05-15.
Electronics 13 02417 g004
Figure 5. Performance (in percentage) of various embedding dimensions with ICEWS14.
Figure 5. Performance (in percentage) of various embedding dimensions with ICEWS14.
Electronics 13 02417 g005
Figure 6. Performance (in percentage) of different history length settings with ICEWS14.
Figure 6. Performance (in percentage) of different history length settings with ICEWS14.
Electronics 13 02417 g006aElectronics 13 02417 g006b
Table 1. Symbols and descriptions.
Table 1. Symbols and descriptions.
SymbolDescription
G A temporal knowledge graph.
H The entity set in the TKG.
R The relation set in the TKG.
M The fact set.
E s , E o Subject entity, object entity.
W r η The weight matrix related to the relation r .
W o η The weight matrix related to the object entity o .
E s The static embedding matrix.
E e The evolutionary embedding matrix.
H s The similarity matrix.
Φ ( ) The activation function.
H The total number of entities.
R The total number of relations.
P s c o r e E The score probability of the entity.
P s c o r e r The scoring probability of the relation.
Table 2. Performance (in percentage) of the entity prediction task with ICESW14, ICEWS05-15, and ICEWS18. The best results are highlighted in bold. The second-best results are highlighted by underlining. (Higher values indicate better performance.)
Table 2. Performance (in percentage) of the entity prediction task with ICESW14, ICEWS05-15, and ICEWS18. The best results are highlighted in bold. The second-best results are highlighted by underlining. (Higher values indicate better performance.)
ModelICE14ICE105-15ICE18
MRRH@1H@3H@10MRRH@1H@3H@10MRRH@1H@3H@10
DisMult 120.326.1327.5946.6119.915.6327.2247.3313.865.6115.2231.26
ComplEx 122.619.8828.9347.5720.266.6626.4347.3115.458.0417.1930.73
R-GCN 128.0319.4231.9544.8327.1318.8330.4143.1615.058.1316.4929.00
ConvE 130.3021.3034.4247.8931.4021.5635.7050.9622.8113.6325.8341.43
Conv-
TransE 1
31.5022.4634.9850.0330.2820.7933.8049.9523.2214.2626.1341.34
RotatE 125.7116.4129.0145.1619.0110.4221.3536.9214.536.4715.7831.86
HyTE 116.782.13 24.8443.9416.056.5320.2034.727.413.107.3316.01
TTransE 112.863.1415.7233.6516.535.5120.7739.268.441.858.9522.38
TA-DistMult 126.2216.8329.7245.2327.5117.5731.4647.3216.428.6018.1332.51
DE-SimplE 232.6724.4335.6949.1135.0225.9138.9952.7519.3011.5321.8634.80
TNT-
ComplEx 2
32.1223.3536.0349.1327.549.5230.8042.8621.2313.2824.0236.91
CyGNet 134.6825.3538.8853.1635.4625.4440.2054.4724.9815.5428.5843.54
RE-NET 135.7725.9940.1054.8736.8626.2441.8557.6026.1716.4329.8944.37
TANGO-DistMult22.8714.2225.4340.3240.2330.5344.9559.0526.2116.9229.7744.41
TANGO-Tucker24.3615.1227.1543.0741.8231.1047.5562.1924.3615.1227.1543.07
RE-GCN 141.2530.4646.2662.0545.6134.4351.8566.6430.5520.0034.7351.46
xERTE 132.2324.2924.2924.2938.0728.4543.9257.6227.9819.2632.4346.00
GHT37.4027.7741.6656.1941.5030.7946.8562.7327.4018.0830.7645.76
rGalT38.3328.5742.8658.1338.8927.5844.1959.1027.8818.0131.5947.02
PPT38.4228.9442.5057.0138.8528.5743.3558.6326.6316.9430.6445.43
ERSP42.6531.8847.9963.6447.1035.6853.4268.7031.1720.4535.3952.39
1 The results are taken from [44]. 2 The results are taken from [18]. Other results are taken from the original paper.
Table 3. Performance (in percentage) of the entity prediction task with YAGO and GDELT. The best results are highlighted in bold. The second-best results are highlighted by underlining.
Table 3. Performance (in percentage) of the entity prediction task with YAGO and GDELT. The best results are highlighted in bold. The second-best results are highlighted by underlining.
ModelGDELTYAGO
MRRH@1H@3H@10MRRH@3H@10
DisMult 18.613.918.2717.0444.0549.7059.94
ComplEx 19.845.179.5818.2344.0949.5759.64
R-GCN 112.177.4012.3720.6320.2524.0137.30
ConvE 118.3711.2919.3632.1341.2247.0359.90
Conv-TransE 119.0711.8520.3233.1446.6752.2262.52
RotatE 13.620.522.268.3742.0846.7759.39
HyTE 16.690.017.5719.0614.4239.7346.98
TTransE 15.530.464.9715.3726.1036.2847.73
TA-DistMult 110.344.4410.4421.6344.9850.6461.11
RGCRN 218.6311.5319.8032.4243.7148.5356.98
CyGNet 118.0511.1319.1131.5046.7252.4861.52
RE-NET 119.6012.0320.5633.8946.8152.7161.93
TANGO-DistMult----49.4955.4263.74
TANGO-Tucker----49.3155.1263.73
RE-GCN 119.3111.9920.6133.5962.5070.2481.55
rGalT19.5612.1120.8934.1551.4557.7668.31
RE-GAT19.1111.8020.4433.34---
ERSP19.8312.2521.2734.6363.8972.7884.18
1 The results are taken from [44]. 2 The results are taken from [17]. Others are taken from the original paper.
Table 4. Performance (in percentage) of the relation-prediction task with ICEWS18, ICESW14, ICEWS05-15, YAGO, and GDELT. The best results are highlighted in bold. The second-best results are highlighted by underlining.
Table 4. Performance (in percentage) of the relation-prediction task with ICEWS18, ICESW14, ICEWS05-15, YAGO, and GDELT. The best results are highlighted in bold. The second-best results are highlighted by underlining.
ModelICE18ICE14ICE05-15YAGOGDELT
ConvE 137.7338.8037.8991.3318.84
ConvTransE 138.0038.4038.2690.9818.97
RGCRN 137.1438.0438.3790.1818.58
RE-GCN 140.5341.0640.6393.8519.22
ERSP41.1741.1441.2094.0819.61
1 The results are taken from [17].
Table 5. Ablation studies on the ERSP model. The best results are in bold. RGU is the relation-aware gating unit, SGSC is the static graph similarity constraint mechanism, and ATGN is the adaptive time gate network structure.
Table 5. Ablation studies on the ERSP model. The best results are in bold. RGU is the relation-aware gating unit, SGSC is the static graph similarity constraint mechanism, and ATGN is the adaptive time gate network structure.
ModelICE18ICE14ICE05-15YAGOGDELT
RGU31.1341.6246.8363.8919.87
SGSC31.2842.1347.0063.8319.86
RGU + SGSC31.1241.9947.0063.7819.86
RGU + ATGN30.9842.0146.5163.7919.91
SGSC + ATGN31.2042.3046.9163.7819.89
ERSP31.1942.6547.1063.8919.83
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Feng, S.; Zhou, C.; Liu, Q.; Ji, X.; Huang, M. Temporal Knowledge Graph Reasoning Based on Entity Relationship Similarity Perception. Electronics 2024, 13, 2417. https://doi.org/10.3390/electronics13122417

AMA Style

Feng S, Zhou C, Liu Q, Ji X, Huang M. Temporal Knowledge Graph Reasoning Based on Entity Relationship Similarity Perception. Electronics. 2024; 13(12):2417. https://doi.org/10.3390/electronics13122417

Chicago/Turabian Style

Feng, Siling, Cong Zhou, Qian Liu, Xunyang Ji, and Mengxing Huang. 2024. "Temporal Knowledge Graph Reasoning Based on Entity Relationship Similarity Perception" Electronics 13, no. 12: 2417. https://doi.org/10.3390/electronics13122417

APA Style

Feng, S., Zhou, C., Liu, Q., Ji, X., & Huang, M. (2024). Temporal Knowledge Graph Reasoning Based on Entity Relationship Similarity Perception. Electronics, 13(12), 2417. https://doi.org/10.3390/electronics13122417

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop