Next Article in Journal
DIPA: Adversarial Attack on DNNs by Dropping Information and Pixel-Level Attack on Attention
Next Article in Special Issue
Construction of Knowledge Graphs: Current State and Challenges
Previous Article in Journal
The Development of a Prototype Solution for Collecting Information on Cycling and Hiking Trail Users
Previous Article in Special Issue
Intelligent Text Mining for Ontological Knowledge Graph Refinement and Patent Portfolio Analysis—Case Study of Net-Zero Data Center Innovation Management
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Review

Use of Knowledge Graphs for Construction Safety Management: A Systematic Literature Review

Department of Civil and Environmental Engineering, Hongik University, Seoul 04066, Republic of Korea
*
Author to whom correspondence should be addressed.
Information 2024, 15(7), 390; https://doi.org/10.3390/info15070390
Submission received: 12 April 2024 / Revised: 21 June 2024 / Accepted: 28 June 2024 / Published: 3 July 2024
(This article belongs to the Special Issue Knowledge Graph Technology and its Applications II)

Abstract

:
Effective safety management is crucial in the construction industry. The growing interest in employing Knowledge Graphs (KGs) for safety management in construction is driven by the need for efficient computing-aided safety practices. This paper systematically reviews the literature related to automating safety management processes through knowledge base systems, focusing on the creation and utilization of KGs for construction safety. It captures current methodologies for developing and using KGs in construction safety management, outlining the techniques for each phase of KG development, including scope identification, integration of external data, ontological modeling, data extraction, and KG completion. This provides structured guidance on building a KG for safety management. Moreover, this paper discusses the challenges and limitations that hinder the wider adoption of KGs in construction safety management, leading to the identification of goals and considerations for future research.

1. Introduction

The construction industry is recognized globally as one of the most hazardous sectors. Persistent safety challenges are highlighted by stark fatality statistics. In 2020, fatalities included 1008 in the USA [1], 27 in the UK [2], and 794 in China [3]. The occurrence of injuries and occupational diseases is similarly alarming. The US construction sector reported 174,000 nonfatal injuries and illnesses in 2020 [2]. In the UK, the construction industry averaged 59,000 injuries annually between 2019 and 2021, with a notable increase following the coronavirus pandemic [4].
Both academia and industry have advanced a range of safety management theories and methods, covering performance measurement, event documentation, policy formulation, and risk evaluation [5]. With particular interests in technological advancements for safety, researchers like Fang et al. [6] and Ye et al. [7] have introduced computer vision and intelligent platforms for hazard identification and response. These automation-oriented approaches to safety management, grounded in digital technology, necessitate enhanced gathering, processing, and analyzing of safety data from diverse sources, including sensors [8], accident reports [9], and surveillance systems [10].
The rise of digitalization and data science has broadened the sources of safety information. Technologies like Virtual Reality (VR), Geographic Information System (GIS), 3D & 4D Computer-Aided Design (CAD), Building Information Modeling (BIM), and the Internet of Things (IoT) have significantly diversified the data supporting safety management in construction [11]. While these varied techniques offer vast opportunities for collecting various types of safety-related information, the diversity and volume of data presented new challenges: (1) incompatible file formats—VR, CAD, or BIM IFC file formats vary across software, complicating data integration and analysis; (2) knowledge entity ambiguity—the different uses of construction terms, such as “Toolbox” in “Toolbox Meeting” or “Toolbox Talks”, which denote brief safety meetings, may cause confusion in processing such textual data; (3) information redundancy—safety regulations, frequently updated by authorities or project management teams, can render some data obsolete or redundant; (4) interoperability issues—employing multiple tools for creating and processing safety-related data complicates gaining comprehensive insights, especially due to interoperability issues among software programs and platforms [12].
Knowledge Graph (KG) technology offers a method for representing and integrating such heterogeneous safety-related data. A KG is structured as a semantic graph comprising nodes (or vertices) and edges, which encapsulate and denote data from various sources. Nodes can symbolize concepts or entities, such as workers, tools, or environmental factors, while edges represent the relationships among them, such as operations, commands, or causation [13], in the context of construction safety. For instance, as Figure 1 depicts, a regulation about working on a roof could be represented in a KG as follows. Rule 1 requires fall protection (including harnesses, guardrails, lanyards, and safety nets) and PPE (helmet and non-slip footwear). The relationship “requires” links Rule 1 to these safety measures. A worker, linked by the “equipped” relationship, must use this equipment. The “supervise” relationship connects the safety manager to the worker, ensuring compliance. Finally, the worker is linked by the “work” relationship to the task “Work on roof”. This KG structure clarifies the dependencies and requirements for safe roof work. By organizing data into graphs, a KG also enables data retrieval based on semantic relationships between specific data. Also, this method aids in uncovering insights from datasets that are not apparent in the original data [14].
The appeal of Knowledge Graphs (KGs) has garnered widespread interest. Commercial IT giants like Google, IBM, Amazon, and LinkedIn have each developed their own proprietary KGs [15,16]. Moreover, there are various open-source KGs available, such as DBpedia, Google’s Knowledge Graph, and YAGO, drawing on general knowledge and official databases [17]. Several researchers proposed the use of KGs in health and safety management in multiple domains. Wu et al. [18] integrated Bayesian networks with KGs for fault prognosis in high-speed railway traction devices. In aviation, KGs have been utilized to analyze risk reports and assist in flight [19,20]. Wang et al. [21] created a KG for analyzing railway electrical accidents. Furthermore, KGs have supported safety management in electrical grids, including incident analysis [22], fault handling [23], and safety planning [24]. Notably, the development of KGs in the healthcare domain accelerated due to the COVID-19 pandemic, aiding in the integration of information from various organizations and identifying transmission patterns [25].
In the domain of construction safety management in particular, several researchers also proposed developing Knowledge Graphs (KGs) for organizing different types of data. Jiang et al. [26] conducted a case study on developing a KG for safety standards in the construction industry. Chen et al. [27] proposed to integrate linguistic and visual information about on-site occupational hazards, extracted through AI models, into a KG. Preliminary experiments have also explored converting Industry Foundation Classes (IFC) from BIM models to develop safety ontologies and map them onto graphs [28]. As such, research on KGs within construction safety management is emerging, but there is a lack of research that systematically analyzes the application, development, trends, and challenges of using KGs in the construction safety management domain.
Against this backdrop, this paper aims to thoroughly review the application of Knowledge Graphs (KGs) in the construction safety management domain and to propose a detailed, step-by-step process for developing KGs specifically for this domain. More specifically, this review sets out with the following objectives to fulfill the study’s aim:
(1)
To identify global trends and research focal points within scientific publications on KGs in construction safety management;
(2)
To categorize current research efforts in applying KGs in the domain.
(3)
To model the methods employed at each stage of constructing KGs for construction safety management.
The rest of the paper is organized as follows. The next section details the methodology for the literature retrieval process. The third section presents a scientometric analysis of 139 publications selected for this review paper. The fourth section critically examines the categorization, construction procedures, and applications of KGs. The fifth section summarizes the main methods used in KG development and highlights five major issues impeding the use of KGs in the construction safety management domain. The conclusion is presented in the sixth section.

2. Selection of Papers for Review

Web of Science (WoS) and Scopus are two renowned and extensively utilized databases in the engineering field. Previous studies have indicated that, while Web of Science offers rich metadata for analysis, Scopus boasts a wider array of literature resources [29]. To ensure a comprehensive database for preliminary analysis, the authors opted for these two databases. The search timeframe was set from 2000 onwards to encompass the relevant literature and developments in the field [30], as the application of KGs in safety management domain was scarce before then.
The search strategy encompassed three themes: KG, safety, and construction. For the safety theme, terms like “accident”, “incident”, “job hazard”, “risk OR risk management”, “safety OR safety management”, and “occupational health” were identified. The construction theme included keywords such as “construction”, “civil engineering”, and “construction industry”, while “knowledge graph” was deemed sufficient to represent its own theme. As such, the final search formula executed in Scopus was ALL (“knowledge graph”) AND TITLE-ABS-KEY (“accident” OR “incident” OR “risk” OR “safety” OR “occupational health”) AND TITLE-ABS-KEY (“construction” OR “civil engineering” OR “construction industry”). Adjustments were made to the search syntax for application in WoS such that the papers relevant to all three themes are searched in the database.
As a result of the initial search, a total of 368 publications were retrieved. Subsequent filtering was conducted to eliminate duplicates across the databases. Following the PRISMA guideline [31], the contents were meticulously reviewed to discard papers unrelated to the topic, resulting in 139 papers suitable for review, as presented in Figure 2 (the research project was registered in Open Science Framework). These papers span various publication years and categories. The bulk of the publications were in journals (67.7%), while conference proceedings accounted for the remaining 32.3%. An upward trend in publications began in 2016, with a significant increase noted in 2022, and the momentum remained high into 2023 compared to earlier years. Table 1 summarizes the distribution over the years and the categories of the publications selected for review.
Furthermore, we identified journals that have published more than two papers on this topic in recent years. The top eight journals accounted for 24.5% of the total papers, with “Automation in Construction”, “Advanced Engineering Informatics”, and “Buildings” emerging as the most prolific in terms of publications on KGs in construction safety management.

3. Scientometric Analysis

The scientometric approach is a statistical method that elucidates the knowledge structure of a specific domain by analyzing extensive literature sources. Constructing a comprehensive knowledge structure from large-scale bibliographic data is both time-consuming and challenging for researchers. In review studies, mapping and clustering techniques prove useful for revealing relationships and characteristics among articles, keywords, authors, and journals [32]. To support scientometric analysis, various tools have been developed, including Bibexcel, VOSviewer, and CiteSpace [33]. This study employed CiteSpace 6.2R4, a robust Java-based software that is capable of processing data from multiple sources to generate networks of heterogeneous nodes [34].
Utilizing CiteSpace, the study extracted the co-occurrence of keywords directly from the bibliographic data. The criteria for filtering papers were defined by a modified g-index with a scale factor value of 50. Figure 3 showcases the co-occurrence links among various keywords, with the size of the label font and node corresponding to the frequency of publications containing the keyword. The six most-frequent keywords identified were “knowledge graph”, “deep learning”, “system”, “ontology”, “management”, and “risk assessment”. The centrality of the knowledge network’s nodes was calculated to assess their role within the network. The analysis revealed that the five keywords with the highest connectivity were “deep learning”, “data mining”, “BIM”, “construction method”, and “design”, indicating their significant influence on other nodes within the network. The keywords could be categorized into two groups: the first group pertains to techniques, including “deep learning”, “data mining”, “BIM”, “named entity recognition”, and “ontology”, among others; the second group relates to the field’s application, featuring keywords such as “risk assessment”, “construction safety”, “knowledge representation”, and “design”.
Cluster analysis is frequently employed to extract themes and research insights from extensive bibliographic data [32]. In this study, the modularity Q value of the keyword network is 0.7887, suggesting that the network can be effectively segmented into clusters for more detailed analysis (Q value quantifies the strength of how a network can be divided into distinct communities [35]). Figure 4 illustrates eight clusters identified. Furthermore, this paper applied silhouette score (range from −1 to 1) to evaluate the quality of clustering; a closer value to 1 indicates a clear clustering [36]. The silhouette score for these clusters exceeds 0.827, affirming their internal homogeneity and suitability for further investigation. The cluster analysis reveals that construction site-related terms are a focal point in the papers, while deep learning and complex system analysis are frequently linked with the topic as methodical approaches. Moreover, the representation of information through Knowledge Graphs appears to be a primary goal across the publications.
Overall, the analysis of keyword co-occurrence and cluster formation within the selected papers highlights two principal research dimensions. The first concerns artificial intelligence (AI) techniques, featuring keywords like “deep learning”, “name entity recognition”, and “data mining” and clusters like “deep neural network”. The second concerns safety management themes, with keywords such as “BIM” and “risk assessment” and clusters like “accident scenario” and “healthy operation index”. KGs provides a structured representation of information possibly from multiple sources by connecting data points through nodes and edges. Structured information makes AI algorithms such as natural language processing and machine learning easier to understand the relationships of information and the context of data. On the other hand, KGs enable semantic search by both the meanings and the relationships of words, not just word matching, which enriches the information retrieval capabilities of AI systems. The trend towards leveraging AI for knowledge acquisition and utilizing KGs in construction safety management, which is consistent with the direction of AI development.

4. Categories of KGs for Construction Safety Management

This section is structured into three subsections. The first subsection delineates the categories of KGs in safety management, shedding light on the dominant research perspectives in academia. The second subsection provides a comprehensive overview of the processes and methods required to develop a KG in the context of safety management. The third and final subsection outlines the application scenarios of KGs in construction safety management research, demonstrating the current trends in their use.

4.1. The Categories of Knowledge Graphs in Safety Management

KGs in safety management can be classified from several angles. A notable approach is the incorporation of time series elements into KGs, leading to their classification into static and dynamic categories. Furthermore, the data source, whether singular or multiple, impacts data-processing methods, graph scope, and information diversity, resulting in single-source and fusion-source KG categories. From a usage standpoint, KGs are differentiated into domain-specific KGs and general KGs, depending on whether their entities and edges are confined to a specific domain or encompass a broader range. In the field of safety management, the focus is often on developing and analyzing domain-specific KGs rather than general ones. Consequently, the first two classifications are explored in further detail below.
Time consideration plays a vital role in differentiating between static and dynamic KGs. In conventional KGs, relationships among nodes are usually depicted without regard for temporal changes. However, in the safety management domain, time is an essential factor due to the dynamic nature of accident occurrence scenarios. Recent research has delved into two primary methods for incorporating time elements into KGs. The first method involves incorporating time information directly into the definition of entities and relations in the KG. Leblay et al. [37] introduced a KG where time intervals are used to represent relationships between entities, along with utilizing event timelines to forecast future connections, thus providing enhanced understanding of event relationships evolving over time. Moreover, several studies have embedded timestamps into nodes, enabling these KGs to capture temporal changes effectively and facilitate dynamic analysis [38]. The second method updates the graph contents at regular intervals. Duan et al. [39] developed a crowd congestion KG to depict real-time evacuation scenarios, extracting data from sensors and videos and employing a dynamic algorithm to refresh node attributes at each time interval.
The differentiation between single-source KGs and fusion-source KGs lies in their data integration process. Single-source KGs rely on homogeneous data types, such as standardized accident reports, official statistical reports, and field logs. These data are consistent and standardized, facilitating straightforward retrieval and processing from the respective sources. KGs based on a single source are, therefore, typically more concise and clearer due to this uniformity. In contrast, fusion-source KGs have gained more attention as access to a variety of data types has increased, including videos, event news, meteorological data, and safety regulations. For purely textual data, researchers employ natural language processing techniques to extract and convert related data into a uniform format [38,40,41]. Researchers have explored the use of computer vision alongside natural language processing to manage different data types [27,42]. However, the integration of varied data sources through KGs in the safety management domain is still in its early stages.

4.2. Application Scenarios of KG

Most use cases of KGs in the construction safety management domain are primarily for data representation and storage, which are considered initial phases of KG utilization [43,44,45,46]. However, several pioneering researchers have recently explored a more advanced use of KGs in this domain:
(1)
Creation of Accident Evolution Networks for Key Factor Identification: KGs have been used to depict risk factors as entities and connect different risk factors with directed or undirected edges, forming a network that models accident evolution. The underlying idea is that identifying critical risk factors along potential accident paths before an incident can facilitate targeted preventive measures to avert accident escalation and recurrence [40];
(2)
KG-Based Construction Safety Recommendation System: Adherence to safety regulations is essential for the competence of construction practitioners. However, gaining knowledge of these regulations often requires significant time and educational investment. Some studies have focused on extracting regulatory requirements into KGs for application in safety scenarios, aiding in comprehension and compliance [47];
(3)
Entity-Based Risk Assessment: Utilizing the structural characteristics of KGs, algorithms that compute the shortest path and node connections can facilitate risk assessments [48]. For example, in their study, hazard-related entities such as incident cause, consequence, time, location, and speed were extracted as nodes, with causation serving as links to construct KGs. Consequently, nodes closer to the incident’s nodes indicate a higher risk and occurrence associated with these nodes. Network path techniques such as closeness centrality have been employed to quantify the impact of entity occurrences, with thresholds set to evaluate entities’ risk levels. Direct causation and indirect causation elements were assessed as active and passive causal closeness to calculate the probability of accident occurrence.
These applications highlight ongoing efforts to leverage KGs for advanced analysis and problem-solving in construction safety management, pointing toward a future where KGs play a central role in enhancing safety measures and reducing risk.

4.3. Trends in KG Research in the Construction Safety Management Domain

From a technological perspective, KGs are increasingly being integrated with natural language processing, machine learning, and recommender [49]. Specifically, research in construction safety management by leveraging KGs is advancing in three main directions:
(1)
Transformation of data into knowledge: By transforming construction data into nodes and edges, the semantic connections between construction management elements become clearer. For example, risk analysts decompose project entities into nodes and their interactions into links to illustrate complex relationships [6,27]. Most publications employ KGs as a method to bolster their findings, allowing for concise expression that facilitates easy understanding and modification by others;
(2)
KG as a database for complex AI models: KGs are constructed as a base layer to support generative AI systems, such as large language models (LLM) and multi-modal models, such that AI algorithms can access, retrieve, and process real-world facts and data in the most efficient way. KGs, with their inherent machine-readable structure, are suited as input databases. Consequently, developing safety management KGs by grounding a generative AI system to assist in construction safety management tasks (e.g., [50,51]) has recently emerged as a research direction;
(3)
New dimensional knowledge discovery: KGs are a part of graph science, making graph attributes and theories directly applicable. Node metrics like degree, centrality, and clustering indicate the significance of nodes and their relation to other nodes. Thus, the development of KGs not only aids in data structuring but also paves the way for gaining new insights from the knowledge base through graph science principles [30,52].
These three research directions highlight the technological trends in advancing KGs in the construction safety management domain, underscoring their role in clarifying complex relationships among project data, enhancing AI model development, and discovering insights from knowledge bases.

5. Analysis of KG Development Process

This section provides a thorough examination of the KG development process as found in the construction safety management literature, essentially discussing the state-of-the-art practices for constructing KGs within this domain. It begins by outlining the most common process for developing KGs. Then, it delves into a comprehensive description of the methods employed in KG development, as discovered through the literature review.

5.1. Overview of KG Development Process

There is no standard procedure for building Knowledge Graphs (KGs) in either computer science or the construction safety domain. The literature analysis shows that, at a high level, the development of a KG for construction safety management typically follows the process depicted in Figure 5.
The first step in developing a Knowledge Graph for construction safety management involves integrating both internal and external project data. In addition to utilizing safety files created on-site and used by stakeholders, some researchers suggest that incorporating external knowledge not directly related to the project is essential for broadening the knowledge base. This external information may include safety data from official publications, similar companies, academic research, and industry management bureaus [53]. Accessing a wider range of information resources aids in refining technical terminology, enhancing semantic context, and enriching domain-specific knowledge.
Ontological templates function as foundational blueprints for constructing a KG, with their design primarily aimed at knowledge sharing and enabling semantic interoperability. This ensures that various parties and computer systems can exchange and repurpose information seamlessly, which is a core principle of KGs. An ontology delineates the intrinsic attributes of entities and their interrelations through a structured set of classes. To minimize information redundancy and complexity in knowledge formats, external data are typically adapted to fit the predefined ontological structure. For example, Wang et al. [53] integrated risk management data from an existing ontology library; Zhu and Luo [54] merged a comprehensive public KG into their specifically designed safety rule KG to enhance information accuracy; and Yang and Liao [38] incorporated The People’s Daily Corpus in the form of semi-structured text data to broaden the knowledge base.
Following the ontological templates, data extraction focuses on formatting the information into nodes and edges, the fundamental elements of KGs. When two triplets share a common node, they form preliminary KGs through node–edge–node relationships. However, overly large and sparse KGs can lead to inefficient use of storage resources and other issues. Common problems include repetitive triplets that convey the same meaning in various forms and a lack of connections between nodes, leading to disconnected segments within the graph. The process of completion checking is crucial for identifying such issues. If problems are detected, graph completion techniques are applied; if not, the KG construction is deemed complete.

5.2. Methods Used in Developing KGs

5.2.1. Scope Identification

In defining the scope for KG development, it is essential to consider the problems that the KG aims to solve and the needs of stakeholders involved in the study, given the limited resources and priority of issues. The domain of safety management inherently encompasses extensive data due to its multifaceted nature [27]. The safety knowledge base in a construction project includes written and graphic documents, as well as raw data from equipment or devices [6]. The vast quantity of data and the varied authority of data sources necessitate filtering to establish a valid foundation [26]. Therefore, articulating the problem statement and prioritizing safety knowledge information based on specific needs are crucial steps in clearly defining the scope of analysis [53].

5.2.2. Ontological Template Development

Developing a safety ontology can follow a top-down approach; for instance, Fang et al. [55] defined classes such as “time, space, event, thing, part, attribute, and attribute-value” based on reports of hazardous events, while Simone et al. [56] conceptualized classes like “event, apparatus, substance, activity, barrier, and people” from near-miss reports. This categorization takes into account relevant standards, specifications, codes, and regulations. The next step involves defining subjects, relationships (predicates), and objects to construct a basic ontology model for each class.
Alternatively, an ontology can be developed from a bottom-up perspective. Leveraging big data and advanced data-processing techniques, valuable insights can be extracted from unstructured data, allowing for the construction of ontologies based on the attributes of safety information. For example, researchers attempted to automate the development of a railway safety risk management ontology using statistical methods and k-means clustering [53]. Additionally, tools like the Jena inference engine assist in ontology construction through various algorithms [38].
As ontological theory in construction safety management evolves, the development of ontologies increasingly adopts hybrid approaches. Chen and Bria’s review [57] indicates that ontology-building methods and their effectiveness have diversified, providing a robust foundation for advancing knowledge graph development.

5.2.3. Data Extraction

In the data extraction phase of ontology development, the process involves navigating through data and filtering out triplets, which can be approached in two ways: extracting entities and relationships separately or using models designed to identify both simultaneously. The choice of approach largely depends on the format of the safety data source. For structured formats like JSON, CSV, RDF, etc. from safety organizations such as OSHA, KOSHA, and SWA, data can be straightforwardly transformed into triplets using data processing tools or packages. For instance, construction accident fatality CSV files from OSHA can be processed into triplets like “<accident_name><caused><number_of_deaths>” after appropriate filtering.
However, unstructured data sources such as news articles, regulations, field notes, and semi-structured sources like Wikipedia necessitate additional steps for processing. For these types of data, the foundational approach involves labeling both entities and relationships to extract them in a manner that aligns with the ontologies. To facilitate this, researchers have utilized a variety of statistical tools and AI models, encompassing machine learning and deep learning models. Table 2 summarizes the common categories of extraction and models employed in the recent literature over the past three years.
As indicated in Table 2, raw data in the construction safety management domain often encompass both documents and pictures, necessitating the use of both computer vision (CV) and natural language processing (NLP) techniques for data extraction. Models such as Mask RCNN are frequently utilized in CV to simultaneously identify entities and spatial relationships, effectively meeting ontological needs. However, deep learning models from the Recurrent Neural Network (RNN) family, including LSTM and GRU, which are typically employed for NLP tasks, might only capture entities or relationships. In instances where entities are missing, they can be supplemented through integration with CV models [59]. Additionally, missing relationships can be annotated based on the attributes of entities. For example, missing comparative, inclusive, or judgmental statements (“higher”, “includes”, “is”, etc.), can be indicated in ontological templates [22,48].
The use of deep learning models for extracting safety information has grown significantly with the advancement of AI technology. LSTM (Long Short-Term Memory), an enhanced version of the RNN model, is adept at capturing information across lengthy sequences of input words or graphical elements [67]. Conversely, Mask RCNN (Mask Region Convolutional Neural Network) is capable of segmenting images, detecting objects, and assessing spatial distances, making it ideal for extracting safety triplets. The introduction of the transformer model has further optimized and facilitated the integration of text and visual data [68]. For capturing contextualized representations of words and sentences, the application of Bidirectional Encoder Representations from Transformers (BERT) has set new benchmarks in NLP [50]. Moreover, the potential of Graph Convolutional Networks (GCN) in analyzing complex relationships within graph-based data suggests promising applications in conjunction with KGs [64].

5.2.4. Knowledge Graph Completion

Extracted triplets are uploaded to KG storage or a Database Management System (DBMS) to generate a preliminary KG. Automatically extracted triplets may include redundant, incomplete, or erroneous elements. Consequently, before utilizing the KG, it is essential to refine, augment, and adjust the initial content. The following four strategies are commonly employed to enhance the completeness of KGs:
(1)
Manual Verification: Common-sense and logical errors are identified and corrected through expert review. This method ensures accuracy by addressing errors that automated systems might overlook [68];
(2)
Filtering: Using filter functions or conditional operations available in most graph-based DBMSs, such as the “MATCH” function in Neo4j, allows users to identify and remove redundant or conflicting entries effectively;
(3)
Terminology Consolidation: Yin et al. [58] designed an entity library to standardize terminology, thereby preventing conflicts and duplications. This library provides a unique representation for each concept and ensures that multiple entities within the same sentence are separated and linked individually;
(4)
Reasoning-Based Graph Completion: This method involves generating new links or connections. For instance, from the relationship <A><BELONG_TO><B><CAUSE><C>, a new link <A><CAUSE><C> can be derived. For more intricate scenarios, graph algorithms are utilized to dynamically predict relationships [38]. Additionally, in the construction safety management domain, various sophisticated approaches are employed for KG reasoning and inference, including logical inference, the development of entity and relation embeddings, and the use of statistical relational learning to refine the network [30].

6. Discussion—Issues and Potential Solutions

6.1. Main Issues with Applying KGs in the Construction Safety Management Domain

A KG integrates data from multiple sources, often with varying formats and schemas that are not as standardized as IFC files from BIM software. For instance, on-site project log documents may employ terminologies that differ from those used in construction standard documents. Additionally, establishing links between on-site CCTV-based image or video data and construction accident records can be problematic due to the absence of a unified data schema, which would include data definitions, labels, and relations. These discrepancies significantly complicate the construction of high-quality KGs. Extracting meaningful information from unstructured data sources is a challenging task, markedly more so than from structured data. Although advances in natural language processing and image processing have opened new possibilities for effectively extracting information from text and images, defining conditions to filter out noisy data remains complex. Consequently, automating the data cleaning process in safety management is both crucial and challenging.
The scientometric analysis of this research shows that regulation-related documents constitute a significant portion of safety management data and serve as crucial references in management practices. The typical language structure of clauses in safety management is often imperative, advising or specifying certain actions within specific contexts. The absence of subjects and the complexity of activation conditions in these clauses compromise the accuracy of automated information extraction, thus complicating the formation of effective rule triplets and the construction of a clear KG. For instance, the OSHA Standard [69] states, “The employer shall assess the workplace to determine if hazards are present, which necessitate the use of personal protective equipment (PPE)”. The complex conditions specified make it challenging to construct effective triplets from this standard.
The foundational elements of a KG are triplets, yet these may sometimes inadequately convey the full meaning of safety management semantics, particularly when generated through automatic extraction. There are two primary limitations in the representational capability of triplets. First, the standard <subject-predicate-object> format often fails to capture all essential elements of safety information. For instance, the statement “workers cannot access the restraint area” appears straightforward but requires additional details to specify the type of workers, effective duration, and exceptions such as “unless…” to clearly define safety actions under specific conditions. Second, ontological models may struggle with complex relationships [57]. For example, a construction project might be associated with multiple safety plans, each containing various procedures. This can lead to complex many-to-many relationships, posing challenges in establishing clear, direct relationships between ontological elements.
Furthermore, professional jargon within the construction domain not only affects text extraction accuracy, but can also lead to KG misunderstandings. For example, “Retrofit” might be used as a relationship in a KG. In construction safety management, it refers to the process of updating existing buildings to meet new safety standards, whereas, in a general context, it simply means adding new components to an existing structure. Overcoming this ambiguity requires precise labeling of nodes and relationships to avoid misinterpretation. Currently, there are limited automated methods for achieving this clarity. Moreover, most KGs use a duplication removal function based on syntax rather than semantics, which can lead to the erroneous deletion of ambiguous nodes and edges.
The applications of KGs in the construction safety management domain necessitate timely additions, deletions, and updates of project information to the graph’s content due to the dynamic nature of construction works and their environments. Additionally, the continuous accumulation of safety information requires ongoing updates in the KGs. However, the development of a dynamic KG that is capable of adapting to data updates is still in its early stages. While some studies have used multiple KGs to represent the progression of events, these graphs are constructed independently based on information from different time points and lack continuity [70].

6.2. Proposed Solutions Based on Current Technologies

In order to address the abovementioned issues, firstly, the data preprocessing step can be optimized through knowledge standardization and/or the use of large language models. For structured data from various sources, researchers and project managers could establish knowledge-sharing standards that regulate terminology, data transmission length, format, and the granularity of information transmission. This would dictate what knowledge should be shared within a construction project and how. Additionally, large language models have shown exceptional capabilities in information extraction and filtering. By inputting domain-specific data, construction safety researchers and practitioners could fine-tune a model to help filter out irrelevant information and use prompting engineering to customize outputs into Resource Description Framework (RDF) or JavaScript Object Notation (JSON) formats for input into a KG.
To address the challenge of inadequate KG representation in complex conditions, developing multiple KGs for different aspects of a task could be an approach. If representing the entire safety management process of a project is challenging, it could be segmented into subdivisions and sub-items, with multiple KGs developed for each. This allows each KG to contain a more focused body of information, functioning effectively as a module. Conflicting terms or triplets could be localized within these modules to prevent misrepresentation of the entire project information. For complicated sentences, converting them into multiple triplets within a KG and organizing them into a hierarchical KG could be a solution.
Generative models for graphs offer a promising approach to facilitate evolution within construction safety management KGs. By developing a deep learning model based on existing graph data, these models can generate a new graph that closely resembles the actual data structure found in construction projects. This capability is expected to be particularly valuable in environments where safety conditions and regulatory requirements frequently change, requiring KGs to adapt and update continuously. Furthermore, by comparing the generated KG with one constructed from real-world construction data, researchers can enhance model performance, ensuring that the KG remains effective in representing complex safety management knowledge and solutions. This approach not only mitigates the challenges of outdated information but also simplifies the management of extensive datasets typically encountered in construction safety management, thereby maintaining the integrity and utility of the safety management system.

7. Conclusions

This paper provides a systematic literature review on the topic of the creation and utilization of KGs for construction safety management. A total of 139 relevant publications were sourced from the Scopus and Web of Science databases. The research involved co-occurrence and cluster analysis of keywords to identify prevalent topics. The research also involved categorizing KGs used in the construction safety management domain such as static versus dynamic KGs, single versus multiple source KGs, and domain-specific versus general KGs. The development process of KGs was also examined. The paper detailed the methods used in each phase of KG development, such as scope identification, ontological model construction, data extraction, and completion. The main contributions of this study are as follows:
(1)
Identification of research trends: Charted the evolution and current state of KG applications in the construction safety management domain, highlighting predominant themes, technological integrations, and methodological approaches;
(2)
Methodological framework: Developed a structured methodology for creating KGs tailored to construction safety, covering essential stages like scope identification, ontological modeling, data extraction, and KG completion;
(3)
Categorization of KGs: Identified and categorized various types of KGs in the domain, such as static vs. dynamic KGs and single vs. multiple source KGs, providing clarity on their applicability and utility;
(4)
Integration techniques: Discussed the integration of KGs with advanced data science techniques such as graph-based learning algorithms, which enhance the depth and effectiveness of safety data analysis.
The application of KGs in construction safety management is rapidly evolving. Beyond risk factor identification, safety recommendation systems, and risk assessments, new application directions are emerging. However, the application and development of KGs in construction safety management face several challenges, including the complexity of data cleaning, difficulty of extracting graph structures from unstructured data sources such as texts, the insufficient representativeness of ontology models, handling language ambiguities, and the high demand for dynamicity of KGs. This paper indicates that various data science techniques offer promising solutions to these issues, suggesting that KG applications could benefit from integrating large language models, a common data environment, and graph deep learning models. Based on the analysis results, three future research directions are identified as follows and these future directions will be pursued in the authors’ future research:
(1)
Integration of KGs with graph-based learning algorithms: Graph theory has not been fully exploited within the safety domain. Graph-based deep learning models, such as Graph Convolutional Networks and Graph Neural Networks, could be instrumental in conducting deeper, more effective analyses of construction safety knowledge data. These models are well-suited for analyzing complex interdependencies within safety management systems, potentially leading to more sophisticated predictive safety models;
(2)
Integration of KGs into a common data environment for construction projects: The common data environment in the construction industry serves as a centralized platform for managing and sharing data across all stages of a construction project. Integrating KGs into this environment can enhance the manageability and analysis of construction data, supporting dynamic updates and information sharing. This integration can help in maintaining up-to-date safety protocols and ensuring compliance with safety regulations, thus enhancing overall project safety management;
(3)
Dynamic KGs for real-time safety management: The construction industry requires KGs that can adapt to rapid changes in the environment and project conditions. Developing KGs that incorporate time-series data and that can evolve in real time could significantly improve the responsiveness of safety management systems. Future research should explore methods to integrate real-time data updates into KGs, allowing for continuous adaptation and learning. This could involve developing new frameworks or algorithms that enable KGs to dynamically update and predict potential safety issues based on ongoing construction activities.

Author Contributions

Conceptualization, S.A. and F.K.; methodology, F.K.; software, F.K.; validation, S.A.; formal analysis, S.A.; investigation, F.K.; data curation, F.K.; writing—original draft preparation, F.K.; writing—review and editing, S.A.; visualization, F.K.; supervision, S.A.; project administration, S.A.; funding acquisition, S.A. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the National Research Foundation of Korea Grant funded by the Korean Government (No. NRF-2022R1F1A1074448) and 2023 Hongik University Research Fund.

Data Availability Statement

Data will be made available on request.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. U.S. Bureau of Labor Statistics. A Look at Workplace Deaths, Injuries, and Illnesses on Workers’ Memorial Day. 2022. Available online: https://www.bls.gov/opub/ted/2022/a-look-at-workplace-deaths-injuries-and-illnesses-on-workers-memorial-day.htm (accessed on 25 July 2023).
  2. Health and Safety Executive. 2020–21 Names and Details of Fatalities. 2022. Available online: https://www.hse.gov.uk/foi/fatalities/2020-21.htm (accessed on 25 July 2023).
  3. Ministry of Housing and Urban-Rural Development of the People’s Republic of China. Notice from the General Office of the Ministry of Housing and Urban-Rural Development on the Production Safety Accidents in Housing and Municipal Engineering in 2020. 2022. Available online: https://www.mohurd.gov.cn/gongkai/zhengce/zhengcefilelib/202210/20221026_768565.html (accessed on 25 July 2023). (In Chinese)
  4. Office for National Statistics (ONS), Released 28 November 2023, ONS Website, Article, Construction Statistics, Great Britain. 2022. Available online: https://www.ons.gov.uk/businessindustryandtrade/constructionindustry/articles/constructionstatistics/2022#:~:text=excluding%20self%2Demployment-,The%20number%20of%20construction%2Drelated%20employees%20(excluding%20self%2Demployment,Scotland%20both%20increased%20by%202.0%25 (accessed on 30 November 2023).
  5. Holt, A.S.J. Principles of Construction Safety; John Wiley & Sons: Hoboken, NJ, USA, 2008; ISBN 978-0-470-68024-7. [Google Scholar]
  6. Fang, W.; Ding, L.; Love, P.E.D.; Luo, H.; Li, H.; Peña-Mora, F.; Zhong, B.; Zhou, C. Computer vision applications in construction safety assurance. Autom. Constr. 2020, 110, 103013. [Google Scholar] [CrossRef]
  7. Ye, Z.; Ye, Y.; Zhang, C.; Zhang, Z.; Li, W.; Wang, X.; Wang, L.; Wang, L. A digital twin approach for tunnel construction safety early warning and management. Comput. Ind. 2023, 144, 103783. [Google Scholar] [CrossRef]
  8. Kong, F.; Ahn, S.; Seo, J.; Kim, T.W.; Huang, Y. Beacon-Based Individualized Hazard Alarm System for Construction Sites: An Experimental Study on Sensor Deployment. Appl. Sci. 2021, 11, 11654. [Google Scholar] [CrossRef]
  9. Ale, B.J.M.; Bellamy, L.J.; Baksteen, H.; Damen, M.; Goossens, L.H.J.; Hale, A.R.; Mud, M.; Oh, J.; Papazoglou, I.A.; Whiston, J.Y. Accidents in the construction industry in the Netherlands: An analysis of accident reports using Storybuilder. Reliab. Eng. Syst. Saf. 2008, 93, 1523–1533. [Google Scholar] [CrossRef]
  10. Tran, S.V.-T.; Nguyen, T.L.; Chi, H.-L.; Lee, D.; Park, C. Generative planning for construction safety surveillance camera installation in 4D BIM environment. Autom. Constr. 2022, 134, 104103. [Google Scholar] [CrossRef]
  11. Jin, R.; Zou, P.X.W.; Piroozfar, P.; Wood, H.; Yang, Y.; Yan, L.; Han, Y. A science mapping approach based review of construction safety research. Saf. Sci. 2019, 113, 285–297. [Google Scholar] [CrossRef]
  12. Zhou, Z.; Goh, Y.M.; Shen, L. Overview and Analysis of Ontology Studies Supporting Development of the Construction Industry. J. Comput. Civ. Eng. 2016, 30, 04016026. [Google Scholar] [CrossRef]
  13. Ehrlinger, L.; Wolfram, W. Towards a Definition of Knowledge Graphs. Semant. Posters Demos SuCCESS 2016, 48, 2. [Google Scholar]
  14. Ji, S.; Pan, S.; Cambria, E.; Marttinen, P.; Yu, P.S. A Survey on Knowledge Graphs: Representation, Acquisition, and Applications. IEEE Trans. Neural Netw. Learn. Syst. 2022, 33, 494–514. [Google Scholar] [CrossRef]
  15. Chen, X.; Jia, S.; Xiang, Y. A review: Knowledge reasoning over knowledge graph. Expert Syst. Appl. 2020, 141, 112948. [Google Scholar] [CrossRef]
  16. Li, X.; Lyu, M.; Wang, Z.; Chen, C.-H.; Zheng, P. Exploiting knowledge graphs in industrial products and services: A survey of key aspects, challenges, and future perspectives. Comput. Ind. 2021, 129, 103449. [Google Scholar] [CrossRef]
  17. Gutierrez, C.; Sequeda, J.F. Knowledge graphs. Commun. ACM 2021, 64, 96–104. [Google Scholar] [CrossRef]
  18. Wu, Y.; Jiang, B.; Lu, N.; Zhou, Y. Bayesian network based fault prognosis via bond graph modeling of high-speed railway traction device. Math. Probl. Eng. 2015, 2015, 321872. [Google Scholar] [CrossRef]
  19. Cheng, Y.; Jiao, Y.; Wei, W.; Wu, Z. Research on construction method of knowledge graph in the civil aviation security field. In Proceedings of the 2019 IEEE 1st International Conference on Civil Aviation Safety and Information Technology (ICCASIT), Kunming, China, 17–19 October 2019; pp. 556–559. [Google Scholar] [CrossRef]
  20. Zhao, Q.; Li, Q.; Wen, J. Construction and application research of knowledge graph in aviation risk field. MATEC Web Conf. 2018, 151, 05003. [Google Scholar] [CrossRef]
  21. Wang, X.; Wang, J.; Han, J. Knowledge graph construction for railway electrical accident analysis. In Proceedings of the 2019 International Conference on Machine Learning, Big Data and Business Intelligence (MLBDBI), Taiyuan, China, 8–10 November 2019; Institute of Electrical and Electronics Engineers Inc.: Piscataway, NJ, USA, 2019; pp. 214–219. [Google Scholar] [CrossRef]
  22. Liu, C.; Yang, S. A text mining-based approach for understanding Chinese railway incidents caused by electromagnetic interference. Eng. Appl. Artif. Intell. 2023, 117, 105598. [Google Scholar] [CrossRef]
  23. Xiao, N.; Peng, B.; Li, X.; Wu, J.; Lou, J.; Si, Y. Research on the construction and implementation of power grid fault handling knowledge graphs. Energy Rep. 2023, 9, 182–189. [Google Scholar] [CrossRef]
  24. Jiang, Z.; Long, H. Knowledge Graph Construction for Electric Substation Based on Multiple-source Data. In Proceedings of the 2022 IEEE 5th International Electrical and Energy Conference (CIEEC), Nangjing, China, 27–29 May 2022; Institute of Electrical and Electronics Engineers Inc.: Piscataway, NJ, USA, 2022; pp. 4826–4830. [Google Scholar] [CrossRef]
  25. Chatterjee, A.; Nardi, C.; Oberije, C.; Lambin, P. Knowledge Graphs for COVID-19: An Exploratory Review of the Current Landscape. J. Pers. Med. 2021, 11, 300. [Google Scholar] [CrossRef]
  26. Jiang, Y.; Gao, X.; Su, W.; Li, J. Systematic knowledge management of construction safety standards based on knowledge graphs: A case study in China. Int. J. Environ. Res. Public. Health 2021, 18, 692. [Google Scholar] [CrossRef]
  27. Chen, S.; Demachi, K.; Dong, F. Graph-based linguistic and visual information integration for on-site occupational hazards identification. Autom. Constr. 2022, 137, 104191. [Google Scholar] [CrossRef]
  28. Farghaly, K.; Soman, R.K.; Collinge, W.; Mosleh, M.H.; Manu, P.; Cheung, C.M. Construction safety ontology development and alignment with industry foundation classes (IFC). J. Inf. Technol. Constr. 2022, 27, 94–108. [Google Scholar] [CrossRef]
  29. Cai, S.; Ma, Z.; Skibniewski, M.J.; Bao, S. Construction automation and robotics for high-rise buildings over the past decades: A comprehensive review. Adv. Eng. Inform. 2019, 42, 100989. [Google Scholar] [CrossRef]
  30. Yan, J.; Wang, C.; Cheng, W.; Gao, M.; Zhou, A. A retrospective of knowledge graphs. Front. Comput. Sci. 2018, 12, 55–74. [Google Scholar] [CrossRef]
  31. Page, M.J.; McKenzie, J.E.; Bossuyt, P.M.; Boutron, I.; Hoffmann, T.C.; Mulrow, C.D.; Shamseer, L.; Tetzlaff, J.M.; Akl, E.A.; Brennan, S.E.; et al. The PRISMA 2020 statement: An updated guideline for reporting systematic reviews. BMJ 2021, 372, n71. [Google Scholar] [CrossRef]
  32. Waltman, L.; van Eck, N.J.; Noyons, E.C.M. A unified approach to mapping and clustering of bibliometric networks. J. Inform. 2010, 4, 629–635. [Google Scholar] [CrossRef]
  33. Jayasree, V.; Baby, M.D. Scientometrics: Tools, Techniques and Software for Analysis. Indian J. Inf. Sources Serv. 2019, 9, 116–121. [Google Scholar] [CrossRef]
  34. Chen, C.; Song, M. Visualizing a field of research: A methodology of systematic scientometric reviews. PLoS ONE 2019, 14, e0223994. [Google Scholar] [CrossRef] [PubMed]
  35. Newman, M.E.J. Modularity and community structure in networks. Proc. Natl. Acad. Sci. USA 2006, 103, 8577–8582. [Google Scholar] [CrossRef]
  36. Rousseeuw, P.J. Silhouettes: A graphical aid to the interpretation and validation of cluster analysis. J. Comput. Appl. Math. 1987, 20, 53–65. [Google Scholar] [CrossRef]
  37. Leblay, J.; Chekol, M.W. Deriving Validity Time in Knowledge Graph. In Proceedings of the Companion Proceedings of the Web Conference 2018, Lyon, France, 23–27 April 2018; International World Wide Web Conferences Steering Committee: Geneva, Switzerland, 2018; pp. 1771–1776. [Google Scholar] [CrossRef]
  38. Yang, B.; Liao, Y.-M. Research on enterprise risk knowledge graph based on multi-source data fusion. Neural Comput. Appl. 2022, 34, 2569–2582. [Google Scholar] [CrossRef]
  39. Duan, J.; Liu, H.; Gong, W.; Lyu, L. Crowd Evacuation Under Real Data: A Crowd Congestion Control Method Based on Sensors and Knowledge Graph. IEEE Sens. J. 2023, 23, 8923–8931. [Google Scholar] [CrossRef]
  40. Chen, C.; Hu, J.; Han, Z.; Chen, Y.; Xiao, S. Knowledge graph based early warning method for accident evolution in natural gas pipeline station abroad for harsh environmental conditions. Qinghua Daxue Xuebao/J. Tsinghua Univ. 2022, 62, 1081–1087. [Google Scholar] [CrossRef]
  41. Tang, W.; Zhang, X.; Feng, D.; Wang, Y.; Ye, P.; Qu, H. Knowledge graph of alpine skiing events: A focus on meteorological conditions. PLoS ONE 2022, 17, e0274164. [Google Scholar] [CrossRef]
  42. Ding, Y.; Liu, M.; Luo, X. Safety compliance checking of construction behaviors using visual question answering. Autom. Constr. 2022, 144, 104580. [Google Scholar] [CrossRef]
  43. Huang, Y.; Yin, P.; Zhou, G.; Liu, P.; Tang, Y.; Li, W. Construction of public safety knowledge graphs. In Proceedings of the 2020 International Conference on Computer, Information and Telecommunication Systems (CITS), Hangzhou, China, 5–7 October 2020; Obaidat, M.S., Hsiao, K.-F., Nicopolitidis, P., Cascado-Caballero, D., Eds.; Institute of Electrical and Electronics Engineers Inc.: Piscataway, NJ, USA, 2020. [Google Scholar] [CrossRef]
  44. Hughes, P.; Robinson, R.; Figueres-Esteban, M.; van Gulijk, C. Extracting safety information from multi-lingual accident reports using an ontology-based approach. Saf. Sci. 2019, 118, 288–297. [Google Scholar] [CrossRef]
  45. Mao, S.; Zhao, Y.; Chen, J.; Wang, B.; Tang, Y. Development of process safety knowledge graph: A Case study on delayed coking process. Comput. Chem. Eng. 2020, 143, 107094. [Google Scholar] [CrossRef]
  46. Wang, H.; Zhu, H.; Bai, Y. Construction of causality event evolutionary graph of aviation accident. In Proceedings of the 2019 5th International Conference on Transportation Information and Safety (ICTIS), Liverpool, UK, 14–17 July 2019; Institute of Electrical and Electronics Engineers Inc.: Piscataway, NJ, USA, 2019; pp. 692–697. [Google Scholar] [CrossRef]
  47. Kim, W.; Lee, J.; Ahn, S. Proposal of a Workflow Model for Generating a Construction Safety Guideline Knowledge Graph. J. Constr. Autom. Robot. 2022, 1, 46–54. [Google Scholar] [CrossRef]
  48. Liu, C.; Yang, S. Using Text Mining to Establish Knowledge Graph from Accident/Incident Reports in Risk Assessment. Expert Syst. Appl. 2022, 207, 117991. [Google Scholar] [CrossRef]
  49. Tiddi, I.; Schlobach, S. Knowledge graphs as tools for explainable machine learning: A survey. Artif. Intell. 2022, 302, 103627. [Google Scholar] [CrossRef]
  50. Devlin, J.; Chang, M.-W.; Lee, K.; Toutanova, K. BERT: Pre-Training of Deep Bidirectional Transformers for Language Understanding. Available online: https://arxiv.org/abs/1810.04805v2 (accessed on 19 July 2023).
  51. Liu, J.; Luo, H.; Liu, H. Deep learning-based data analytics for safety in construction. Autom. Constr. 2022, 140, 104302. [Google Scholar] [CrossRef]
  52. Vermuyten, H.; Belien, J.; De Boeck, L.; Reniers, G.; Wauters, T. A review of optimisation models for pedestrian evacuation and design problems. Saf. Sci. 2016, 87, 167–178. [Google Scholar] [CrossRef]
  53. Wang, Y.; Wang, Y.; Ye, Y. A Novel Method for Constructing Knowledge Graph of Railway Safety Risk. In Proceedings of the ACM International Conference on Computer Science and Application Engineering, Online, 21–23 October 2022; Emrouznejad, A., Ed.; Association for Computing Machinery: New York, NY, USA, 2022. [Google Scholar] [CrossRef]
  54. Zhu, Y.; Luo, X. A Knowledge Graph for Automated Construction Workers’ Safety Violation Identification. In Proceedings of the 39th International Symposium on Automation and Robotics in Construction (ISARC 2022), Bogota, Colombia, 12–15 July 2022; pp. 312–319. [Google Scholar]
  55. Fang, W.; Ma, L.; Love, P.E.D.; Luo, H.; Ding, L.; Zhou, A. Knowledge graph for identifying hazards on construction sites: Integrating computer vision with ontology. Autom. Constr. 2020, 119, 103310. [Google Scholar] [CrossRef]
  56. Simone, F.; Ansaldi, S.M.; Agnello, P.; Patriarca, R. Industrial safety management in the digital era: Constructing a knowledge graph from near misses. Comput. Ind. 2023, 146, 103849. [Google Scholar] [CrossRef]
  57. Chen, W.T.; Bria, T.A. A Review of Ontology-Based Safety Management in Construction. Sustainability 2023, 15, 413. [Google Scholar] [CrossRef]
  58. Yin, Z.; Shi, L.; Yuan, Y.; Tan, X.; Xu, S. A Study on a Knowledge Graph Construction Method of Safety Reports for Process Industries. Processes 2023, 11, 146. [Google Scholar] [CrossRef]
  59. Wang, X.; El-Gohary, N. Deep learning-based relation extraction and knowledge graph-based representation of construction safety requirements. Autom. Constr. 2023, 147, 104696. [Google Scholar] [CrossRef]
  60. Li, Y.; Wei, H.; Han, Z.; Jiang, N.; Wang, W.; Huang, J. Computer Vision-Based Hazard Identification of Construction Site Using Visual Relationship Detection and Ontology. Buildings 2022, 12, 857. [Google Scholar] [CrossRef]
  61. Liu, X.; Jing, X.; Zhu, Q.; Du, W.; Wang, X. Automatic Construction Hazard Identification Integrating On-Site Scene Graphs with Information Extraction in Outfield Test. Buildings 2023, 13, 377. [Google Scholar] [CrossRef]
  62. Wang, Y.; Xiao, B.; Bouferguene, A.; Al-Hussein, M.; Li, H. Vision-based method for semantic information extraction in construction by integrating deep learning object detection and image captioning. Adv. Eng. Inform. 2022, 53, 101699. [Google Scholar] [CrossRef]
  63. Wu, H.; Zhong, B.; Li, H.; Love, P.; Pan, X.; Zhao, N. Combining computer vision with semantic reasoning for on-site safety management in construction. J. Build. Eng. 2021, 42, 103036. [Google Scholar] [CrossRef]
  64. Pan, X.; Zhong, B.; Wang, Y.; Shen, L. Identification of accident-injury type and bodypart factors from construction accident reports: A graph-based deep learning framework. Adv. Eng. Inform. 2022, 54, 101752. [Google Scholar] [CrossRef]
  65. Zhang, L.; Wang, J.; Wang, Y.; Sun, H.; Zhao, X. Automatic construction site hazard identification integrating construction scene graphs with BERT based domain knowledge. Autom. Constr. 2022, 142, 104535. [Google Scholar] [CrossRef]
  66. Staudemeyer, R.C.; Morris, E.R. Understanding LSTM—A Tutorial into Long Short-Term Memory Recurrent Neural Networks. Available online: https://arxiv.org/abs/1909.09586v1 (accessed on 19 July 2023).
  67. Vaswani, A.; Shazeer, N.; Parmar, N.; Uszkoreit, J.; Jones, L.; Gomez, A.N.; Kaiser, Ł.; Polosukhin, I. Attention is All you Need. In Proceedings of the Advances in Neural Information Processing Systems, Long Beach, CA, USA, 4–9 December 2019; Curran Associates, Inc.: Red Hook, NY, USA, 2017; Volume 30. [Google Scholar]
  68. Peng, F.-L.; Qiao, Y.-K.; Yang, C. Building a knowledge graph for operational hazard management of utility tunnels. Expert Syst. Appl. 2023, 223, 119901. [Google Scholar] [CrossRef]
  69. OSHA 1910.132; General Requirements. U.S. Department of Labor: Washington, DC, USA, 18 November 2016.
  70. Ren, H.; Chen, Z.; Jiang, Z.; Yang, C.; Gui, W. An Industrial Multilevel Knowledge Graph-Based Local-Global Monitoring for Plant-Wide Processes. IEEE Trans. Instrum. Meas. 2021, 70, 3526715. [Google Scholar] [CrossRef]
Figure 1. Example of KG about text-format construction safety regulation.
Figure 1. Example of KG about text-format construction safety regulation.
Information 15 00390 g001
Figure 2. Prisma flow diagram.
Figure 2. Prisma flow diagram.
Information 15 00390 g002
Figure 3. Keyword co-occurrence networks.
Figure 3. Keyword co-occurrence networks.
Information 15 00390 g003
Figure 4. Keyword clusters.
Figure 4. Keyword clusters.
Information 15 00390 g004
Figure 5. Process of KG development.
Figure 5. Process of KG development.
Information 15 00390 g005
Table 1. Publication statistics (2016–2023).
Table 1. Publication statistics (2016–2023).
Source Total20192020202120222023
Top-8 Most Productive JournalsAutomation in Construction1423153
Advanced Engineering Informatics712031
Buildings500032
Journal of Management in Engineering200002
Computer in Industry200011
Engineering Applications of Artificial Intelligence200002
Expert System with Applications200011
Mathematical Problems in Engineering100010
Total Journals -9341173536
Proceedings -43939166
Total
Publications
-1361314165142
Table 2. Data extraction details from publications from the last 3 years.
Table 2. Data extraction details from publications from the last 3 years.
Publication YearPublication TitleRaw Data Extraction ApproachModels
2023A study on a KG construction method of safety reports for process ndustries [58]Safety reportsEntities & RelationshipsBERT-BiLSTM-CRF-TFIDF
2023Deep learning-based relation extraction and knowledge graph-based representation of construction safety requirements [59]OSHA sectionsEntities & RelationshipsAttention-based CNN-perceptron + Attention-based BiLSTM-perceptron
2023Industrial safety management in the digital era: Constructing a knowledge graph from near misses [56]Near missing reportsEntities & Relationshipsdid not specified
2023Building a knowledge graph for operational hazard management of utility tunnels [60]Normative documents + Hazard description text + Control measure description textEntitiesBiLSTM-CRF
2023Automatic construction hazard identification integrating on-site scene graphs with information extraction in outfield test [61]On-site scene graphs + Chinese safety regulation datasetEntities & RelationshipsBERT-BIEO-Mask RCNN
2023A text mining-based approach for understanding Chinese railway incidents caused by electromagnetic interference [22]Chinese railway incidents reportsEntitiesCNN-BiLSTM-BERT
2022Automatic construction site hazard identification, integrating construction scene graphs with BERT based domain knowledge [62]On-site scene graphs Entities & RelationshipsMask RCNN-Transformer-C BERT
2022Vision-based method for semantic information extraction in construction by integrating deep learning object detection and image captioning [63]On-site scene graphs + Safety regulationsEntities & RelationshipsMask RCNN- Attention-based LSTM
2022A novel method for constructing knowledge graph of railway safety risk [53]Railway safety documents + Railway safety risk text documents + Public works dataEntities & RelationshipsCNN
2022Identification of accident-injury type and bodypart factors from construction accident reports: A graph-based deep learning framework [64]Accident reportsEntities & RelationshipsGCN + Co-occurrence network
2022Using text mining to establish knowledge graph from accident/incident reports in risk assessment [48]Accident/incident reportsEntitiesHMM-BiLSTM-CRF
2022Computer vision-based hazard identification of construction site using visual relationship detection and ontology [65]Visual Relationship DatasetEntities & RelationshipsVisual Translation Embedding
2021Combining computer vision with semantic reasoning for on-site safety management in construction [66]On-site scene graphs Entities & RelationshipsMask RCNN
2020Knowledge graph for identifying hazards on construction sites: Integrating computer vision with ontology [55] Safety hazard reports + related imagesEntities & RelationshipsMask RCNN
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Kong, F.; Ahn, S. Use of Knowledge Graphs for Construction Safety Management: A Systematic Literature Review. Information 2024, 15, 390. https://doi.org/10.3390/info15070390

AMA Style

Kong F, Ahn S. Use of Knowledge Graphs for Construction Safety Management: A Systematic Literature Review. Information. 2024; 15(7):390. https://doi.org/10.3390/info15070390

Chicago/Turabian Style

Kong, Fansheng, and Seungjun Ahn. 2024. "Use of Knowledge Graphs for Construction Safety Management: A Systematic Literature Review" Information 15, no. 7: 390. https://doi.org/10.3390/info15070390

APA Style

Kong, F., & Ahn, S. (2024). Use of Knowledge Graphs for Construction Safety Management: A Systematic Literature Review. Information, 15(7), 390. https://doi.org/10.3390/info15070390

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop