A Novel Approach for the Analysis of Ship Pollution Accidents Using Knowledge Graph
Abstract
:1. Introduction
1.1. Context and Motivations
- Although there are literature records of past ship pollution accidents, these data may be incomplete, so much so that we need more information investigation and data collection to ensure the completeness and accuracy of these data.
- Ship pollution accidents are usually distributed in many sea areas with different jurisdictional areas, and relevant information may be scattered in different accident databases. Therefore, it is necessary to integrate this cross-regional accident information into a database through the method of a knowledge graph in order to facilitate more effective analyses.
- There are shortcomings in the research on the application and expansion of ship pollution accidents. At present, knowledge graph technology is rarely used in the literature to analyze ship pollution accidents.
1.2. Research Questions and Contributions
- We obtain data information from multiple sources, such as public databases, government reports, and news reports, integrate and clean them up, and use natural language processing techniques to clean and standardize the data to ensure consistency and accuracy.
- We use natural language processing and machine learning techniques to extract relationships between entities from the text data. For example, information about the type of pollution accident, occurrence time, location, pollution type, oil leakage amount, etc., is extracted, and then we use this information to construct a knowledge graph.
- We use visualization tools to present the constructed knowledge graph and use Cypher language for case retrieval analysis. Through this method, users can intuitively understand the relevant information of ship pollution accidents and conduct further analysis and mining. Finally, we use centrality algorithms to analyze the causes of ship pollution accidents and provide relevant suggestions.
2. Literature Review
2.1. Ship Pollution Incidents
2.2. Knowledge Graphs
3. The Fundamental Theory of Knowledge Graphs
3.1. Basic Concepts
3.2. Construction Process
3.3. Knowledge Storage Method
4. Design Framework of Knowledge Graph for Marine Ship Pollution Accidents
4.1. Pattern Layer
4.2. Data Layer
4.3. Management Application Layer
4.4. Data Source Validation
4.5. Entity Recognition Implementation Based on Rule Matching
5. Construction Process of Marine Ship Pollution Accident Knowledge Graph
5.1. Text Vectorization
- (1)
- Skip-Gram Model
- (2)
- CBOW Model
5.2. Knowledge Extraction
5.2.1. Entity Extraction Based on BERT–BiLSTM–CRF
- (1)
- BERT Layer
- (2)
- BiLSTM Layer
- (3)
- CRF Layer
5.2.2. Relationship Extraction Based on BiLSTM–CRF
5.2.3. Experimental Environment
5.2.4. Model Parameter Configuration Table
5.2.5. Model Evaluation Metrics for Extraction
5.3. Knowledge Graph Visualization
5.4. Query Performance Response Time Sensitivity Test
6. Accident Case Retrieval and Causal Analysis
6.1. Accident Case Retrieval
6.2. Accident Cause Analysis
- (1)
- Direct causes: According to the table, the centrality and causality degrees of “lack of lookout”, “failure to make a comprehensive assessment of the situation and collision risk”, and “failure to fulfill the obligation of the give-way ship” are ranked at the top, indicating that these three nodes are the most important in the knowledge graph. Their influence is also ranked at the top, indicating that the node has the highest degree of influence on other nodes and is least affected by other influences. In view of these reasons, crew safety training, management duty, and focus at work should be strengthened to reduce accidents.
- (2)
- Indirect causes: According to the table, the centrality and causality degree of indirect causes caused by direct causes such as “not equipped with a sufficient number of qualified crew members”, “crew certificates do not meet the grade requirements”, and “safety management system fails to operate effectively” are ranked at the top, and the influence value is high, which shows that indirect causes also play a decisive role in the occurrence of pollution accidents. Therefore, in response to these problems, the management of shipping companies, supervision by law enforcement departments, and crew training should be strengthened to prevent similar accidents from happening again.
- (3)
- Objective causes: According to the table, the affectedness degree of accidents caused by factors such as “poor visibility” and “stormy weather” is 0, but the centrality is greater than 1, indicating that they are the main objective factors leading to pollution accidents. However, the causality degree is less than 0, indicating that they do not dominate a large number of pollution accidents. In view of these objective factors, it is necessary to conduct a careful weather and route analysis before the ship sails and choose a suitable and safe route to minimize the occurrence of accidents caused by objective factors.
7. Discussion
8. Conclusions
8.1. Management Significance
- Visualizing the relationship between ship pollution accidents. Knowledge graph visualization can visually display key nodes and relationships in ship pollution accidents, such as ship parameters, pollution types, oil spills, etc. Management personnel can also quickly understand the overall information and development trends of pollution accidents through visual graphs, helping them better handle pollution accidents.
- Case retrieval and experience sharing. Management personnel can easily discover similar cases of ship pollution accidents in history and learn relevant experiences and lessons by using the case retrieval function through knowledge graphs. This helps managers to develop more timely response measures and emergency plans, improving their ability and efficiency in responding to ship pollution accidents.
- Centrality algorithm analysis. By using centrality algorithms to analyze nodes in a knowledge graph, managers can identify nodes that hold significant importance and influence in the graph. These nodes may be ships with frequent pollution accidents, key nodes where pollution accidents occur, etc. Managers can focus on monitoring and managing these key nodes to improve the monitoring and response capabilities of ship pollution accidents.
- Decision support and policy making. Based on the visualization and analysis results of the knowledge graph, managers can formulate management policies and response measures for ship pollution accidents more scientifically. By deeply understanding the key factors and influencing factors of pollution accidents, managers can formulate targeted policies and measures to improve the prevention and control of ship pollution accidents.
- Data-driven management decisions. Knowledge graph visualization and analysis provide managers with data-based decision support, enabling them to make management decisions more rationally and scientifically. Through in-depth analysis of data in the knowledge graph, managers can adjust and optimize management strategies in a timely manner to improve management efficiency and decision-making.
8.2. Limitations and Research Prospects
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Michel, J.; Fingas, M. Oil Spills: Causes, consequences, prevention, and countermeasures. Foss. Fuels Curr. Status Future Dir. 2016, 1, 159–201. [Google Scholar]
- Irannezhad, M.; Ahmadi, B.; Liu, J.; Chen, D.; Matthews, J.H. Global water security: A shining star in the dark sky of achieving the sustainable development goals. Sustain. Horiz. 2022, 1, 100005. [Google Scholar] [CrossRef]
- Deng, J.; Liu, S.; Shu, Y.; Hu, Y.; Xie, C.; Zeng, X. Risk evolution and prevention and control strategies of maritime accidents in China’s coastal areas based on complex network models. Ocean Coast. Manag. 2023, 237, 106527. [Google Scholar] [CrossRef]
- Chen, J.; Zhang, S.; Xu, L.; Wan, Z.; Fei, Y.; Zheng, T. Identification of key factors of ship detention under Port State Control. Mar. Policy 2019, 102, 21–27. [Google Scholar] [CrossRef]
- Heij, C.; Bijwaard, G.E.; Knapp, S. Ship inspection strategies: Effects on maritime safety and environmental protection. Transp. Res. Part D Transp. Environ. 2011, 16, 42–48. [Google Scholar] [CrossRef]
- Zhang, W.; Li, C.; Chen, J.; Wan, Z.; Shu, Y.; Song, L.; Xu, L.; Di, Z. Governance of global vessel-source marine oil spills: Characteristics and refreshed strategies. Ocean Coast. Manag. 2021, 213, 105874. [Google Scholar] [CrossRef]
- Wang, Z.; Zhou, Y.; Wang, T. Dynamic Risk Assessment of Oil Spill Accident on Offshore Platform Based on the Bayesian Network. IEEE Trans. Eng. Manag. 2023, 71, 9188–9201. [Google Scholar] [CrossRef]
- Han, M.; Fan, C.; Huang, S.; Hu, K.; Fan, E. An n-valued neutrosophic set method for the assessment of an offshore oil spill risk. Water Sci. Technol. 2023, 87, 1643–1659. [Google Scholar] [CrossRef]
- Sevgili, C.; Fiskin, R.; Cakir, E. A data-driven Bayesian Network model for oil spill occurrence prediction using tankship accidents. J. Clean. Prod. 2022, 370, 133478. [Google Scholar] [CrossRef]
- Jing, S.; Yujun, T.; Lili, R. Frequency and Causes of Oil Spill Accidents from Ships and Storage Tanks in Quanzhou, China. J. Resour. Ecol. 2023, 14, 391–398. [Google Scholar] [CrossRef]
- Tang, C.; Yi, Y.; Yang, Z.; Sun, J. Risk analysis of emergent water pollution accidents based on a Bayesian Network. J. Environ. Manag. 2016, 165, 199–205. [Google Scholar] [CrossRef]
- Hou, D.; Ge, X.; Huang, P.; Zhang, G.; Loáiciga, H. A real-time, dynamic early-warning model based on uncertainty analysis and risk assessment for sudden water pollution accidents. Environ. Sci. Pollut. Res. 2014, 21, 8878–8892. [Google Scholar] [CrossRef]
- Zheng, H.; Lei, X.; Shang, Y.; Duan, Y.; Kong, L.; Jiang, Y.; Wang, H. Sudden water pollution accidents and reservoir emergency operations: Impact analysis at Danjiangkou Reservoir. Environ. Technol. 2018, 39, 787–803. [Google Scholar] [CrossRef] [PubMed]
- Dong, L.; Liu, J.; Du, X.; Dai, C.; Liu, R. Simulation-based risk analysis of water pollution accidents combining multi-stressors and multi-receptors in a coastal watershed. Ecol. Indic. 2018, 92, 161–170. [Google Scholar] [CrossRef]
- Huang, D.; Liang, T.; Hu, S.; Loughney, S.; Wang, J. Characteristics analysis of intercontinental sea accidents using weighted association rule mining: Evidence from the Mediterranean Sea and Black Sea. Ocean Eng. 2023, 287, 115839. [Google Scholar] [CrossRef]
- Sathish, T.; Maheswari, S.U.; Balaji, V.; Nirupama, P.; Panchal, H.; Li, Z.; Tlili, I. Coastal pollution analysis for environmental health and ecological safety using deep learning technique. Adv. Eng. Softw. 2023, 179, 103441. [Google Scholar] [CrossRef]
- Hogan, A.; Blomqvist, E.; Cochez, M.; d’Amato, C.; Melo, G.D.; Gutierrez, C.; Zimmermann, A. Knowledge graphs. ACM Comput. Surv. (Csur) 2021, 54, 1–37. [Google Scholar] [CrossRef]
- Ji, S.; Pan, S.; Cambria, E.; Marttinen, P.; Philip, S.Y. A survey on knowledge graphs: Representation, acquisition, and applications. IEEE Trans. Neural Netw. Learn. Syst. 2021, 33, 494–514. [Google Scholar] [CrossRef]
- Kaplar, A.; Stošović, M.; Kaplar, A.; Brković, V.; Naumović, R.; Kovačević, A. Evaluation of clinical named entity recognition methods for Serbian electronic health records. Int. J. Med. Inform. 2022, 164, 104805. [Google Scholar] [CrossRef] [PubMed]
- Wang, W.; Bi, B.; Yan, M.; Wu, C.; Bao, Z.; Xia, J.; Si, L. Structbert: Incorporating language structures into pre-training for deep language understanding. arXiv 2019, arXiv:1908.04577. [Google Scholar]
- Li, J.; Sun, A.; Han, J.; Li, C. A Survey on Deep Learning for Named Entity Recognition. IEEE Trans. Knowl. Data Eng. 2020, 34, 50–70. [Google Scholar] [CrossRef]
- Li, Z.; Li, Q.; Zou, X.; Ren, J. Causality extraction based on self-attentive BiLSTM-CRF with transferred embeddings. Neurocomputing 2021, 423, 207–219. [Google Scholar] [CrossRef]
- Mikolov, T.; Chen, K.; Corrado, G.; Dean, J. Efficient estimation of word representations in vector space. arXiv 2013, arXiv:1301.3781. [Google Scholar]
- Liu, Y.; Wei, S.; Huang, H.; Lai, Q.; Li, M.; Guan, L. Naming entity recognition of citrus pests and diseases based on the BERT-BiLSTM-CRF model. Expert Syst. Appl. 2023, 234, 121103. [Google Scholar] [CrossRef]
- Liu, J.; Schmid, F.; Li, K.; Zheng, W. A knowledge graph-based approach for exploring railway operational accidents. Reliab. Eng. Syst. Saf. 2021, 207, 107352. [Google Scholar] [CrossRef]
- Liu, C.; Zhang, X.; Xu, Y.; Xiang, B.; Gan, L.; Shu, Y. Knowledge graph for maritime pollution regulations based on deep learning methods. Ocean Coast. Manag. 2023, 242, 106679. [Google Scholar] [CrossRef]
- Gan, L.; Ye, B.; Huang, Z.; Xu, Y.; Chen, Q.; Shu, Y. Knowledge graph construction based on ship collision accident reports to improve maritime traffic safety. Ocean Coast. Manag. 2023, 240, 106660. [Google Scholar] [CrossRef]
- Xie, C.; Zhang, L.; Zhong, Z. A Novel Method for Constructing Spatiotemporal Knowledge Graph for Maritime Ship Activities. Electronics 2023, 12, 3205. [Google Scholar] [CrossRef]
- Yan, J.; Gao, Q.; Yu, Y.; Chen, L.; Xu, Z.; Chen, J. Combining knowledge graph with deep adversarial network for water quality prediction. Environ. Sci. Pollut. Res. 2022, 30, 10360–10376. [Google Scholar] [CrossRef]
- Liu, X.; Zhang, Y.; Zou, H.; Wang, F.; Cheng, X.; Wu, W.; Liu, X.; Li, Y. Multi-source knowledge graph reasoning for ocean oil spill detection from satellite SAR images. Int. J. Appl. Earth Obs. Geoinf. 2023, 116, 103153. [Google Scholar] [CrossRef]
- Wan, H.; Fu, S.; Zhang, M.; Xiao, Y. A Semantic Network Method for the Identification of Ship’s Illegal Behaviors Using Knowledge Graphs: A Case Study on Fake Ship License Plates. J. Mar. Sci. Eng. 2023, 11, 1906. [Google Scholar] [CrossRef]
- Wan, H.; Xiao, Y.; Xu, W.; Fu, S.; Gao, P. A Cognitive Approach for Identification of Ship Illegal Behaviors by Using Knowledge Graph. In Proceedings of the 2023 7th International Conference on Transportation Information and Safety (ICTIS) IEEE, Xi’an, China, 4–6 August 2023; pp. 732–737. [Google Scholar]
- Zhang, L.; Hou, M.; Chen, A.; Zhong, H.; Ogg, J.G.; Zheng, D. Construction of a fluvial facies knowledge graph and its application in sedimentary facies identification. Geosci. Front. 2023, 14, 101521. [Google Scholar] [CrossRef]
Parent Company | Open Source | Free | Max Nodes | Visualization Tool | Operating System | Application Scenarios | |
---|---|---|---|---|---|---|---|
Neo4j | NeoTechnology | Yes | Yes | 32 billion | Yes | Windows, Linux, OSX, Solaris | Artificial intelligence, fraud detection, knowledge graphs |
Core Concept | Entity | Relationship | Relationship Description |
---|---|---|---|
Accident type (operational or accidental) | Pollution accident title | Has title | Describes the relationship between the pollution accident and the pollution accident report title |
Location | Takes place in | Describes the relationship between the pollution accident and the location where it occurred | |
Grade of accident | Has grade | Describes the relationship between the pollution accident and its severity grade | |
Consequence of accident | Has consequence | Describes the relationship between the pollution accident and its consequences | |
Accident cause | Has cause | Describes the relationship between the pollution accident and its causes | |
Description | Has summary | Describes the relationship between the pollution accident and a brief description | |
Administrative advice | Has advice information | Describes the relationship between the pollution accident and administrative advice | |
Responsibility identification | Has responsibility | Describes the relationship between the pollution accident and the confirmation of responsibility by authorities | |
Involved ships | Loss | Has ship loss | Result of the damage to the ship involved in the accident |
Casualties | Has ship casualties | Result of casualties caused by the ship involved in the accident | |
Pollution | Has oil spill volume | Result of oil spill pollution caused by the ship involved in the accident | |
Pollutant | Ship pollutant type | Relationship between the ship and various pollutants carried | |
Accident cause Accident type | Makes mistakes with | Some ships make mistakes for certain reasons | |
Accident type | Ship accident type | Types of accidents that some ships have | |
Accident cause | Involved ships | Involves ship name | The cause of the accident is attributed to certain ships |
Cause category | Involves reason category | The category of the cause is determined by the accident cause | |
Accident losses | Grade of accident | Determines involvement | Loss determines the severity of the accident |
Pollution accident | Involves ship loss | The loss is the loss of a pollution accident | |
Grade of accident | Accident results | Involves result | The grade is determined by the result of the pollution accident |
Proper Nouns | Synonyms | Proper Nouns | Synonyms |
---|---|---|---|
Vessel name | Ship name | Ship width | Width |
Port of registry | Port of registration, registry port | Cargo capacity | Reference cargo capacity |
Gross tonnage | Total tonnage | Ship type | Vessel type, |
Number of engines | Engine quantity, number of hosts | Hull material | material type, ship material, ship quality |
Engine power | Power, total power, total engine power | Shipbuilding plant | Shipyard construction, shipyard |
Captain | Shipowner | Ship owner, owner |
Knowledge | Category |
---|---|
Jingwei Oil 3 | Ship Name (Attribute) |
115.00 m | Ship Length (Attribute) |
15.00 m | Ship Width (Attribute) |
4054 tons | Gross Tonnage (Attribute) |
2151 KW | Main Engine Power (Attribute) |
Oil Tanker | Ship Type (Entity) |
COSCO SHIPPING Co., Ltd. | Ship Owner (Entity) |
Category | Configuration | |
---|---|---|
OS | Windows10, 64 bits | |
CPU | Intel Corel i12-12400F@3.60 GHz*8 | |
Computer Configuration | Memory | 16G |
Graphics Card | Ge Force GTX 1660S 6 GB | |
IDE | Pycharm | |
Programming Environment | Python-version | 3.6.9 |
Torch-version | 1.0.0 | |
Cuda-version | 10.0 |
Parameter Name | Parameter Value |
---|---|
Hidden size | 200 |
Batch size | 2 |
Epoch | 120 |
Optimizer | Adam |
Learning rate | 0.01 |
Pretrain_embedding | True |
Enbedding_dim | 300 |
Dropout | 0.5 |
Model | Accuracy (%) | Recall (%) | F1 Score (%) |
---|---|---|---|
BiLSTM | 61.58% | 62.45% | 60.81% |
BERT–BiLSTM | 72.48% | 75.59% | 73.51% |
BERT–BiLSTM–CRF | 82.47% | 83.69% | 85.48% |
BERT–CRF | 68.00% | 65.00% | 65.00% |
Limit the Number of Results Returned | Number of Entities and Relationships | Response Time (s) |
---|---|---|
MATCH p = ()-->() RETURN p LIMIT 25 | Displaying 32 entities, 25 relationships. | 0.61 s |
MATCH p = ()-->() RETURN p LIMIT 100 | Displaying 144 entities, 103 relationships. | 1.29 s |
MATCH p = ()-->() RETURN p LIMIT 500 | Displaying 300 entities, 238 relationships. | 1.95 s |
MATCH p = ()-->() RETURN p LIMIT 1000 | Displaying 1125 entities, 1864 relationships. | 3.14 s |
MATCH p = ()-->() RETURN p LIMIT 3000 | Displaying 2951 entities, 3825 relationships. | 9.22 s |
MATCH p = ()-->() RETURN p LIMIT 5793 | Displaying 3928 entities, 5793 relationships. | 28.62 s |
Cause | Influence | Affectedness | Centrality | Causality |
---|---|---|---|---|
Lack of lookout | 4.93 | 0.02 | 4.95 | 4.91 |
Failure to make a full estimation of the situation and collision risk | 4.35 | 0.27 | 4.62 | 4.08 |
Failure to fulfill the obligation of giving way to ships | 3.89 | 0.54 | 4.43 | 3.35 |
Failure to fulfill the obligation of the standby ship | 3.21 | 0.45 | 3.66 | 2.76 |
Failure to sound the horn as required | 3.06 | 0.32 | 3.38 | 2.74 |
Poor visibility | 2.63 | 0.00 | 2.63 | 2.63 |
Lack of a sufficient number of qualified crew members | 2.61 | 0.56 | 3.17 | 2.05 |
Ineffective operation of the safety management system | 2.49 | 0.51 | 3.00 | 1.98 |
Crew certificates do not meet grade requirements | 1.87 | 0.89 | 2.76 | 0.98 |
Violation of navigation rules/navigation/anchoring/operational requirements | 1.75 | 1.54 | 3.29 | 0.21 |
High navigation density/complex traffic flow | 1.62 | 1.32 | 2.94 | 0.30 |
Lack of valid ship certificates | 1.21 | 1.27 | 2.48 | −0.06 |
Lack of crew safety training | 0.95 | 1.28 | 2.23 | −0.33 |
Failure to execute safe speed | 0.73 | 1.19 | 1.92 | −0.46 |
Driver negligence | 0.36 | 1.21 | 1.57 | −0.85 |
Contrary to common practices of seafarers/failure to drive cautiously | 0.29 | 0.85 | 1.14 | −0.56 |
Lack of concentration | 0.21 | 0.99 | 1.20 | −0.78 |
Navigation information not displayed/incorrectly displayed | 0.18 | 1.11 | 1.29 | −0.93 |
No effective communication between ships | 0.05 | 1.02 | 1.07 | −0.97 |
Stormy weather | 1.01 | 0.00 | 1.01 | −1.01 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Hu, J.; Zhou, W.; Zheng, P.; Liu, G. A Novel Approach for the Analysis of Ship Pollution Accidents Using Knowledge Graph. Sustainability 2024, 16, 5296. https://doi.org/10.3390/su16135296
Hu J, Zhou W, Zheng P, Liu G. A Novel Approach for the Analysis of Ship Pollution Accidents Using Knowledge Graph. Sustainability. 2024; 16(13):5296. https://doi.org/10.3390/su16135296
Chicago/Turabian StyleHu, Junlin, Weixiang Zhou, Pengjun Zheng, and Guiyun Liu. 2024. "A Novel Approach for the Analysis of Ship Pollution Accidents Using Knowledge Graph" Sustainability 16, no. 13: 5296. https://doi.org/10.3390/su16135296
APA StyleHu, J., Zhou, W., Zheng, P., & Liu, G. (2024). A Novel Approach for the Analysis of Ship Pollution Accidents Using Knowledge Graph. Sustainability, 16(13), 5296. https://doi.org/10.3390/su16135296