Research on the Prediction of Sustainable Safety Production in Building Construction Based on Text Data
Abstract
:1. Introduction
2. Materials and Methods
2.1. Preparation of High-Quality Data
2.1.1. Text Segmentation Process
- The Trie tree segmentation model was established. The Trie tree data structure consists of root and multilayered leaf nodes. The root node stores no data, while the leaf nodes each store one character. The retrieval process starts from the root node and sequentially moves through the leaf nodes, with the characters connected to form a complete word. The number inside the leaf node indicates how frequently the word appears in the corpus.
- A directed acyclic graph (DAG) of the corpus was established. Through the rapid scanning process of the Trie tree, all possible combinations of each word in the corpus are traversed, the potential combinations are obtained, and a DAG is formed from the nodes and multiple links.
- The maximum probability of the segmentation path based on the word frequency was obtained. Finally, unlogged words are subdivided using the HMM model. By considering sentences as observable states, the state transfer process is described through the random analysis of states and observational states. The set of state values corresponding to a sentence is denoted as (B, M, E, S), representing the beginning position (Begin), middle position (Middle), end position (End), and independent word formation (Single), respectively. Sentence annotation is performed using these four states. After extensive corpus training, we can determine the dependency relationships between word states and observations and simulate the sequence prediction problem using three HMM model parameters: transition probability, emission probability, and initial hidden state probability. The Viterbi decoding algorithm, with model parameters and observation states as inputs, finds the most likely hidden state sequence, thereby implementing an HMM-based segmentation algorithm to process text containing unregistered words. As an example, consider the statement “Inadequate execution of duties by supervisory personnel”. The corresponding word tag sequence is “BMMEBEBEBESBE”, where each word constitutes an unregistered word. The corresponding word label sequence is “BMMEBEBEBESBE”, where each word constitutes an observation sequence, and the word position labeling constitutes a state sequence. The optimal segmentation result is obtained by combining the HMM model and Viterbi algorithm derived from the training. Additionally, when using the Jieba package for Chinese text segmentation without constructing and loading a complete custom dictionary, the segmentation results are mostly binary word strings. Two statistical measures, mutual information, and information entropy, were introduced to overcome the limitations of dictionary-based segmentation in specific domains and accurately recognize words with complete meanings. Mutual information assesses the likelihood of word formation and evaluates the independence and semantic integrity of multi-word expressions by counting the frequency of simultaneous occurrence of adjacent word combinations. Information entropy measures the degree of freedom of candidate words. When the information entropy value on both sides of the candidate word is higher, the likelihood of independent word formation increases. For example, the algorithm implementation process can be described as follows: there are ordered neighboring words A and B, and the mutual information between A and B is expressed as the probability value I(A,B), reflecting the degree of association. If I(A,B) is greater than the set threshold, A and B are considered to co-occur. After multiple calculations to find the combined word string AB, the frequency of the occurrence of the left and right neighboring words of AB is identified, and the left and right neighboring word information entropy values are calculated according to the information entropy formula. By simultaneously eliminating boundary uncertainty, we obtained feature words containing rich semantic information.
2.1.2. Keyword Extraction Based on TF-IDF
2.1.3. Multiple Covariance Analysis of Accident Attribute Variables
2.2. Risk Prediction Model Construction Based on the XGBoost Algorithm
2.2.1. XGBoost Algorithm Construction Process
- Design the objective function. The regularization term is added in the middle of the prediction error as its objective Function (3), where is the sample, is the prediction error of the sample, and is the complexity function of the tree and the regularization term of the objective function.
- Evaluate the error second-order Taylor expansion. The second-order Taylor approximation expansion of the objective function uses first-order and second-order derivatives to calculate the evaluation error for learning to generate . The objective function under the iteration can be expressed as Equation (4).
- Find the optimal splitting point. The complexity function of the tree (i.e., the regularization term of the objective function) is related to the number of leaf nodes . The value of leaf nodes of the tree is the number of leaves penalizing the regular term, which is used to limit the regression tree to producing branches and has the effect of pruning, and is the leaf node weight penalizing the common term, which plays a role in reducing the impact of overfitting. The structure can be expressed as (5). All the samples classified to the leaf node can be formed into a sample set using the mathematical expression . For a fixed structure , it can be obtained through quadratic derivation that the optimal weight of the leaf node is (6).
2.2.2. Model Optimization and Performance Evaluation Metrics
- Hyperparameter tuning
- Performance Evaluation
2.3. Interpretability Analysis of the Modeling Results
3. Case Study
3.1. Data Collection and Processing
3.1.1. Text Segmentation Process
3.1.2. Keyword Extraction
3.1.3. Multiple Covariance Analysis of Accident Attribute Variables
3.2. Predictive Model Construction Based on XGBoost
3.3. Interpretability Analysis of Modeling Results
4. Discussion
5. Conclusions
5.1. Contributions
5.2. Limitations
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Gandomi, A.; Haider, M. Beyond the hype: Big data concepts, methods, and analytics. Int. J. Inf. Manag. 2015, 35, 137–144. [Google Scholar] [CrossRef]
- Nawaz, A.; Waqar, A.; Shah, S.A.R.; Sajid, M.; Khalid, M.I. An innovative framework for risk management in construction projects in developing countries: Evidence from Pakistan. Risks 2019, 7, 24. [Google Scholar] [CrossRef]
- Hwang, B.-G.; Shan, M.; Phuah, S.L. Safety in green building construction projects in Singapore: Performance, critical issues, and improvement solutions. KSCE J. Civ. Eng. 2018, 22, 447–458. [Google Scholar] [CrossRef]
- Tonetto, M.S.; Saurin, T.A. Choosing fall protection systems in construction sites: Coping with complex rather than complicated systems. Saf. Sci. 2021, 143, 105412. [Google Scholar] [CrossRef]
- Wang, J.; Zou, P.X.; Li, P.P. Critical factors and paths influencing construction workers’ safety risk tolerances. Accid. Anal. Prev. 2016, 93, 267–279. [Google Scholar] [CrossRef]
- Kumar, K.S.; Narayanan, R. Review on construction risk and development of risk management procedural index–A case study from Chennai construction sector. Mater. Today Proc. 2021, 43, 1141–1146. [Google Scholar] [CrossRef]
- Tian, Z.; Chen, Q.; Zhang, T. A method for assessing the crossed risk of construction safety. Saf. Sci. 2022, 146, 105531. [Google Scholar] [CrossRef]
- Alkaissy, M.; Arashpour, M.; Ashuri, B.; Bai, Y.; Hosseini, R. Safety management in construction: 20 years of risk modeling. Saf. Sci. 2020, 129, 104805. [Google Scholar] [CrossRef]
- Afzal, M.; Shafiq, M.T.; Al Jassmi, H. Improving construction safety with virtual-design construction technologies—A review. J. Inf. Technol. Constr. 2021, 26, 319–340. [Google Scholar] [CrossRef]
- Yap, J.B.H.; Lee, W.K. Analysing the underlying factors affecting safety performance in building construction. Prod. Plan. Control 2020, 31, 1061–1076. [Google Scholar] [CrossRef]
- Kukoyi, P.O.; Adebowale, O.J. Impediments to construction safety improvement. J. Eng. Proj. Prod. Manag. 2021, 11, 207–214. [Google Scholar]
- Sanni-Anibire, M.O.; Mahmoud, A.S.; Hassanain, M.A.; Salami, B.A. A risk assessment approach for enhancing construction safety performance. Saf. Sci. 2020, 121, 15–29. [Google Scholar] [CrossRef]
- Darko, A.; Chan, A.P.; Yang, Y.; Tetteh, M.O. Building information modeling (BIM)-based modular integrated construction risk management–Critical survey and future needs. Comput. Ind. 2020, 123, 103327. [Google Scholar] [CrossRef]
- Zhou, Z.; Wei, L.; Yuan, J.; Cui, J.; Zhang, Z.; Zhuo, W.; Lin, D. Construction safety management in the data-rich era: A hybrid review based upon three perspectives of nature of dataset, machine learning approach, and research topic. Adv. Eng. Inform. 2023, 58, 102144. [Google Scholar] [CrossRef]
- Pagell, M.; Johnston, D.; Veltri, A.; Klassen, R.; Biehl, M. Is safe production an oxymoron? Prod. Oper. Manag. 2014, 23, 1161–1175. [Google Scholar] [CrossRef]
- Ghousi, R.; Khanzadi, M.; Atashgah, K.M. A flexible method of building construction safety risk assessment and investigating financial aspects of safety program. Int. J. Optim. Civ. Eng. 2018, 8, 433–452. [Google Scholar]
- Manzoor, B.; Othman, I.; Manzoor, M. Evaluating the critical safety factors causing accidents in high-rise building projects. Ain Shams Eng. J. 2021, 12, 2485–2492. [Google Scholar] [CrossRef]
- Chen, H.; Mao, Y.; Xu, Y.; Wang, R. The impact of wearable devices on the construction safety of building workers: A systematic review. Sustainability 2023, 15, 11165. [Google Scholar] [CrossRef]
- Sofwan, N.M.; Zaini, A.A.; Mahayuddin, S.A. Preliminary study on the identification of safety risks factors in the high rise building construction. J. Teknol. 2016, 78, 8505. [Google Scholar] [CrossRef]
- Hu, W.; Xia, W.; Li, Y.; Jiang, J.; Li, G.; Chen, Y. An intelligent identification method of safety risk while drilling in gas drilling. Pet. Explor. Dev. 2022, 49, 428–437. [Google Scholar] [CrossRef]
- Macêdo, J.B.; Moura, M.d.C.; Aichele, D.; Lins, I.D. Identification of risk features using text mining and BERT-based models: Application to an oil refinery. Process Saf. Environ. Prot. 2022, 158, 382–399. [Google Scholar] [CrossRef]
- Zhang, S.; Teizer, J.; Pradhananga, N.; Eastman, C.M. Workforce location tracking to model, visualize and analyze workspace requirements in building information models for construction safety planning. Autom. Constr. 2015, 60, 74–86. [Google Scholar] [CrossRef]
- Aziz, A.; Ahmed, S.; Khan, F.I. An ontology-based methodology for hazard identification and causation analysis. Process Saf. Environ. Prot. 2019, 123, 87–98. [Google Scholar] [CrossRef]
- Chartres, N.; Bero, L.A.; Norris, S.L. A review of methods used for hazard identification and risk assessment of environmental hazards. Environ. Int. 2019, 123, 231–239. [Google Scholar] [CrossRef] [PubMed]
- Martin, P.; Bladier, C.; Meek, B.; Bruyere, O.; Feinblatt, E.; Touvier, M.; Watier, L.; Makowski, D. Weight of evidence for hazard identification: A critical review of the literature. Environ. Health Perspect. 2018, 126, 076001. [Google Scholar] [CrossRef] [PubMed]
- Mihić, M. Classification of construction hazards for a universal hazard identification methodology. J. Civ. Eng. Manag. 2020, 26, 147–159. [Google Scholar] [CrossRef]
- Martin, P.; Bladier, C.; Meek, B.; Bruyere, O.; Feinblatt, E.; Touvier, M.; Watier, L.; Makowski, D. Hazard identification, risk assessment and risk control (HIRARC) accidents at power plant. In Proceedings of the MATEC Web of Conferences, Amsterdam, The Netherlands, 23–25 March 2016; EDP Sciences: Les Ulis, France, 2016. [Google Scholar]
- Mihić, M.; Cerić, A.; Završki, I. Developing construction hazard database for automated hazard identification process. Teh. Vjesn. 2018, 25, 1761–1769. [Google Scholar]
- Aven, T. Risk assessment and risk management: Review of recent advances on their foundation. Eur. J. Oper. Res. 2016, 253, 1–13. [Google Scholar] [CrossRef]
- Gambrill, E.; Shlonsky, A. Risk assessment in context. Child. Youth Serv. Rev. 2000, 225, 813–837. [Google Scholar] [CrossRef]
- Jannadi, O.A.; Almishari, S. Risk assessment in construction. J. Constr. Eng. Manag. 2003, 129, 492–500. [Google Scholar] [CrossRef]
- Hegde, J.; Rokseth, B. Applications of machine learning methods for engineering risk assessment—A review. Saf. Sci. 2020, 122, 104492. [Google Scholar] [CrossRef]
- Mohandes, S.R.; Zhang, X. Developing a Holistic Occupational Health and Safety risk assessment model: An application to a case of sustainable construction project. J. Clean. Prod. 2021, 291, 125934. [Google Scholar] [CrossRef]
- Wuni, I.Y.; Shen, G.Q.; Osei-Kyei, R.; Agyeman-Yeboah, S. Modelling the critical risk factors for modular integrated construction projects. Int. J. Constr. Manag. 2022, 22, 2013–2026. [Google Scholar] [CrossRef]
- Luo, X.; Li, X.; Goh, Y.M.; Song; Liu, Q. Application of machinelearning technology for occupational accident severityprediction in the case of construction collapse accidents. Saf. Sci. 2023, 163, 106138. [Google Scholar] [CrossRef]
- Kang, K.; Ryu, H. Predicting types of occupational accidents at construction sites in Korea using random forest model. Saf. Sci. 2019, 120, 226–236. [Google Scholar] [CrossRef]
- Chen, F.; Wang, J.; Deng, Y. Road safety risk evaluation by means of improved entropy TOPSIS–RSR. Saf. Sci. 2015, 79, 39–54. [Google Scholar] [CrossRef]
- Karasan, A.; Ilbahar, E.; Cebi, S.; Kahraman, C. A new risk assessment approach: Safety and Critical Effect Analysis (SCEA) and its extension with Pythagorean fuzzy sets. Saf. Sci. 2018, 108, 173–187. [Google Scholar] [CrossRef]
- Koulinas, G.; Marhavilas, P.; Demesouka, O.; Vavatsikos, A.; Koulouriotis, D. Risk analysis and assessment in the worksites using the fuzzy-analytical hierarchy process and a quantitative technique—A case study for the Greek construction sector. Saf. Sci. 2019, 112, 96–104. [Google Scholar] [CrossRef]
- Zavari, M.; Shahhosseini, V.; Ardeshir, A.; Sebt, M.H. Multi-objective optimization of dynamic construction site layout using BIM and GIS. J. Build. Eng. 2022, 52, 104518. [Google Scholar] [CrossRef]
- Kouabenan, D.R.; Ngueutsa, R.; Mbaye, S. Safety climate, perceived risk, and involvement in safety management. Saf. Sci. 2015, 77, 72–79. [Google Scholar] [CrossRef]
- Gunderson, D.E.; Gloeckner, G.W. Superintendent competencies and attributes required for success: A national study comparing construction professionals’ opinions. Int. J. Constr. Educ. Res. 2011, 7, 294–311. [Google Scholar] [CrossRef]
- Post, C.; Latu, I.M.; Belkin, L.Y. A female leadership trust advantage in times of crisis: Under what conditions? Psychol. Women Q. 2019, 43, 215–231. [Google Scholar] [CrossRef]
- Moshood, T.; Adeleke, A.; Nawanir, G.; Mahmud, F. Ranking of human factors affecting contractors’ risk attitudes in the Malaysian construction industry. Soc. Sci. Humanit. Open 2020, 2, 100064. [Google Scholar] [CrossRef]
- Zhou, Z.; Goh, Y.M.; Shi, Q.; Qi, H.; Liu, S. Data-driven determination of collapse accident patterns for the mitigation of safety risks at metro construction sites. Tunn. Undergr. Space Technol. 2022, 127, 104616. [Google Scholar] [CrossRef]
- Toh, Y.Z.; Goh, Y.M.; Guo, B.H. Knowledge, attitude, and practice of design for safety: Multiple stakeholders in the Singapore construction industry. J. Constr. Eng. Manag. 2017, 143, 04016131. [Google Scholar] [CrossRef]
- Hossain, M.A.; Abbott, E.L.S.; Chua, D.K.H.; Qui, N.T.; Goh, Y.M. Design-for-safety knowledge library for BIM-integrated safety risk reviews. Autom. Constr. 2018, 94, 290–302. [Google Scholar] [CrossRef]
- Zhou, W.; Whyte, J.; Sacks, R. Construction safety and digital design: A review. Autom. Constr. 2012, 22, 102–111. [Google Scholar] [CrossRef]
- Che Ibrahim, C.K.I.; Belayutham, S.; Mohammad, M.Z. Prevention through design (PtD) education for future civil engineers in Malaysia: Current state, challenges, and way forward. J. Civ. Eng. Educ. 2021, 147, 05020007. [Google Scholar] [CrossRef]
- Poghosyan, A.; Manu, P.; Mahdjoubi, L.; Gibb, A.G.F.; Behm, M.; Mahamadu, A.-M. Design for safety implementation factors: A literature review. J. Eng. Des. Technol. 2018, 16, 783–797. [Google Scholar] [CrossRef]
- Goh, Y.M.; Chua, S. Knowledge, attitude and practices for design for safety: A study on civil & structural engineers. Accid. Anal. Prev. 2016, 93, 260–266. [Google Scholar]
- Ibrahim, C.K.I.C.; Manu, P.; Belayutham, S.; Mahamadu, A.-M.; Antwi-Afari, M.F. Design for safety (DfS) practice in construction engineering and management research: A review of current trends and future directions. J. Build. Eng. 2022, 52, 104352. [Google Scholar] [CrossRef]
- Teo, A.L.E.; Ofori, G.; Tjandra, I.K.; Kim, H. Design for safety: Theoretical framework of the safety aspect of BIM system to determine the safety index. Constr. Econ. Build. 2016, 16, 1–18. [Google Scholar] [CrossRef]
- Tymvios, N.; Gambatese, J.A. Perceptions about design for construction worker safety: Viewpoints from contractors, designers, and university facility owners. J. Constr. Eng. Manag. 2016, 142, 04015078. [Google Scholar] [CrossRef]
Jieba segmentation results | operation, construction, accident, production, cause, site, indirect cause, supervision, management, construction site, installation, training, inspection, dismantling, violation of regulations |
Segmentation results after combining statistics optimization | operators, construction site, operating procedures, timely detection, poor safety awareness, work at height, unauthorized operation, construction program, safety management personnel, safety production, chaotic safety management, operation site, safety management system |
Serial Number | Keywords | TF-IDF | Serial Number | Keywords | TF-IDF |
---|---|---|---|---|---|
1 | Operator | 4.579 | 35 | Construction permit | 1.156 |
2 | Design plan | 3.581 | 36 | Warning signs | 1.147 |
3 | Construction site | 2.898 | 37 | Safety management system | 1.139 |
4 | Management aspects | 2.753 | 38 | Inadequate staffing | 1.127 |
5 | Operating procedures | 2.591 | 39 | Unauthorized command | 1.116 |
6 | Site investigation | 2.439 | 40 | Repair and maintenance | 0.178 |
7 | Timely discovery | 1.965 | 41 | Unauthorized operation | 0.169 |
8 | Operating area | 1.941 | 42 | Illegal contracting | 0.154 |
9 | Limited company | 1.911 | 43 | Failure to wear a safety helmet | 0.091 |
10 | Unqualified | 1.899 | 44 | Light safety | 0.090 |
11 | Slow safety awareness | 1.852 | 45 | Disorderly safety management | 0.089 |
12 | Emergency response plan | 1.785 | 46 | Engineering design | 0.085 |
13 | Safety education and training | 1.763 | 47 | Supervision and administration | 0.076 |
14 | Measures taken | 1.751 | 48 | Supervision contracts | 0.072 |
15 | Site management chaos | 1.726 | 49 | Operator’s certificates | 0.071 |
16 | Fatal accidents | 1.692 | 50 | Violation of labor discipline | 0.069 |
17 | Unauthorized operation | 1.683 | 51 | Safety supervision | 0.068 |
18 | Safety confirmation | 1.669 | 52 | Supervisory work for safety | 0.065 |
19 | Crane equipment | 1.651 | 53 | Unreasonable organization | 0.064 |
20 | Lack of supervision | 1.648 | 54 | Technical programs | 0.062 |
21 | Safety inspections | 1.592 | 55 | On-site management personnel | 0.059 |
22 | Poor safety awareness | 1.572 | 56 | Personal protective equipment | 0.058 |
23 | Hidden dangers investigation | 1.568 | 57 | Safety briefing | 0.044 |
24 | Safety rules and regulations | 1.534 | 58 | Illegal subcontracting | 0.041 |
25 | Safety protection | 1.531 | 59 | Conscientious implementation | 0.039 |
26 | Organization and coordination | 1.519 | 60 | Operation qualification certificate | 0.037 |
27 | Management system | 1.486 | 61 | Safety protective equipment | 0.036 |
28 | Strict supervision | 1.451 | 62 | Training and assessment | 0.036 |
29 | Responsible accidents | 1.392 | 63 | Operation management | 0.034 |
30 | Licensed work | 1.363 | 64 | Subcontracting units | 0.033 |
31 | Unlicensed work | 1.357 | 65 | Emergency disposal | 0.033 |
32 | Management procedures | 1.296 | 66 | Special equipment safety | 0.032 |
33 | Laws and regulations | 1.271 | 67 | Blind command | 0.032 |
34 | Program design | 1.188 | 68 | Wearing helmets | 0.030 |
Risk Factor Category | Risk Factor |
---|---|
Human Factor | Violation of operating regulations (HF1) |
Contravention of technical safety standards (HF2) | |
Formwork erection and dismantling not conforming to specifications (HF3) | |
Improper design of the construction process (HF4) | |
Failure to strictly comply with safety regulations (HF5) | |
Failure to follow the construction design program (HF6) | |
Incomplete inspection of potential safety hazards (HF7) | |
Failure to deal with hidden safety hazards promptly (HF8) | |
Inadequate site investigation and monitoring (HF9) | |
Operators on duty without certification (HF10) | |
Inadequate safety awareness (HF11) | |
Inadequate personal protection for operators (HF12) | |
Unauthorized change in the construction organization design (HF13) | |
Facility Factors | Substandard quality of the construction materials (FF1) |
Inadequate inspection of the equipment (FF2) | |
Improper maintenance management of the special equipment (FF3) | |
Inadequate implementation of the site safety protection measures (FF4) | |
Inadequate provision of safety protection equipment (FF5) | |
Environmental Factors | Harsh natural climate (EF1) |
Complex geological conditions (EF2) | |
Poor geographic location (EF3) | |
Poor operating space environment (EF4) | |
Management Factors | Failure to implement a system of responsibility for work safety (MF1) |
Failure to implement geographic safety supervision responsibilities (MF2) | |
Neglect of safety issues by project management (MF3) | |
Incomplete safety management regulatory system (MF4) | |
Inadequate safety management organizational structure (MF5) | |
Inadequate execution of duties by safety management personnel (MF6) | |
Noncompliance of safety management personnel (MF7) | |
Inadequate on-site safety supervision (MF8) | |
Lack of safety training and education (MF9) | |
Failure to supervise the rectification and review of hidden accident hazards (MF10) | |
Lack of supervisory organization or personnel (MF11) | |
Inadequate execution of duties by supervisory personnel (MF12) | |
Inadequate safety inspection (MF13) | |
Illegal subcontracting and sub-subcontracting behavior (MF14) | |
Incomplete emergency response mechanism (MF15) | |
Failure to formulate an emergency response plan for work safety (MF16) | |
Incomplete safety technical instructions (MF17) | |
Failure to formulate a particular safety construction plan (MF18) | |
Particular program without expert assessment (MF19) | |
Construction organization and management are chaotic and disordered (MF20) | |
Inadequate investment of safety funds (MF21) |
Experiments | Accuracy | Precision | Recall | F1-Score | Severity |
---|---|---|---|---|---|
a | 0.81 | 0.71 | 0.84 | 0.77 | 0 |
0.73 | 0.67 | 0.71 | 1 | ||
0.97 | 0.96 | 0.96 | 2 | ||
0.80 | 0.82 | 0.81 | Mean | ||
b | 0.77 | 0.72 | 0.71 | 0.73 | 0 |
0.73 | 0.69 | 0.69 | 1 | ||
0.91 | 0.89 | 0.91 | 2 | ||
0.79 | 0.76 | 0.78 | Mean | ||
c | 0.86 | 0.81 | 0.83 | 0.86 | 0 |
0.79 | 0.8 | 0.78 | 1 | ||
0.98 | 0.97 | 0.95 | 2 | ||
0.86 | 0.87 | 0.86 | Mean |
Test | Test Scenario | Accuracy | Precision | Recall | F1-Score |
---|---|---|---|---|---|
XGBoost | a | 0.81 | 0.71 | 0.84 | 0.77 |
b | 0.77 | 0.73 | 0.67 | 0.71 | |
c | 0.86 | 0.97 | 0.96 | 0.96 | |
LightGBM | a | 0.74 | 0.72 | 0.72 | 0.73 |
b | 0.77 | 0.73 | 0.77 | 0.71 | |
c | 0.83 | 0.81 | 0.83 | 0.84 | |
GBDT | a | 0.69 | 0.62 | 0.69 | 0.67 |
b | 0.72 | 0.66 | 0.71 | 0.69 | |
c | 0.74 | 0.67 | 0.73 | 0.77 | |
Random forest | a | 0.73 | 0.70 | 0.76 | 0.76 |
b | 0.76 | 0.73 | 0.71 | 0.75 | |
c | 0.82 | 0.78 | 0.80 | 0.80 |
Test Scenario(c) | Model Parameters |
---|---|
XGBoost | n_estimator = 100, eta = 0.1, min_child_weight = 1, max_depth = 2, gamma = 0.3, subsample = 0.1, colsample_bytree = 0.2, lambda = 0.3 |
LightGBM | num_leaves = 18, min_data_in_leaf = 1, max_depth = 2, min_split_gain = 0.3, subsample = 0.1, colsample_bytree = 0.2, reg_lambda = 0.3 |
GBDT | n_estimators = 100, min_samples_split = 2, max_depth = 2, min_samples_leaf = 1, subsample = 0.1, max_features = 0.2 |
Random forest | n_estimators = 100, max_depth = 2, min_samples_split = 2, min_samples_leaf = 1, max_features = 0.2 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Fan, J.; Wang, D.; Liu, P.; Xu, J. Research on the Prediction of Sustainable Safety Production in Building Construction Based on Text Data. Sustainability 2024, 16, 5081. https://doi.org/10.3390/su16125081
Fan J, Wang D, Liu P, Xu J. Research on the Prediction of Sustainable Safety Production in Building Construction Based on Text Data. Sustainability. 2024; 16(12):5081. https://doi.org/10.3390/su16125081
Chicago/Turabian StyleFan, Jifei, Daopeng Wang, Ping Liu, and Jiaming Xu. 2024. "Research on the Prediction of Sustainable Safety Production in Building Construction Based on Text Data" Sustainability 16, no. 12: 5081. https://doi.org/10.3390/su16125081
APA StyleFan, J., Wang, D., Liu, P., & Xu, J. (2024). Research on the Prediction of Sustainable Safety Production in Building Construction Based on Text Data. Sustainability, 16(12), 5081. https://doi.org/10.3390/su16125081