Assessing the Accuracy of ChatGPT Use for Risk Management in Construction Projects
Abstract
:1. Introduction
- Understanding the appropriate usage areas along with the usage limitations of ChatGPT in the risk decision-making process, which provides insights for construction companies that want to be digital pioneers in the industry.
- Understanding the accuracy of ChatGPT in the risk decision-making process, which can enable decision-makers to build resilience through technology-driven risk management in business operations.
- Additionally, as one of the pioneer studies in the AI-driven risk management domain, the study indubitably contributes to the body of risk management knowledge with its findings related to ChatGPT’s performance in each risk management sub-process for different construction project types.
2. Literature Review
2.1. Artificial Intelligence and the Subfields Used in Risk Management
2.2. Literature Review of AI Use for Risk Management in Construction Projects
2.3. Literature Review on ChatGPT Use for Risk Management in Construction Projects
3. Research Methodology
3.1. Determination of KPIs for Assessing the Accuracy of ChatGPT Use in Risk Management Process
3.2. Preparation of Questionnaire
3.3. Data Collection
3.4. Data Analysis
4. Results and Discussion
4.1. Evaluation of ChatGPT’s Performance in Risk Identification Sub-Process
- Evaluation of ChatGPT’s performance in KPI-1: The experts were additionally encouraged to identify a set of 20 risk factors specific to the construction project for their focus group session. A comparison is presented in Table 5.
FGS-1 | Risk Factors Identified by ChatGPT | Risk Factors Identified by Experts |
(√) 1. Political instability and changes in government policies. (√) 2. Regulatory and legal risks, including changes in legislation and regulations. (√) 3. Delays in obtaining necessary permits and approvals. (√) 4. Currency exchange rate fluctuations. (x) 5. Economic downturn or recession. (√) 6. Construction and infrastructure risks, such as delays, cost overruns, and quality issues. (√) 7. Environmental risks, including potential impact on ecosystems and local communities. (√) 8. Public opposition and protests against the project. (√) 9. Land acquisition and resettlement issues. (√) 10. Financing risks, such as difficulty in securing funding or changes in interest rates. (x) 11. Force majeure events, such as natural disasters or acts of terrorism. (√) 12. Traffic and demand risks, including lower-than-expected usage or changes in travel patterns. (x) 13. Technology and operational risks, such as system failures or cybersecurity threats. (x) 14. Performance and maintenance risks, including the need for costly repairs or upgrades. (√) 15. Dispute resolution and contract enforcement risks. (√) 16. Inflation and cost escalation. (√) 17. Revenue and cash flow risks, including lower than projected toll or fare revenues. (√) 18. Market and competition risks, such as changes in market conditions or new competitors. (x) 19. Reputation and public image risks, including negative media coverage or customer dissatisfaction. (x) 20. Exit strategy risks, such as difficulty in selling or transferring the project at the end of the concession period. | (√) 1. Public opposition (x) 2. Termination of concessionaire by public institution. (√) 3. Failure/delay in obtaining permits/approvals. (√) 4. Nationalization/expropriation. (√) 5. Financial risks associated with project funding (inflation, interest rate fluctuation, foreign exchange fluctuation, taxation, etc.). (√) 6. Market risks associated with tariff and demand changes, etc. (√) 7. Environmental concerns (environmental impact of the project, adverse site conditions, weather conditions). (x) 8. Project objective/purpose differences among stakeholders (lack of commitment among stakeholders, etc.). (x) 9. Purchasing guarantees by public institution. (x) 10. Payment mechanisms (√) 11. Inadequate contract management (inadequacy of plans and specifications, inadequacy in contract enforcement). (x) 12. Integration between design and construction phases. (√) 13. Inadequate government regulations (unstable policies, lack of regulation regarding PPP). (√) 14. Completion risk in terms of delays in project timeline. (x) 15. Excessive design variations and construction changes. (x) 16. Wrong estimation of cost trade-offs. (√) 17. Revenue (income) risk. (√) 18. Social and political instability in host countries. (x) 19. Improper risk-allocation decisions. (x) 20. Inadequate relationships with employer (public). | |
FGS-2 | Risk factors identified by ChatGPT | Risk factors identified by experts |
(√) 1. Political instability and changes in government policies. (√) 2. Regulatory and legal uncertainties. (√) 3. Delays in obtaining necessary permits and approvals. (√) 4. Fluctuations in energy prices and market conditions. (√) 5. Currency exchange rate fluctuations. (√) 6. Changes in tax and tariff policies. (√) 7. Environmental and social impact concerns. (x) 8. Volatility in fuel prices and availability. (√) 9. Construction delays and cost overruns. (√) 10. Technical and operational risks. (√) 11. Supply chain disruptions. (√) 12. Natural disasters and extreme weather events. (√) 13. Security and geopolitical risks. (√) 14. Labor disputes and strikes. (√) 15. Inadequate infrastructure and grid connection challenges. (√) 16. Project financing and funding risks. (x) 17. Counterparty risks, including non-payment or contract breaches. (x) 18. Public opposition and community resistance. (√) 19. Changes in energy policies and regulations. (x) 20. Inadequate risk management and mitigation strategies. | (√) 1. Completion risk in terms of delays in project timeline. (√) 2. Increased costs due to unforeseen circumstances (Construction cost overrun + Operation cost overrun). (√) 3. Changes in government regulations (unstable renewable energy policies; changes in tax and tariff policies). (√) 4. Environmental concerns (environmental impact of the project, adverse site conditions, weather conditions). (x) 5. Health and safety concerns. (√) 6. Technological challenges. (√) 7. Social and political instability in host countries. (√) 8. Financial risks associated with project funding (inflation, interest rate fluctuation, foreign exchange fluctuation, taxation, etc.). (√) 9. Market risks associated with energy commodity prices, and demand change, etc. (√) 10. Disruption in supply chain due to force majeure events. (√) 11. Risk of terrorism and strikes. (x) 12. Liquidity risks based on non-existence of secondary market and long payback period. (x) 13. Credit risk based on default of renewable energy projects. (√) 14. Risks based on non-existence of required infrastructure. (x) 15. Insufficient project finance supervision. (x) 16. Inability of concessionaire. (x) 17. Imperfect law and supervision system. (x) 18. Inadequate contract management (inadequacy of plans and specifications, contract enforcement). (√) 19. Delay in project approvals and permits. (x) 20. Lack of support infrastructures. |
- Evaluation of ChatGPT’s performance in KPI-2: The general performance of ChatGPT for generating relationships among the identified risks was found to be sufficient in each focus group session. It was seen that ChatGPT considered cause-and-effect relationships or influences on each other, leading to potential correlations or dependencies. However, in some cases, ChatGPT failed to broaden the generated relationships among the identified risks with solid explanations.
- Evaluation of ChatGPT’s performance in KPI-3: ChatGPT gave a breakdown of the identified risks by using the classification that it provided previously. However, it failed to visualize this information in a visual representation as a hierarchical structure or flowchart.
- Evaluation of ChatGPT’s performance in KPI-4: The ability of ChatGPT in recognizing and assessing the potential risks in the context of new variables was found to be high. In each trial, it gave a new set of risk factors by taking into account the specific variables.
4.2. Evaluation of ChatGPT’s Performance in Risk Analysis Sub-Processes
4.3. Evaluation of ChatGPT’s Performance in Risk Response Sub-Processes
- Evaluation of ChatGPT’s performance in KPI-6: In both trials, the experts found ChatGPT’s ability to “provide a relevant risk response” as very low. The experts especially emphasized that ChatGPT generally neglected more possible effects and did not suggest any proper risk mitigation measures for these circumstances. There was consensus that ChatGPT suggested more common actions that did not reflect a detailed plan. In addition, especially in trial 1, when the underlying reasons for choosing the risk response strategy was asked, it was seen that ChatGPT started to give inconsistent answers in such a way that it switched to its previous answers each time the same question was requisitioned.
- Evaluation of ChatGPT’s performance in KPI-7: It was seen that in each focus group session, ChatGPT provided more accurate responses in providing proper risk allocation decisions. It also gave clear and well-defined reasons for how it provided these allocation decisions, which builds up trust for its user(s).
- Evaluation of ChatGPT’s performance in KPI-8: In each focus group session, ChatGPT’s performance for generating contingent response strategies for the prioritized risks in the context of a certain emerging situation was found to be very high. It provided clear and well-defined explanations of how the suggested contingent response strategies ensured project continuity.
4.4. Evaluation of ChatGPT’s Performance in Risk Monitoring Sub-Processes
- Evaluation of ChatGPT’s performance in KPI-9: In FGS-1, ChatGPT’s performance in providing supportive suggestions of how to monitor risk factors was found to be reasonable and proactive. The experts emphasized that project managers can effectively monitor the identified risk factors by implementing ChatGPT’s suggestions. They also agreed that the suggested risk monitoring approach may enable project managers to take timely actions to mitigate the impact and ensure the overall success of PPP-type transportation projects. On the other hand, in FGS-2, ChatGPT’s performance in providing supportive suggestions on how to monitor risk factors was found to be not as successful as ChatGPT’s performance of FGS-1, as the suggestions made by ChatGPT provided guidance from a narrower perspective. The experts emphasized that they should be broadened, with more inclusive and explanatory suggestions. That way, project managers can stay informed, anticipate potential challenges, and take more appropriate actions.
- Evaluation of ChatGPT’s performance in KPI-10: There were no additional questions for ChatGPT for this KPI. However, for the experts, the whole conversation history revealed insight into whether the gathered responses of ChatGPT were flexible in terms of fitting the specific needs and goals of the construction project. There was consensus that, to a certain degree, ChatGPT customized its responses to fit a specific need/goal/emerging situation from a narrower perspective. However, it is believed that as more precise information/data was sustained for ChatGPT, it had the potential to customize its outputs to fit the specific needs and goals of the construction project.
- Evaluation of ChatGPT’s performance in KPI-11: An incoherent situation was witnessed. In trial 1, ChatGPT created a template of an informative risk report for project managers by informing in each sub-heading what kind of information/data should be inserted into this template. Although it served as a good example for project managers by revealing what kind of information a risk report should contain, ChatGPT did not create an actual risk report in the context of its conversation history. User(s) should customize the content at the end. Then, in trial 2, ChatGPT created a template of an informative risk report for project managers in the context of its conversation history. It presented a summary on the risk assessment of the top risk factors, risk response strategies, and recommendations. It also gave a summary in the conclusion sub-heading. Another important deficiency in streamlining risk reporting and communication was the issue of generating a text-based report without visual representation.
4.5. Evaluation of Tool Features of ChatGPT’s Performance
- Evaluation of ChatGPT’s performance in KPI-12: It can be said that ChatGPT did not give consistent responses all the time. Even though it provided more accurate answers in trial 2 in comparison to trial 1, there were many times ChatGPT gave answers inconsistent with its previous answers. Therefore, the user tried to clarify the situation with follow-up questions within the conversation. ChatGPT can also make mistakes in basic calculations, which can lead decision makers to act with overlooked outputs if the mistakes and/or inconsistency escape their attention. With respect to this, the general performance of ChatGPT in providing consistent responses to similar questions or inputs was unfortunately found to be ineffective by the experts in each focus group session.
- Evaluation of ChatGPT’s performance in KPI-13: As an overall review, ChatGPT uses clear language. It is easy to understand ChatGPT’s responses, but it fails sometimes in sustaining detailed answers for specific circumstances.
- Evaluation of ChatGPT’s performance in KPI-14: In each trial, it was seen that ChatGPT’s performance in generating strategies based on risk allocation decisions for newly defined situations was satisfactory. However, its general performance in learning and adapting to new information seemed to be not so satisfying since it was observed by the experts that, in some cases, ChatGPT did not learn from the information given during the conversation. As a result, users tried to clarify the situation with follow-up questions within the conversation.
- Evaluation of ChatGPT’s performance in KPI-15: With this aim, a set of five questions (one question from each of the five sub-processes was chosen) was asked to ChatGPT in the author’s native language. The logic behind executing a conversation in a different language was to assess the ability of ChatGPT to handle multi-language input. It can be said that ChatGPT’s performance in handling input in different languages was high, but its performance in sustaining precise output in multiple languages was very low. This is the reason ChatGPT provided totally different responses when the same questions were asked in English and in another language.
- Evaluation of ChatGPT’s performance in KPI-16: Users can use the chosen ChatGPT platform for free without needing to log in. Users are expected to enter their questions into a chat box in text format. From this perspective, the chosen ChatGPT platform is very easy to use. Through the conversation, if there is a misspelling, it allows the users to correct it. Additionally, there is a sidebar that allows the users to access their chat logs. Users can continue to carry on their conversation whenever they want. On the other hand, users are not allowed to insert or extract visual information/data (jpeg, png, dwg, etc.). The conversation history cannot be saved as a Word document or in pdf. format for the chosen platform. One of the deficiencies of ChatGPT in risk management was found to be related to data representation. In risk management, frequently used data representation techniques include “probability and impact matrix” and “hierarchical charts”. As a text-based tool, ChatGPT did not provide templates, matrix, hierarchical charts, etc. with visual presentation. Therefore, users have to convert text-based information/responses to better suit their specific needs. When it is considered that a picture paints a thousand words, ChatGPT lacks in using the power of visuals.
- Evaluation of ChatGPT’s performance in KPI-17: With this aim, a set of five questions (one question from each of the five sub-processes was chosen) was asked to the same ChatGPT platform that was used in the previous trials on a mobile device. For the trial, which was carried out on the mobile device, it provided the exact same answers as that carried out on a desktop computer.
- Evaluation of ChatGPT’s performance in KPI-18: Through all trials, ChatGPT always underlined that as an AI language model, its responses were generated based on a mixture of licensed data, data created by human trainers, and publicly available data. In addition, it had not been directly trained on specific industry reports, best practices, or regulatory frameworks. Through all trials, ChatGPT also did not refer to any best practices in the industry, and did not mention the exact names of any industry standards, regulatory framework related to PPP-type transportation or energy project that can be used by the professions dealing with risk management. It only referred to risk management standards, such ISO 31000 [41] and COSO ERM [53], from the risk management perspective. As a conclusion, all these inefficiencies make ChatGPT’s compliance with industry standards and best practices questionable.
- Evaluation of ChatGPT’s performance in KPI-19: Based on the performed trials and focus group evaluations, it can be inferred that ChatGPT generally provided general insights in terms of commonly known information and widely accepted practices. But if more precise and specific information was provided, it has the potential to give accurate answers in the context of complex scenarios. For example, in trial 1, ChatGPT was asked to perform a risk analysis using the decision tree method with a given complex scenario, as follows, and then asked to detect the riskier seller. “The contractor first assesses the options available regarding the outsourcing of the construction materials. Here he has two options. He can either outsource from a native seller or an overseas seller. As per each option, there are two potential outcomes. While the native seller would allow the contractor to personally inspect, it is costlier. On the other hand, the overseas seller might be cheaper, but the travel expenses won’t allow inspection of materials. While the Expected Monetary Value (EMV) for native sellers is 80,000 with an 80 percent chance of success. When the EMV of the overseas seller is calculated, with merely a 50 percent chance of success, the loss EMV is 15,000.” ChatGPT did a pretty good job of analyzing both situations using a decision tree. It constructed a decision tree based on the information provided and detected that the native seller option would be less risky compared to the overseas seller option.
5. Conclusions
5.1. Implications for Researchers
5.2. Implications for Construction Industry
5.3. Limitations and Directions for Future Research
Supplementary Materials
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Al-Bahar, J.; Crandall, K.C. Systematic Risk Management Approach for Construction Projects. J. Constr. Eng. Manag. 1990, 116, 533–546. [Google Scholar] [CrossRef]
- Akintoye, A.; MacLeod, M.J. Risk Analysis and Management in Construction. Int. Journey Proj. Manag. 1997, 15, 31–38. [Google Scholar] [CrossRef]
- Nobanee, H.; Al Hamadi, F.Y.; Abdulaziz, F.A.; Abukarsh, L.S.; Alqahtani, A.F.; AlSubaey, S.K.; Almansoori, H.A. A bibliometric analysis of sustainability and risk management. Sustainability 2021, 13, 3277. [Google Scholar] [CrossRef]
- Wang, S.Q.; Dulaimi, M.F.; Aguria, M.Y. Risk Management Framework for Construction Projects in Developing Countries. Constr. Manag. Econ. 2004, 22, 237–252. [Google Scholar] [CrossRef]
- Cardona, O.D. The need for rethinking the concepts of vulnerability and risk from a holistic perspective: A necessary review and criticism for effective risk management. In Mapping Vulnerability; Routledge: London, UK, 2013; pp. 37–51. [Google Scholar]
- Auth, G.; Jöhnk, J.; Wiecha, D.A. A Conceptual Framework for Applying Artificial Intelligence in Project Management. In Proceedings of the IEEE 23rd Conference on Business Informatics (CBI), Bolzano, Italy, 1–3 September 2021; pp. 161–170. [Google Scholar]
- Regona, M.; Yigitcanlar, T.; Xia, B.; Li, R.Y.M. Opportunities and adoption challenges of AI in the construction industry: A PRISMA review. J. Open Innov. Technol. Mark. Complex. 2022, 8, 45. [Google Scholar] [CrossRef]
- Choi, S.J.; Choi, S.W.; Kim, J.H.; Lee, E. AI and Text-Mining Applications for Analyzing Contractor’s Risk in Invitation to Bid (ITB) and Contracts for Engineering Procurement and Construction (EPC) Projects. Energies 2021, 14, 4632. [Google Scholar] [CrossRef]
- Khodabakhshian, A.; Puolitaival, T.; Kestle, L. Deterministic and Probabilistic Risk Management Approaches in Construction Projects: A Systematic Literature Review and Comparative Analysis. Buildings 2023, 13, 1312. [Google Scholar] [CrossRef]
- Chenya, L.; Aminudin, E.; Mohd, S.; Yap, L.S. Intelligent risk management in construction projects: Systematic Literature Review. IEEE Access 2022, 10, 72936–72954. [Google Scholar] [CrossRef]
- Afzal, F.; Yunfei, S.; Nazir, M.; Bhatti, S.M. A review of artificial intelligence-based risk assessment methods for capturing complexity-risk interdependencies: Cost overrun in construction projects. Int. J. Manag. Proj. Bus. 2021, 14, 300–328. [Google Scholar] [CrossRef]
- Pan, Y.; Zhang, L. Roles of artificial intelligence in construction engineering and management: A critical review and future trends. Autom. Constr. 2021, 122, 103517. [Google Scholar] [CrossRef]
- Basaif, A.A.; Alashwal, A.M.; Mohd-Rahim, F.A.; Karim, S.B.A.; Loo, S.C. Technology awareness of artificial intelligence (AI) application for risk analysis in construction projects. Malays. Constr. Res. J. 2020, 9, 182–195. [Google Scholar]
- Al-Mhdawi, M.K.S.; Qazi, A.; Alzarrad, A.; Dacre, N.; Rahimian, F.; Buniya, M.K.; Zhang, H. Expert Evaluation of ChatGPT Performance for Risk Management Process Based on ISO 31000 Standard. 2023. Available online: https://papers.ssrn.com/sol3/papers.cfm?abstract_id=4504409 (accessed on 17 August 2023).
- Weng, J.C. Putting Intellectual Robots to Work: Implementing Generative AI Tools in Project Management; NYU SPS Applied Analytics Laboratory: New York, NY, USA, 2023; Available online: https://archive.nyu.edu/handle/2451/69531 (accessed on 17 August 2023).
- Schwarz, I.J.; Sánchez, I.P.M. Implementation of Artificial Intelligence into Risk Management Decision-Making Processes in Construction Projects; Universität der Bundeswehr München, Institut für Baubetrieb: Neubiberg, Germany, 2015; pp. 357–378. [Google Scholar]
- Aziz, S.; Dowling, M. Machine learning and AI for risk management. In Disrupting Finance: FinTech and Strategy in the 21st Century; Springer: Berlin/Heidelberg, Germany, 2019; pp. 33–50. [Google Scholar]
- Baryannis, G.; Validi, S.; Dani, S.; Antoniou, G. Supply chain risk management and artificial intelligence: State of the art and future research directions. Int. J. Prod. Res. 2019, 57, 2179–2202. [Google Scholar] [CrossRef]
- Gevaert, C.M.; Carman, M.; Rosman, B.; Georgiadou, Y.; Soden, R. Fairness and accountability of AI in disaster risk management: Opportunities and challenges. Patterns 2021, 2, 100363. [Google Scholar] [CrossRef]
- Khatib, E.; ZM, R.; Al-Nakeeb, A. The effect of AI on project and risk management in health care industry projects in the United Arab Emirates (UAE). Int. J. Appl. Eng. Res. 2021, 6, 1–9. [Google Scholar]
- Rodríguez-Espíndola, O.; Chowdhury, S.; Dey, P.K.; Albores, P.; Emrouznejad, A. Analysis of the adoption of emergent technologies for risk management in the era of digital manufacturing. Technol. Forecast. Soc. Chang. 2022, 178, 121562. [Google Scholar] [CrossRef]
- Abioye, S.O.; Oyedele, L.O.; Akanbi, L.; Ajayi, A.; Delgado, J.M.D.; Bilal, M.; Ahmed, A. Artificial intelligence in the construction industry: A review of present status, opportunities and future challenges. J. Build. Eng. 2021, 44, 103299. [Google Scholar] [CrossRef]
- Prieto, S.A.; Mengiste, E.T.; García de Soto, B. Investigating the use of ChatGPT for the scheduling of construction projects. Buildings 2023, 13, 857. [Google Scholar] [CrossRef]
- Franceschini, F.; Maisano, D.; Mastrogiacomo, L. Empirical analysis and classification of database errors in Scopus and Web of Science. J. Informetr. 2016, 10, 933–953. [Google Scholar] [CrossRef]
- Akinosho, T.D.; Oyedele, L.O.; Bilal, M.; Ajayi, A.O.; Delgado, M.D.; Akinade, O.O.; Ahmed, A.A. Deep learning in the construction industry: A review of present status and future innovations. J. Build. Eng. 2020, 32, 101827. [Google Scholar] [CrossRef]
- Locatelli, M.; Seghezzi, E.; Pellegrini, L.; Tagliabue, L.C.; Di Giuda, G.M. Exploring natural language processing in construction and integration with building information modeling: A scientometric analysis. Buildings 2021, 11, 583. [Google Scholar] [CrossRef]
- Erfani, A.; Cui, Q. Predictive risk modeling for major transportation projects using historical data. Autom. Constr. 2022, 139, 104301. [Google Scholar] [CrossRef]
- Tirmizi, S.A.A.; Arif, F. Conceptual Approach for the Use of Artificial Intelligence for Contractual Risk Assessment in Infrastructure Projects. Eng. Proc. 2022, 22, 12. [Google Scholar]
- Bigham, G.F.; Adamtey, S.; Onsarigo, L.; Jha, N. Artificial Intelligence for Construction Safety: Mitigation of the Risk of Fall. In Proceedings of the SAI Intelligent Systems Conference, London, UK, 6–7 September 2018. [Google Scholar]
- Poh, C.Q.; Ubeynarayana, C.U.; Goh, Y.M. Safety leading indicators for construction sites: A machine learning approach. Autom. Constr. 2018, 93, 375–386. [Google Scholar] [CrossRef]
- Koc, K.; Ekmekcioğlu, Ö.; Gurgun, A.P. Integrating feature engineering, genetic algorithm and tree-based machine learning methods to predict the post-accident disability status of construction workers. Autom. Constr. 2021, 131, 103896. [Google Scholar] [CrossRef]
- Yaseen, Z.M.; Ali, Z.H.; Salih, S.Q.; Al-Ansari, N. Prediction of risk delay in construction projects using a hybrid artificial intelligence model. Sustainability 2020, 12, 1514. [Google Scholar] [CrossRef]
- Phasha, C. The Impact of Artificial Intelligence on Cost Overruns and Risk Management in Construction Project Management. Ph.D. Thesis, University of Johannesburg, Johannesburg, South Africa, 2022. [Google Scholar]
- Cheng, M.Y.; Peng, H.S.; Wu, Y.W.; Chen, T.L. Estimate at completion for construction projects using evolutionary support vector machine inference model. Autom. Constr. 2010, 19, 619–629. [Google Scholar] [CrossRef]
- Awad, A.; Fayek, A.R. A decision support system for contractor prequalification for surety bonding. Autom. Constr. 2012, 21, 89–98. [Google Scholar] [CrossRef]
- Forbes, D.R.; Smith, S.D.; Horner, R.M.W. The selection of risk management techniques using case-based reasoning. Civ. Eng. Environ. Syst. 2010, 27, 107–121. [Google Scholar] [CrossRef]
- Xue, Z.; Xu, C.; Xu, X. Application of ChatGPT in natural disaster prevention and reduction. Nat. Hazards Res. 2023, 3, 556–562. [Google Scholar] [CrossRef]
- Uddin, S.J.; Albert, A.; Ovid, A.; Alsharef, A. Leveraging ChatGPT to Aid Construction Hazard Recognition and Support Safety Education and Training. Sustainability 2023, 15, 7121. [Google Scholar] [CrossRef]
- Hofert, M. Assessing ChatGPT’s Proficiency in Quantitative Risk Management. 2023. Available online: https://papers.ssrn.com/sol3/papers.cfm?abstract_id=4444104 (accessed on 17 August 2023).
- Klepo, M.S.; Knežević, D.; Knežević, T.; Meštrović, H. Artificial Intelligence in Risk Management System on Infrastructure Projects. In Creative Construction e-Conference 2023; Budapest University of Technology and Economics: Budapest, Hungary, 2023; pp. 208–214. [Google Scholar]
- ISO 31000-2018; Risk Management—Guidelines. International Organization for Standardization: Geneva, Switzerland, 2018.
- PMBOK. A Guide to the Project Management Body of Knowledge (PMBOK Guide)—Seventh Edition and the Standard for Project Management; Project Management Institute, Inc.: Newtown Square, PA, USA, 2021. [Google Scholar]
- PMBOK. A Guide to the Project Management Body of Knowledge (PMBOK Guide)—Sixth Edition and the Standard for Project Management; Project Management Institute, Inc.: Newtown Square, PA, USA, 2017. [Google Scholar]
- Hillson, D. Use a risk breakdown structure (RBS) to understand your risks. In Proceedings of the Project Management Institute Annual Seminars & Symposium, San Antonio, TX, USA, 7–8 October 2002. [Google Scholar]
- Abednego, M.P.; Ogunlana, S.O. Good project governance for proper risk allocation in public–private partnerships in Indonesia. Int. J. Proj. Manag. 2006, 24, 622–634. [Google Scholar] [CrossRef]
- Jin, X.H.; Zhang, G. Modelling optimal risk allocation in PPP projects using artificial neural networks. Int. J. Proj. Manag. 2011, 29, 591–603. [Google Scholar] [CrossRef]
- Ameyaw, E.E.; Chan, A.P. Risk allocation in public-private partnership water supply projects in Ghana. Constr. Manag. Econ. 2015, 33, 187–208. [Google Scholar] [CrossRef]
- Yang, W.; Dai, D. Concession Decision Model of BOT Projects Based on a Real Options Approach. In Proceedings of the International Conference on Management Science and Engineering, Lille, France, 6–7 October 2006; pp. 307–312. [Google Scholar]
- Chan, I.Y.S.; Leung, M.Y.; Yu, S.S.W. Managing the Stress of Hong Kong Expatriate Construction Professionals in Mainland China: Focus Group Study Exploring Individual Coping Strategies and Organizational Support. J. Constr. Eng. Manag. 2012, 138, 1150–1160. [Google Scholar] [CrossRef]
- Wibeck, V.; Dahlgren, M.A.; Öberg, G. Learning in Focus Groups An Analytical Dimension For Enhancing Focus Group Research. Qual. Res. 2007, 7, 249–267. [Google Scholar] [CrossRef]
- Saaty, T.L. Decision Making—The Analytic Hierarchy and Network Processes (AHP/ANP). J. Syst. Sci. Syst. Eng. 2004, 13, 1–35. [Google Scholar] [CrossRef]
- Ishizaka, A.; Labib, A. Review of the Main Developments in the Analytic Hierarchy Process. Expert Syst. Appl. 2011, 38, 14336–14345. [Google Scholar] [CrossRef]
- COSO ERM-2017; COSO Enterprise Risk Management-Integrating with Strategy and Performance. Committee of Sponsoring Organizations: Englewood Cliffs, NJ, USA, 2017.
- Heimann, J.F. Contingency planning as a necessity. In Proceedings of the Project Management Institute Annual Seminars & Symposium, Houston, TX, USA, 7–16 September 2000. [Google Scholar]
Machine Learning | Computer Vision | Knowledge-Based SYSTEMS (KBS) | Optimization |
---|---|---|---|
|
|
|
|
Robotics | Natural Language Processing (NLP) | Automated Planning and Scheduling | |
|
|
|
Risk Identification | KPI-1 | Accuracy of risk identification | The ability of ChatGPT to effectively recognize and assess potential risks that could impact the construction project. |
KPI-2 | Accuracy of generating relationships among identified risks | The extent to which ChatGPT can capture complexity–risk interdependencies and correlate identified risks in terms of their interactions. | |
KPI-3 | Ability to generate Risk Breakdown Structure (RBS) | The extent to which ChatGPT can break down the risks of a project, as a hierarchical outline of risk. | |
KPI-4 | Ability to generate new risk(s) in correspondence with new circumstances | The ability of ChatGPT to recognize and assess potential risks in terms of new circumstances and/or emerging trends within the construction industry. | |
Risk Analysis | KPI-5 | Ability for risk assessment and prioritization | The degree to which ChatGPT consistently evaluates and prioritizes risks in accordance with the construction project’s objectives, considering factors such as probability and impact. |
Risk Response | KPI-6 | Ability to provide relevant risk responses | The ability of ChatGPT to propose relevant and effective risk response strategies (such as escalation, avoidance, transfer, mitigation, or acceptance) that align with the specific requirements of the construction project. |
KPI-7 | Ability to provide proper risk allocation decisions | The extent to which ChatGPT can specify the accurate stakeholder that should undertake the risk based on industry trends, project-specific factors, etc. | |
KPI-8 | Ability to generate contingent response strategies (mitigation strategies) | Risk mitigation refers to the risk handling strategy used to eliminate or lessen the likelihood and/or consequence of a risk. In this sense, this pertains to the extent to which ChatGPT can aid project managers in generating contingent response strategies for prioritized risks (such as contingency plans, removal of high-risk elements of scope from the project, etc.), providing a proactive approach to addressing potential issues. | |
Risk Monitoring | KPI-9 | Ability to provide supportive suggestions for how to monitor risk | The extent to which ChatGPT can support project managers in monitoring identified risks, assessing the efficacy of risk response strategies, and providing recommendations for adjustments when necessary. |
KPI-10 | Flexibility to customize risk management processes | The degree to which ChatGPT outputs can be tailored and customized to meet the specific requirements and objectives of the construction project. | |
KPI-11 | Streamlining risk reporting and communication | The extent to which ChatGPT can produce concise and informative risk reports for stakeholders, ensuring they are consistently informed about the project’s risk profile. | |
Risk Management Tool Features | KPI-12 | Consistency of responses | The degree of consistency in ChatGPT’s responses when presented with similar questions or inputs. |
KPI-13 | Clarity of communication | The degree of clarity with which ChatGPT can effectively communicate its responses to the user, encompassing factors such as language choice and the level of detail provided. | |
KPI-14 | Ability to learn and adapt to new information | The degree of ChatGPT’s ability to assimilate new information and adapt its responses accordingly. | |
KPI-15 | Ability to handle multi-language input | The degree of ChatGPT’s ability to process input in various languages, which can be valuable for international construction projects. | |
KPI-16 | Ease of use | The degree of user-friendliness and intuitiveness in ChatGPT, facilitating its easy utilization by project team members for risk management purposes. | |
KPI-17 | Compatibility with different devices and platforms | The degree of compatibility of ChatGPT with various devices and platforms, including desktop computers, mobile devices, or cloud-based platforms. | |
KPI-18 | Compliance with industry standards and best practices | The degree of alignment between ChatGPT’s risk management processes, and industry standards, and best practices specific to the construction industry. | |
KPI-19 | Ability to generate data with complex scenarios | The extent to which ChatGPT can give accurate answers in the context of a complex scenario. |
Respondents | Profession | Academic Background | Position | Experience (on Yearly Basis) |
---|---|---|---|---|
Respondent 1 | Civil Engineer | Civil Engineering, Ph.D. | Project manager | 12 years |
Respondent 2 | Civil Engineer | Construction Management, Ph.D. | Academician | 10 years |
Focus Group Session (FGS) | Expert | Position | Industrial Experience | Experience in PPP Projects | Academic Background |
---|---|---|---|---|---|
FGS-1 for PPP Transportation Project | Expert 1 | Project Manager | 12 years | 7 years | Civil Engineering, M.Sc. |
Expert 2 | Technical Manager | 8 years | 3 years | Construction Management, M.Sc. | |
Expert 3 | Planning and Cost Control Executive Manager | 12 years | 5 years | Civil Engineering, Ph.D. | |
Expert 4 | Project Manager | 10 years | 5 years | Civil Engineering, B.Sc. | |
Expert 5 | Deputy Director Contracts and Administrative | 15 years | 9 years | Civil Engineering, M.Sc. | |
FGS-2 for PPP Energy Project | Expert 1 | Project Manager | 7 years | 4 years | Civil Engineering, M.Sc. |
Expert 2 | Contract Manager | 10 years | 3 years | Construction Management, Ph.D. | |
Expert 3 | Project Manager | 13 years | 7 years | Construction Management, Ph.D. | |
Expert 4 | Managing Director | 8 years | 6 years | Construction Management, M.Sc. | |
Expert 5 | General Manager | 16 years | 12 years | Civil Engineering, M.Sc. |
FGS-1 | FGS-2 | ||||
---|---|---|---|---|---|
Group Decision Score | Overall Score | Group Decision Score | Overall Score | ||
KPI-1 | 3/7 | 3.75 (10) | KPI-1 | 5/7 | 4.25 (10) |
KPI-2 | 4/7 | KPI-2 | 5/7 | ||
KPI-3 | 2/7 | KPI-3 | 2/7 | ||
KPI-4 | 6/7 | KPI-4 | 5/7 |
FGS-1 | FGS-2 | ||||
---|---|---|---|---|---|
Group Decision Score | Overall Score | Group Decision Score | Overall Score | ||
KPI-5 | 1/7 | 1 (16) | KPI-5 | 2/7 | 2 (16) |
FGS-1 | FGS-2 | ||||
---|---|---|---|---|---|
Group Decision Score | Overall Score | Group Decision Score | Overall Score | ||
KPI-6 | 2/7 | 4.33 (6) | KPI-6 | 3/7 | 5.33 (6) |
KPI-7 | 5/7 | KPI-7 | 6/7 | ||
KPI-8 | 6/7 | KPI-8 | 7/7 |
FGS-1 | FGS-2 | ||||
---|---|---|---|---|---|
Group Decision Score | Overall Score | Group Decision Score | Overall Score | ||
KPI-9 | 6/7 | 4.33 (4) | KPI-9 | 4/7 | 4 (4) |
KPI-10 | 3/7 | KPI-10 | 3/7 | ||
KPI-11 | 4/7 | KPI-11 | 5/7 |
Performance of ChatGPT’ Tool Features Based on Author’s Experience | ||||
---|---|---|---|---|
KPI-12 | 3/7 | KPI-16 | 6/7 | 4.5 |
KPI-13 | 5/7 | KPI-17 | 7/7 | |
KPI-14 | 3/7 | KPI-18 | 2/7 | |
KPI-15 | 6/7 | KPI-19 | 4/7 |
Risk Identification | Risk Analysis | Risk Response | Risk Monitoring | Risk Management Tool Features | ||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Group Decision Score | Overall Score | Group Decision Score | Overall Score | Group Decision Score | Overall Score | Group Decision Score | Overall Score | Overall Score | ||||||||
FGS-1 | KPI-1 | 3/7 | 3.75 (10) | KPI-5 | 1/7 | 1 (16) | KPI-6 | 2/7 | 4.33 (6) | KPI-9 | 6/7 | 4.33 (4) | Based on author’s experience | KPI-12 | 3/7 | 4.5 |
KPI-2 | 4/7 | KPI-7 | 5/7 | KPI-10 | 3/7 | KPI-13 | 5/7 | |||||||||
KPI-3 | 2/7 | KPI-8 | 6/7 | KPI-11 | 4/7 | KPI-14 | 3/7 | |||||||||
KPI-4 | 6/7 | KPI-15 | 6/7 | |||||||||||||
FGS-2 | KPI-1 | 5/7 | 4.25 (10) | KPI-5 | 2/7 | 2 (16) | KPI-6 | 3/7 | 5.33 (6) | KPI-9 | 4/7 | 4 (4) | KPI-16 | 6/7 | ||
KPI-2 | 5/7 | KPI-7 | 6/7 | KPI-10 | 3/7 | KPI-17 | 7/7 | |||||||||
KPI-3 | 2/7 | KPI-8 | 7/7 | KPI-11 | 5/7 | KPI-18 | 2/7 | |||||||||
KPI-4 | 5/7 | KPI-19 | 4/7 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Aladağ, H. Assessing the Accuracy of ChatGPT Use for Risk Management in Construction Projects. Sustainability 2023, 15, 16071. https://doi.org/10.3390/su152216071
Aladağ H. Assessing the Accuracy of ChatGPT Use for Risk Management in Construction Projects. Sustainability. 2023; 15(22):16071. https://doi.org/10.3390/su152216071
Chicago/Turabian StyleAladağ, Hande. 2023. "Assessing the Accuracy of ChatGPT Use for Risk Management in Construction Projects" Sustainability 15, no. 22: 16071. https://doi.org/10.3390/su152216071
APA StyleAladağ, H. (2023). Assessing the Accuracy of ChatGPT Use for Risk Management in Construction Projects. Sustainability, 15(22), 16071. https://doi.org/10.3390/su152216071