A Retrieval-Augmented Generation Approach for Data-Driven Energy Infrastructure Digital Twins
Abstract
:Highlights
- Novel data-driven and knowledge-based energy digital-twin framework integrating a Retrieval-Augmented Generation (RAG) approach.
- Prototype applied to a real-world scenario involving the management of high-voltage energy infrastructures, showcasing the framework feasibility and effectiveness in operational environments.
- Improved management of energy infrastructures enhancing the ability to predict future conditions and prescribe more informed and data-driven decisions in asset maintenance.
- Exploitation of a conversational virtual assistant to interact with users, improving the accessibility, interpretability, and usability of complex data for decision-makers.
Abstract
1. Introduction
- RA1. Enhancing grid management through DTs and data-driven approaches: By examining the integration of digital twins and innovative data visualization techniques with data-driven methods, the research highlights how this combination improves the monitoring of energy infrastructures. This vision aims to enable more accurate real-time assessments and failure prevention procedures.
- RA2. Leveraging Large Language Models (LLMs) for decision support: The study explores the challenges and benefits of utilizing LLMs in energy infrastructure. Key challenges include data integration and data modeling and ensuring accuracy in domain-specific tasks in order to obtain enhanced decision-making capabilities and improved automation.
- RA3. Integrating Retrieval-Augmented Generation (RAG) approaches with Knowledge Graphs (KGs): The exploitation of a Resource Description Framework (RDF) [10] for data modeling in RAG is also a focus of this study. By structuring domain-specific knowledge, a KG improves the performance of RAG-based LLMs in delivering real-time, accurate information, thereby increasing the effectiveness of decision support systems.
- a reference architecture for energy infrastructure management and monitoring, integrating digital twins and data-driven methods;
- a Natural User Interface (NUI) extending conventional visualization and interaction paradigms with direct object retrieval and manipulation by means of a combination of gestures and a conversational virtual assistant;
- annotation of DT models and data streams into a RDF KG, exploited for RAG to provide real-time domain information to a general-purpose foundation LLM powering the conversational agent;
- characterization of each architectural module by means of a mapping with Commercial-Off-The-Shelf (COTS) components to enhance feasibility while simultaneously lowering development costs and time to market;
- a cloud-based prototype implementation integrating open-source software technologies and tools;
- a case study regarding a section of the high-voltage network in central Italy, showcasing key value propositions of the approach.
2. Background
2.1. Emerging Technologies and Digital Twins for Energy Infrastructures
2.2. Related Work
3. Framework Architecture
3.1. Sensor Networks and Communication
- Traditional sensors: These generate signals enabling the digital twin to acquire operational and environmental data from real-world physical assets. They provide direct measurements of the variables related to the analyzed physical process, such as temperature and voltage.
- Process Analytical Technology (PAT) sensors: Similar to traditional sensors, PAT sensors record data from sampling points. Collected data cannot be directly interpreted as with traditional sensors. Typically, statistical and real-time observation models are employed to interpret the signal. A near-infrared spectrometer is an example of PAT sensor.
- Actuators: These control and manipulate physical elements or systems in response to signals or commands received from control units. They usually translate digital or analog signals into mechanical, electrical, hydraulic, or pneumatic actions to effect changes in the physical environment.
- Gateways: These receive data locally from the sensors and perform protocol translation tasks, converting data from diverse protocols into a standardized format. Gateways may also perform preliminary data processing tasks, such as filtering, normalization, and data cleaning, to enhance the quality and integrity of the collected data. This processing helps mitigate noise and reduce bandwidth consumption. They implement authentication mechanisms and encryption protocols and access control policies to enforce security measures and data protection.
- Controllers: These handle communication tasks between actuator devices and other network components, such as gateways. They receive the control actions from the gateways, forward messages to the actuators connected to the physical assets, and manage task assignment to facilitate reliable and efficient communication within the energy network.
3.2. Data Management and Storage Techniques
- Digital Twins Definition Language (DTDL) ontologies for energy grid (https://github.com/Azure/opendigitaltwins-energygrid, accessed on 8 September 2024) and smart cities (https://github.com/Azure/opendigitaltwins-smartcities, accessed on 8 September 2024), specifically proposed for modeling DT solutions, including the monitoring of grid and urban assets (concepts in blue in Figure 3);
- Smart Energy Aware Systems (SEAS) ontology (https://w3id.org/seas/, accessed on 8 September 2024) [43], designed as a set of simple core ontology patterns modeling multiple engineering-related concepts and properties of the energy ecosystem (concepts in violet in Figure 3);
- Procedure Execution ontology (PEP) (https://w3id.org/pep/, accessed on 8 September 2024), including the properties used to describe procedures, outputs, and results related to metering activities;
- DBpedia [44] resources (DBR) and GeoNames (GN) ontology (https://www.geonames.org/ontology/, accessed on 8 September 2024) have been exploited to model elements of the IT network and the relationships between asset locations (concepts in yellow in Figure 3).
3.3. Data Processing and Analytics
- Fault detection and diagnosis: identify possible malfunctions of the energy grid and monitor the health and performance of grid assets such as transformers, generators, or substations;
- Preventive maintenance: based on grid equipment structural and behavioral models, analyze status data series and predict when maintenance should be performed to prevent failures and minimize downtime;
- Load forecasting and balancing: predict future electricity demand based on historical data and balance the load across different parts of the grid to ensure stable operation and prevent overloading or congestion;
- Demand response optimization: optimize demand response programs to manage electricity consumption during peak periods or in response to pricing signals.
- Batch processing for large volumes of data and group of transactions. Usually, data is collected, inserted, and processed to produce results automatically. Tasks are scheduled over even medium to long periods of time;
- Real-time processing for tasks requiring a rapid response and performed by simple systems managing rapid transactions. Mainly used in environments where many events occur in a short time.
- What-if analysis [49]: one or more parameters of the reference scenario are expected to be changed and, after the execution of multiple simulation sessions, results are stored to allow comparisons and sensitivity analysis;
- Goal-seeking analysis [50]: a simulation objective is defined in terms of observable variables or Key Performance Indicators (KPIs), and the system will provide support in defining scenarios aimed to identify and evaluate suitable ways to obtain the specific result.
3.4. Data Visualization and Digital Twins
- 3D models representing cyber-physical objects. Users can access, view, and modify assets through their 3D virtual representations;
- Interactive dashboards and business intelligence applications used to monitor in real-time the whole system and show the results of data processing, relevant KPIs, and warnings received in the presence of dangerous conditions;
- NUIs that enable users to interact with the platform using intuitive movements and gestures [51]. Natural User Interface are designed to enhance user experience by reducing the learning curve and enabling a seamless human–machine integration.
3.5. System Monitoring and Security
- Access control: multi-factor authentication and role-based access control functionalities, based on the OpenID Connect (OIDC) [54] protocol, are implemented to ensure that only authorized personnel can access sensitive data and critical infrastructure components such as substations, control systems, and datacenters;
- Continuous monitoring: a service orchestrator aggregates and analyzes network traffic and log data collected from various grid components, enabling security teams to identify performance issues, suspicious activities, or potential security breaches;
- Vulnerability management: regular vulnerability assessments and patch management procedures are conducted to identify and remediate security issues in grid components and software applications. This includes monitoring for security updates and patches released by vendors and promptly applying them to mitigate potential risks;
- Incident response planning: defining strict procedures, roles, and responsibilities for effectively managing and mitigating security incidents when they occur;
- Compliance and regulations: ensuring compliance with industry standards is essential for maintaining the security and reliability of the power grid. Compliance requirements include conducting regular security audits, implementing security controls, and reporting security incidents to regulatory authorities.
4. Case Study: Enhancing Grid Reliability Through Digital Twins
4.1. Prototype Implementation
4.2. Key Functionality Indicators
- (RA1) Enhancing grid management through DTs and data-driven approaches:
- a.
- real-time data integration to provide updates and assessments about the grid infrastructure and to represent the asset status in a intuitive and easily interpretable format;
- b.
- DT replication of real-world infrastructure in terms of asset behaviors and system processes;
- c.
- failure and anomaly prevention to detect irregularities within the grid infrastructure that could lead to performance issues;
- d.
- creation of a virtual representation of physical grid components that mirrors real-time operational data and enables remote control functionalities.
- (RA2) Leveraging LLMs for decision support:
- a.
- exploitation of pre-trained models on domain-specific tasks requiring specialized language, terminology, and concepts of the energy sector;
- b.
- understanding and generating textual content related to energy infrastructures for responding to complex, multi-faceted queries related to grid management;
- c.
- provide comprehensible explanations related to the generated content and insights, increasing operator trust and understanding of its outputs;
- d.
- LLM integration into existing operational workflows and automation systems within the energy infrastructure, in order to interact with other data management tools and platforms.
- (RA3) Integrating RAG approaches with knowledge graphs:
- a.
- reference data model for annotating data in the knowledge graph, ensuring it accurately represents real-world relationships and supports relevant queries;
- b.
- integration of data from various sources (e.g., IoT sensors, databases, log files) into the KG, expanding the scope of the RAG framework;
- c.
- provide contextually relevant and domain-specific results, improving the accuracy of generated responses and reducing errors.
4.3. Case Study Analysis
- System administrators access the web platform and display the whole energy grid, composed of power plants, substations, lines and poles, as depicted in Figure 7. For each power station, geographic coordinates are extracted from OpenStreetMap (OSM) [57] and queried via the Overpass API (https://wiki.openstreetmap.org/wiki/Overpass_API) (RA1.a). All data related to the grid infrastructure are stored in the reference KG (RA3.a, RA3.b).
- In the presence of warnings detected through ML algorithms, a red icon is shown on the map to highlight potential anomalies on the specific node (RA1.c).
- Each item can be selected in order to display detailed information and relevant properties, as well as alarm messages associated with the devices, hosted by a plant or asset. A list of available actions enabled on the platform is also shown, enabling the user to remotely interact with network-connected actuators. These actions provide direct remote control over the physical devices in the system, enabling tasks such as resetting equipment, adjusting set points, or initiating maintenance procedures (RA1.a, RA1.d).
- The system also integrates 2D models (Figure 8a), presenting selectable zones allowing users to open further representations of the same asset (RA1.b).
- Users interact with 3D models (Figure 8b) through a gesture-based NUI to perform actions like rotating the model, zooming in/out on specific components, or highlighting potential maintenance areas. Models respond dynamically, providing a realistic and immersive experience for exploring data regarding equipment conditions, structural components, and spatial relationships (RA1.b).
- The green button placed in the bottom right corner of the user interface opens a virtual assistant box (Figure 7). By means of simple requests in natural language (e.g., ”Show me detailed information about recent alarms at plant X”), administrators receive useful information about monitored devices and detected events, including alarm type, reference timestamp, affected components, and suggested actions for resolution (RA2.a, RA2.b).
- After reviewing the alarm information, in case of a critical issue requiring immediate or preventive maintenance, system administrators notify technicians. A warning message is automatically composed, indicating the nature of the issue and the urgency for maintenance action, and sent to the maintenance operators responsible for the plant (RA2.c, RA2.d).
- Once the maintenance task is successfully completed, operators close the issue and provide notes or additional insights gained during the process. The platform receives the confirmation, updates the maintenance record in real-time, and notifies administrators or maintenance coordinators (RA1.a, RA2.d).
- Information collected from maintenance completions are annotated to enrich the RDF-based KG (RA3.a) with valuable insights and contextual data useful for future analysis (RA1.c). Data are analyzed periodically to identify recurring issues, improve maintenance procedures, and optimize responses provided by the RAG-based solutions (RA3.c).
5. Discussions
5.1. Platform Evaluation
- Infrastructure assesses how each DT framework leverages computational resources and manages physical components within the infrastructure.
- Data Intelligence examines the platform capabilities in terms of data analysis to extract knowledge from data generated within the framework.
- Cybersecurity assesses the capabilities of each framework for monitoring and protecting against threats, to ensure data integrity and confidentiality within the DT environments.
- Visualization details the human–machine interface solutions of each proposal, including support for interactive GUIs, GIS maps, 3D models, and NUIs.
5.2. Challenges and Opportunities
- The design of the UI plays an important role in facilitating user interaction with the DT platform in order to create intuitive and user-friendly interfaces, providing relevant information, insights, and predictions. UIs should show information in a way that aligns with the users’ cognitive processes, facilitating quicker and more accurate decision-making in response to dynamic energy scenarios. Tailoring the interface to the specific needs and expertise of energy management professionals ensures efficient use and minimizes the learning curve associated with adopting new technologies [64]
- Increasing trust in EDT systems is paramount for their successful adoption. Human operators must have confidence in the accuracy and reliability of the DT’s representations. Research must increase focus on developing validation mechanisms, transparent communication of uncertainties, and incorporating user feedback to make these complex models more understandable to (non-expert) users and enhance trust in the system [65].
- The complexity of EDT platforms often requires comprehensive training and education programs for end-users. Energy management operators and decision-makers need to be familiar with the capabilities and functionalities of the DT. Ongoing training programs help users harness the full potential of the technology, make informed decisions, and troubleshoot issues effectively [66].
- As DTs collect and process large amounts of data, cybersecurity considerations become crucial, aiming to ensure responsible data use, privacy protection, and compliance with relevant regulations. In particular, EDTs must be treated as critical systems in which security issues need to be considered in terms of confidentiality, integrity, and availability of both data and resources, along with privacy issues with respect to entities as well as location and status of assets [67].
6. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Joseph, A.; Balachandra, P. Smart grid to energy internet: A systematic review of transitioning electricity systems. IEEE Access 2020, 8, 215787–215805. [Google Scholar] [CrossRef]
- Cronin, J.; Anandarajah, G.; Dessens, O. Climate change impacts on the energy system: A review of trends and gaps. Clim. Change 2018, 151, 79–93. [Google Scholar] [CrossRef]
- Schleussner, C.F.; Rogelj, J.; Schaeffer, M.; Lissner, T.; Licker, R.; Fischer, E.M.; Knutti, R.; Levermann, A.; Frieler, K.; Hare, W. Science and policy characteristics of the Paris Agreement temperature goal. Nat. Clim. Change 2016, 6, 827–835. [Google Scholar] [CrossRef]
- Dileep, G. A survey on smart grid technologies and applications. Renew. Energy 2020, 146, 2589–2625. [Google Scholar] [CrossRef]
- Statista. Smart Grid Market Value Worldwide 2022–2028. 2023. Available online: https://www.statista.com/study/111848/smart-grids-worldwide/ (accessed on 8 September 2024).
- Alotaibi, I.; Abido, M.A.; Khalid, M.; Savkin, A.V. A comprehensive review of recent advances in smart grids: A sustainable future with renewable energy resources. Energies 2020, 13, 6269. [Google Scholar] [CrossRef]
- Alasali, F.; Itradat, A.; Abu Ghalyon, S.; Abudayyeh, M.; El-Naily, N.; Hayajneh, A.M.; AlMajali, A. Smart Grid Resilience for Grid-Connected PV and Protection Systems under Cyber Threats. Smart Cities 2023, 7, 51–77. [Google Scholar] [CrossRef]
- Do Amaral, J.; Dos Santos, C.; Montevechi, J.; De Queiroz, A. Energy Digital Twin Applications: A Review. Renew. Sustain. Energy Rev. 2023, 188, 113891. [Google Scholar] [CrossRef]
- Manickam, R.; Vollmar, J.; Prabhakar, G.M. User Experience–Digital Twin Maturity Model (UX-DTMM). In Proceedings of the International Conference on Research into Design, Bangalore, India, 9–11 January 2023; pp. 877–889. [Google Scholar]
- Cyganiak, R.; Wood, D.; Lanthaler, M. RDF 1.1 Concepts and Abstract Syntax. W3C Recommendation, W3C. 2014. Available online: https://www.w3.org/TR/rdf11-concepts/ (accessed on 8 September 2024).
- Hossein Motlagh, N.; Mohammadrezaei, M.; Hunt, J.; Zakeri, B. Internet of Things (IoT) and the energy sector. Energies 2020, 13, 494. [Google Scholar] [CrossRef]
- Ahmadzadeh, S.; Parr, G.; Zhao, W. A review on communication aspects of demand response management for future 5G IoT-based smart grids. IEEE Access 2021, 9, 77555–77571. [Google Scholar] [CrossRef]
- Entezari, A.; Aslani, A.; Zahedi, R.; Noorollahi, Y. Artificial intelligence and machine learning in energy systems: A bibliographic perspective. Energy Strategy Rev. 2023, 45, 101017. [Google Scholar] [CrossRef]
- Omitaomu, O.A.; Niu, H. Artificial intelligence techniques in smart grid: A survey. Smart Cities 2021, 4, 548–568. [Google Scholar] [CrossRef]
- Cao, K.; Liu, Y.; Meng, G.; Sun, Q. An overview on edge computing research. IEEE Access 2020, 8, 85714–85728. [Google Scholar] [CrossRef]
- Minh, Q.N.; Nguyen, V.H.; Quy, V.K.; Ngoc, L.A.; Chehri, A.; Jeon, G. Edge Computing for IoT-Enabled Smart Grid: The Future of Energy. Energies 2022, 15, 6140. [Google Scholar] [CrossRef]
- Arcas, G.I.; Cioara, T.; Anghel, I.; Lazea, D.; Hangan, A. Edge Offloading in Smart Grid. Smart Cities 2024, 7, 680–711. [Google Scholar] [CrossRef]
- Ruta, M.; Scioscia, F.; Loseto, G.; Pinto, A.; Di Sciascio, E. Machine learning in the Internet of Things: A semantic-enhanced approach. Semant. Web 2019, 10, 183–204. [Google Scholar] [CrossRef]
- Dhaou, I.B. Design and Implementation of an Internet-of-Things-Enabled Smart Meter and Smart Plug for Home-Energy-Management System. Electronics 2023, 12, 4041. [Google Scholar] [CrossRef]
- Loconte, D.; Ieva, S.; Pinto, A.; Loseto, G.; Scioscia, F.; Ruta, M. Expanding the cloud-to-edge continuum to the IoT in serverless federated learning. Future Gener. Comput. Syst. 2024, 155, 447–462. [Google Scholar] [CrossRef]
- Aljadani, N.; Gazdar, T. A novel security architecture for WSN-based applications in smart grid. Smart Cities 2022, 5, 633–649. [Google Scholar] [CrossRef]
- Xu, Z.; Salehi Shahraki, A.; Rudolph, C. Blockchain-Based Malicious Behaviour Management Scheme for Smart Grids. Smart Cities 2023, 6, 3005–3031. [Google Scholar] [CrossRef]
- Yu, W.; Patros, P.; Young, B.; Klinac, E.; Walmsley, T.G. Energy digital twin technology for industrial energy management: Classification, challenges and future. Renew. Sustain. Energy Rev. 2022, 161, 112407. [Google Scholar] [CrossRef]
- Singh, M.; Fuenmayor, E.; Hinchy, E.P.; Qiao, Y.; Murray, N.; Devine, D. Digital twin: Origin to future. Appl. Syst. Innov. 2021, 4, 36. [Google Scholar] [CrossRef]
- Dietz, M.; Pernul, G. Digital twin: Empowering enterprises towards a system-of-systems approach. Bus. Inf. Syst. Eng. 2020, 62, 179–184. [Google Scholar] [CrossRef]
- Liu, S.; Lu, Y.; Shen, X.; Bao, J. A digital thread-driven distributed collaboration mechanism between digital twin manufacturing units. J. Manuf. Syst. 2023, 68, 145–159. [Google Scholar] [CrossRef]
- Gourisetti, S.N.G.; Bhadra, S.; Sebastian-Cardenas, D.J.; Touhiduzzaman, M.; Ahmed, O. A Theoretical Open Architecture Framework and Technology Stack for Digital Twins in Energy Sector Applications. Energies 2023, 16, 4853. [Google Scholar] [CrossRef]
- Liao, H.; Zhou, Z.; Liu, N.; Zhang, Y.; Xu, G.; Wang, Z.; Mumtaz, S. Cloud-Edge-Device Collaborative Reliable and Communication-Efficient Digital Twin for Low-Carbon Electrical Equipment Management. IEEE Trans. Ind. Inform. 2023, 19, 1715–1724. [Google Scholar] [CrossRef]
- Saad, A.; Faddel, S.; Mohammed, O. IoT-Based Digital Twin for Energy Cyber-Physical Systems: Design and Implementation. Energies 2020, 13, 4762. [Google Scholar] [CrossRef]
- Brosinsky, C.; Westermann, D.; Krebs, R. Recent and prospective developments in power system control centers: Adapting the digital twin technology for application in power system control centers. In Proceedings of the 2018 IEEE International Energy Conference (ENERGYCON), Limassol, Cyprus, 3–7 June 2018; pp. 1–6. [Google Scholar]
- Huang, J.; Zhao, L.; Wei, F.; Cao, B. The Application of Digital Twin on Power Industry. IOP Conf. Ser. Earth Environ. Sci. 2021, 647, 012015. [Google Scholar] [CrossRef]
- Ruhe, S.; Schaefer, K.; Branz, S.; Nicolai, S.; Bretschneider, P.; Westermann, D. Design and Implementation of a Hierarchical Digital Twin for Power Systems Using Real-Time Simulation. Electronics 2023, 12, 2747. [Google Scholar] [CrossRef]
- Liu, T.; Yu, H.; Yin, H.; Zhang, Z.; Sui, Z.; Zhu, D.; Gao, L.; Li, Z. Research and Application of Digital Twin Technology in Power Grid Development Business. In Proceedings of the 2021 6th Asia Conference on Power and Electrical Engineering (ACPEE), Chongqing, China, 8–11 April 2021; pp. 383–387. [Google Scholar]
- Zhang, G.; Huo, C.; Zheng, L.; Li, X. An Architecture Based on Digital Twins for Smart Power Distribution System. In Proceedings of the 2020 3rd International Conference on Artificial Intelligence and Big Data (ICAIBD), Chengdu, China, 28–31 May 2020; pp. 29–33. [Google Scholar]
- Uslar, M.; Rohjans, S.; Neureiter, C.; Pröstl Andrén, F.; Velasquez, J.; Steinbrink, C.; Efthymiou, V.; Migliavacca, G.; Horsmanheimo, S.; Brunner, H.; et al. Applying the smart grid architecture model for designing and validating system-of-systems in the power and energy domain: A European perspective. Energies 2019, 12, 258. [Google Scholar] [CrossRef]
- Zhang, J.; Ma, M.; Wang, P.; Sun, X.d. Middleware for the Internet of Things: A survey on requirements, enabling technologies, and solutions. J. Syst. Archit. 2021, 117, 102098. [Google Scholar] [CrossRef]
- Deng, S.; Zhao, H.; Fang, W.; Yin, J.; Dustdar, S.; Zomaya, A.Y. Edge intelligence: The confluence of edge computing and artificial intelligence. IEEE Internet Things J. 2020, 7, 7457–7469. [Google Scholar] [CrossRef]
- Loseto, G.; Scioscia, F.; Ruta, M.; Gramegna, F.; Ieva, S.; Fasciano, C.; Bilenchi, I.; Loconte, D. Osmotic cloud-edge intelligence for IoT-based cyber-physical systems. Sensors 2022, 22, 2166. [Google Scholar] [CrossRef]
- Velepucha, V.; Flores, P. A survey on microservices architecture: Principles, patterns and migration challenges. IEEE Access 2023, 11, 88339–88358. [Google Scholar] [CrossRef]
- Mishra, B.; Kertesz, A. The use of MQTT in M2M and IoT systems: A survey. IEEE Access 2020, 8, 201071–201086. [Google Scholar] [CrossRef]
- Li, L.; Chou, W.; Zhou, W.; Luo, M. Design patterns and extensibility of REST API for networking applications. IEEE Trans. Netw. Serv. Manag. 2016, 13, 154–167. [Google Scholar] [CrossRef]
- Bizer, C.; Heath, T.; Berners-Lee, T. Linked data-the story so far. In Linking the World’s Information: Essays on Tim Berners-Lee’s Invention of the World Wide Web; ACM: New York, NY, USA, 2023; pp. 115–143. [Google Scholar]
- Lefrançois, M. Planned ETSI SAREF Extensions based on the W3C&OGC SOSA/SSN-compatible SEAS Ontology Patterns. In Proceedings of the Workshop on Semantic Interoperability and Standardization in the IoT, SIS-IoT, Amsterdam, The Netherlands, 11–14 September 2017. [Google Scholar]
- Lehmann, J.; Isele, R.; Jakob, M.; Jentzsch, A.; Kontokostas, D.; Mendes, P.N.; Hellmann, S.; Morsey, M.; Van Kleef, P.; Auer, S.; et al. Dbpedia—A large-scale, multilingual knowledge base extracted from wikipedia. Semant. Web 2015, 6, 167–195. [Google Scholar] [CrossRef]
- Sawadogo, P.; Darmont, J. On data lake architectures and metadata management. J. Intell. Inf. Syst. 2021, 56, 97–120. [Google Scholar] [CrossRef]
- Roy, D.; Srivastava, R.; Jat, M.; Karaca, M.S. A complete overview of analytics techniques: Descriptive, predictive, and prescriptive. In Decision Intelligence Analytics and the Implementation of Strategic Business Management; Springer: Cham, Switzerland, 2022; pp. 15–30. [Google Scholar]
- Ahmad, T.; Madonski, R.; Zhang, D.; Huang, C.; Mujeeb, A. Data-driven probabilistic machine learning in sustainable smart energy/smart energy systems: Key developments, challenges, and future research opportunities in the context of smart grid paradigm. Renew. Sustain. Energy Rev. 2022, 160, 112128. [Google Scholar] [CrossRef]
- Alimi, O.A.; Ouahada, K.; Abu-Mahfouz, A.M. A review of machine learning approaches to power system security and stability. IEEE Access 2020, 8, 113512–113531. [Google Scholar] [CrossRef]
- Kegel, L.; Hahmann, M.; Lehner, W. Generating what-if scenarios for time series data. In Proceedings of the 29th International Conference on Scientific and Statistical Database Management, Chicago, IL, USA, 27–29 June 2017; pp. 1–12. [Google Scholar]
- Nguyen, T.N.; Gonzalez, C. Effects of decision complexity in goal-seeking gridworlds: A comparison of instance-based learning and reinforcement learning agents. In Proceedings of the 18th International Conference on Cognitive Modelling, Online, 20 July–1 August 2020. [Google Scholar]
- Jin, Y.; Ma, M.; Zhu, Y. A comparison of natural user interface and graphical user interface for narrative in HMD-based augmented reality. Multimed. Tools Appl. 2022, 81, 5795–5826. [Google Scholar] [CrossRef]
- Pan, S.; Luo, L.; Wang, Y.; Chen, C.; Wang, J.; Wu, X. Unifying large language models and knowledge graphs: A roadmap. IEEE Trans. Knowl. Data Eng. 2024, 36, 3580–3599. [Google Scholar] [CrossRef]
- Gunduz, M.Z.; Das, R. Cyber-security on smart grid: Threats and potential solutions. Comput. Netw. 2020, 169, 107094. [Google Scholar] [CrossRef]
- Sakimura, N.; Bradley, J.; Jones, M.; De Medeiros, B.; Mortimore, C. OpenID Connect Core 1.0 incorporating errata set 2. OpenID Foundation Specification. 2023. Available online: https://openid.net/specs/openid-connect-core-1_0.html (accessed on 21 October 2024).
- Achiam, J.; Adler, S.; Agarwal, S.; Ahmad, L.; Akkaya, I.; Aleman, F.L.; Almeida, D.; Altenschmidt, J.; Altman, S.; Anadkat, S.; et al. Gpt-4 technical report. arXiv 2023, arXiv:2303.08774. [Google Scholar]
- Touvron, H.; Martin, L.; Stone, K.; Albert, P.; Almahairi, A.; Babaei, Y.; Bashlykov, N.; Batra, S.; Bhargava, P.; Bhosale, S.; et al. Llama 2: Open foundation and fine-tuned chat models. arXiv 2023, arXiv:2307.09288. [Google Scholar]
- Haklay, M.; Weber, P. Openstreetmap: User-generated street maps. IEEE Pervasive Comput. 2008, 7, 12–18. [Google Scholar] [CrossRef]
- Adel, A. Unlocking the future: Fostering human–machine collaboration and driving intelligent automation through industry 5.0 in smart cities. Smart Cities 2023, 6, 2742–2782. [Google Scholar] [CrossRef]
- Veichtlbauer, A.; Ortmayer, M.; Heistracher, T. OPC UA integration for field devices. In Proceedings of the 2017 IEEE 15th International Conference on Industrial Informatics (INDIN), Emden, Germany, 24–26 July 2017; pp. 419–424. [Google Scholar]
- Mosteiro-Sanchez, A.; Barcelo, M.; Astorga, J.; Urbieta, A. Securing IIoT using defence-in-depth: Towards an end-to-end secure industry 4.0. J. Manuf. Syst. 2020, 57, 367–378. [Google Scholar] [CrossRef]
- Teerakanok, S.; Uehara, T.; Inomata, A. Migrating to zero trust architecture: Reviews and challenges. Secur. Commun. Netw. 2021, 2021, 1–10. [Google Scholar] [CrossRef]
- Neumann, W.P.; Winkelhaus, S.; Grosse, E.H.; Glock, C.H. Industry 4.0 and the human factor–A systems framework and analysis methodology for successful development. Int. J. Prod. Econ. 2021, 233, 107992. [Google Scholar] [CrossRef]
- Agrawal, A.; Thiel, R.; Jain, P.; Singh, V.; Fischer, M. Digital Twin: Where do humans fit in? Autom. Constr. 2023, 148, 104749. [Google Scholar] [CrossRef]
- Nardo, M.; Forino, D.; Murino, T. The evolution of man–machine interaction: The role of human in Industry 4.0 paradigm. Prod. Manuf. Res. 2020, 8, 20–34. [Google Scholar] [CrossRef]
- Bonney, M.S.; de Angelis, M.; Dal Borgo, M.; Wagg, D.J. Contextualisation of information in digital twin processes. Mech. Syst. Signal Process. 2023, 184, 109657. [Google Scholar] [CrossRef]
- Molino, M.; Cortese, C.G.; Ghislieri, C. The promotion of technology acceptance and work engagement in industry 4.0: From personal resources to information and training. Int. J. Environ. Res. Public Health 2020, 17, 2438. [Google Scholar] [CrossRef] [PubMed]
- Alcaraz, C.; Lopez, J. Digital twin: A comprehensive survey of security threats. IEEE Commun. Surv. Tutor. 2022, 24, 1475–1503. [Google Scholar] [CrossRef]
Entity | Vocabulary | Description |
---|---|---|
Device | DTDL | A device included within the energy infrastructure |
Meter | DTDL | Physical asset used for measuring energy consumption and detecting events of interest |
Electricity Metering | SEAS | Process of measuring and recording electrical energy consumption |
Meter Reading | DTDL | Amount of electricity consumption recorded through a meter |
Alarm | EDT | Specific event related to an issue identified by a device |
Location | DTDL | Physical location of a device or energy asset |
NIC | DBR | Network Interface Controller (NIC) representing a physical connection through which the device interacts with external peripherals |
Computer Network | DBR | Group of interconnected devices capable of communicating with each other |
Substation | DTDL | Electrical station part of a generation, transmission, or distribution system |
Antenna | DBR | Local secondary substation installed to support critical scenarios requiring high data availability in the smart grid |
DWDM | EDT | Dense Wavelength Division Multiplexing (DWDM) system supporting fiber-optic transmissions |
Grid Segment | EDT | Section of a power or communication network |
Line | DTDL | A line composing a telecommunication or power transmission infrastructure |
Pole | DTDL | A utility pole used to support overhead power lines or other public utilities |
DataCenter | DBR | Centralized facility designed to store and process vast amounts of data related to the energy grid |
Server Room | EDT | Air-conditioned room devoted to the continuous operation of computer servers in a datacenter |
Rack | DBR | Physical container designed to house servers, networking devices, and other datacenter equipment |
Feature | GPT-4 | Llama 2 |
---|---|---|
License | commercial | Llama 2 community license |
Input | text and images | text only |
Model size | ∼1.76 T parameters | 7 B, 13 B, 70 B parameters |
Model customization | limited to selected organizations | highly customizable through RAG or fine-tuning procedures |
Model updates | updates not immediately accessible to end users | periodic updates with potential new features |
Tokens per prompt | up to 8192 | up to 2048 |
Integration with other systems | through RESTful API calls | LLM backend required |
Conversational skills | immersive conversations integrating multiple forms of media | fluent and natural-sounding responses for interactive chat experiences |
Privacy concerns | data processed on external servers, potentially subject to data breaches or misuse | data processing occurs on-premises, reducing exposure to third-party entities |
EDT Platforms | [27] | [28] | [29] | [30] | [31] | [33] | [34] | This Work |
---|---|---|---|---|---|---|---|---|
Infrastructure | ||||||||
Cloud Infrastructure | ✓ | ✓ | ✓ | ✗ | ✓ | ✓ | ✓ | ✓ |
Edge Computing | ✗ | ✓ | ✓ | ✗ | ✶ | ✗ | ✓ | ✓ |
IoT Sensor Networks | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ |
Data Lake | ✶ | ✶ | ✓ | ✓ | ✓ | ✶ | ✶ | ✓ |
Physical Asset Control | ✓ | ✶ | ✶ | ✶ | ✓ | ✶ | ✓ | ✶ |
Intelligence | ||||||||
Real-time Data Processing | ✓ | ✓ | ✓ | ✓ | ✓ | ✗ | ✓ | ✓ |
Data Analytics and ML | ✓ | ✓ | ✓ | ✶ | ✓ | ✓ | ✓ | ✓ |
Asset Simulation | ✶ | ✓ | ✓ | ✓ | ✓ | ✗ | ✓ | ✶ |
Scenario Forecasting | ✶ | ✶ | ✓ | ✓ | ✓ | ✓ | ✓ | ✶ |
Knowledge Graphs | ✗ | ✗ | ✗ | ✗ | ✶ | ✗ | ✗ | ✓ |
LLM RAG | ✗ | ✗ | ✗ | ✗ | ✗ | ✗ | ✗ | ✓ |
Cybersecurity | ||||||||
System Auditing | ✶ | ✗ | ✓ | ✗ | ✗ | ✗ | ✗ | ✓ |
Access Control | ✓ | ✗ | ✓ | ✗ | ✗ | ✗ | ✗ | ✓ |
Visualization | ||||||||
Interactive GUI | ✓ | ✗ | ✓ | ✓ | ✓ | ✓ | ✗ | ✓ |
GIS Maps | ✶ | ✗ | ✗ | ✶ | ✶ | ✓ | ✶ | ✓ |
3D Models | ✗ | ✗ | ✗ | ✗ | ✶ | ✓ | ✶ | ✓ |
Natural User Interface | ✗ | ✗ | ✗ | ✗ | ✗ | ✗ | ✗ | ✓ |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Ieva, S.; Loconte, D.; Loseto, G.; Ruta, M.; Scioscia, F.; Marche, D.; Notarnicola, M. A Retrieval-Augmented Generation Approach for Data-Driven Energy Infrastructure Digital Twins. Smart Cities 2024, 7, 3095-3120. https://doi.org/10.3390/smartcities7060121
Ieva S, Loconte D, Loseto G, Ruta M, Scioscia F, Marche D, Notarnicola M. A Retrieval-Augmented Generation Approach for Data-Driven Energy Infrastructure Digital Twins. Smart Cities. 2024; 7(6):3095-3120. https://doi.org/10.3390/smartcities7060121
Chicago/Turabian StyleIeva, Saverio, Davide Loconte, Giuseppe Loseto, Michele Ruta, Floriano Scioscia, Davide Marche, and Marianna Notarnicola. 2024. "A Retrieval-Augmented Generation Approach for Data-Driven Energy Infrastructure Digital Twins" Smart Cities 7, no. 6: 3095-3120. https://doi.org/10.3390/smartcities7060121
APA StyleIeva, S., Loconte, D., Loseto, G., Ruta, M., Scioscia, F., Marche, D., & Notarnicola, M. (2024). A Retrieval-Augmented Generation Approach for Data-Driven Energy Infrastructure Digital Twins. Smart Cities, 7(6), 3095-3120. https://doi.org/10.3390/smartcities7060121