Integration of Foundation Models and Federated Learning in AIoT-Based Aircraft Health Monitoring Systems
Abstract
:1. Introduction
1.1. Background and Motivation
1.2. Related Works
1.3. Research Gaps, Contributions, and Paper Structure
2. Framework for Collaborative Intelligence in Aircraft Health Monitoring
3. Results
3.1. Aircraft Health Monitoring on the Base of AIoT Technologies
- Sensors installed throughout the aircraft monitor critical subsystems such as engines, avionics, hydraulics, and environmental systems. These sensors collect data on parameters like temperature, pressure, vibration, fuel consumption, and more, which are then transmitted to edge computing devices or cloud systems for further analysis. By gathering continuous data on the health of various systems, these sensors provide real-time insights into potential malfunctions or areas of concern.
- Edge computing plays a pivotal role in real-time data processing within the aircraft. With the vast amounts of data generated by sensors, processing some of these data locally through onboard computing devices reduces the need to transmit large datasets to the cloud. AI models embedded in edge devices perform initial analysis, such as predicting component failure or detecting anomalies in the system. This local processing improves the speed of decision making, ensuring critical actions are taken without delay.
- AIoT has the ability to analyze massive datasets and generate insights with AI-driven analytics. AI algorithms process both historical and real-time data to make predictions about system health, maintenance needs, and operational optimizations. ML models are trained on historical maintenance logs, operational data, and sensor readings to identify patterns of failure and predict future issues. Additionally, cognitive AI systems can interpret unstructured data such as pilot reports or technician notes and correlate this information with sensor data to generate actionable insights.
- Cloud platforms serve as central repositories for processing and storing the large volumes of data generated by aircraft sensors. These platforms provide the computational power required for training AI models and running advanced data analytics. Data lakes store the data from multiple aircraft, allowing for aggregated analysis and continuous improvement of AI models. With centralized data, aviation stakeholders can collaborate to enhance aircraft performance, monitor fleet health, and implement long-term improvements.
- Communication infrastructure is vital for the successful operation of AIoT-based AHMS. Data must be transmitted securely between aircraft and ground stations, and insights must be relayed in real time to relevant stakeholders. Communication networks include satellite communication (SATCOM), which allows data transmission during flights over remote regions, and 5G/6G networks, which provide high-speed connections for data uploads when the aircraft is on the ground. These networks enable seamless data flow between the aircraft, ground systems, and cloud platforms, ensuring that information is always up-to-date and available.
3.2. Analysis of FM, FL, and Integrated Model Approaches in AHMS
3.2.1. Foundation Model Approach in AHMS
3.2.2. Federated Learning Approach in AHMS
- represents the global model parameters after aggregation at iteration ,
- represents the local model parameters on aircraft at iteration ,
- is the weight assigned to each aircraft, proportional to the size of its local dataset ,
- is the total number of participating aircraft.
- Directly replacing their local models with the global model:
- Fine-tuning the global model using their local data:
- Model compression, where only significant changes in the model parameters (i.e., those above a certain threshold) are transmitted, thus reducing the amount of data shared.
- Sparse updates, where instead of sharing all parameters, only a fraction of parameters that have undergone the most significant updates are transmitted.
- Periodic communication, where instead of transmitting updates after every local iteration, updates are shared after a fixed number of local training steps.
- More processes occur in the decentralized component (on aircraft), including model training.
- The centralized component is lighter, focused on aggregation rather than direct model training.
- Communication channels primarily transmit model updates and aggregated models, not raw data.
- There is a more prominent feedback loop between local training and global aggregation.
3.2.3. Integrated (FM with FL) Model Approach
- The hybrid nature of the system, combining centralized foundation model training with decentralized federated learning.
- The dual role of the central component in both FM training and FL aggregation.
- The enhanced role of communication channels, facilitating various types of data and model exchanges.
- The continuous cycle of centralized training, local fine-tuning, global aggregation, and model redistribution.
- It includes both a centralized data lake for FM training and local processing for FL.
- The central component performs both FM training and federated aggregation.
- Communication channels are more diverse, handling raw data, foundation models, local updates, and aggregated models.
- There is a more complex feedback loop involving both centralized and decentralized learning processes.
3.2.4. Loss Function Structure
- 1.
- Universal Loss Function Structure
- Classification tasks (binary for anomaly detection or multi-class for failure modes).
- Regression tasks (continuous health metrics like remaining useful life).
- 2.
- Foundation Models
- Binary cross-entropy for anomaly detection (binary classification).
- Categorical cross-entropy for multi-class failure mode classification.
- Mean squared error (MSE) or mean absolute error (MAE) for predicting continuous metrics.
- 3.
- Federated Learning
- Binary cross-entropy for anomaly detection.
- Categorical cross-entropy for multi-class classification.
- MSE/MAE for regression tasks.
- 4.
- Integrated FM + FL Approach
- is the global loss from the FM trained on the centralized dataset;
- is the local loss for each aircraft, fine-tuning the model using FL; and
- and are weights that control the contribution of global vs. local losses.
- Anomaly detection—binary cross-entropy for detecting abnormal states.
- Failure mode classification—categorical cross-entropy for classifying types of failures.
- Health metric prediction—MSE or MAE for continuous predictions like remaining useful life or wear levels.
3.3. Architecture of an AIoT-Based Aviation Ecosystem Integrating FM and FL Approaches
- Centralized processing domain;
- Client domain;
- Communication domain.
- 1.
- Centralized processing domain
- The FM serves as the core of the system, providing a general-purpose cognitive baseline that is trained on a comprehensive dataset aggregated from various sources within the aviation ecosystem. This model captures broad patterns and relationships that are applicable across different scenarios and aircraft, making it a crucial component for generalized decision-making and predictive analytics.
- The data lake acts as the central repository where all collected data are stored. These data include aircraft sensor readings, maintenance logs, flight operations data, and external data such as weather reports. The data lake enables extensive data analysis and model training, leveraging big data technologies to manage the volume, velocity, and variety of data.
- While the FM provides a strong starting point, the FL component allows for the decentralized fine-tuning of models on individual aircraft. This ensures that the global model benefits from local adaptations, reflecting the unique conditions experienced by different aircraft. FL aggregates the locally fine-tuned models to update the global model periodically, preserving data privacy and reducing communication overhead.
- The AIoT platform integrates these components, enabling seamless interaction between centralized data processing, FM training, and FL. The platform manages the orchestration of these processes, ensuring that data flow smoothly between components and that the system remains responsive and adaptable.
- 2.
- Client domain
- 3.
- Communication domain
- The cloud data center acts as the central hub for storing and processing data, as well as hosting the foundation model and federated learning algorithms.
- The space domain includes communication satellites in GEO, MEO, and LEO orbits, which provide global connectivity for data transmission, especially in remote or airborne operations.
- The air domain encompasses the various aircraft types, including traditional aircraft, UAVs, airships, and balloons. These platforms generate data and require communication links for real-time data transfer and model updates.
- The ground domain consists of ground stations and cellular networks (e.g., 5G/6G) that facilitate communication between airborne platforms and the central data center.
- The IoT level represents the IoT devices embedded within aircraft and ground infrastructure, which collect and transmit data to the AIoT platform for processing.
- 4.
- Integration of models within the architecture
4. Discussion
4.1. Simulation-Based Case Study
4.1.1. Simulation Methodology for AHMS Experiment
- Generating synthetic data for 100 aircraft over 1000 flight hours, including failure patterns for five randomly selected aircraft.
- Training and validating the FM on the aggregated data, then deploying it to all aircraft in the test set.
- Initializing models on each aircraft for FL, performing 100 rounds of federated learning and evaluating the personalized models.
- Training the initial FM for the integrated approach, performing 50 rounds of federated fine-tuning, and evaluating the personalized models.
- Using the trained models from each approach to predict engine failures on a held-out test set and calculating all evaluation metrics.
- Conducting statistical tests to determine the significance of differences between approaches.
- Performing sensitivity analysis by varying key parameters such as the number of aircraft, failure rate, and noise levels to assess the robustness of each approach under different conditions.
4.1.2. Results of Simulation
4.2. Comparative Analysis of FM, FL and Integrated Approaches in AHMS
4.3. Challenges in Integrating FM and FL
4.4. Future Directions of Research in AI-Driven AHMS
5. Conclusions
Funding
Data Availability Statement
Conflicts of Interest
References
- Bommasani, R.; Hudson, D.A.; Adeli, E.; Altman, R.; Arora, S.; von Arx, S.; Bernstein, M.S.; Bohg, J.; Bosselut, A.; Brunskill, E.; et al. On the Opportunities and Risks of Foundation Models. 2021. Available online: https://arxiv.org/abs/2108.07258 (accessed on 5 September 2024).
- Li, X.; Tian, Y.; Ye, P.; Duan, H.; Wang, F.Y. A novel scenarios engineering methodology for foundation models in metaverse. IEEE Trans. Syst. Man Cybern. Syst. 2022, 53, 2148–2159. [Google Scholar] [CrossRef]
- Reed, S.; Zolna, K.; Parisotto, E.; Colmenarejo, S.G.; Novikov, A.; Barth-Maron, G.; Gimenez, M.; Sulsky, Y.; Kay, J.; Springenberg, J.T.; et al. A Generalist Agent. 2022. Available online: https://arxiv.org/abs/2205.06175 (accessed on 5 September 2024).
- Brown, T.; Mann, B.; Ryder, N.; Subbiah, M.; Kaplan, J.; Dhariwal, P.; Neelakantan, A.; Shyam, P.; Sastry, G.; Askell, A.; et al. Language models are fewshot learners. In Proceedings of the 2020 Advances in Neural Information Processing Systems, Online, 6–12 December 2020; Volume 33, pp. 1877–1901. [Google Scholar]
- Bender, E.M.; Gebru, T.; McMillan-Major, A.; Shmitchell, S. On the dangers of stochastic parrots: Can language models be too big? In Proceedings of the 2021 ACM Conference on Fairness, Accountability, and Transparency, Toronto, ON, Canada, 3–10 March 2021; pp. 610–623. [Google Scholar]
- Competition and Markets Authority (CMA). AI Foundation Models: Technical Update Report; CMA: London, UK, 2024. Available online: https://www.gov.uk/government/publications/ai-foundation-models-update-paper (accessed on 5 September 2024).
- Sun, C.; Li, H.; Dui, H.; Hong, S.; Sun, Y.; Song, M.; Cai, D.; Zhang, B.; Wang, Q.; Wang, Y.; et al. A multi-model architecture based on deep learning for aircraft load prediction. Commun Eng. 2023, 2, 47. [Google Scholar] [CrossRef]
- FGVC-Aircraft Benchmark. Available online: https://www.robots.ox.ac.uk/~vgg/data/fgvc-aircraft/ (accessed on 5 September 2024).
- Blumenfeld, J. NASA and IBM Openly Release Geospatial AI Foundation Model for NASA Earth Observation Data; NASA Earthdata: Huntsville, AL, USA, 2023. Available online: https://www.earthdata.nasa.gov/news/impact-ibm-hls-foundation-model (accessed on 5 September 2024).
- Ray, T.; Bhat, A.P.; White, R.T.; Nguyen, V.M.; Pinon Fischer, O.J.; Mavris, D.N. Examining the Potential of Generative Language Models for Aviation Safety Analysis: Case Study and Insights Using the Aviation Safety Reporting System (ASRS). Aerospace 2023, 10, 770. [Google Scholar] [CrossRef]
- McMahan, B.; Moore, E.; Ramage, D.; Hampson, S.; y Arcas, B. Communication-efficient learning of deep networks from decentralized data. In Proceedings of the 20th International Conference on Artificial Intelligence and Statistics, PMLR, Lauderdale, FL, USA, 20–22 April 2017; pp. 1273–1282. [Google Scholar]
- Ogundokun, R.O.; Misra, S.; Maskeliunas, R.; Damasevicius, R. A Review on Federated Learning and Machine Learning Approaches: Categorization, Application Areas, and Blockchain Technology. Information 2022, 13, 263. [Google Scholar] [CrossRef]
- Li, W.; Yang, W.; Jin, G.; Chen, J.; Li, J.; Huang, R.; Chen, Z. Clustering Federated Learning for Bearing Fault Diagnosis in Aerospace Applications with a Self-Attention Mechanism. Aerospace 2022, 9, 516. [Google Scholar] [CrossRef]
- Berghout, T.; Benbouzid, M.; Bentrcia, T.; Lim, W.H.; Amirat, Y. Federated Learning for Condition Monitoring of Industrial Processes: A Review on Fault Diagnosis Methods, Challenges, and Prospects. Electronics 2023, 12, 158. [Google Scholar] [CrossRef]
- Khan, S.; Gaba, G.S.; Gurtov, A. A Federated Learning Based Security for Controller-Pilot Data Link Communication; Linköping University: Linköping, Sweden, 2022; Available online: https://www.icas.org/ICAS_ARCHIVE/ICAS2022/data/papers/ICAS2022_0704_paper.pdf (accessed on 5 September 2024).
- Llasag Rosero, R.H.; Silva, C.; Ribeiro, B. Remaining Useful Life Estimation in Aircraft Components with Federated Learning. PHM Soc. Eur. Conf. 2020, 5, 9. [Google Scholar] [CrossRef]
- Qu, Y.; Dai, H.; Zhuang, Y.; Chen, J.; Dong, C.; Wu, F.; Guo, S. Decentralized Federated Learning for UAV Networks: Architecture, Challenges, and Opportunities. IEEE Netw. 2021, 35, 156–162. [Google Scholar] [CrossRef]
- Doğru, A.; Bouarfa, S.; Arizar, R.; Aydoğan, R. Using Convolutional Neural Networks to Automate Aircraft Maintenance Visual Inspection. Aerospace 2020, 7, 171. [Google Scholar] [CrossRef]
- Abdelghany, E.S.; Farghaly, M.B.; Almalki, M.M.; Sarhan, H.H.; Essa, M.E.-S.M. Machine Learning and IoT Trends for Intelligent Prediction of Aircraft Wing Anti-Icing System Temperature. Aerospace 2023, 10, 676. [Google Scholar] [CrossRef]
- Gao, Z.; Mavris, D.N. Statistics and Machine Learning in Aviation Environmental Impact Analysis: A Survey of Recent Progress. Aerospace 2022, 9, 750. [Google Scholar] [CrossRef]
- Brandoli, B.; de Geus, A.R.; Souza, J.R.; Spadon, G.; Soares, A.; Rodrigues, J.F., Jr.; Komorowski, J.; Matwin, S. Aircraft Fuselage Corrosion Detection Using Artificial Intelligence. Sensors 2021, 21, 4026. [Google Scholar] [CrossRef] [PubMed]
- Yang, R.; Gao, Y.; Wang, H.; Ni, X. Fuzzy Neural Network PID Control Used in Individual Blade Control. Aerospace 2023, 10, 623. [Google Scholar] [CrossRef]
- Wang, Z.; Zhao, Y. Data-Driven Exhaust Gas Temperature Baseline Predictions for Aeroengine Based on Machine Learning Algorithms. Aerospace 2023, 10, 17. [Google Scholar] [CrossRef]
- Chen, J.; Qi, G.; Wang, K. Synergizing Machine Learning and the Aviation Sector in Lithium-Ion Battery Applications: A Review. Energies 2023, 16, 6318. [Google Scholar] [CrossRef]
- Baumann, M.; Koch, C.; Staudacher, S. Application of Neural Networks and Transfer Learning to Turbomachinery Heat Transfer. Aerospace 2022, 9, 49. [Google Scholar] [CrossRef]
- Quadros, J.D.; Khan, S.A.; Aabid, A.; Alam, M.S.; Baig, M. Machine Learning Applications in Modelling and Analysis of Base Pressure in Suddenly Expanded Flows. Aerospace 2021, 8, 318. [Google Scholar] [CrossRef]
- Papakonstantinou, C.; Daramouskas, I.; Lappas, V.; Moulianitis, V.C.; Kostopoulos, V. A Machine Learning Approach for Global Steering Control Moment Gyroscope Clusters. Aerospace 2022, 9, 164. [Google Scholar] [CrossRef]
- Python. Available online: https://www.python.org/downloads/ (accessed on 7 October 2024).
- TensorFlow. An End-to-End Platform for Machine Learning. Available online: https://www.tensorflow.org/ (accessed on 7 October 2024).
- TensorFlow Federated: Machine Learning on Decentralized Data. Available online: https://www.tensorflow.org/federated (accessed on 7 October 2024).
- NumPy. The Fundamental Package for Scientific Computing with Python. Available online: https://numpy.org/ (accessed on 7 October 2024).
- Pandas. Available online: https://pandas.pydata.org/ (accessed on 7 October 2024).
- Matplotlib: Visualization with Python. Available online: https://matplotlib.org/ (accessed on 7 October 2024).
- Seaborn: Statistical Data Visualization. Available online: https://seaborn.pydata.org/ (accessed on 7 October 2024).
Approach | Strengths | Limitations | Suitable Use Cases |
---|---|---|---|
Foundation Model |
|
| Fleets with similar operating conditions where centralized data processing is feasible |
Federated Learning |
|
| Scenarios requiring privacy protection and adaptive learning to diverse environments |
Integrated (FM + FL) |
|
| Large fleets with varied operational conditions and a need for privacy and adaptability |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Kabashkin, I. Integration of Foundation Models and Federated Learning in AIoT-Based Aircraft Health Monitoring Systems. Mathematics 2024, 12, 3428. https://doi.org/10.3390/math12213428
Kabashkin I. Integration of Foundation Models and Federated Learning in AIoT-Based Aircraft Health Monitoring Systems. Mathematics. 2024; 12(21):3428. https://doi.org/10.3390/math12213428
Chicago/Turabian StyleKabashkin, Igor. 2024. "Integration of Foundation Models and Federated Learning in AIoT-Based Aircraft Health Monitoring Systems" Mathematics 12, no. 21: 3428. https://doi.org/10.3390/math12213428