Next Article in Journal
A Novel 3D 2TnC FeRAM Architecture and Operation Scheme with Improved Disturbance for High-Bit-Density Dynamic Random-Access Memory
Next Article in Special Issue
Applications of Machine Learning in Cancer Imaging: A Review of Diagnostic Methods for Six Major Cancer Types
Previous Article in Journal
Observer-Based Fixed-Time Consensus Tracking for a Class of Nonlinear Multi-Agent Systems
Previous Article in Special Issue
Robust Biometric Verification Using Phonocardiogram Fingerprinting and a Multilayer-Perceptron-Based Classifier
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Utilization of Machine Learning and Explainable Artificial Intelligence (XAI) for Fault Prediction and Diagnosis in Wafer Transfer Robot

1
Department of Semiconductor Engineering, Myongji University, Yongin-si 17058, Republic of Korea
2
Department of Information and Communication Engineering, Myongji University, Yongin-si 17058, Republic of Korea
*
Authors to whom correspondence should be addressed.
Electronics 2024, 13(22), 4471; https://doi.org/10.3390/electronics13224471
Submission received: 25 September 2024 / Revised: 28 October 2024 / Accepted: 12 November 2024 / Published: 14 November 2024

Abstract

:
Faults in the wafer transfer robots (WTRs) used in semiconductor manufacturing processes can significantly affect productivity. This study defines high-risk components such as bearing motors, ball screws, timing belts, robot hands, and end effectors, and generates fault data for each component based on Fluke’s law. A stacking classifier was applied for fault prediction and severity classification, and logistic regression was used to identify fault components. Additionally, to analyze the frequency bands affecting each failed component and assess the severity of faults involving two mixed components, a hybrid explainable artificial intelligence (XAI) model combining Shapley additive explanations (SHAP) and local interpretable model-agnostic explanations (LIME) was employed to inform the user about the component causing the fault. This approach demonstrated a high prediction accuracy of 95%, and its integration into real-time monitoring systems is expected to reduce maintenance costs, decrease equipment downtime, and ultimately improve productivity.

1. Introduction

As semiconductor devices continue to miniaturize, the need for precise control over manufacturing equipment has intensified. Component failures in semiconductor equipment can lead to system-wide malfunctions, resulting in significant economic losses and reduced productivity [1,2]. To prevent such issues, the need for a ‘part-level diagnosis’ that enables real-time monitoring and early fault prediction of equipment components has emerged as a crucial research area. Part-level diagnosis refers to the technology that monitors the performance and condition of individual equipment components in real-time, predicts potential failures before they occur, and enables proactive responses [2,3]. This method plays a crucial role in maintaining overall system efficiency and preventing unexpected equipment breakdowns, leading to reduced downtime and increased productivity [4,5,6,7].
Among semiconductor manufacturing equipment, wafer transfer robots (WTRs) play a critical role in safely transporting wafers between various process equipment [8]. However, failures in WTRs can induce vibrations, leading to wafer slippage or dropping, which in turn causes significant process issues, such as reduced wafer uniformity [8,9]. Various approaches have been employed to predict faults in WTRs. Early research efforts used scheduling algorithms to ensure the accuracy of wafer transfer by predicting potential faults. However, these approaches required prior verification of the wafer conditions, presenting a significant limitation [10]. More recent approaches have used neural networks (NNs), which can perform well even with limited data; however, restricted datasets often lead to overfitting, reducing the generalizability of these models [11]. To overcome these limitations, researchers have applied machine learning algorithms such as k-means clustering and density-based spatial clustering of applications with noise (DBSCAN). However, these methods also face challenges, such as difficulty in determining the number of clusters, which limits their adaptability [12]. Other studies have combined Gaussian mixture models (GMMs) with mean-shift (MS) clustering algorithms to model complex fault patterns while accounting for nonlinear data characteristics [13]. Although these approaches have improved fault prediction accuracy, they have primarily focused on prediction without delving into the underlying causes of the faults.
Recently, explainable artificial intelligence (XAI) has been increasingly applied in fault diagnosis research. Brito et al. (2023) applied XAI in rotating machinery fault diagnosis using augmented synthetic data to improve model performance [14]. However, their reliance on synthetic data raises concerns about the applicability of their models to real industrial environments, and the study primarily focused on fault prediction without in-depth analysis of the fault causes. Cohen et al. (2024) employed SHAP-based XAI techniques in clustering applications to improve fault diagnosis and prognosis, but the unsupervised nature of clustering poses challenges to diagnostic accuracy [15]. Similarly, Jang et al. (2023) combined various machine learning methods with XAI to provide insights into fault causes in industrial processes. However, their work lacks the depth of analysis needed to fully explain the underlying causes of the faults [16].
In contrast, this study proposes a distinct approach by adapting SHAP and LIME-based XAI methods to analyze fault characteristics specific to WTRs. Particularly, WTR vibration data were transformed from the time domain to the frequency domain using fast Fourier transformation (FFT) [17] to simulate scenarios where abnormal frequency components appear and increase in intensity beyond the natural frequencies of normal states [18]. Based on these simulated scenarios, fault data were constructed at three severity levels: caution, warning, breakdown. Next, using the constructed fault data, SHAP was employed to extract key frequency features that influence fault indicators across the spectrum, while LIME was applied to analyze faults at specific frequency regions, allowing for a quantitative evaluation of fault levels for each component. Furthermore, LIME values were utilized to individually assess fault levels for each component under mixed-fault scenarios, where faults in two components may occur simultaneously, enabling users to determine maintenance timing based on the fault severity of each component. This approach goes beyond fault prediction to provide practical assistance in maintenance decision-making, thus offering a distinct contribution to WTR fault diagnosis. While previous studies primarily focused on fault prediction accuracy, this research takes a significant step forward by enhancing the interpretability of the fault prediction model. By identifying the root causes of faults, this approach enables better decision-making in semiconductor manufacturing processes. Although synthetic data were used to generate fault scenarios, these scenarios were carefully constructed to closely reflect potential real-world WTR failure modes. This structured approach provides a viable foundation for adapting the model to actual operational data, facilitating its future applicability in industrial settings.
Ultimately, this study presents a robust and interpretable fault diagnosis system that can contribute to productivity improvement, downtime reduction, and overall quality enhancement in semiconductor manufacturing processes, providing a foundational resource for future WTR maintenance strategies.

2. Experiment

2.1. Data Acquisition Using Wafer Transfer Robot (WTR)

In this study, the voltage data were collected using a WTR (RR732) manufactured by Rorze, located in Fukushima, Japan. An acceleration sensor (ADXL335) with three axes (x, y, z) was installed on the top of the robot arm to measure acceleration data associated with the movement of the arm. To ensure accurate data acquisition and prevent interference from unintended movements, a weatherproof ultrasonic sensor and camera were also installed. The experimental setup is illustrated in Figure 1.
The dataset comprised voltage measurements recorded while the WTR arm transported wafers between designated locations. For each axis, 870 data points were collected, yielding a total of 560 datasets. To facilitate the frequency-based analysis, a fast Fourier transformation (FFT) was applied [17,19] to convert the real-time data into the frequency domain, enabling the identification of spectral intensity changes and the emergence of abnormal frequencies. For consistent analysis, FFT data from all three axes were normalized using min–max scaling [20], as show in Equation (1):
x = x x m i n x m a x x m i n
In this work, the FFT analysis focused on identifying key frequencies related to the movements of the WTR arm, particularly within the positive frequency range up to 480 Hz, which was considered for use as normal operation data. However, only x-axis data were deemed reliable for normal operation evaluation, as y-axis and z-axis data lacked the distinctive characteristics needed for accurate assessment. Consequently, only the normalized x-axis data were used in further analysis, as shown in Figure 2d.

2.2. Fault Data Generation

In semiconductor manufacturing, maintaining equipment reliability and stability is crucial, as unexpected failures can result in significant maintenance costs and downtime. Vibration monitoring is a commonly used method for early fault detection in equipment [21]. According to the vibration analysis method developed by Fluke, vibration levels in machinery increase significantly as wear or faults begin to occur [21,22]. This method focuses on detecting the emergence of new frequencies or the amplification of existing frequencies that deviate from the natural vibration patterns of the machinery. These natural frequencies represent the inherent vibration tendencies of each component during operation, and changes in these patterns can indicate defects such as imbalance, misalignment, bearing faults, and gear wear [23]. By detecting these changes, it is possible to continuously assess the condition of the equipment and implement preventive measures to address potential faults at an early stage. This approach offers several benefits including early fault detection, reduced maintenance costs, and maximized equipment uptime.
In this analysis, due to the difficulty in obtaining real-world fault data, we employed a vibration analysis technique based on the Fluke method, using normal operation data to generate synthetic fault data. This approach allowed us to simulate fault scenarios and produce data suitable for early fault detection and root cause analysis. The goal of this study is to develop a system capable of accurately detecting and analyzing potential equipment faults before they occur.
To generate the synthetic fault data, various fault severity levels were defined, with frequency amplification applied according to natural frequencies of each component. The severity of faults was categorized into three levels based on frequency characteristics:
  • Caution: Minor anomalies or vibrations occur around natural frequencies, resulting in a slight amplification of frequency power. Additionally, frequency increases were observed for the other three nonnatural frequencies.
  • Warning: Amplification occurs over a broader frequency range than in the caution stage, with anomalies appearing in additional non-natural frequencies, indicating the need for prompt maintenance.
  • Breakdown: The equipment is in a fully failed state, with a maximum amplification of frequencies, including significant increases in 13 additional non-natural frequencies, signaling the need for immediate shutdown and maintenance.
These stages are illustrated in Figure 3, showing the progression of fault severity in the bearing motor. For each severity level, frequencies closely related to the natural frequencies were randomly selected and amplified. The fault data generated by this method were designed to simulate real fault conditions accurately. Furthermore, mixed-fault data, combining two components, were generated and used in the modeling process to validate the robustness of the proposed approach.

3. Modeling

3.1. Allocation of Natural Frequencies

In situations in which acquiring actual fault data is challenging due to the long time required for equipment failure to occur and the controlled laboratory environment, fault data are generated based on experimental data. To simulate real-world fault scenarios in the WTR, five high-risk components were selected: robot hand, end effector, timing belt, ball screw, and bearing motor [24]. The natural frequencies for each component were established through a review of the relevant studies and literature, reflecting the primary frequency ranges that may occur during actual operation.
  • Robot hand: 7, 16, and 127 Hz
These frequencies were determined based on the experimental data related to the vibration characteristics of a robot hand [25].
  • End effector: 54 and 72 Hz
The natural frequencies of the end effector were established by analyzing its dynamic behavior in various environments, with significant vibration characteristics observed at 54 and 72 Hz [26].
  • Timing belt: 13 and 192 Hz
The frequency range of the timing belt reflects the natural frequencies associated with wear and tension changes [27,28].
  • Ball screw: 156, 160, and 164 Hz
The frequencies of the ball screw were selected to reflect the high-frequency vibrations that can occur during high-speed operations [29].
  • Bearing motor: 254 and 260 Hz
The frequencies of the bearing motor were based on the vibration characteristics related to wear in the high-frequency range [30].
Each component exhibits significant vibration characteristics across distinct frequency ranges. This precise allocation of natural frequencies ensures an accurate simulation of fault scenarios specific to each WTR component.

3.2. Fault Detection and Classification (FDC) Modeling for WTR

In this study, a stacking classifier was employed to develop a fault detection and classification (FDC) system to detect and classify faults occurring in the WTR system. A stacking classifier enhances performance by combining the strengths of multiple base models.
Two main tasks were performed: classification of fault severity (Fault_Class) and classification of the fault component (Fault_Component). In the first stage, the stacking classifier was used to detect faults and classify the fault severity into three levels: Fault_C (caution), Fault_W (warning), Fault_B (breakdown). For samples identified as faults, the faulty components were subsequently classified into nine categories. Out of 803 samples, 560 were normal data, and 243 were fault data generated through simulation. The dataset was divided into 80% training data and 20% test data for model training and evaluation, and values below 0.2 were considered noise and removed.
The stacking classifier utilized logistic regression (LR), random forest (RF), and support vector regression (SVR) [31] as base models to predict faults. The basic principle and schematic of the stacking classifier are illustrated in Figure 4. Each base model generated predictions y ^ i for the input data X :
y ^ 1 = M 1 X ,   y ^ 1 = M 2 X , ,   y ^ n = M n X
where y ^ i represents the predicted output from each base model [32]. The meta-model M m e t a then took the predictions from the base models as input to generate the final prediction:
y ^ = M m e t a ( y ^ 1 ,   y ^ 2 , ,   y ^ n )
By leveraging the strengths of each base model, the stacking classifier significantly improved the accuracy of fault detection and classification [32]. This approach combines the interpretability of logistic regression, the ensemble learning power of random forests, and the non-linear boundary-setting capability of support vector regression to handle various fault scenarios in the WTR system.
However, a significant drawback of stacking is the risk of overfitting because the base learners are trained on the same original data [33]. To prevent overfitting, k-fold cross-validation was applied during the training process. Specifically, the dataset was divided into k = 10 folds, where each fold was used as a validation set once while the remaining folds were used for training. This cross-validation process helped assess the generalizability of the model and mitigated the risk of overfitting by ensuring that the model did not overly rely on any specific subset of the data during training. By averaging the performance across the folds, we obtained a more robust estimate of the accuracy of the model and its ability to generalize to unseen data [33,34].
The performance of the model was evaluated using metrics such as accuracy, precision, and recall. The detailed performance results and analysis are discussed in Section 4: Results and Discussion, where we analyze the classification accuracy across different fault levels and components, along with the practical implications for real WTR scenarios.

3.3. Fault Cause Identification Using XAI

After implementing the FDC system, the next step involved applying methods to identify the frequencies that most significantly influenced the classification of the fault components. This process was critical in providing users with actionable insights for monitoring the operational status of the WTR and making informed maintenance decisions. To achieve this, we employed XAI techniques [35], specifically SHAP and LIME, to analyze the decision-making process of the model [36,37].
SHAP calculates the contribution of each feature to the prediction based on Shapley values, providing a global interpretation of the model. It allows for the evaluation of the average impact of each feature on the model’s prediction SHAP values to be computed using the following equation [36]:
i = s N / { i } S ! N S 1 ! N ! ( v S i v S )
where S is a subset of the feature set N, excluding feature i , and v S represents the model’s prediction for subset S. Using SHAP, we identified the key frequencies that had the greatest impact on fault component predictions. During the creation of the simulated fault data, predefined fault frequencies were extracted and compared with the key frequencies identified by SHAP to assess the model’s reliability. This validation confirmed that the model accurately reflected the importance of amplified fault frequencies in classifying faulty components [37].
To complement the global analysis of SHAP algorithm, we applied the LIME technique to provide a more localized analysis of the frequencies contributing to faults for each component. LIME approximates the model behavior locally using the following linear model:
f x g x = w 0 + w 1 x 1 + w 2 x 2 + + w n x n
where g x is the local linear model and w i represents the weight assigned to each feature x i .
Specifically, LIME was used to quantify the severity of faults for each component within mixed-fault scenarios, where faults in multiple components may occur simultaneously. By leveraging LIME values, we accurately assessed the fault severity of each component in mixed-fault conditions, thereby providing users with clear and precise information regarding the root causes of faults in the WTR system.
The combination of SHAP for global analysis and LIME for local analysis allowed us to provide both a broad understanding of fault causes and detailed insights into specific fault scenarios. This dual approach improved the interpretability of the model, enabling more informed maintenance decisions [36,37].
In Section 4, we present detailed results showing how SHAP and LIME were used to interpret fault causes, and we discuss the practical implications for WTR maintenance in real-world applications. Additionally, the overall modeling procedure can be seen in Figure 5.

4. Result and Discussions

In this study, we developed an FDC system using a stacking classifier to detect and classify faults in the WTR system. The system operates in a two-step process that focuses on primary tasks: first, it identifies the fault severity, determines whether the system is operating normally or if a fault has occurred, and if so, classifies the severity of the fault. Second, when a fault is detected, the system identifies the components responsible for the fault. Fault_Class is divided into three types (caution, warning, and breakdown), whereas Fault_Component consists of nine distinct types (five single components and four mixed components).

4.1. Fault Detection and Fault Level Classification Method

To improve fault detection and fault severity classification in the WTR system, this study applied a stacking classifier. During the training phase, normal data were labeled as “Normal,” while fault data were categorized into three severity levels: caution, warning, and breakdown. This allowed the stacking classifier to differentiate various fault states. The stacking classifier combines the strengths of logistic regression, random forest, and support vector machine (SVM) models to improve detection and classification performance.
Given the risk of overfitting during the stacking classifier training phase, 10-fold cross-validation was applied to ensure robustness and generalizability. This technique helped the model reliably predict fault states without over-relying on specific data subsets. The stacking classifier achieved an overall accuracy of 0.95, effectively distinguishing between normal operations and each fault level (caution, warning, breakdown).
The experimental results demonstrated that the stacking classifier was highly accurate, achieving an overall accuracy of 0.95 in fault severity classification, effectively distinguishing between normal operations and three distinct fault levels. As seen in Table 1, the stacking classifier demonstrated strong performance in distinguishing between normal and fault states, achieving an overall accuracy of 0.95. In particular, the model exhibited high precision (0.96) and recall (0.97) for the breakdown state, underscoring its reliability in detecting severe faults. For the caution state, the model displayed a slightly lower precision (0.75) but a high recall (0.90), suggesting that it is particularly sensitive to early-stage fault indications, even though some false positives were present. This sensitivity to caution-level faults enables the model to capture initial warning signs, making it suitable for preventive maintenance strategies. Regarding the warning state, the model achieved an F1-score (0.85), indicating a balanced performance between precision (0.94) and recall (0.77). Although the slightly lower recall for this state suggests a few warning-level faults may have been missed, overall, the model performed well in distinguishing between fault severity levels, likely due to data distribution and class balance.

4.2. Fault Component Classification

For the samples identified as faulty, the stacking classifier demonstrated strong performance in identifying faulty components, achieving an overall accuracy of 0.95. The model successfully identified specific faulty components, highlighting its ability to efficiently detect and classify multiple fault conditions in the WTR system. As shown in Table 2, the model performed particularly well in identifying faults in components such as the ball screw and robot hand, both of which achieved 1.00 F1-scores, indicating perfect precision and recall. The end effector showed slightly lower performance, with an F1-score of 0.87, possibly due to its more complex behavior in fault scenarios or a lower amount of training data compared to other components. Overall, the model demonstrated robust performance across all components, successfully identifying faults even in mixed-fault scenarios (e.g., robot hand with ball screw, timing belt with bearing motor).
By combining global insights from SHAP and local insights from LIME, the system effectively interprets fault causes and provides detailed insights into specific fault scenarios. This comprehensive analysis enhances model interpretability and boosts its practical applicability for WTR maintenance.

4.3. Frequency Identification of Fault Component Using Hybrid XAI Model

SHAP was employed to assess the WTR fault component classification model and interpret its prediction results by quantifying the contribution of each feature to the model predictions. This interpretability of machine learning models enhances user trust and understanding of model decision. After training the fault detection data using a stacking classifier with logistic regression as the meta-classifier, SHAP was applied to identify the most significant frequencies influencing fault classification. Converting the Shapley values to absolute values provided a clear measure of each importance of features, and as illustrated in Figure 6, the top 20 features were identified as the most influential frequencies in distinguishing fault components. These results provide clear insight into the frequencies crucial for classification, thereby increasing the reliability of fault prediction and root cause analysis for WTR.
In addition, LIME was employed to perform a component-specific, localized analysis, complementing the global feature importance from SHAP by analyzing the influence of individual features on each component. Using absolute LIME values enabled a detailed evaluation of the impact of each frequency on model predictions, especially under specific fault scenarios. This approach makes it possible to distinguish the contribution of key frequencies for each faulty component, providing a better understanding of the root causes and characteristics of each fault. As shown in Figure 7, critical frequencies were identified for key components, including the bearing motor, end effector, robot hand with ball screw, and timing belt with bearing motor, underscoring the effectiveness of LIME in pinpointing fault-specific frequencies.
The combination of SHAP and LIME proved valuable for identifying key frequencies associated with faults, offering both a broad and detailed view of fault causes. However, when analyzing WTR mixed-component fault data, where multiple components might fail simultaneously, determining fault severity for each component posed a challenge. To address this, key fault frequencies were pre-identified in the fault data generation step, allowing validation of the XAI results. In cases with mixed components, this approach allowed matching the key frequencies to their respective fault severity levels, with LIME values further used to assess fault severity across caution, warning, breakdown levels. Table 3, Table 4, Table 5 and Table 6 summarize this analysis, illustrating LIME values for component combinations such as end effector and robot hand, ball screw and timing belt, and timing belt with bearing motor. Higher LIME values in these tables indicate greater fault severity, providing a clear priority for maintenance. This combined approach provides in-depth insights into the severity and root causes of mixed component failures, enabling efficient decision-making for maintenance prioritization and enhancing reliability in the fault prediction process.

5. Conclusions

In this study, we developed an FDC system for WTR used in semiconductor manufacturing processes. By leveraging Fluke theory, fault data were generated for critical high-risk components, including the bearing motor, ball screw, timing belt, robotic hand, and end effector. A stacking classifier was applied to predict faults and classify fault severity, achieving 94% accuracy in fault detection and severity classification. Additionally, logistic regression was employed as the meta-classifier within the stacking model, resulting in a 95% accuracy in fault component classification.
Furthermore, a hybrid XAI model combining SHAP and LIME was utilized to analyze the key frequencies affecting each faulty component and assess fault severity in mixed-component scenarios. For mixed-fault components, LIME values provided fault severity levels for each component, supporting prioritized maintenance decisions. This approach offered a clear understanding of the root causes and severity of each faulty component, even in complex fault scenarios.
Integrating this system with real-time monitoring can significantly reduce maintenance costs, minimize equipment downtime, and ultimately improve productivity. The results of this study markedly improve the reliability of fault prediction in WTR systems and contribute to greater operational stability within semiconductor manufacturing processes.

Author Contributions

Conceptualization, S.J.H. and S.-S.H.; Methodology, S.J.H. and S.-S.H.; Software, J.E.J.; Formal analysis, J.E.J.; Investigation, J.E.J.; Data curation, J.E.J.; Writing—original draft, J.E.J.; Writing—review & editing, S.J.H. and S.-S.H.; Supervision, S.J.H. and S.-S.H.; Funding acquisition, S.-S.H. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the National Research Foundation of Korea (NRF) grant funded by the Korean government (MSIT) (No. 2022R1F1A1076374).

Data Availability Statement

The data presented in this study are available on request from the corresponding author.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Radamson, H.H.; He, X.; Zhang, Q.; Liu, J.; Cui, H.; Xiang, J.; Kong, Z.; Xiong, W.; Li, J.; Gao, J.; et al. Miniaturization of CMOS. Micromachines 2019, 10, 293. [Google Scholar] [CrossRef] [PubMed]
  2. Ison, A.; Spanos, C.J. Robust fault detection and fault classification of semiconductor manufacturing equipment. In Proceedings of the International Symposium on Semiconductor Manufacturing, Tokyo, Japan, 2–4 October 1996. [Google Scholar]
  3. Uzsoy, R.; Lee, C.-Y.; Martin-Vega, L.A. A Review of Production Planning and Scheduling Models in the Semiconductor Industry Part I: System Characteristics, Performance Evaluation and Production Planning. IIE Trans. 1992, 24, 47–60. [Google Scholar] [CrossRef]
  4. Zhang, K.; Zhou, W.; Li, T.; Wang, S.; Cheng, X.; Xia, Z.; Miao, X. Improvement of Warpage and Leakage for 3D NAND Flash Memory. Mater. Sci. Semicond. Process. 2024, 176, 108294. [Google Scholar] [CrossRef]
  5. Thieullen, A.; Ouladsine, M.; Pinaton, J. Application of Principal Components Analysis to Improve Fault Detection and Diagnosis on Semiconductor Manufacturing Equipment. In Proceedings of the 2013 European Control Conference (ECC), Zurich, Switzerland, 17–19 July 2013; pp. 1445–1500. [Google Scholar] [CrossRef]
  6. Zvonar, J.G. Predictive Failure Monitoring System for a Mass Flow Controller. U.S. Patent US6205409B1, 20 March 2001. Available online: https://patents.google.com/patent/US6205409B1/en (accessed on 15 January 2024).
  7. Kim, S.H.; Kim, C.Y.; Seol, D.H.; Choi, J.E.; Hong, S.J. Machine Learning-Based Process-Level Fault Detection and Part-Level Fault Classification in Semiconductor Etch Equipment. IEEE Trans. Semicond. Manuf. 2022, 35, 174–185. [Google Scholar] [CrossRef]
  8. Muradore, R.; Fiorini, P. A PLS-Based Statistical Approach for Fault Detection and Isolation of Robotic Manipulators. IEEE Trans. Ind. Electron. 2012, 59, 3167–3175. [Google Scholar] [CrossRef]
  9. Zhao, B.; Zhao, X.; Sun, R.-H.; Zhang, X.-J. Research on Key Technology of Semiconductor Robot Controller. J. Artif. Intell. Robot. 2024, 1, 4–10. [Google Scholar] [CrossRef]
  10. Song, K.-T.; Ou, S.-Q.; Yang, C.-A.; Sun, Y.-X.; Kang, L.-R.; Wang, Z.-Y.; Wang, Y.-S.; Lu, P.-C.; Ko, C.-L.; Chen, Y.H. Scheduling and Control of a Wafer Transfer Robot for Foundry Equipment Innovation Competition. IFAC-Pap. 2019, 52, 627–632. [Google Scholar] [CrossRef]
  11. Kim, H.-G.; Yoon, H.-S.; Yoo, J.-H.; Yoon, H.-I.; Han, S.-S. Development of Predictive Maintenance Technology for Wafer Transfer Robot Using Clustering Algorithm. In Proceedings of the 2019 International Conference on Electronics, Information, and Communication (ICEIC), Auckland, New Zealand, 22–25 January 2019; pp. 1–4. [Google Scholar] [CrossRef]
  12. Yoo, J.-H.; Park, Y.-K.; Han, S.-S. Predictive Maintenance System for Wafer Transport Robot Using K-Means Algorithm and Neural Network Model. Electronics 2022, 11, 1324. [Google Scholar] [CrossRef]
  13. Jeon, J.-E.; Song, W.-S.; Hong, S.-J.; Han, S.-S. Predictive Maintenance System for Wafer Transfer Robot Using Gaussian Mixture Model and Mean-Shift Clustering. Procedia Comput. Sci. 2024, 237, 453–460. [Google Scholar] [CrossRef]
  14. Brito, L.C.; Susto, G.A.; Brito, J.N.; Duarte, M.A.V. Fault Diagnosis using eXplainable AI: A transfer learning-based approach for rotating machinery exploiting augmented synthetic data. Expert Syst. Appl. 2023, 232, 120860. [Google Scholar] [CrossRef]
  15. Cohen, J.; Huan, X.; Ni, J. Shapley-based explainable ai for clustering applications in fault diagnosis and prognosis. J. Intell. Manuf. 2024, 1–16. [Google Scholar] [CrossRef]
  16. Jang, K.; Pilario, K.E.S.; Lee, N.; Moon, I.; Na, J. Explainable artificial intelligence for fault diagnosis of industrial processes. IEEE Trans. Ind. Inform. 2023, 1–8. [Google Scholar] [CrossRef]
  17. Heckbert, P. Fourier Transforms and the Fast Fourier Transform (FFT) Algorithm. Computer Graphics 2, 15-463, Carnegie Mellon University. 1995. Available online: https://www.cs.cmu.edu/afs/andrew/scs/cs/15-463/99/pub/www/notes/fourier/fourier.pdf (accessed on 24 September 2024).
  18. Fluke. Understanding the Effects of Vibration Testing. Available online: https://www.e4ds.com/webinar_tech_dn.asp?idx=58 (accessed on 2 October 2020).
  19. Atoui, I.; Meradi, H.; Boulkroune, R.; Saidi, R.; Grid, A. Fault Detection and Diagnosis in Rotating Machinery by Vibration Monitoring Using FFT and Wavelet Techniques. In Proceedings of the 2013 8th International Workshop on Systems, Signal Processing and Their Applications (WoSSPA), Algiers, Algeria, 12–15 May 2013; pp. 401–406. [Google Scholar] [CrossRef]
  20. Henderi, T.; Wahyuningsih, T.; Rahwanto, E. Comparison of Min-Max Normalization and Z-Score Normalization in the K-Nearest Neighbor Algorithm to Test the Accuracy of Types of Breast Cancer. Int. J. Inform. Inf. Syst. 2021, 4, 13–20. [Google Scholar] [CrossRef]
  21. McFadden, P.D.; Smith, J.D. Vibration Monitoring of Rolling Element Bearings by the High-Frequency Resonance Technique—A Review. Tribol. Int. 1984, 17, 3–10. [Google Scholar] [CrossRef]
  22. McFadden, P.D.; Smith, J.D. Model for the Vibration Produced by a Single-Point Defect in a Rolling Element Bearing. J. Sound Vib. 1984, 96, 69–82. [Google Scholar] [CrossRef]
  23. Shakya, P.; Darpe, A.K.; Kulkarni, M.S. Vibration-Based Fault Diagnosis in Rolling Element Bearings: Ranking of Various Time, Frequency and Time-Frequency Domain Data-Based Damage Identification Parameters. Int. J. Cond. Monit. 2013, 3, 53–62. [Google Scholar] [CrossRef]
  24. Sung, B.-J.; Lee, J.-B. Development of Reliability Assessment Technique for Wafer Transfer Robot. J. Int. Counc. Electr. Eng. 2018, 8, 190–194. [Google Scholar] [CrossRef]
  25. Karlsson, M.; Hörnqvist, F.C. Robot Condition Monitoring and Production Simulation. Master’s Thesis, Luleå University of Technology, Luleå, Sweden, 2018. Available online: https://www.diva-portal.org/smash/get/diva2:1211923/FULLTEXT01.pdf (accessed on 24 September 2024).
  26. Liu, Y.; Xu, M.; Cao, Y. Research, Design, and Experiment of End Effector for Wafer Transfer Robot. Ind. Robot Int. J. 2012, 39, 79–91. [Google Scholar] [CrossRef]
  27. Melas, D.A.; Craighead, I.A.; Esat, I.I. An Experimental Study for Measuring the Natural Frequency of the Timing Belt. Int. J. Veh. Des. 2004, 35, 274–288. [Google Scholar] [CrossRef]
  28. Khazaee, M.; Banakar, A.; Ghobadian, B.; Mirsalim, M.A.; Minaee, S.; Jafari, S.M. Detection of Inappropriate Working Conditions for the Timing Belt in Internal-Combustion Engines Using Vibration Signals and Data Mining. Proc. Inst. Mech. Eng. Part D J. Automob. Eng. 2016, 220, 947–957. [Google Scholar] [CrossRef]
  29. Zhang, J.; Zhang, H.; Du, C.; Zhao, W. Research on the Dynamics of Ball Screw Feed System with High Acceleration. Int. J. Mach. Tools Manuf. 2016, 111, 9–16. [Google Scholar] [CrossRef]
  30. Lin, T.R.; Yu, K.; Tan, J. Condition Monitoring and Fault Diagnosis of Roller Element Bearing. IntechOpen 2017. [Google Scholar] [CrossRef]
  31. Le, T.; Nguyen, V.; Pham, T.; Dinh, M.; Le, T.H. Fuzzy Semi-Supervised Large Margin One-Class Support Vector Machine. In Some Current Advanced Researches on Information and Computer Science in Vietnam; Dang, Q.A., Nguyen, X.H., Le, H.B., Nguyen, V.H., Bao, V.N.Q., Eds.; Springer International Publishing: Cham, Switzerland, 2015; pp. 65–78. [Google Scholar] [CrossRef]
  32. Abro, A.A.; Taşcı, E.; Ugur, A. A Stacking-Based Ensemble Learning Method for Outlier Detection. Balk. J. Electr. Comput. Eng. 2020, 8, 181–185. [Google Scholar] [CrossRef]
  33. Montesinos Lopez, O.A.; Montesinos Lopez, A.; Crossa, J. Overfitting, Model Tuning, and Evaluation of Prediction Performance. In Machine Learning for Decision Sciences with Case Studies in Python; Springer: Cham, Switzerland, 2021; pp. 59–80. [Google Scholar] [CrossRef]
  34. Srivastava, N.; Hinton, G.; Krizhevsky, A.; Sutskever, I.; Salakhutdinov, R. Dropout: A Simple Way to Prevent Neural Networks from Overfitting. J. Mach. Learn. Res. 2014, 15, 1929–1958. Available online: http://jmlr.org/papers/v15/srivastava14a.html (accessed on 24 September 2024).
  35. Sharma, J.; Mittal, M.L.; Soni, G.; Keprate, A. Explainable Artificial Intelligence (XAI) Approaches in Predictive Maintenance: A Review. Recent Pat. Eng. 2024, 18, el70423215860. [Google Scholar] [CrossRef]
  36. Strumbelj, E.; Kononenko, I. Explaining Individual Predictions When Features Are Dependent: More Accurate Approximations to Shapley Values. arXiv 2019, arXiv:1903.10464v3. Available online: https://arxiv.org/abs/1903.10464v3 (accessed on 24 September 2024).
  37. Ribeiro, M.T.; Singh, S.; Guestrin, C. “Why Should I Trust You?”: Explaining the Predictions of Any Classifier. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA, 13–17 August 2016; arXiv:1602.04938. Available online: https://arxiv.org/abs/1602.04938 (accessed on 26 March 2024).
Figure 1. Experimental setup.
Figure 1. Experimental setup.
Electronics 13 04471 g001
Figure 2. The result of data pre-processing using FFT; (a) x -axis; (b) y -axis; (c) z -axis; (d) normalized x -axis.
Figure 2. The result of data pre-processing using FFT; (a) x -axis; (b) y -axis; (c) z -axis; (d) normalized x -axis.
Electronics 13 04471 g002aElectronics 13 04471 g002b
Figure 3. Bearing motor fault data generation; (a) caution; (b) warning; (c) breakdown.
Figure 3. Bearing motor fault data generation; (a) caution; (b) warning; (c) breakdown.
Electronics 13 04471 g003
Figure 4. Stacking classifier; (a) principle of model training; (b) schematic of modeling.
Figure 4. Stacking classifier; (a) principle of model training; (b) schematic of modeling.
Electronics 13 04471 g004
Figure 5. Framework of modeling.
Figure 5. Framework of modeling.
Electronics 13 04471 g005
Figure 6. Identification of affected frequency using SHAP.
Figure 6. Identification of affected frequency using SHAP.
Electronics 13 04471 g006
Figure 7. Identification of affected frequency using LIME; (a) bearing motor; (b) end effector; (c) robot hand with ball screw; (d) timing belt with bearing motor.
Figure 7. Identification of affected frequency using LIME; (a) bearing motor; (b) end effector; (c) robot hand with ball screw; (d) timing belt with bearing motor.
Electronics 13 04471 g007
Table 1. The result of fault detection and fault level classification using stacking classifier.
Table 1. The result of fault detection and fault level classification using stacking classifier.
StatusPrecisionRecallF1-ScoreSupport
Normal0.940.940.9416
Caution0.750.900.7810
Warning0.940.770.8522
Breakdown0.960.970.97113
Overall Accuracy 0.95
Table 2. The result of fault component classification using stacking classifier.
Table 2. The result of fault component classification using stacking classifier.
ComponentPrecisionRecallF1-ScoreSupport
Ball screw1.001.001.0027
Bearing motor0.960.960.9626
End effector0.910.840.8725
Robot hand with Bearing motor1.001.001.027
Robot hand with Ball screw0.901.000.9527
Robot hand0.961.000.9827
Timing belt with Bearing motor1.00.890.9427
Timing belt with Ball screw0.931.000.9627
Timing belt1.000.960.9826
Overall Accuracy 0.95
Table 3. The result of end robot hand with end effector using LIME value.
Table 3. The result of end robot hand with end effector using LIME value.
ComponentStageLIME Value
End effectorBreakdown0.19
Caution0.43
Warning0.40
Robot handBreakdown0.20
Caution0.65
Warning0.45
UnknownUnknown0.25
Table 4. The result of timing belt with ball screw using LIME value.
Table 4. The result of timing belt with ball screw using LIME value.
ComponentStageLIME Value
Ball screwCaution0.52
Timing beltCaution0.43
Warning0.49
UnknownUnknown1.32
Table 5. The result of ball screw with robot hand using LIME value.
Table 5. The result of ball screw with robot hand using LIME value.
ComponentStageLIME Value
Ball screwCaution1.02
Waring0.16
Robot handBreakdown0.34
Caution0.65
UnknownUnknown0.27
Table 6. The result of timing belt with bearing motor using LIME value.
Table 6. The result of timing belt with bearing motor using LIME value.
ComponentStageLIME Value
Bearing motorCaution1.27
Waring0.39
Timing beltBreakdown0.20
Warning0.20
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Jeon, J.E.; Hong, S.J.; Han, S.-S. Utilization of Machine Learning and Explainable Artificial Intelligence (XAI) for Fault Prediction and Diagnosis in Wafer Transfer Robot. Electronics 2024, 13, 4471. https://doi.org/10.3390/electronics13224471

AMA Style

Jeon JE, Hong SJ, Han S-S. Utilization of Machine Learning and Explainable Artificial Intelligence (XAI) for Fault Prediction and Diagnosis in Wafer Transfer Robot. Electronics. 2024; 13(22):4471. https://doi.org/10.3390/electronics13224471

Chicago/Turabian Style

Jeon, Jeong Eun, Sang Jeen Hong, and Seung-Soo Han. 2024. "Utilization of Machine Learning and Explainable Artificial Intelligence (XAI) for Fault Prediction and Diagnosis in Wafer Transfer Robot" Electronics 13, no. 22: 4471. https://doi.org/10.3390/electronics13224471

APA Style

Jeon, J. E., Hong, S. J., & Han, S.-S. (2024). Utilization of Machine Learning and Explainable Artificial Intelligence (XAI) for Fault Prediction and Diagnosis in Wafer Transfer Robot. Electronics, 13(22), 4471. https://doi.org/10.3390/electronics13224471

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop