3.2. PCA Process and Results
To standardize data in
Table 8, average values and standard deviations were calculated at first (see
Table 9). After that the data were standardized using the normalization method. The results can be seen in
Table 10.
PCA was performed afterwards on the data from
Table 10 using Statistical Product and Service Solutions (SPSS) software, and the results are shown in
Table 11. Six components were selected from the initial solution. The total variance of the original variables was calculated, giving the cumulative contribution rate of 100%. Two components with λ exceeding one (2.607 and 1.314) were selected as the most informative, whose contribution rates were found to be 43.455% and 21.906%, respectively, resulting in the cumulative contribution rate of 65.361%. In this case, PCA is satisfactory because most of the information about the parameters of the samples could be successfully extracted.
The coefficients of elements 1 and 2 were afterwards calculated using the regression method in SPSS software. The results are available in
Table 12, where
z1 and
z2 components account for a major proportion. This means that surface stress and vacant pillars exert a significant influence on the TVG damage, which agrees with a practical situation.
According to the data in
Table 12 and Formula (1), the new variables were determined as follows:
In turn, the scores of two principal components were generated by calculating the standardized values from
Table 10 using Formula (8). The results can be found in
Table 13.
3.3. HCA Process and Results
HCA was applied to the damage-inducing parameters of thirteen TVG pieces by using SPSS software.
Table 14 shows the Euclidean distances between the samples. The first minimum Euclidean distance (0.168) was found for samples 3 and 9, meaning that. According to HCA, these can be classified into one kind. The next Euclidian distance (0.185) linked samples 7 and 12.
Table 15 depicts the condensation states of all TVG samples, and the connection statistics between Clusters 1 and 2 were used for clustering. Based on the results from
Table 14, samples 3 and 9 were clustered into one kind with a coefficient of 0.168. Another clustering occurred between samples 7 and 12 with a coefficient of 0.185. The next possible clustering was between samples 7 and 13 as sample 13 joined with sample 7 and sample 12, with the coefficients of 0.192, and so on.
Figure 3 shows a tree diagram as a result of HCA of the selected data. The distances between different kinds were mapped in accordance with numbers 0 to 25. As expected, the samples could be classified into the three groups, if drawing a line at a point 15 on the abscissa. The first kind composed samples 3, 9, 2, and 11. The second was formed by sample 7, 12, 13, 4, and 1, and the third one corresponded to sample 5, 8, 10, and 6.
It is noteworthy that the groups presented above were obtained according to the distances between the corresponding clusters only. In this respect, it is difficult to say which of the kinds is the best. To perform the quality analysis of the data, PCA was involved in the discussion. As seen in
Table 11, the contribution rate of element 1 was obviously the largest, followed by that of element 2. The analysis of scores from
Table 13 for each sample at elements 1 and 2 revealed that element 1 including samples 7, 12, 13, 4, and 1 got the highest score of 0.68759 and even more, which indicated the highest quality as a whole. In turn, the lower scores of the group including samples 3, 9, 2, and 11 and the kind formed by samples 5, 10, 8, and 6 meant the lower quality. However, distinguishing the better group among the latter two kinds is a challenge that requires a further analysis of element 2. While the scores of the group including samples 3, 9, 2, and 11 are above zero, those of the kind with samples 5, 10, 8, and 6 are below zero. Therefore, the quality of the former kind of samples is better. The relevant grade classification of TVG can be found in
Table 16.
Besides, the average values on elements 1 and 2 of the corresponding samples were calculated with respect to the three grades of damage, and the results are shown in
Table 17.
In this respect, evaluating the performance of new TVGs can be implemented through the calculation of Euclidean distances of sample’s two principal components and their average values. Each minimum Euclidean distance between the samples allows them to be automatically classified into a certain kind.
3.4. Evaluation Model Verification
The damage evaluation model of TVG can be summarized according to the evaluation process described above. First, six characteristic parameters were tested for the new samples. Each parameter was then standardized and two principal components were found. After that, Euclidean distances of the average values were calculated for main components, allowing one to extract various grades of the samples. Finally, the sample grade could be classified into the kinds with the minimum distance. The whole process of evaluation and verification is shown in
Figure 4.
Taking a new sample of TVG and testing six characteristic parameters result in:
Z1 = 8,
Z2 = 9,
Z3 = 7,
Z4 = 7,
Z5 = 8, and
Z6 = 8. Standardizing them using data in
Table 9 provides the values:
z1 = 0.41278,
z2 = 1.09529,
z3 = −0.61574,
z4 = −0.09875,
z5 = 0.91877, and
z6 = 0.41279. Putting them into Formula (8) gives two principal components as
p1 = 0.24766,
p2 = −0.61149. Applying Formula (7) ensures Euclidean distances between the new samples’ principal components and the corresponding average values from
Table 17, and the results can be found in
Table 18. According to these data, the new sample is close to Grade I and can thereby be classified with respect to this grade.