In this section, we will review the performance analysis findings of our proposed integrated model, which utilizes ensemble subspace KNN classification based on a BOF model. The experimental settings involved variations of the BOF with different numbers of levels in the vocabulary tree K values, applied with machine learning algorithms using datasets comprising muscle ultrasound images. We extensively examined the extracted features from BOF, demonstrating the best performance based on classification accuracy using the optimal K value. We compared the performance of our BOF dataset with cross-validation using the ensemble subspace KNN algorithm, which is considered a state-of-the-art model for our research.
4.1. BOF Empirical Evaluation
The suggested BOF-based ensemble subspace KNN muscle ultrasound classification model is designed to diagnose DPN. The model undergoes training using labeled ultrasound images of six muscles to construct the BOF codebook and set parameters for the classification model. During the testing stage, we utilized ultrasound images as the basis for our key points detection algorithm. This enabled us to derive SURF descriptors from the recognized key points. We then quantized the most dependable features using a created codebook to produce frequency histograms for the testing images. Ultimately, the pre-existing classification model produced the intended result. The dataset was used to select optimal values using empirical analysis to find the best setting for optimizing the BOF. In this study, we examined the impact of varying the K values for quantization of the visual word (K visual word) on the system’s performance. We conducted experiments using different K values, from 200 to 400, to explore how increasing the number of extracted features from each muscle ultrasound image affects the classification accuracy.
Additionally, we compared two classification models with the proposed ensemble subspace KNN-based classification in the BOF model. The first model involved fine KNN algorithms with a Euclidean distance and a single neighbor with equal distance weight. The second model used SVM with a Gaussian Kernel function and a Kernel scale parameter set to 32.
Table 4 illustrates the empirical analysis of ADM ultrasound images. The highest accuracy of 97.23% was achieved with 400 visual words among the three classifiers using our proposed ensemble subspace KNN algorithm. The base KNN and SVM algorithms exhibit variation as the K value increases alongside our proposed ensemble model.
Table 5,
Table 6,
Table 7,
Table 8 and
Table 9 demonstrate the impact of increasing the number of visual words K from 200 to 400 on the classification models of ultrasound images of TA, RF, BR, AHB, and BB muscle, respectively. This empirical analysis is based on three classification algorithms to indicate the performance of our proposed ensemble model.
The proposed ensemble subspace KNN algorithms demonstrated superior classification accuracy compared to KNN and SVM with a K value of 400 visual words BOF. They achieved performance accuracy of 95.85% for TA muscle, 95.70% for RF muscle, 95.17% for BR muscle, 92.67% for AHB, and 89.00% for BB muscle. In comparison, KNN achieved performance accuracy of 95.26%, 94.16%, 93.13%, 91.7%, and 85.61%, while SVM achieved performance accuracy of 90.12%, 92.41%, 89.27%, 92.12%, and 85.13% for ultrasound images of TA, RF, BR, AHB, and BB muscle, respectively.
Increasing the number of visual words from 200 to 400 positively impacted the performance of ensemble, KNN, and SVM algorithms by improving classification accuracy metrics. The analysis revealed that by increasing the number of visual words K, there was an average improvement in performance accuracy of nearly 5% across the six muscle ultrasounds for the classification algorithms.
4.3. Classification Performance Ensemble Subspace KNN
The classification evaluation results are based on the performance metrics listed in
Table 10 for the feature datasets. We used five-fold cross-validation to ensure the model’s performance on an independent dataset and to effectively identify issues such as overfitting or selection bias to gain valuable insights. This evaluation used an ensemble subspace KNN with the extracted features, employing the BOF with 400 visual words.
The results show that the ensemble-BOF-based classifier for the ADM muscle achieved the highest classification accuracy of 97.23%, while the sensitivity, specificity, precision, F1-score, and AUC were 97.62%, 96.88%, 96.63%, 97.12%, and 99.52%, respectively.
On the other hand, our proposed model achieved the lowest result with BB muscle, with an accuracy of 89.00%, sensitivity of 89.70%, specificity of 88.75%, precision of 85.41%, F1-score of 87.29%, and AUC of 99.62%. Additionally, the evaluation metrics of RF, TA, BR, and AHB achieved an accuracy of 95.7%, 95.85%, 95.17%, and 92.67%, while the AUC was 99.06%, 99.21%, 99.04%, and 92.83%, respectively. The sensitivity, specificity, precision, and F1-score finding for the TA muscle ultrasound images were 94.12%, 97.61%, 97.56%, and 95.81%, and for RF muscle ultrasound images, were 94.80%, 97.09%, 96.96%, and 95.86%, respectively, based on our proposed ensemble model.
Moreover, the sensitivity, specificity, precision, and F1-score finding for the BR muscle ultrasound images were 96.68%, 93.62%, 93.95%, and 95.3%, and for AHB muscle ultrasound images, the values were 89.66%, 95.62%, 95.25%, and 92.37%, respectively. The sensitivity, specificity, precision, F1-score, AUC, and accuracy can vary within a reasonable range across different muscle ultrasound images due to variations in anatomy, physiology, and the pathological effects of DPN on each muscle. Our proposed ensemble-BOF-based classifier can accurately classify cases as either DPN or healthy.
Figure 6 shows the confusion matrix for the binary classifier for each muscle, based on the proposed ensemble-BOF-based model. The numbers in the matrix indicate the count of images per class. Upon five-fold cross-validation, our model consistently predicted DPN images more accurately than healthy/control images. This is due to anatomical changes in the muscle fibers of DPN patients, resulting in increased muscle echogenicity or grayscale representation. These changes will be reflected in the detected features and improve the classification accuracy of DPN cases.
Additionally,
Figure 7 shows the ROC curve for the ensemble subspace KNN classifier based on the muscle ultrasound images coded with 400 visual words for the ADM muscle ultrasound images, where we achieved the highest classification accuracy compared with other muscles.