Augmentation of Soft Partition with a Granular Prototype Based Fuzzy C-Means

Wang, Ruixin; Xu, Kaijie; Wang, Yixi

doi:10.3390/math12111639

Open AccessArticle

Augmentation of Soft Partition with a Granular Prototype Based Fuzzy C-Means

by

Ruixin Wang

¹

,

Kaijie Xu

^2,*

and

Yixi Wang

²

¹

SDU-ANU Joint Science College, Shandong University, Weihai 264209, China

²

School of Electronic Engineering, Xidian University, Xi’an 710071, China

^*

Author to whom correspondence should be addressed.

Mathematics 2024, 12(11), 1639; https://doi.org/10.3390/math12111639

Submission received: 7 May 2024 / Revised: 20 May 2024 / Accepted: 22 May 2024 / Published: 23 May 2024

(This article belongs to the Special Issue New Advances in Data Analytics and Mining)

Download

Browse Figures

Versions Notes

Abstract

:

Clustering is a fundamental cornerstone in unsupervised learning, playing a pivotal role in various data mining techniques. The precise and efficient classification of data stands as a central focus for numerous researchers and practitioners alike. In this study, we design an effective soft partition classification method which refines and extends the prototype of the well-known Fuzzy C-Means clustering algorithm. Specifically, the developed scheme employs membership function to extend the prototypes into a series of granular prototypes, thus achieving a deeper revelation of the structure of the data. This process softly divides the data into core and extended parts. The core part can be succinctly encapsulated through several information granules, whereas the extended part lacks discernible geometry and requires formal descriptors (such as membership formulas). Our objective is to develop information granules that shape the core structure within the dataset, delineate their characteristics, and explore the interaction among these granules that result in their deformation. The granular prototypes become the main component of the information granules and provide an optimization space for traditional prototypes. Subsequently, we apply quantum-behaved particle swarm optimization to identify the optimal partition matrix for the data. This optimized matrix significantly enhances the partition performance of the data. Experimental results provide substantial evidence of the effectiveness of the proposed approach.

Keywords:

soft partition; granular prototypes; granular computing; membership function; Fuzzy C-Means (FCM); information granules

MSC:

68U35

1. Introduction

Fuzzy C-Means (FCM) is a soft clustering technique [1] that extends the traditional K-means (a hard clustering method) to accommodate data that may belong to multiple clusters simultaneously [2]. Unlike hard clustering methods, which assign each datum exclusively to a single cluster [3], FCM assigns membership values to each datum for every cluster, indicating the degree to which the datum belongs to each cluster [4]. This membership value is represented by a fuzzy membership function, which allows for more nuanced and flexible clustering [5].

In FCM, the algorithm iteratively updates prototypes (cluster centroids) and membership values until convergence is reached [6]. The objective function of the FCM algorithm minimizes the total fuzzy membership-weighted squared distances between the data and the prototypes [7]. This iterative process ensures that each datum is assigned to the prototype where it has the highest membership value. FCM finds applications in various fields, particularly in pattern recognition and classification problems [8]. In classification tasks, FCM can be utilized as a soft clustering technique to assign membership probabilities to data across multiple classes [9]. By considering the fuzzy memberships, FCM provides a more nuanced understanding of the data distribution, allowing for better classification accuracy, especially in scenarios where data may belong to multiple classes simultaneously or exhibit overlapping boundaries between classes [10].

There are many advantages of the FCM algorithm. FCM is less sensitive to noise and outliers compared to hard clustering methods like K-means since it assigns membership values rather than definitive cluster assignments [11]. FCM allows for flexible cluster boundaries by accommodating data that belong to multiple clusters to varying degrees, making it suitable for datasets with complex structures and overlapping clusters [12]. Fuzzy membership values provided by FCM offer a more interpretable representation of data’s relationships with clusters, allowing for a more nuanced analysis of data distribution [13]. FCM can handle datasets of varying shapes, densities, and sizes, making it applicable to a wide range of classification tasks across different domains. Overall, FCM offers a powerful approach to classification problems by incorporating fuzzy logic to provide a more flexible and robust clustering solution [14].

Over the last few decades, theoretical research and application efforts surrounding FCM have made great progress. In [15], the authors present a significant advancement in the field of image segmentation by introducing an innovative adaptive FCM method. Using a dynamic cluster center adjustment mechanism based on data density, the algorithm effectively addresses the limitations of traditional FCM in handling diverse and complex image datasets. The experimental results validate the superiority of the algorithm in terms of segmentation accuracy, computational efficiency, and robustness. By incorporating an adaptive-weighted Euclidean distance measure, the algorithm in [16] addresses the limitations of traditional FCM and improves its accuracy and robustness in clustering tasks. The experimental results validate the effectiveness of the proposed enhancement and highlight its potential for various applications requiring accurate and robust clustering techniques, including data analysis, pattern recognition, and image segmentation.

By leveraging the local data relationships captured through the KNN graph, the FCM-KNN algorithm [17] addresses the limitations of traditional FCM and improves its accuracy and robustness in clustering tasks. The experimental results validate the performance of the proposed hybrid approach and highlight its potential for various applications requiring accurate and robust clustering techniques. In [18], a novel adaptation of the FCM clustering algorithm that incorporates density peaks is introduced to improve clustering performance. The authors provide a detailed description of the adaptive FCM algorithm based on density peaks. They explain how the algorithm identifies density peaks and incorporates them into the fuzzy membership and cluster center update steps. They outline the iterative process of the algorithm and discuss its computational complexity. The study significantly advances the field of clustering algorithms and has implications for various applications requiring accurate and robust clustering techniques.

By adjusting prototypes to better represent the underlying data distribution, an enhanced fuzzy clustering method is developed [19]. The study explains how refinements to prototypes are incorporated into the iterative process of FCM, highlighting the adjustments made to better align prototypes with data. The authors compare it with traditional FCM and other state-of-the-art classification and reconstruction methods using benchmark datasets. Evaluation metrics such as classification accuracy, reconstruction errors, and computational efficiency are considered. However, the limitation of this scheme is that it cannot adaptively obtain the upper and lower bounds of the adjusted prototypes, which will increase the parameters designed by the algorithm. In [20], the authors provide a detailed description of a data weighting algorithm and integrate it into the fuzzy clustering process. They explain how the algorithm adjusts prototype optimization based on the weighted contributions of data, emphasizing its adaptability to different datasets. By assigning appropriate weights to data, the algorithm improves the accuracy of prototype optimization and enhances the classification performance of fuzzy clustering algorithms. The experimental results validate the effectiveness of the proposed approach and highlight its potential for various applications requiring accurate and robust clustering techniques. In summary, adjusting the prototype is one of the most direct and effective ways to improve the classification performance of FCM.

The aforementioned studies all rely on the theory of fuzzy sets to attain algorithmic flexibility. Fuzzy-set-based granular computing is a method that combines fuzzy set theory and granular computing theory to address classification problems. In classification algorithms, it offers a more flexible and robust approach, particularly in handling data with high uncertainty and fuzziness. Furthermore, the existing research on fuzzy clustering typically generates prototypes as discrete points which effectively capture the global characteristics of the data but struggle to represent local characteristics. This limitation poses challenges for practical applications. For instance, when designing a classifier, it is difficult to achieve good classification performance through parameter adjustment. Therefore, effectively extending the prototypes can enhance the flexibility of the algorithm and greatly benefit its application.

In this study, we design a granular prototype based Fuzzy C-Means method that can be used to enhance its performance in classification. In the developed scheme, we utilize the membership functions to expand (define) the prototypes into a collection of granular prototypes to achieve a deeper revelation of the structure in the data. At this point, the data are softly divided into core and extended parts. By introducing these granular prototypes into the objective function of Fuzzy C-Means, we obtain a spectrum of potentially available partition matrices. Subsequently, quantum-behaved particle swarm optimization (QPSO) [21,22,23] is employed to determine an optimal partition (matrix) for the data. This optimized matrix enables us to achieve superior classification results, effectively enhancing the classification rate of the dataset. To the best of our knowledge, this marks the first exploration of such an approach. Our experimental results substantiate the efficacy of the proposed scheme.

The paper is structured to systematically address key phases of the design issues. Section 2 provides a concise introduction to Fuzzy C-Means clustering. In Section 3, we elaborate on an enhanced Fuzzy C-Means through refinements of prototypes and apply the quantum-behaved particle swarm optimization to identify the optimal partition matrix for the data. Section 4 presents the findings of our experimental studies. Finally, Section 5 encapsulates our conclusions.

2. A Brief Introduction of Fuzzy C-Means Clustering

Fuzzy C-Means (FCM) clustering is a popular method used for partitioning a dataset into a set of clusters, where each datum may belong to multiple clusters with varying degrees of membership. Unlike traditional crisp clustering methods, which assign each datum to only a single cluster, FCM allows for soft clustering by assigning membership values to each datum for each cluster [2,24,25]. For a dataset X containing N elements in n-dimensional space, the FCM method aims to minimize an objective function known as the fuzzy partition coefficient, which measures the fuzziness or ambiguity of the clustering. The objective function is defined as the following [26]:

J = \sum_{i = 1}^{N} \sum_{j = 1}^{C} u_{i j}^{m} {∥x_{i} - v_{j}∥}^{2}, i = 1,2, \dots, N; j = 1,2, \dots, C

(1)

where

C

is the number of clusters and

u_{i j}

is the membership value of data

x_{i}

in cluster

j

.

m

denotes the weighting exponent

(m > 1)

, also known as the fuzzifier parameter.

v_{j}

is the prototype (centroid) of cluster

j

, and

∥x_{i} - v_{j}∥

is the Euclidean distance between data

x_{i}

and the centroid

v_{j}

.

The FCM algorithm iteratively updates the membership values

u_{i j}

and cluster prototypes

v_{j}

until convergence is achieved. By using Lagrange multipliers, the membership values and the cluster prototypes can be derived in the following equations:

u_{i j} = \frac{1}{\sum_{k = 1}^{c} {(\frac{∥x_{i} - v_{j}∥}{∥x_{i} - v_{k}∥})}^{\frac{2}{m - 1}}}

(2)

v_{j} = \frac{\sum_{i = 1}^{n} u_{i j}^{m} x_{i}}{\sum_{i = 1}^{n} u_{i j}^{m}}

(3)

FCM allows for greater flexibility in clustering compared to crisp clustering algorithms, making it suitable for applications where data may belong to multiple clusters simultaneously or exhibit ambiguity in their cluster assignments. In practical applications, researchers have consistently aimed to improve the performance of this fuzzy partitioning method.

3. Augmentation of Fuzzy C-Means with Granular Prototypes

In granular computing, the process of clustering data through FCM is also referred to as information granulation, representing a granulation mechanism [27]. Information granules are expressed through prototypes and membership matrices. The prototypes are regarded as representative points in the data, and they serve as a condensed representation of data, offering an overview of the data’s characteristics. Typically, the expectation is that the data encapsulated by prototypes occupies a defined region within the feature space, which enhances the description and characterization of the data’s features. However, relying solely on prototypes often fails to effectively capture the structural nuances of the data or unveil its essence within the feature space. To better reflect and describe the data, as well as to characterize its essence more comprehensively, it becomes imperative to extend prototypes within the feature space in order to form granular prototypes.

In classification issues, the space (region) formed by the core data constitutes the granular prototypes (the core parts of the data are covered by the granular prototypes). In this study, we use prototypes and the membership matrix (

{U = [\dots, u}_{i j}, \dots] ϵ R^{C \times N}

) to establish granular prototypes. To facilitate analysis, we write the membership matrix

U

in the following form:

\begin{array}{l} U = [u_{1}^{T}, u_{2}^{T}, \dots, u_{i}^{T}, \dots, u_{N}^{T}] \\ u_{i} = [u_{i 1}, u_{i 2}, \dots, u_{i j}, \dots, u_{i C}] \\ i = 1,2, \dots, N; j = 1,2, \dots, C \end{array}

(4)

It is easy to know that the core of the data can be identified by the subset of data exhibiting the highest membership degrees for each class. We define a membership threshold

τ

, which we use to divide the membership matrix into two parts. Correspondingly, the data are also divided into two parts: the core part and the contour part.

\begin{array}{l} X = \{X_{c o r e}; X_{c o n t o u r}\} = \{x_{i} (u_{i} \geq τ); x_{i} (u_{i} < τ)\} \\ i = 1,2, \dots, N; j = 1,2, \dots, C \end{array}

(5)

Let

Ω_{j}

represent the jth granular prototypes, which can better characterize the structure of data compared with the prototypes. Thus, we can modify the objective function of the FCM to discover the information contained in the data in depth. By introducing the granular prototypes, the objective function can be modified in the following manner:

J = \sum_{i = 1}^{N} \sum_{j = 1}^{C} u_{i j}^{m} {∥x_{i} - Ω_{j}∥}^{2}

(6)

In other words, the traditional prototypes (represented as point sets) are substituted with a group of representative regions. From a numerical calculation perspective, the above modification can also be expressed as the following:

J = \sum_{i = 1}^{N} \sum_{j = 1}^{C} u_{i j}^{m} {∥x_{i} - v_{j} (v_{j} \in Ω_{j})∥}^{2}

(7)

Figure 1 depicts the membership functions alongside the granular prototypes with a membership threshold

τ = 0.9

. In Figure 1b, the region within the yellow wireframe signifies the core part of the data, which corresponds to the granular prototypes. The incorporation of granular prototypes not only confines the prototype to particular regions during the optimization of the objective function but also enables adjustments to the prototype within that region based on user requirements, thereby enhancing the flexibility of the method.

In classification tasks during model training, we employ classification errors (original labels of the data) to supervise the objective function. This helps us derive the best prototype and consequently generate the optimal partition matrix to achieve the required data partitioning. The process of designing a classifier using the developed scheme is summarized as follows.

First, FCM is employed to generate the initial prototypes and membership matrix (partition matrix). Following this, a membership threshold is set and a number of granular prototypes are constructed. The granular prototypes serve as the primary elements of the information granules, offering an optimization space for traditional prototypes. To accommodate users’ classification requirements, the prototypes can be adjusted based on the granular prototypes, and the partition matrix is modified accordingly. To attain the optimal prototypes, the classification error determined by labels of the data is utilized for the model supervision, and quantum particle swarm optimization (QPSO) [28] is employed for prototype optimization. The QPSO stands out for its prowess in global optimization, swift convergence, adaptability to dynamic environments, robustness against noise, and ability to maintain solution diversity. Additionally, its inherent parallelism enables efficient scalability and computation, making it a compelling choice for various optimization tasks.

In the optimization of QPSO, let

V^{(0)} = [v_{1}^{(0)}, \dots, v_{j}^{(0)}, \dots, v_{C}^{(0)}] \in R^{1 \times (n C)}

be the initial prototypes generated through the classical FCM, and we set the initial position of K particles to

V^{(0)}

. The fitness value is obtained by calculating and minimizing the classification error. The personal best position of the

k

th particle is represented as

V_{k}^{p b e s t} = (v_{k 1}^{p b e s t}, {\dots, v}_{k j}^{p b e s t}, \dots, v_{k C}^{p b e s t})

, and the global best position is given by

V_{k}^{g b e s t} = (v_{k 1}^{g b e s t}, {\dots, v}_{k j}^{g b e s t}, \dots, v_{k C}^{g b e s t}), k \in {1,2, \dots, K}

. The mean of the personal best positions of all particles can be computed in the following form:

\begin{array}{l} mbest (t) = \frac{1}{K} \sum_{k = 1}^{N} V_{n}^{p b e s t} (t) \\ = (\frac{1}{K} \sum_{k = 1}^{K} v_{k 1}^{p b e s t} (t), \dots, \frac{1}{K} \sum_{k = 1}^{K} v_{k j}^{p b e s t} (t), \dots, \frac{1}{K} \sum_{n = 1}^{K} v_{k C}^{p b e s t} (t)) \end{array}

(8)

where

t

indicates the current iterative. The convergence behavior of particles, as outlined in [29], indicates that each particle is expected to converge towards its respective local attractor

V^{l a} = (v_{k 1}^{la}, {\dots, v}_{k j}^{la}, \dots, v_{k C}^{la})

. This local attractor is determined through the following:

V_{k}^{la} (t) = τ V_{k}^{pbest} (t) + (1 - τ) V_{k}^{g b e s t} (t), τ \sim U (0,1)

(9)

By employing the Monte Carlo method, the determination of the particle position is achieved through the following:

V_{k} (t + 1) = v_{k}^{l a} (t) \pm α |m b e s t (t) - V_{k} (t)| \cdot \ln (\frac{1}{u}), u \sim U (0,1)

(10)

where

α

is referred to as the contraction–expansion coefficient. Typically, it is computed as the following [30]:

α = w_{1} + (w_{2} - w_{1}) \frac{(t_{m a x} - t)}{t_{m a x}}

(11)

where

t_{m a x}

is the maximum iteration time. In general,

w_{1} = 0.5, w_{2} = 1

.

Ultimately, the optimized prototypes are produced and lead to a refined partition matrix, thereby enhancing the classification performance. Figure 2 illustrates the methodology of implementing the proposed scheme in detail. To summarize, the fundamental steps include the following:

(1): The employment of classical FCM to conduct fuzzy clustering on dataset X, resulting in an initial partition matrix and a set of prototypes.
(2): The transformation of prototypes into granular prototypes using the membership functions.
(3): The replacement of prototypes with granular prototypes to establish an optimized boundary for the objective function.
(4): The utilization of QPSO for prototype optimization, guided by classification errors for supervision.
(5): The completion of the optimization process, which yields an optimal prototype, refining the partition matrix to maximize classification accuracy, thus completing model training.

4. Experimental Studies

The ensuing analysis seeks to evaluate the efficacy of the devised scheme through a comparative study with the FCM and Gaussian kernel-based fuzzy clustering (GKFCM) [31] methods. The principal objective of this comprehensive array of experiments is to examine the classification performance of these clustering methodologies. A diverse range of experiments are conducted utilizing publicly available datasets (http://archive.ics.uci.edu/ml, https://www.kaggle.com/datasets, and http://lib.stat.cmu.edu/datasets/ (accessed on 1 May 2024)). The classification rate [32] serves as a primary index in these experiments, chosen for its widespread adoption as a standard measure for performance evaluation.

Throughout the experiments, we investigate different values within the interval [1.1, 3.1] for the fuzzification factor, adjusting it in increments of 0.2. For the GKFCM, we set the Gaussian kernel parameter σ² as 10 to 100 in increments of 10. The number of classes is set to range from the original number of classes, denoted as C₀ to C₀ + 5. In the implementation of the QPSO, the method is run for 500 iterations with 30 particles by considering the number of prototypes and the number of dataset samples. However, we also allow the algorithm to terminate when the objective function remains unchanged in 50 consecutive iterations. To evaluate the effectiveness of the proposed approach, we utilize five-fold cross-validation, a commonly employed technique for assessing and validating the classification performance and stability of fuzzy classification models.

We utilize 10 publicly available datasets, detailed descriptions of which can be found on http://archive.ics.uci.edu/ml, https://www.kaggle.com/datasets, and http://lib.stat.cmu.edu/datasets/ (accessed on 1 May 2024). Brief descriptions of these datasets are as follows: the Iris dataset is a classic machine learning dataset containing 150 samples of three different species of iris flowers (setosa, versicolor, and virginica), with four features for each sample (sepal length, sepal width, petal length, and petal width). The Connectionist Bench dataset comprises datasets commonly utilized in neural network research, featuring 208 samples. Each pattern consists of a set of 60 numbers ranging from 0.0 to 1.0. These numbers represent the energy within specific frequency bands, integrated over defined time periods. The Glass Identification dataset contains 214 samples of glass, each with nine attributes, including a refractive index and various chemical properties, aiming to classify the types of glass. The Pima Indian dataset contains 768 medical records of Pima Indian women, with eight physiological attributes (pregnancies, glucose, blood pressure, skin thickness, insulin, BMI, diabetes pedigree function, and age), aiming to predict the onset of diabetes. The Statlog heart dataset contains medical data of patients, including 13 attributes (such as age, sex, and physiological measurements), aiming to predict the presence or absence of heart disease. The User Knowledge dataset contains 403 samples with five features and is used to assess the performance of learning systems and contains data on the knowledge levels of students in different domains and their learning behavior. The FIFA dataset contains data on football matches, players, teams, and match results, and it is used for football data analysis and prediction. The Habermans dataset contains medical data from breast cancer patients, including their age, year of operation, and the number of positive axillary nodes detected, aiming to predict the survival status of patients after five years. The Irish dataset is used for the classification of Irish individuals and contains physiological and demographic attributes. The Sensory dataset contains sensory evaluation data for food and beverages, including taste, aroma, and other sensory attributes.

Figure 3, Figure 4, Figure 5, Figure 6, Figure 7, Figure 8, Figure 9, Figure 10, Figure 11 and Figure 12 plot the experimental results, represented as the means of experimental results for five-fold cross-validation, indicating the classification rates for each dataset. Notably, the application of the proposed method consistently enhances the classification quality across all datasets.

The proposed approach noticeably improves the classification performance across all datasets and demonstrates significant advantages over the FCM algorithm. The kernel-based clustering method also provides enhancements for most datasets. Interestingly, in most cases, the proposed method achieves higher classification rates than the kernel-based clustering algorithm. On average, the classification performance improvement is about 5%, ranging from a minimal improvement of 1% to a maximum of 10%.

In summary, this study presents a novel approach to classifier design utilizing fuzzy clustering with granular prototypes. Through the use of granular prototypes, we achieve a space determined by the membership function for prototype optimization, enabling adjustments to the partitioning matrix and facilitating optimal data partitioning tailored to user requirements.

5. Conclusions

In this study, we develop a granular prototype-based fuzzy clustering method to design an enhanced fuzzy classifier. During the design process, the membership function of the data is used to extend the prototypes into a group of granular prototypes to significantly characterize the structure of data. Consequently, the data are softly partitioned into core and extended segments. We employ quantum-behaved particle swarm optimization to optimize the prototypes within the granular prototypes and use the classification errors derived from the data labels to supervise the optimization process. This developed scheme facilitates the creation of an optimal partition matrix tailored to the dataset. The theoretical framework presented in this study is supported by a series of experimental investigations. The experimental results show that compared with similar methods, our algorithm exhibits excellent performance in designing fuzzy classifiers.

The proposed scheme is not only beneficial to the theoretical research of developing clustering algorithms based on fuzzy sets, but it also facilitates the translation of these technologies into practical applications.

Unfortunately, the proposed algorithm involves the QPSO optimization which results in additional computational overhead. Thus, the proposed models facilitate a specific means of enhancing the performance of fuzzy clustering in classification and also highlight a broader challenge based on reducing computational complexity. One of the future directions is to explore the deep relationship between the size of granular prototypes and the classification rate. Future research endeavors could also explore extending the proposed granular prototype scheme to various fuzzy clustering methods, in order to enhance performance according to user requirements.

Author Contributions

Study conception and design: K.X.; data collection: R.W.; analysis and experimental of results: R.W. and Y.W.; draft manuscript preparation: R.W. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by the National Natural Science Foundation of China (nos. 62101400, 72101075, 72171069, and 92367206), partly supported by the China Postdoctoral Science Foundation (no. 2023M732743), and partly supported by the Shaanxi Fundamental Science Research Project for Mathematics and Physics (no. 22JSQ032).

Data Availability Statement

Data will be made available on request.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Xu, K.J.; Pedrycz, W.; Li, Z.W.; Nie, W.K. High-accuracy signal subspace separation algorithm based on gaussian kernel. IEEE Trans. Ind. Electron. 2019, 66, 491–499. [Google Scholar] [CrossRef]
Zhou, S.; Li, D.; Zhang, Z.; Ping, R. A new membership scaling fuzzy c-means clustering algorithm. IEEE Trans. Fuzzy Syst. 2021, 29, 2810–2818. [Google Scholar] [CrossRef]
Borlea, I.-D.; Precup, R.-E.; Borlea, A.-B.; Iercan, D. A unified form of fuzzy C-means and K-means algorithms and its partitional implementation. Knowl. Based Syst. 2021, 214, 106731. [Google Scholar] [CrossRef]
Cui, Y.; Hanyu, E.; Pedrycz, W.; Li, Z. Augmentation of rule-based models with a granular quantification of results. Soft Comput. 2019, 23, 12745–12759. [Google Scholar] [CrossRef]
Cerqueti, R.; Mattera, R. Fuzzy clustering of time series with time-varying memory. Int. J. Approx. Reason. 2023, 153, 193–218. [Google Scholar] [CrossRef]
Hanyu, E.; Cui, Y.; Pedrycz, W.; Li, Z. Design of distributed rule-based models in the presence of large data. IEEE Trans. Fuzzy Syst. 2023, 31, 2479–2486. [Google Scholar]
Xu, K.J.; Pedrycz, W.; Li, Z.; Nie, W. Constructing a virtual space for enhancing the classification performance of fuzzy clustering. IEEE Trans. Fuzzy Syst. 2019, 27, 1779–1792. [Google Scholar] [CrossRef]
Hanyu, E.; Cui, Y.; Pedrycz, W.; Li, Z. Fuzzy relational matrix factorization and its granular characterization in data description. IEEE Trans. Fuzzy Syst. 2022, 30, 794–804. [Google Scholar]
Xu, K.J.; Hanyu, E.; Quan, Y.; Cui, Y.; Nie, W. From granulation-degranulation mechanisms to fuzzy rule-based models: Augmentation of granular-based models with a double fuzzy clustering. J. Intell. Fuzzy Syst. 2021, 40, 12243–12252. [Google Scholar] [CrossRef]
Cui, Y.; Pedrycz, W.; Li, Z. Design of fuzzy rule-based models with fuzzy relational factorization. Expert Syst. Appl. 2022, 206, 117904. [Google Scholar]
Xu, K.J.; Pedrycz, W.; Li, Z. Augmentation of the reconstruction performance of fuzzy c-means with an optimized fuzzification factor vector. Knowl. Based Syst. 2021, 222, 106951. [Google Scholar] [CrossRef]
Cui, Y.; Pedrycz, W.; Li, Z. Enhancements of rule-based models through refinements of Fuzzy C-Means. Knowl. Based Syst. 2019, 170, 43–60. [Google Scholar]
Xu, K.J.; Pedrycz, W.; Li, Z. Granular Computing: An augmented scheme of degranulation through a modified partition matrix. Fuzzy Sets Syst. 2022, 440, 131–148. [Google Scholar] [CrossRef]
Cui, Y.; Hanyu, E.; Pedrycz, W.; Li, Z. Designing distributed fuzzy rule-based models. IEEE Trans. Fuzzy Syst. 2021, 29, 2047–2053. [Google Scholar] [CrossRef]
Gao, Y.; Wang, D.; Pan, J.; Wang, Z.; Chen, B. A novel fuzzy c-means clustering algorithm using adaptive norm. Int. J. Fuzzy Syst. 2019, 21, 2632–2649. [Google Scholar] [CrossRef]
Hashemzadeh, M.; Oskouei, A.; Farajzadeh, N. New fuzzy C-means clustering method based on feature-weight and cluster-weight learning. Appl. Soft Comput. 2019, 78, 324–345. [Google Scholar] [CrossRef]
Qin, Y.; Yu, Z.; Wang, C. A novel clustering method based on hybrid k-nearest-neighbor graph. Pattern Recognit. 2018, 74, 1–14. [Google Scholar] [CrossRef]
Lina, R.; Yao, M. Adaptive weighted fuzzy c-means clustering algorithm based on density peaks. In Proceedings of the 2020 International Conference on Artificial Intelligence and Electromechanical Automation (AIEA), Tianjin, China, 26–28 June 2020; pp. 52–56. [Google Scholar]
Galaviz, O.F.R.; Pedrycz, W. Enhancement of the classification and re-construction performance of fuzzy c-means with refinements of prototypes. Fuzzy Sets Syst. 2017, 318, 80–99. [Google Scholar] [CrossRef]
Xu, K.J.; Pedrycz, W.; Li, Z.; Nie, W. Optimizing the prototypes with a novel data weighting algorithm for enhancing the classification performance of fuzzy clustering. Fuzzy Sets Syst. 2021, 413, 29–41. [Google Scholar] [CrossRef]
Du, B.; Wei, Q.; Liu, R. An Improved Quantum-Behaved Particle Swarm Optimization for Endmember Extraction. IEEE Trans. Geosci. Remote Sens. 2019, 57, 6003–6017. [Google Scholar] [CrossRef]
Yang, C.X.; Zhang, J.; Tong, M.S. A Hybrid Quantum-Behaved Particle Swarm Optimization Algorithm for Solving Inverse Scattering Problems. IEEE Trans. Antennas Propag. 2021, 69, 5861–5869. [Google Scholar] [CrossRef]
Luo, J.; Shao, Y.; Liao, X.; Liu, J.; Zhang, J. Complex Permittivity Estimation for Cloths Based on QPSO Method Over (40 to 50) GHz. IEEE Trans. Antennas Propag. 2021, 69, 600–605. [Google Scholar] [CrossRef]
Xu, K.J.; Hanyu, E.; Liu, J.; Xiao, G.; Tang, X.; Xing, M. Constructing perturbation matrices of prototypes for enhancing the performance of fuzzy decoding mechanism. Int. J. Intell. Syst. 2024, 2024, 5780186. [Google Scholar] [CrossRef]
Li, D.; Zhang, H.; Li, T.; Bouras, A.; Yu, X.; Wang, T. Hybrid Missing Value Imputation Algorithms Using Fuzzy C-Means and Vaguely Quantified Rough Set. IEEE Trans. Fuzzy Syst. 2022, 30, 1396–1408. [Google Scholar] [CrossRef]
Mao, W.; Xu, K. Enhancement of the classification performance of fuzzy C-Means through Uncertainty Reduction with Cloud Model Interpolation. Mathematics 2024, 12, 975. [Google Scholar] [CrossRef]
Xu, K.; Nie, W. Bi-Fuzzy Clustering Algorithm by Augmented Granulation-Degranulation Mechanisms. In Proceedings of the 2019 IEEE 4th International Conference on Signal and Image Processing (ICSIP), Wuxi, China, 19–21 July 2019; pp. 782–787. [Google Scholar]
Yang, H.; Li, X.; Chen, S.; Zhao, L. An accurate registration method based on global mixed structure similarity (GMSIM) for remote sensing images. IEEE Geosci. Remote Sens. Lett. 2022, 19, 8005305. [Google Scholar] [CrossRef]
Clerc, M. The swarm and the queen: Towards a deterministic and adaptive particle swarm optimization. In Proceedings of the 1999 Congress on Evolutionary Computation-CEC99 (Cat. No. 99TH8406), Washington, DC, USA, 6–9 July 1999; pp. 1951–1957. [Google Scholar]
Sun, J.; Fang, W.; Wu, X.; Palade, V.; Xu, W. Quantum-behaved particle swarm optimization: Analysis of individual particle behavior and parameter selection. Evol. Comput. 2012, 20, 349–393. [Google Scholar] [CrossRef] [PubMed]
Shang, C.; Gao, J.; Liu, H.; Liu, F. Short-Term Load Forecasting Based on PSO-KFCM Daily Load Curve Clustering and CNN-LSTM Model. IEEE Access 2021, 9, 50344–50357. [Google Scholar] [CrossRef]
Zhu, X.; Pedrycz, W.; Li, Z. Fuzzy clustering with nonlinearly transformed data. Appl. Soft Comput. 2017, 61, 364–376. [Google Scholar] [CrossRef]

Figure 1. Membership functions and granular prototypes.

Figure 2. An overall model: main functional processing phases.

Figure 3. Mean of experimental results for 5-fold cross-validation (Iris dataset).

Figure 4. Mean of experimental results for 5-fold cross-validation (Connectionist Bench dataset).

Figure 5. Mean of experimental results for 5-fold cross-validation (Glass Identification dataset).

Figure 6. Mean of experimental results for 5-fold cross-validation (Pima Indian dataset).

Figure 7. Mean of experimental results for 5-fold cross-validation (Statlog heart dataset).

Figure 8. Mean of experimental results for 5-fold cross-validation (User Knowledge dataset).

Figure 9. Mean of experimental results for 5-fold cross-validation (FIFA dataset).

Figure 10. Mean of experimental results for 5-fold cross-validation (Habermans dataset).

Figure 11. Mean of experimental results for 5-fold cross-validation (Irish dataset).

Figure 12. Mean of experimental results for 5-fold cross-validation (Sensory dataset).

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Wang, R.; Xu, K.; Wang, Y. Augmentation of Soft Partition with a Granular Prototype Based Fuzzy C-Means. Mathematics 2024, 12, 1639. https://doi.org/10.3390/math12111639

AMA Style

Wang R, Xu K, Wang Y. Augmentation of Soft Partition with a Granular Prototype Based Fuzzy C-Means. Mathematics. 2024; 12(11):1639. https://doi.org/10.3390/math12111639

Chicago/Turabian Style

Wang, Ruixin, Kaijie Xu, and Yixi Wang. 2024. "Augmentation of Soft Partition with a Granular Prototype Based Fuzzy C-Means" Mathematics 12, no. 11: 1639. https://doi.org/10.3390/math12111639

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Augmentation of Soft Partition with a Granular Prototype Based Fuzzy C-Means

Abstract

1. Introduction

2. A Brief Introduction of Fuzzy C-Means Clustering

3. Augmentation of Fuzzy C-Means with Granular Prototypes

4. Experimental Studies

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI