An Efficient Segmentation and Classification System in Medical Images Using Intuitionist Possibilistic Fuzzy C-Mean Clustering and Fuzzy SVM Algorithm

Chowdhary, Chiranji Lal; Mittal, Mohit; P., Kumaresan; Pattanaik, P. A.; Marszalek, Zbigniew

doi:10.3390/s20143903

Open AccessArticle

An Efficient Segmentation and Classification System in Medical Images Using Intuitionist Possibilistic Fuzzy C-Mean Clustering and Fuzzy SVM Algorithm

by

Chiranji Lal Chowdhary

¹

,

Mohit Mittal

^2,*

,

Kumaresan P.

¹

,

P. A. Pattanaik

³ and

Zbigniew Marszalek

^4,*

¹

Vellore Institute of Technology, Vellore 632014, India

²

Department of Information Science and Engineering, Kyoto Sangyo University, Kyoto 603-8555, Japan

³

Telecom SudParis, 9 rue Charles Fourier, CEDEX 91011 Evry, France

⁴

Faculty of Applied Mathematics, Silesian University of Technology, 44100 Gliwice, Poland

^*

Authors to whom correspondence should be addressed.

Sensors 2020, 20(14), 3903; https://doi.org/10.3390/s20143903

Submission received: 5 June 2020 / Revised: 6 July 2020 / Accepted: 11 July 2020 / Published: 13 July 2020

(This article belongs to the Special Issue Computational Intelligence and Sensoric Aparatures for Virus, Fungus, Bacteria, and Biological Threads Detection in Image and Data Samples from Multi Surface Environments)

Download

Browse Figures

Versions Notes

Abstract

:

The herpesvirus, polyomavirus, papillomavirus, and retrovirus families are associated with breast cancer. More effort is needed to assess the role of these viruses in the detection and diagnosis of breast cancer cases in women. The aim of this paper is to propose an efficient segmentation and classification system in the Mammography Image Analysis Society (MIAS) images of medical images. Segmentation became challenging for medical images because they are not illuminated in the correct way. The role of segmentation is essential in concern with detecting syndromes in human. This research work is on the segmentation of medical images based on intuitionistic possibilistic fuzzy c-mean (IPFCM) clustering. Intuitionist fuzzy c-mean (IFCM) and possibilistic fuzzy c-mean (PFCM) algorithms are hybridised to deal with problems of fuzzy c-mean. The introduced clustering methodology, in this article, retains the positive points of PFCM which helps to overcome the problem of the coincident clusters, thus the noise and less sensitivity to the outlier. The IPFCM improves the fundamentals of fuzzy c-mean by using intuitionist fuzzy sets. For the clustering of mammogram images for breast cancer detector of abnormal images, IPFCM technique has been applied. The proposed method has been compared with other available fuzzy clustering methods to prove the efficacy of the proposed approach. We compared support vector machine (SVM), decision tree (DT), rough set data analysis (RSDA) and Fuzzy-SVM classification algorithms for achieving an optimal classification result. The outcomes of the studies show that the proposed approach is highly effective with clustering and also with classification of breast cancer. The performance average segmentation accuracy for MIAS images with different noise level 5%, 7% and 9% of IPFCM is 91.25%, 87.50% and 85.30% accordingly. The average classification accuracy rates of the methods (Otsu, Fuzzy c-mean, IFCM, PFCM and IPFCM) for Fuzzy-SVM are 79.69%, 92.19%, 93.13%, 95.00%, and 98.85%, respectively.

Keywords:

virus; intuitionistic possibilistic fuzzy c-mean; support vector machine; segmentation; breast cancer; Mammography Image Analysis Society (MIAS) dataset; machine learning

1. Introduction

Over the estimated new cases of cancer in the USA for 2020, breast cancer is observed as the first leading cancer type in the female. In developing countries [1], there is a lack of early detection schemes, tolerable diagnosis, and cure facilities for breast cancer cases, so the survival rate is low compared with developed countries. Breast cancer needs to be detected at an early stage so that proper treatment can be given to reduce the rate of mortality. Small-sized calcium deposits are known as micro-calcification, and they are an indication of malignancy. Mammography is one of the best reliable diagnostics over other methods like Ultrasound, Positron emission tomography (PET), and Magnetic Resonance Imaging (MRI) [2]. Mammograms are unusual to determine the presence of benign or malignant disease with conviction. The radiologists recommend a patient to go for the next diagnosis in the cases of uncertainty. Opaque regions are typically noisy in digital mammogram images, and have poor contrast. For this reason, it is a challenging responsibility for the radiologist to detect and diagnose a cancerous region. The noise must also be eliminated before a mammogram is processed [3]. A significant amount of noise reduction algorithms have been developed in the last two decades. The Mammography Image Analysis Society (MIAS) Mini Mammographic Database is normally a database of mammograms used in such research work [4,5].

The literature shows that the detection of viruses in breast cancer is highly inconsistent [6]. Figure 1 presents basic breast cancer detection procedures. The breast cancer detection method can be divided into five main approaches: (a) traditional image acquisition techniques; (b) Image enhancement model, especially for noise removal; (c) Find cancer affected area by detecting the suspicious region-of-interest (ROI) on medical images by using suitable segmentation method; (d) feature extraction; and (e) classification of benign or malignant from ROI. Proper segmentation is required for better feature extraction and classification. References [7,8,9] addresses many algorithms for the early detection of breast cancer detection. The evaluation of segmentation based on detection rate and accuracy gave the result of breast cancer detection cases [10,11,12]. The segmentation leads to feature extraction—the calculation of features based on density, texture, morphology, shape, and size of regions [13,14,15]. In the case of large and complex feature space with redundant and excessive features, there is a possibility to take excess time with a tendency to reduce accuracy in classification. It requires redundancy removal for performance improvement.

There are some cases where cancer is erroneously diagnosed among several patients. Osmanovic et al. [11] suggested a diagnostic method to resolve these cases by distinguishing between patients with and without breast cancer by defining the characteristics of cell nuclei present in an exceptional needle aspiration picture. Khwairakpam et al. [16] used fuzzy rules to identify the noise of images and filtered them using fuzzy weighted mean. They used genetic algorithm (GA) to optimize the parameters for fuzzy membership function. To evaluate the proposed filter edge-preserving factor and peak signal-to-noise ratio were used. Dutta et al. [17] proposed an approach to predict breast cancer by data mining methodologies, inference systems, and fuzzy logic. A novel prediction was proposed comprising of a fuzzy inference system with images collected from local clinics. This method performed better than other approaches like Logitboost [18], Locally Weighted Regression(LWL) [19], REP Tree [20], and so forth. Using deep learning algorithms, Khan et al. [21] used transfer learning methods for the classification and detection of breast cancer images. The feature extraction was executed with convolutional neural network (CNN) approaches like Residual Networks (ResNet), Visual Geometry Group Network (VGGNet), and GoogLeNet by feeding into a fully connected layer where malignant and benign cells are classified. Some researchers applied their work on classification and control problems by developing a fuzzy brain emotional learning neural network. Robio et al. [22] worked on self-organizing fuzzy modified least-square network. In References [23,24], the authors used discrete wavelet transform to analyze sub-bands within the Electroencephalography (EEG) parameter for creating model for epilepsy diagnosis. In References [25,26], the authors worked on parametric uncertainties and noisy outputs. Later they tested it for breast tumor classification and the chaotic system synchronization [27,28].

Figure 2 shows a classification task related to the medical images as output. Classifiers methods like rough set data analysis, support vector machine (SVM), decision tree, neural network and linear discriminant analysis (LDA) were extensively used for medical image detection approaches [3,7,29,30].

The major contributions of the proposed method are the following:

In this paper, we considered the MIAS dataset for breast cancer detection.
For the detection of cancer in an image, we applied the existing segmentation techniques such as the Otsu algorithm, FCM (fuzzy c-mean) clustering, IFCM (ntuitionist fuzzy c-mean) clustering, and PFCM (possibilistic7fuzzy c-mean). After that, we propose a segmentation model, that is, IPFCM clustering. In addition to this, statistical Feature extraction techniques are also taken into account.
The simulation results are investigated with four classification models such as DT (decision tree), RSDA (rough set data analysis), FCM and Fuzzy SVM. Besides this, we considered IPFCM (intuitionistic possibilistic7fuzzy c-mean) clustering model and Fuzzy SVM classification model resulting in a more promising accuracy than state-of-the-art studies.
This paper presents the evaluation criteria of specificity, sensitivity, MCC (Matthew’s correlation coefficien), PPV ( positive predictive value), accuracy, and NPV (negative predictive value) for classification measurements.
Finally, simulation results of the proposed approach is fast and accurate recognition over existing results from the MIAS dataset.

The rest of paper is organized as follows. Section 2 discusses pre-processing (Section 2.1), segmentation (Section 2.2), clustering (Section 2.3) and feature extraction (Section 2.4). Section 3 highlights the overview of classifications methods and evaluation criterion. Section 4 describes the proposed Intutionistic possibilistic fuzzy clustering algorithm. Section 5 explains the setup used in the design of the experiment and the obtained results with respect to state-of-the-art methods. Finally, the paper is concluded in Section 6.

2. Materials and Methods

2.1. Basic Preprocessing: Noise Removal

Micro-calcification is the primary symptom of the malignant cells. In younger women who appear to indicate denser breast tissue, malignancy detection is particularly troublesome. Most mammography images are noisy and typically include regions of low contrast. Noise, like dust, hair, capture, and storage, are found in digital mammograms. Dense areas are typically noisy and have poor contrast in digital mammography images. For this reason, noise removal is required before breast cancer identification and treatment. Many noise removal algorithms have been proposed in the last two decades. Median filtering, max-min filter, midpoint filtering, adaptive median filtering, alpha-trimmed mean filtering, quantum noise filtering, impulse noise filtering, and wavelet thresholding are methods for mammogram noise removal.

2.2. Background of Segmentation

Segmentation is an essential technique for the analysis of image processing. Image Segmentation plays a crucial role in the diagnosis of disease. The segmentation of medical images aims to make digital images easier and more accessible to analyze. The image is partitioned in multiple non-overlapping, significant homogeneous areas during the segmentation process. Segmentation is based on the techniques of unsupervised clustering. It became challenging in the case of medical imaging due to poor contrast and the noise caused in the acquisition [31,32].

For better understanding and alleviation of the segmentation methods, many papers have been published in the last two decades. According to References [33,34,35], clustering has three main issues—(1) problem-solving, (2) decision-making, and (3) image segmentation. The threshold believed as the most straightforward method over other segmentation methods. The thresholding directive for segmentation is to use the Otsu algorithm [36] for medical images to maximize the class reparability for a class variance. The work in Reference [31] applied the fuzzy method in medical images to overcome the uncertainty issues of vagueness, boundaries, and variations in grey-level images. Clustering has the leading role in separating unlabelled data into discrete sets.

Several potential clustering methods are presented, including k-mean, fuzzy c-mean, artificial neural network, genetic algorithms, and many improved forms of such methods. However, it is challenging in designing an optimum solution from the thresholding technique for further improvement in disease detection methods by medical images; this was a call for constant effort from the research communities. The k-mean clustering method confines every data in a precise cluster which is not functional for all tenders. Meanwhile, the work in [31] employed Fuzzy c-mean for image segmentation which assigns each pixel to unlabeled fuzzy clusters so that each pixel was retained in all clusters with varying membership degrees. Motivated by this research effort, it is found that the membership of Fuzzy c-mean cannot reflect the degrees of data that belong to it. The potential of FCM clustering for diagnosing diseases like breast cancer has been proved to resolve the uncertainty and unknown noise in medical image segmentation.

Interestingly, fuzzy c-mean membership does not represent the degree of the belonging data. As one of the most effective solutions, the authors of References [37,38,39,40,41,42] used possibilistic c-means. Every element has value ranges of 0–1. For example, the possibilistic c-mean identifies outliers (noise points). Another solution in Reference [43] leveraged fuzzy-possibilistic c-mean algorithm to produce typicality values and membership values where clustering data are unlabeled. The authors of Reference [44] applied a possibilistic fuzzy c-mean algorithm that produces membership and possibility under useful point models or cluster centres. possibilistic fuzzy c-mean was useful in detecting fuzzy rule-structures. The work of Reference [45] implemented adaptive and non-adaptive fuzzy c-mean algorithms. The work in Reference [46] applied an adaptive approach to finding the weights of local spatial factors in local spatial continuity. The researchers applied MRIs and found substantial success with the proposed possibilistic fuzzy c-mean method. The possibilistic fuzzy c-mean method was more robust and efficient for many levels of noise. To overcome the noise condition drawback of fuzzy c-mean clustering, the authors of Reference [47] proposed an exponential fuzzy c-mean to enhance membership issues that results in a more meaningful membership degree over fuzzy c-mean.

The work of References [48,49,50] introduced the intuitionist fuzzy c-mean technique and used that approach in medical images. The key findings from this study of intuitionist fuzzy sets are—(1) the degrees of membership; (2) degrees of non-membership; and (3) degrees of hesitation. The goal of our proposed work is to hybridize two algorithms—(1) the possibilistic FCM algorithm, and (2) the intuitionist fuzzy c-mean algorithm. The traditional clustering methods cannot overcome various factors, like noisy data and outliers. Therefore, we used a possibilistic approach to solve those problems. In order to strengthen the possibilistic c-mean algorithm, we have hybridization with an intuitionist fuzzy c-mean algorithm. The Intuitionist fuzzy c-mean algorithm used to solve uncertainty issues by addressing degree of hesitation during the membership function [51,52,53].

2.3. Preliminaries of Clustering

The clustering methods were commonly used in the segmentation or classification of medical images. For many practical issues, clustering analyses were used to explore the data structure to understand the characteristics of data. Different clustering algorithms were proposed, including the Otsu algorithm [36], the k-means algorithm [54], the FCM algorithm [55], various improved FCM algorithms [43,44,45,46,47,48,49] and so on.

2.3.1. Fuzzy C-Mean (FCM)

The FCM method [47] was demarcated a set B in k clusters. This batch has N members from

B = b_{1}, b_{2}, b_{3}, . . ., b_{4}

. It was noticed that an uncertain state of the data

b_{1}

was entrusted into several clusters by various degrees of membership

u_{l m}

. The membership of a cluster data is determined by paralleling its distance or dissimilarity from the cluster centroid

v_{m}

to

d_{l m}

. Distance measurements are conducted with the aid of Euclidean formula defined in Equation (1):

F C M = \sum_{m = 1}^{k} \sum_{l}^{N} u_{l m}^{p} d_{l m}^{2},

(1)

where

p ε (1, \infty)

and

\sum_{m = 1}^{k} d_{l m} = 1

.

The task of the fuzzifier parameter p was to mechanize membership degree control over the objective function. The value of degree and centroid of membership was expressed in Equations (2) and (3), respectively.

u_{l m} = \frac{1}{\sum_{q = 1}^{k} (\frac{d_{l m}^{2}}{d_{l q}^{2}})}

(2)

v_{m} = \frac{\sum_{l = 1}^{N} u_{l m}^{p} x_{l}}{\sum_{l = 1}^{N} u_{l m}^{p}}

(3)

2.3.2. Possibilistic Fuzzy C-Mean (PFCM)

Over k-mean results, the FCM clustering results are good but more noise sensitive. One limitation is that all membership degrees for each data point are incorporated cluster-wise into one, which leads to the abnormal points being members of clusters. The limitations of FCM were overcome by hybridization of the possibilistic approach with fuzzy c-mean, and that approach was named possibilistic fuzzy clustering (PFCM). Equation (4) represents the possibilistic fuzzy c-mean approach as:

P F C M = \sum_{m = 1}^{k} \sum_{l}^{N} u_{i m}^{p} d_{l m}^{2} + \sum_{m = 1}^{k} λ_{m} (\sum_{l}^{N} 1 - u_{l m})

(4)

The role of membership degree and positive number were in Equations (5) and (6) as follows:

u_{l m} = \frac{1}{1 + (\frac{d_{l m}^{2}}{d_{l q}^{2}})}

(5)

Λ_{m} = W \frac{\sum_{l = 1}^{N} u_{l m}^{p} d_{l m}^{2}}{\sum_{l = 1}^{N} u_{l m}^{p}}

(6)

In Equation (6), W is an amendable weight which is typically set to one.

FCM (in Equation (3)) obtains the optimum solution for centroid revamping. In Equation (4), PFCM minimizes if all clusters are coincident clusters. The membership degree relies heavily on the gap between the data and the particular cluster without any consideration of other clusters.

2.3.3. Intuitionistic Fuzzy C-Mean (IFCM)

Another enhanced fuzzy clustering was based on an intuitionistic fuzzy clustering algorithm. The traditional method of fuzzy c-mean was updated using intuitionist fuzzy sets. The cluster centers were modified so that intuitionist properties can be integrated with the fuzzy c-mean method. Atanassov suggested intuitionistic fuzzy sets [48] and discussed the presence of degree of hesitation. It cannot always be valid, according to the author, that the summation of membership degree and degree of non-membership is 1.There was a possibility for degree of hesitation. The level of hesitation was specified as 1 minus the total of degrees of membership and non-membership. The hesitation degree is as follows (Equation (7)):

π_{A} = H e s i t a t i o n D e g r e e = 1 - (M e m b e r s h i p D e g r e e + n o n - M e m b e r s h i p D e g r e e)

(7)

Initially, the hesitation degree was determined using Equation (8) and intuitionistic fuzzy membership values were obtained as follows:

u_{l m}^{*} = u_{l m} + π_{l m},

(8)

where

u_{l m}^{*}

denoted the intuitionistic fuzzy membership of the

m^{t h}

data in

l^{t h}

class. After replacing Equation (8) by Equation (9), the adapted cluster center will be:

v_{m}^{*} = \frac{\sum_{l = 1}^{N} u_{l m}^{*, p} x_{l}}{\sum_{l = 1}^{N} u_{l m}^{*, p}}

(9)

In case of Equation (9), the cluster center was updated simultaneously with the membership matrix. The conventional k-mean clustering method has been used by many medical image segmentation systems proposed by different authors to classify tumors.

2.4. Feature Extraction

A medical image in the form of mammograms is segmented to extract the region of interest (ROI), followed by the feature extraction approach to identify significant features for deciding abnormal relentlessness to check whether tumor status is benign or malignant type. A common approach for the tumor detection of mammograms is using segmentation, followed by feature extraction and then classification to classify benign or malignant images [30]. The radiologists observe the results for identifying breast cancer from screening mammograms by extraction of categories of Breast Imaging Reporting and Data System (BI-RADS). The selection of significant features leads to proper classification. The vital features for extraction from mammograms are texture, shape, margin, and intensity. The segmentation for finding ROI isolates them in segmented area, foreground region of ROI area, and the background region of ROI.

Domineering formulas are encoded here for reckoning of features on the segmented areas. These feature are listed in Table 1.

Another feature of classifications of images is shown in Table 2. The disparity of the lesions, the intensity of the lesions, and their adjacent cells are dissimilar in the ROIs reaped from mammograms. Advanced lesions have considerably sophisticated gray values. In this exertion, the texture features of the mass and the back-ground regions are calculated through the gray level histogram measurements. These are mean, standard deviation, smoothness, skewness, uniformity, entropy, and kurtosis. The variables presumed for a convinced region

r_{r v i n}

are random variables to prompt the intensity,

h (r)

is the gray-level histogram, and L is the gray level.

3. Overview of Classifications Methods

Our proposed work is on noise removal in the preprocessing stage; Intuitionistic possibilistic Fuzzy c-mean clustering performed in the segmentation stage; significant features extraction by using statistical feature extraction methods; and final stage as Classification stage. Later classification results of SVM, decision tree, RSDA, and Fuzzy SVM are tested, optimized, and compared. These classification methods are as discussed below:

3.1. Decision Tree (DT)

A decision tree takes several correlations in factual lifespan, and it can be used in a range of machine learning applications [42,56], covering both classification and regression. The decision tree is aimed at visually and unambiguously representing decisions and decision making for decision analysis. A decision tree classification has three types of nodes—(1) root node, (2) splitting node, and (3) terminal node. Recently, the datasets are classified through the decision summary, well-defined via the tree in order. Then the respective class label is dispensed on the analysis with the terminal nodes, wherein the analysis cascades. A simple decision tree is shown in Table 3.

3.2. Rough Set Data Analysis (RSDA)

Rough set data analysis generates a set of rules from a system of decisions. A significant number of rules must be minimized. To get a minimum number of rules, it is essential to abstract the conditional attributes which are superfluous. Significant steps associated with rough set data analysis are core and reduct computation, finding the significance of attributes, constructing a decision table, producing rules, followed by classifying data [30]. To remove the more essential features, the core and the reduct are determined. Decision and core form the decision-making framework. A set of minimum rules can be created based on the decision system, and those rules are the basic building blocks of the classification model [27,38].

3.3. Support Vector Machine (SVM)

Vapnik proposed support vector machine concepts in Vapnik-Chervonenkis’ learning theory and structural risk minimization (SRM) inductive principle [8]. The SVM theory has attained abundant deliberation in earlier years.

Support Vector Machine provides a better performance in orthodox machine learning applications, pattern recognition for solving classification glitches. SVM is a valuable method for a nonlinear efficient approximation trick [39]. The support vector machine is primarily plotted to a high-dimensional feature space with the input data and leads to creating a spreadable hyperplane that exploits the margin in that space between two groups. The maximization of a margin between two groups can be assumed as a quadratic system designed to solve Lagrangian multipliers [40]. SVM uses the dot product functions to show the optimal hyperplane in the high-dimensional feature space known as kernels. For example, the optimal hyperplane elucidation is known as a combination of approximately input points, and they are called support vectors [41,57].

The inadequacy of the support vector machine is the sensitivity of the training procedure to the noises or outliers in the training datasets because of overfitting. Such uncertainty points are crucial to making decisions and creating an overfitting problem. The improved Fuzzy SVM is discussed in the subsequent sub-section.

3.4. Fuzzy SVM (FSVM)

Support vector machine classification has some drawbacks. Those drawbacks can be overcome by using fuzzy logic in SVM. FSVM [9] is a classification technique based on SVM exemplary for the classification of outliers or noise. The most challenging part of FSVM is acquiring the fuzzy membership of the training data. Lin and Wang projected a design process [9] for finding the fuzzy membership. The distance between the sample and its class center in the high-dimensional function space is used by a kernel extension development to measure a new fuzzy member.

Researchers suggested the

ϵ

-margin nonlinear classification prototype on the base of FCM clustering in the creative input space and the fuzzy

I f - T h e n

rules.

I f - T h e n

rule statements are used to formulate the conditional statements that comprise fuzzy logic. Another method proposed is a joint weight-based Fuzzy-SVM system [9], which reflects an identical training sample with various classes. The weight-based FSVM has difficulty in setting fuzzy membership values and diminishing computational complexity.

The basic theory is support vector machine [49] which is followed by a fuzzy support vector machine algorithm. Let S to be set of label

m = l

head training points for a binary classification delinquent is

(y_{m}, z_{m}, s_{m})

thru

m = l

inclines to. Their contribution data was obtainable by

y_{m} \in R^{n}

accordingly specified a binary class label as

z_{m} \in {- 1, 1}

and the fuzzy membership degree was

s_{m} \in [0, 1]

anywhere

y_{m}

belongs to

z_{m}

. The binary classification delinquent model for a fuzzy support vector machine algorithm is essentially a discrimination restriction based quadratic programming problem which is given in Equations (10) and (11):

subject to

z_{m} [ν^{T} δ (y_{m}) + t] \geq 1 - ϖ_{i}

(10)

ϖ_{i} \geq 0, m = 1, \dots, l .

(11)

This quadratic-optimization problem is solved by building Lagrangian description and transforming it into the corresponding dual problem (Equations (12) and (13)):

m a x_{β} \sum_{m = 1}^{l} β_{m} β_{n} z_{m} z_{n} J (y_{m}, y_{n}) s

(12)

subject to

\sum_{m = 1}^{l} β_{m} z_{m} = 0

(13)

Consider

β_{m}

as a Langrange multiplier through a value which is not equal to 0 when data point m is a support vector, and

J (y_{m}, y_{n})

is a kernel function.

During the use of Gaussian kernel function there is Equation (14)):

J (y_{m}, y_{n}) = e^{(- \frac{1}{2 σ^{2}} ∣ ∣ y_{m} - y_{n} ∣ ∣^{2})}

(14)

Now, the outcome of the solution by FSVM model for the class label of testing y can be expected as in Equation (15).

z (y) = [\sum_{n = 1}^{l} a β_{m} z_{m} J (y_{m}, y_{n}) + t]

(15)

4. Proposed Intuitionistic Possibilistic Fuzzy Clustering

Medical images are complicated to understand. Therefore, it is required to remove undesirable portions of medical images. The noise removal process enhances the quality of the image.

Algorithm 1:IPFCM Methodology.

Initialization
Calculate PFCM which as follows:

$P F C M = \sum_{m = 1}^{k} \sum_{l = 1}^{k} u_{l m}^{P} d_{l m}^{P} + \sum_{m = 1}^{k} λ_{m} (\sum_{l = 1}^{N} (1 - u_{l m}))$

(16)

where,

$u_{l m} = \frac{1}{1 + {(\frac{d_{l m}^{2}}{λ_{m}})}^{\frac{1}{P - 1}}}$

$λ_{m} = W \frac{\sum_{l = 1}^{N} u_{l m}^{P} d_{l m}^{2}}{\sum_{l = 1}^{N} u_{l m}^{P}}$
Hesitation degree is initially calculated with

$π_{A} = H e s i t a t i o n D e g r e e = 1 - (M e m b e r s h i p D e g r e e + n o n - M e m b e r s h i p D e g r e e) .$

(17)
Intuitionistic fuzzy membership value is attained by:

$u_{l m}^{*} = u_{l m} + π_{l m},$

(18)

where $u_{l m}^{*}$ signifies the intuitionistic fuzzy membership of the $m^{t h}$ data in $l^{t h}$ class.
Substitute Equation (18) to Equation (16) for finding IPFCM.

$I P F C M = \sum_{m = 1}^{k} \sum_{l = 1}^{k} u_{l m}^{*, p} d_{l m}^{P} + \sum_{m = 1}^{k} λ_{m} (\sum_{l = 1}^{N} (1 - u_{l m}^{*}))$

(19)

The improved cluster center will be:

$λ_{m} = W \frac{\sum_{l = 1}^{N} u_{l m}^{*, P} d_{l m}^{2}}{\sum_{l = 1}^{N} u_{l m}^{P}}$

(20)

and the cluster center was modernized and instantaneously the membership matrix was also rationalized.
Accomplish the conclusion of iteration. Patronize the convergence standard.
In case convergence was extended, break the iteration otherwise go back to Step 2.

Possibilistic clustering approaches attempt to decrease the membership degree of noisy data, whereas the Intuitionist Possibilistic fuzzy clustering (IPFCM) approach assigns membership and non-membership degrees with hesitation degree. In Algorithm 1, the Intuitionist Possibilistic fuzzy c-mean methodology is written to strengthen the breast cancer detection system. Medical images are subsequently enhanced using Intuitionist Possibilistic fuzzy c-mean algorithms to form a cluster of pixels [47,48]. In order to improve membership assignments, a possibilistic approach has been used to overcome the noise cases. We also proposed an integrated intuitionistic fuzzy c-mean system [48,49] to improve the possibilistic c-mean algorithm [47]. A medical image segmentation system referred as an intuitionistic possibilistic fuzzy c-mean (IPFCM) clustering system. The design of the proposed system is in four stages as an initiative for pre-processing, main segmentation, that is, clustering, statistical extraction and final classification (Figure 3). The driving idea behind our proposed work is based on segmentation part that combines the possibilistic fuzzy c-mean with intuitionistic fuzzy c-mean and reduces the number of iterations to help with minimizing execution time.

5. Results and Discussion

The segmentation algorithm and classification methods are performed using MATLAB R2018a. At the classification stage, a support vector machine, decision tree, rough set data analysis, and fuzzy support vector machines are also executed to compare the accuracy of results.

5.1. Data Collection

The MIAS dataset [58] has a total of 320 digital mammogram images. These images are categorized into three types such as malignant, benign, and normal. There are 51 images in the malignant group, while 63 are benign. The remaining 206 images are normal. The pathological images are quite well known to be malignant. The original MIAS database was digitized at 50 micron-pixel edge, but reduced to 200-micron pixel edge and clipped in such a way that each image has (1024 × 1024) pixels.

5.2. Segmentation for Medical Imaging

The selected input image is shown in Figure 4 left. Some noise removal algorithms are applied over input images are verified with few noise removal algorithms. We examined with median filtering, max-min filter, midpoint filter, adaptive filtering, adaptive-median filtering, alpha-trimmed-mean filter, quantum-noise filtering, impulse-noise filtering, and wavelet-thresholding methods for noise removal from mammogram input images. Figure 4 right presents a smoothened image per custom by Gaussian filter of diverse sizes

(5 \times 5)

and standard deviation value 2.

A

5 \times 5

window was preferred for calculating the average value of local gray-levels. The pixels are aimed at each point of the set. An average of the resemblance value to the reference images is kept on every 32 directions, and the points were devised the maximum spatial resemblance.

The interpolation made the segmentation of the MIAS image of the 32 contour points by using a polynomial interpolating method. The accuracy of the segmentation method is considered by superposing the contours perceived inevitably and manually to compute the transformation amid them.

5.3. Average Segmentation Accuracy

With consideration of the optimal parameters for comparison of Otsu, FCM, IFCM, PFCM, and proposed IPFCM method for segmentation on the simulated MIAS breast cancer images as shown graphically in Figure 5. We also evaluated with different noise levels. Table 4 demonstrates the average accuracy of breast cancer segmentation for MIAS images with noise levels of 5%, 7%, and 9%. It was noted that the proposed approach with the negation function of Possibilistic is computationally inefficient compared to the negation function of Intuitionistic. The performance of the proposed IPFCM method with the hybridization of the negative function of Intuitionistic and the negative function of Possibilistic is better than the conventional segmentation methods.

Table 5 and Figure 6 presents a comparison of the computation time it will take for various techniques for MIAS images.

5.4. Classification for Medical Imaging

We used the MIAS dataset for experimental classification. The efficiency of the SVM approach, decision tree, RSDA approach, and Fuzzy SVM approach is described in Table 6. The accuracy of the classification is more important for the diagnosis of breast cancer, then the consequences of an incorrect diagnosis that trigger unjustified surgery or even lead to death.

The average classification accuracy rates of the methods (Otsu, FCM, IFCM, PFCM and IPFCM) for Fuzzy-SVM are 79.69%, 92.19%, 93.13%, 95.00%, and 98.45%, respectively (Table 6). It seems one-sided by using some features for Fuzzy-SVM to differentiate between benign and malignant breast tumors; it cannot accurately provide classification accuracy for each segmented image. For segmentation, the classification accuracy has changed accordingly and later going to the highest classification accuracy for IPFCM based FSVM classification. The best achieved classification accuracy rate is 98.45%. This proposed IPFCM segmentation with Fuzzy SVM method attains the uppermost classification accuracy rate (Figure 7).

5.5. Performance Evaluation

The specificity, sensitivity, Matthew’s correlation coefficient (MCC), positive predictive value (PPV), accuracy, receiver operating characteristic (ROC), and negative predictive value (NPV) were evaluated for classification measurements (Table 7).

The sensitivity and specificity are two statistical measures of the performance of a binary classification test. The confusion matrix supports research taking place in the root of actual and predicted results for positive actual and negative actual aftermaths. Consider the concern of positive actual outcome and fine two chances of predicted outcomes as “True Positive (TP)” and “False Negative (FN)”. Another consideration is of a negative actual outcome devising two predicted outcomes as “False Positive (FP)” and “True Negative (TN)”.

Sensitivity = \frac{T P}{T P + F N}

(21)

Specificity = \frac{T N}{T N + F N}

(22)

Accuracy = \frac{T P}{T P + F N}

(23)

PPV = \frac{T P}{T P + F P}

(24)

NPV = \frac{T N}{T N + F N}

(25)

MCC = \frac{T P \times T N - F P \times F N}{\sqrt{(T P + F P) (T P + F N) (T N + F P) (T N + F N)}}

(26)

Additionally, evaluation criterion “ROC curve” resolves the measurement for predictive accuracy for the suggested model. The “True Positive Rate (TPR)” and “False Positive Rate (FPR)” are designated in such a criterion. “AUC (Area under ROC Curve)” is castoff to compare the classifiers in two-class concerns.

The higher the values for sensitivity and specificity for the better performance of the system. In many cases, a higher sensitivity value can always be at a lower specificity value. SVM performed the worst, and the RSDA-based process performed better than SVM, which is only consistent with the parameter optimization performance. The decision tree-based method over matches the above techniques, but is still inferior to the Fuzzy SVM. The proposed approach achieves sensitivity-0.99, specificity-0.25, accuracy-0.98, PPV-0.99, NPV-0.50 and MCC-0.34, and has performed much better for Fuzzy SVM (with IPFCM segmentation) than the other classifiers (Table 8) and as shown graphically in Figure 8.

6. Conclusions and Future Research

Viruses can act as direct transforming agents and as triggering co-factors. Our research is conceived to detect digital mammograms. This machine conducts multiple-phase screening of breast cancer images. Noise and outliers trigger the low accuracy of the cluster analysis. In Fuzzy clustering, one data point was allocated to all clusters. As in Fuzzy’s clustering, the abnormal points were used by moving to other locations, which affected the centroids. Hence conventional fuzzy clustering like Fuzzy C-Means (FCM) is not sufficient to separate noise and outliers from typical results. But noise and outliers are not eliminated by the clustering method; hence they are forced to belong in one cluster due to general probabilistic constraint the amount of the membership degree of data across all clusters to 1. By incorporating the Possibilistic method, it enables the identification of outliers by the algorithm. In this paper, Intuitionist Possibilistic Fuzzy c-mean (IPFCM) not only minimizes the effect of outliers during the clustering process but also cesses it. These are also detected and extracted for further outlier mining. The detailed experiments show that IPFCM achieves reliable outlier detection results while maintaining the consistency of the clustering. The performance average segmentation accuracy for MIAS images with different noise levels 5%, 7%, and 9% of IPFCM is 91.25%, 87.50%, and 85.30% accordingly. The average classification accuracy rates of the methods (Otsu, FCM, IFCM, PFCM and IPFCM) for Fuzzy-SVM are 79.69%, 92.69%, 93.13%, 95.00%, and 98.85%, respectively. We conducted a comparison to compare our results with the most relevant studies, and the results are summarized in Table 9.

In future work, the segmentation and classification of this method with deep learning applications will be studied. In this way, breast cancer detection results can be obtained quickly and the application of this research is advanced.

Author Contributions

This research specifies below the individual contributions: Conceptualization, C.L.C. and M.M.; Data curation, C.L.C. and K.P.; Formal analysis, M.M. and Z.M.; Funding acquisition, Z.M.; Investigation; M.M. and K.P.; Methodology, C.L.C. and P.A.P.; Project administration, C.L.C. and M.M.; Resources, K.P. and Z.M.; Software, C.L.C. and K.P.; Supervision, C.L.C. and M.M.; Validation, P.A.P. and Z.M.; Visualization, C.L.C. and Z.M.; Writing—Review and editing, C.L.C. and M.M. All authors have read and agreed to the published version of the manuscript.

Funding

This research obtained no funding.

Acknowledgments

The project is financed by the Rector proquality grant No. 09/010/RGJ19/0042 at the Silesian University of Technology, Poland and the National Agency for Academic Exchange of Poland (under the Academic International Partnerships program, grant agreement PPI/APM/2018/1/00004) for supporting training in the hosting university.

Conflicts of Interest

The authors declare no conflict of interest.

References

Zhu, M.; Lv, Q.; Huang, H.; Sun, C.; Pang, D.; Wu, J. Identification of a four-long non-coding RNA signature in predicting breast cancer survival. Oncol. Lett. 2020, 19, 221–228. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Siegel, R.L.; Miller, K.D.; Jemal, A. Cancer statistics. CA Cancer J. Clin. 2020, 70, 7–30. [Google Scholar] [CrossRef] [PubMed]
Coleman, M.P.; Quaresma, M.; Berrino, F.J.; Lutz, M.; De Angelis, R.; Capocaccia, R.; Baili, P.; Rachet, B.; Gatta, G.; Hakulinen, T.; et al. Noise reduction in synthetic aperture radar images using fuzzy logic and genetic algorithm. Microsyst. Technol. 2019, 25, 1743–1752. [Google Scholar]
Chowdhary, C.L.; Shynu, P.G.; Gurani, V.K. Exploring Breast Cancer Classification of Histopathology Images from Computer Vision and Image Processing Algorithms to Deep Learning. Int. J. Adv. Sci. Technol. 2020, 29, 43–48. [Google Scholar]
Mittal, M.; Saraswat, L.K.; Iwendi, C.; Anajemba, J.H. A Neuro-Fuzzy Approach for Intrusion Detection in Energy Efficient Sensor Routing. In Proceedings of the 4th International Conference on Internet of Things: Smart Innovation and Usages (IoT-SIU), Ghaziabad, India, 18–19 April 2019. [Google Scholar]
Gannon, O.M.; Antonsson, A.; Bennett, I.C.; Saunders, N.A. Viral infections and breast cancer–A current perspective. Cancer Lett. 2018, 420, 182–189. [Google Scholar] [CrossRef] [PubMed]
Ammar, A.; Elouedi, Z.; Lingras, P. Semantically segmented clustering based on possibilistic and rough set theories. Int. J. Intell. Syst. 2015, 30, 676–706. [Google Scholar] [CrossRef]
Cortes, C.; Vapnik, V. Support-vector networks. Mach. Lang. 1995, 20, 273–297. [Google Scholar] [CrossRef]
Hsu, C.W.; Lin, C.J. A comparison of methods for multiclass support vector machines. IEEE Trans. Neural Netw. 2002, 13, 415–425. [Google Scholar]
Alkhasawneh, M.S.; Tay, L.T. A hybrid intelligent system integrating the cascade forward neural network with elman neural network. Arab J. Sci. Eng. 2018, 43, 6737–6749. [Google Scholar] [CrossRef]
Osmanovic, A.; Halilovic, S.; Ilah, L.A.; Fojnica, A.; Gromilic, Z. Machine learning techniques for classification of breast cancer. In Proceedings of the World Congress on Medical Physics and Biomedical Engineering 2018, Prague, Czech Republic, 3–8 June 2018. [Google Scholar]
Rawat, J.; Singh, A.; Bhadauria, H.S.; Virmani, J.; Devgun, J.S. Leukocyte Classification using Adaptive Neuro-Fuzzy Inference System in Microscopic Blood Images. Arab J. Sci. Eng. 2017, 8, 1–18. [Google Scholar] [CrossRef]
Mehmood, Z.; Abbas, F.; Mahmood, T.; Javid, M.A.; Rehman, A.; Nawaz, T. Content-based image retrieval based on visual words fusion versus features fusion of local and global features. Arab J. Sci. Eng. 2018, 43, 7265–7284. [Google Scholar] [CrossRef]
Bick, U.; Engel, C.; Krug, B.; Heindel, W.; Fallenberg, E.M.; Rhiem, K.; Maintz, D.; Golatta, M.; Speiser, D.; Rjosk-Dendorfer, D.; et al. High-risk breast cancer surveillance with MRI: 10-year experience from the German consortium for hereditary breast and ovarian cancer. Breast Cancer Res. Treat 2019, 175, 217–228. [Google Scholar] [CrossRef] [PubMed]
Chowdhary, C.L.; Acharjya, D.P. Segmentation of Mammograms Using a Novel Intuitionistic Possibilistic Fuzzy C-Mean Clustering Algorithm. Nat. Inspired Comput. 2018, 75–82. [Google Scholar]
Khwairakpam, A.; Kandar, D.; Paul, B. Noise reduction in synthetic aperture radar images using fuzzy logic and genetic algorithm. Microsyst. Technol. 2019, 25, 1743–1752. [Google Scholar] [CrossRef]
Dutta, S.; Ghatak, S.; Sarkar, A.; Pal, R.; Pal, R.; Roy, R. Cancer Prediction Based on Fuzzy Inference System. In Proceedings of the 3rd International Conference on Smart Innovations in Communication and Computational Sciences (ICSICCS-2019), Ayodhya, India, 27–28 February 2020. [Google Scholar]
Zhou, Q.; Chao, F.; Lin, C.M. A functional-link-based fuzzy brain emotional learning network for breast tumor classification and chaotic system synchronization. Int. J. Fuzzy Syst. 2018, 20, 349–365. [Google Scholar] [CrossRef]
Khare, N.; Devan, P.; Chowdhary, C.L.; Bhattacharya, S.; Singh, G.; Singh, S.; Yoon, B. SMO-DNN: Spider Monkey Optimization and Deep Neural Network Hybrid Classifier Model for Intrusion Detection. Electronics 2020, 9, 692. [Google Scholar] [CrossRef]
Aquino, G.; Rubio, J.D.J.; Pacheco, J.; Gutierrez, G.J.; Ochoa, G.; Balcazar, R.; Cruz, D.R.; Garcia, E.; Novoa, J.F.; Zacarias, A. Novel nonlinear hypothesis for the delta parallel robot modeling. IEEE Access 2020, 8, 46324–46334. [Google Scholar] [CrossRef]
Khan, S.; Islam, N.; Jan, Z.; Din, I.U.; Rodrigues, J.J.C. A novel deep learning based framework for the detection and classification of breast cancer using transfer learning. Pattern Recognit. Lett. 2019, 125, 1–6. [Google Scholar] [CrossRef]
de Jesús Rubio, J. SOFMLS: Online self-organizing fuzzy modified least-squares network. IEEE Trans. Fuzzy Syst. 2009, 17, 1296–1309. [Google Scholar] [CrossRef]
Chiang, H.S.; Chen, M.Y.; Huang, Y.J. Wavelet-Based EEG Processing for Epilepsy Detection Using Fuzzy Entropy and Associative Petri Net. IEEE Access 2019, 7, 103255–103262. [Google Scholar] [CrossRef]
Elias, I.; Rubio, J.D.J.; Cruz, D.R.; Ochoa, G.; Novoa, J.F.; Martinez, D.I.; Muñiz, S.; Balcazar, R.; Garcia, E.; Juarez, C.F. Hessian with mini-batches for electrical demand prediction. Appl. Sci. 2020, 10, 2036. [Google Scholar] [CrossRef] [Green Version]
Meda-Campaña, J.A. On the estimation and control of nonlinear systems with parametric uncertainties and noisy outputs. IEEE Access 2018, 6, 31968–31973. [Google Scholar] [CrossRef]
Ashfahani, A.; Pratama, M.; Lughofer, E.; Ong, Y.S. DEVDAN: Deep evolving denoising autoencoder. Neurocomputing 2020, 390, 297–314. [Google Scholar] [CrossRef] [Green Version]
Chowdhary, C.L.; Sai, G.V.K.; Acharjya, D.P. Decrease in false assumption for detection using digital mammography. In Proceedings of the International Conference on Computational Intelligence in Data Mining (ICCIDM 2016), Bhubaneswar, India, 10–11 December 2016. [Google Scholar]
Reddy, T.; RM, S.P.; Parimala, M.; Chowdhary, C.L.; Hakak, S.; Khan, W.Z. A deep neural networks based model for uninterrupted marine environment monitoring. Comput. Commun. 2020, 157, 64–75. [Google Scholar]
MacQueen, J. Some methods for classification and analysis of multivariate observations. In Proceedings of the fifth Berkeley symposium on mathematical statistics and probability, Berkeley, CA, USA. 21 June–18 July 1965 and 27 December–7 January 1966.
Chowdhary, C.L. 3D object recognition system based on local shape descriptors and depth data analysis. Recent Patents on Computer Science 2019, 12, 18–24. [Google Scholar] [CrossRef]
Wang, B.; Wang, Y.; Cui, L. Fuzzy clustering recognition algorithm of medical image with multi-resolution feature. Concurr. Comp. Pract. E 2020, 32, e4886. [Google Scholar]
Xia, K.; Gu, X.; Zhang, Y. Oriented grouping-constrained spectral clustering for medical imaging segmentation. Multimed. Syst. 2020, 26, 27–36. [Google Scholar] [CrossRef]
Pan, Q.; Zhu, W.; Zhang, X.; Chang, J.; Cui, J. Research on a bifurcation location algorithm of a drainage tube based on 3D medical images. Vis. Comput. Ind. Biomed. Art 2020, 3, 1–11. [Google Scholar] [CrossRef] [PubMed]
Chowdhary, C.L. Linear feature extraction techniques for object recognition: Study of PCA and ICA. J. Serbian Soc. Comput. Mech. 2011, 5, 19–26. [Google Scholar]
Chowdhary, C.L.; Acharjya, D.P. Segmentation and Feature Extraction in Medical Imaging: A Systematic Review. Procedia Comput. Sci. 2020, 167, 26–36. [Google Scholar] [CrossRef]
Otsu, N. A threshold selection method from gray-level histograms. IEEE Trans. Syst. Man Cybern. 1979, 9, 62–66. [Google Scholar] [CrossRef] [Green Version]
Krishnapuram, R.; Keller, J.M. A possibilistic approach to clustering. IEEE Trans. Fuzzy Syst. 1993, 1, 98–110. [Google Scholar] [CrossRef]
Pattanaik, P.A.; Mittal, M.; Khan, M.Z. Unsupervised Deep Learning Cad Scheme For The Detection Of Malaria In Blood Smear Microscopic Images. IEEE Access 2020, 8, 94936–94946. [Google Scholar] [CrossRef]
Mittal, M.; Kumar, K. Network Lifetime Enhancement of Homogeneous Sensor Network Using ART1 Neural Network. In Proceedings of the Sixth International Conference on Computational Intelligence and Communication Networks (CICN), Bhopal, India, 14–16 November 2014. [Google Scholar]
Mittal, M.; Kumar, K. Quality of Services Provisioning in Wireless Sensor Networks using Artificial Neural Network: A Survey. Int. J. Comput. Appl. 2015, 117, 28–40. [Google Scholar] [CrossRef]
Mittal, M.; Saraswat, L.K. Energy Evaluation of Sensor Protocol based on AI Techniques using CRAWDAD Data. Int. J. Recent Technol. Eng. 2019, 8, 2812–2815. [Google Scholar]
Mittal, M.; Kumar, K. Data Clustering in Wireless Sensor Network Implemented On Self Organization Feature Map (SOFM) Neural Network. In Proceedings of the international conference on Computing, Communication and Automation(ICCCA), Greater Noida, India, 29–30 April 2016. [Google Scholar]
Pal, N.R.; Pal, K.; Bezdek, J.C. A mixed c-means clustering model. In Proceedings of the 6th international fuzzy systems conference, Barcelona, Spain, 5 July 1997. [Google Scholar]
Pal, N.R.; Pal, K.; Keller, J.M.; Bezdek, J.C. A possibilistic fuzzy c-means clustering algorithm. IEEE Trans. Fuzzy Syst. 2005, 13, 517–530. [Google Scholar] [CrossRef]
de Carvalho, F.D.A. Fuzzy c-means clustering methods for symbolic interval data. Pattern Recognit. Lett. 2007, 28, 423–437. [Google Scholar] [CrossRef]
Ji, Z.X.; Sun, Q.S.; Xia, D.S. A modified possibilistic fuzzy c-means clustering algorithm for bias field estimation and segmentation of brain MR image. Comput. Med. Imaging Graph. 2011, 35, 383–397. [Google Scholar] [CrossRef]
Treerattanapitak, K.; Jaruskulchai, C. Outlier detection with possibilistic exponential fuzzy clustering. In Proceedings of the 2011 Eighth International Conference on Fuzzy Systems and Knowledge Discovery (FSKD), Shanghai, China, 26–28 July 2011. [Google Scholar]
Atanassov, K.T. Intuitionistic fuzzy sets; Springer: Berlin/Heidelberg, Germany, 1999; pp. 1–137. [Google Scholar]
Chaira, T. A novel intuitionistic fuzzy C means clustering algorithm and its application to medical images. Appl. Soft Comput. 2011, 11, 1711–1717. [Google Scholar] [CrossRef]
Chowdhary, C.L.; Acharjya, D.P. Singular Value Decomposition–Principal Component Analysis-Based Object Recognition Approach. In Bio-Inspired Computing for Image and Video Processing; Chapman and Hall/CRC: Abingdon, UK, 2018. [Google Scholar]
Chowdhary, C.L. Application of Object Recognition With Shape-Index Identification and 2D Scale Invariant Feature Transform for Key-Point Detection. In Feature Dimension Reduction for Content-Based Image Identification; IGI Global: Hershey, PA, USA, 2018. [Google Scholar]
Reddy, G.T.; Bhattacharya, S.; Ramakrishnan, S.S.; Chowdhary, C.L.; Hakak, S.; Kaluri, R.; Reddy, M.P.K. An Ensemble based Machine Learning model for Diabetic Retinopathy Classification. In Proceedings of the 2020 International Conference on Emerging Trends in Information Technology and Engineering (ic-ETITE), Vellore, India, 24–25 February 2020. [Google Scholar]
Shynu, P.G.; Shayan, H.M.; Chowdhary, C.L. A Fuzzy based Data Perturbation Technique for Privacy Preserved Data Mining. In Proceedings of the 2020 International Conference on Emerging Trends in Information Technology and Engineering (ic-ETITE), Vellore, India, 24–25 February 2020. [Google Scholar]
Nithya, A.; Appathurai, A.; Venkatadri, N.; Ramji, D.R.; Palagan, C.A. Kidney disease detection and segmentation using artificial neural network and multi-kernel k-means clustering for ultrasound images. Measurement 2020, 149, 106952. [Google Scholar] [CrossRef]
Dhanachandra, N.; Chanu, Y.J. An image segmentation approach based on fuzzy c-means and dynamic particle swarm optimization algorithm. Multimed. Tools Appl. 2020, 79, 18839–18858. [Google Scholar] [CrossRef]
Ezhilraman, S.V.; Srinivasan, S.; Suseendran, G. Gaussian Light Gradient Boost Ensemble Decision Tree Classifier for Breast Cancer Detection. In Intelligent Computing and Innovation on Data Science; Springer: Singapore, 2020. [Google Scholar]
Ray, A.; Chen, M.; Gelogo, Y. Performance Comparison of Different Machine Learning Algorithms for Risk Prediction and Diagnosis of Breast Cancer. In Smart Technologies in Data Science and Communication; Springer: Singapore, 2020. [Google Scholar]
Suckling, J.P. The Mammographic Image Analysis Society Digital Mammogram Database Exerpta Medica. Digit. Investig. 1994, 1069, 375–378. [Google Scholar]
Parvathavarthini, S.; Karthikeyani Visalakshi, N.; Shanthi, S. Breast cancer detection using crow search optimization based Intuitionistic fuzzy clustering with neighborhood attraction. Asian Pac. J. Cancer Prev. 2019, 20, 157–165. [Google Scholar]
Chowdhary, C.L.; Acharjya, D.P. A hybrid scheme for breast cancer detection using intuitionistic fuzzy rough set technique. Int. J. Healthc. Inf. Syst. Inform. 2016, 11, 38–61. [Google Scholar] [CrossRef] [Green Version]
Shen, L.; Margolies, L.R.; Rothstein, J.H.; Fluder, E.; McBride, R.; Sieh, W. Deep learning to improve breast cancer detection on screening mammography. Sci. Rep. 2019, 9, 1–12. [Google Scholar] [CrossRef] [PubMed]
Reddy, A.; Soni, B.; Reddy, S. Breast cancer detection by leveraging Machine Learning. ICT Express 2020, in press. [Google Scholar]

Figure 1. A simple block diagram for breast cancer detection steps.

Figure 2. Classification Task.

Figure 3. Schematic of the proposed method.

Figure 4. (left) Selected Input Image (right) Smoothened Image.

Figure 5. Average segmentation accuracy for Mammography Image Analysis Society (MIAS) images with different noise level chart.

Figure 6. Average computation time for various techniques chart.

Figure 7. Comparative Classification average accuracy Chart.

Figure 8. Performances Assessment for IPFCM.

Table 1. Features for Segmentation.

Features	Explanation
Area $A_{r}$	Number of pixels in boundary area of ROI
Perimeter $P_{r}$	Number of pixels on boundary of ROI.
Circularity $C_{r}$	Number of pixels in boundary area of ROI. When there is a circular shape, circularity has a value of zero. By assuming $A_{r}$ for area and $P_{r}$ for the perimeter, circularity will be $C_{r} = 1 - \frac{4 π A_{r}}{P_{r}^{2}}$ .
Shape factor	Number of pixels on boundary of ROI. The count of burr around tumors will show the feature that is, region-of-interest as $S_{r} = \frac{P_{r}^{2}}{A_{r}}$ . Assume $A_{r}$ for area, $P_{r}$ for the perimeter and $S_{r}$ for Shape factor.
Normalization radial length	$n r l (r) = \frac{r l (r)]}{m a x (r l (r))}$ . Here, $r l$ is radial length value meaning it is Euclidean Distance. So $r l (r) = \sqrt{{(b_{i} - t)}^{2} {(b_{j} - u)}^{2}}$ . Here, $(b_{i}, b_{j})$ is centre position and $(t, u)$ is boundary pixel position.
Mean-value of normalization based radial length	$n r l_{m e a n} = \frac{1}{P_{r}} \sum n r l (r)$
. Standard deviation value	$s i g m a = \sqrt{\frac{1}{P_{r}} \sum_{r = 1}^{P} {(n r l (r) - n r l_{m e a n} (r))}^{2}}$
. Entropy value	$E_{r} = \sum_{r = 1}^{P_{r}} p_{k} log p_{k}$ . $p_{k}$ =probability of a certain $n r l$ to the number of whole radials and $P_{r}$ perimeter.
The normalization value of central position shift	$N C P S = \frac{\sqrt{{(b_{i} - c)}^{2} + {(b_{j} - d)}^{2}}}{A}$ . The pixels coordination position $(c, d)$ is denoted with a minimum gray value inside the ROI. The Euclidian distance is calculated from the ROI centre $(c_{i} - c_{j})$ at the position of the pixel with the lowest gray value, the ROI is divided.
Gradient	the gray value alteration among the boundary pixel and the 10^th pixel from this pixel with the radial direction $g_{r} = I (t, u) - I (i_{0}, j_{0})$ , Where $I (t, u)$ is the gray value of the boundary pixel and $I (i_{0}, j_{0})$ is the gray value of the 10^th radial pixel.

Table 2. Features for Classifications.

Features	Equations
mean of the intensity	$\sum_{r v i n}^{L - 1} r_{r v i n} h (r_{r v i n})$
Standard deviation	$\sqrt{\sum_{r v i n}^{L - 1} {(r_{r v i n} - m e a n)}^{2} h (r_{r v i n})}$
Smoothness	$1 - \frac{1}{1 + {(s i g m a)}^{2}}$
Skewness	$\sum_{r v i n}^{L - 1} {(r_{r v i n} - m e a n)}^{3} h (r_{r v i n})$
Uniformity	$\sum_{r v i n}^{L - 1} h^{2} (r_{r v i n})$
Entropy	$\sum_{r v i n}^{L - 1} h (r_{r v i n}) log h (r_{r v i n})$
Kurtosis	$\sum_{r v i n}^{L - 1} {(r_{r v i n} - m e a n)}^{4} h (r_{r v i n})$

Table 3. A Sample Decision Tree.

	Body pain	Cold	Vomiting	Fever
Image-1	High	Low	Yes	Yes
Image-2	Low	Low	No	Yes
Image-3	High	Low	Yes	No
Image-4	Low	Low	No	No

Table 4. Average segmentation accuracy with different noise level.

	Noise Level (in %)
Segmentation Methods ↓	5	7	9
Otsu	0.8375	0.8156	0.7969
FCM	0.8187	0.8531	0.8163
Intuitionistic FCM	0.8656	0.85	0.8125
Possibilistic FCM	0.8781	0.8625	0.8188
Proposed Intuitionistic Possibilistic FCM	0.9125	0.875	0.8531

Table 5. Average computation time for various techniques in seconds.

Otsu	FCM	IFCM	PFCM	IPFCM
0.72	0.65	1.25	1.40	2.26

Table 6. Classification accuracy (Average) for five segmentation methods.

	Classification (All Features)
Segmentation	SVM	Decision Tree	RSDA	Fuzzy SVM
Otsu	70.32	66.88	72.81	79.69
FCM	82.19	86.25	89.63	92.69
Intuitionistic FCM	87.19	81.25	96.13	93.13
Possibilistic FCM	86.25	82.19	92.5	95.00
Proposed Intuitionistic Possibilistic FCM	88.13	82.5	96.1	98.85

Table 7. Assessment Measures.

Evaluation Criterion	Definition
Sensitivity	The “Sensitivity” criteria is constructed on the positive circumstances of found results. The measurements are element of the perceived positive circumstances and the actual positive circumstances.
Specificity	The “Specificity” criteria is constructed on the negative circumstances of found results. The measurements are element of the perceived negative circumstances and the actual negative circumstances.
Accuracy	The “Accuracy” criteria is considered on the accuracy of found results. This criteria is the best common indicator which contributes the precision of forecast results.
PPV	“Positive Predictive Value” is approximately all the circumstances which calculate the decorously sensed positive circumstances concluded all sensed positive circumstances.
NPV	“Negative Predictive Value” is approximately totally the circumstances of conniving as the correctly noticed negative cases concluded totally detected negative circumstances.
MCC	One more operational accuracy evaluation display of machine learning methods is “Matthew’s Correlation Coefficient”. In the MCC, there is a comparison between the negative sample numbers and positive sample number led to finding unbalanced. The MCC compromises a virtuous evaluation ended the altogether accuracy.

Table 8. Performances Assessment for intuitionistic possibilistic fuzzy c-mean (IPFCM).

Evaluation Criterion	Classification (All Features)
Evaluation Criterion	Fuzzy SVM
Sensitivity	0.99
Specificity	0.25
Accuracy	0.98
PPV	0.99
NPV	0.50
MCC	0.34

Table 9. Comparative results between the proposed work and the other related work.

Methodology ↓	Sensitivity	Accuracy
Crow Search Optimization based Intuitionistic Fuzzy Clustering [59]	0.98	0.96
Intuitionistic Fuzzy Rough Hybrid Technique [60]	0.97	0.98
Convolutional Network Method for Classifying Screening Mammograms [61]	0.97	0.95
Deep Neural Network with Support Value (DNNS) [62]	0.97	0.97
Proposed (IPFCM and Fuzzy SVM)	0.99	0.98

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Chowdhary, C.L.; Mittal, M.; P., K.; Pattanaik, P.A.; Marszalek, Z. An Efficient Segmentation and Classification System in Medical Images Using Intuitionist Possibilistic Fuzzy C-Mean Clustering and Fuzzy SVM Algorithm. Sensors 2020, 20, 3903. https://doi.org/10.3390/s20143903

AMA Style

Chowdhary CL, Mittal M, P. K, Pattanaik PA, Marszalek Z. An Efficient Segmentation and Classification System in Medical Images Using Intuitionist Possibilistic Fuzzy C-Mean Clustering and Fuzzy SVM Algorithm. Sensors. 2020; 20(14):3903. https://doi.org/10.3390/s20143903

Chicago/Turabian Style

Chowdhary, Chiranji Lal, Mohit Mittal, Kumaresan P., P. A. Pattanaik, and Zbigniew Marszalek. 2020. "An Efficient Segmentation and Classification System in Medical Images Using Intuitionist Possibilistic Fuzzy C-Mean Clustering and Fuzzy SVM Algorithm" Sensors 20, no. 14: 3903. https://doi.org/10.3390/s20143903

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

An Efficient Segmentation and Classification System in Medical Images Using Intuitionist Possibilistic Fuzzy C-Mean Clustering and Fuzzy SVM Algorithm

Abstract

1. Introduction

2. Materials and Methods

2.1. Basic Preprocessing: Noise Removal

2.2. Background of Segmentation

2.3. Preliminaries of Clustering

2.3.1. Fuzzy C-Mean (FCM)

2.3.2. Possibilistic Fuzzy C-Mean (PFCM)

2.3.3. Intuitionistic Fuzzy C-Mean (IFCM)

2.4. Feature Extraction

3. Overview of Classifications Methods

3.1. Decision Tree (DT)

3.2. Rough Set Data Analysis (RSDA)

3.3. Support Vector Machine (SVM)

3.4. Fuzzy SVM (FSVM)

4. Proposed Intuitionistic Possibilistic Fuzzy Clustering

5. Results and Discussion

5.1. Data Collection

5.2. Segmentation for Medical Imaging

5.3. Average Segmentation Accuracy

5.4. Classification for Medical Imaging

5.5. Performance Evaluation

6. Conclusions and Future Research

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI