Fuzzy Color Aura Matrices for Texture Image Segmentation

Haliche, Zohra; Hammouche, Kamal; Losson, Olivier; Macaire, Ludovic

doi:10.3390/jimaging8090244

Open AccessArticle

Fuzzy Color Aura Matrices for Texture Image Segmentation

¹

Laboratoire Vision Artificielle et Automatique des Systèmes, Université Mouloud Mammeri, Tizi-Ouzou 15000, Algeria

²

Univ. Lille, CNRS, Centrale Lille, UMR 9189 CRIStAL, F-59000 Lille, France

^*

Author to whom correspondence should be addressed.

J. Imaging 2022, 8(9), 244; https://doi.org/10.3390/jimaging8090244

Submission received: 13 July 2022 / Revised: 23 August 2022 / Accepted: 2 September 2022 / Published: 8 September 2022

(This article belongs to the Special Issue Color Texture Classification)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

Fuzzy gray-level aura matrices have been developed from fuzzy set theory and the aura concept to characterize texture images. They have proven to be powerful descriptors for color texture classification. However, using them for color texture segmentation is difficult because of their high memory and computation requirements. To overcome this problem, we propose to extend fuzzy gray-level aura matrices to fuzzy color aura matrices, which would allow us to apply them to color texture image segmentation. Unlike the marginal approach that requires one fuzzy gray-level aura matrix for each color channel, a single fuzzy color aura matrix is required to locally characterize the interactions between colors of neighboring pixels. Furthermore, all works about fuzzy gray-level aura matrices consider the same neighborhood function for each site. Another contribution of this paper is to define an adaptive neighborhood function based on information about neighboring sites provided by a pre-segmentation method. For this purpose, we propose a modified simple linear iterative clustering algorithm that incorporates a regional feature in order to partition the image into superpixels. All in all, the proposed color texture image segmentation boils down to a superpixel classification using a simple supervised classifier, each superpixel being characterized by a fuzzy color aura matrix. Experimental results on the Prague texture segmentation benchmark show that our method outperforms the classical state-of-the-art supervised segmentation methods and is similar to recent methods based on deep learning.

Keywords:

color texture segmentation; aura matrices; fuzzy color aura matrix; SLIC superpixel; regional feature

1. Introduction

Color texture segmentation consists of partitioning an image into homogeneous regions with respect to color and texture properties. It is involved in various fields, such as medical image analysis [1], remote sensing [2], synthetic aperture radar [3], and fruit detection [4]. Although a wide variety of techniques have been developed, color texture segmentation remains an open and challenging problem due to the high variability of textures, combined with the great diversity of colors. Most approaches characterize each pixel by a set of texture and color features, and use them to assign one color texture class to each pixel thanks to classification algorithms [5,6]. Section 1.1 and Section 1.2 discuss the state of the art of color texture features and classification algorithms used for color texture segmentation.

1.1. Color Texture Features

The characterization of color textures is a fundamental problem in computer vision, where a texture is generally described by some visual cues represented by statistical structures, hereafter called texture features. Pixel colors are often represented by the values or by statistic measures of color components (R, G, B) or by those derived from color spaces such as HSV, L

^{*}

u

^{*}

v

^{*}

, or L

^{*}

a

^{*}

b

^{*}

[7]. Various texture features are designed to characterize texture appearance. Widely used ones are based on Gabor filters, wavelet transform, Markov random field model, local binary patterns, and co-occurrence matrices [8,9,10].

Color texture classification/segmentation techniques can be broadly classified into two approaches according to how they combine color texture and color features [5,8]. In the first approach, color and texture features are computed separately and are then combined by the clustering or classification process. For example, in [11], L

^{*}

, u

^{*}

, and v

^{*}

components were used as color features and the output of Gabor filters as texture features. In [12], the authors used the mean, standard deviation, and skewness of each color channel, H, S, and V, as color features, and the co-occurrence matrices computed on the intensity image were considered as texture features.

In the second approach, color and texture features are assumed to be mutually dependent. With this approach, marginal, opponent, and vector strategies have been developed to compute color texture features [5,8]. The marginal strategy assumes that texture can be separately described within each color channel, where pixels are characterized by only one color component. Texture features designed for gray-level images are then computed for each color channel and aggregated into a global feature representation. In [13], for instance, texture features were extracted from wavelet transform and co-occurrence matrices for each color channel. In [14], Leung–Malik filter banks were applied on each color channel for color texture feature extraction. The opponent strategy extends texture feature extraction to color images thanks to both within- and between-channel analyses. The within-channel part consists of computing features from each color channel separately (as in the marginal strategy), whereas between-channel (opponent) features are obtained by processing pairs of color channels jointly. This strategy was adopted by Panjwani et al. [15], who used Markov random field models to characterize a texture in terms of spatial interaction within each color channel and interaction between different color channels. The vector strategy takes advantage of color vector information. It makes it possible to analyze relationships between colors of neighboring pixels, as in the case of gray-level images. Examples of this strategy can be found in [9,16] where local binary patterns were computed from a color image. It should be noted that the vector strategy is more suitable to characterizing color texture because it is less memory-consuming and fully takes correlation between colors of neighboring pixels into account.

1.2. Color Texture Image Segmentation by Pixel Classification

Color texture segmentation methods can also be categorized into three ways, depending on whether the pixel classification is performed in a supervised, unsupervised, or semi-supervised manner [1,11,17]. In supervised segmentation, prior knowledge about the training samples and their class labels is needed to classify the input image pixels. Some classical supervised classifiers are used for texture segmentation [1,4,17]. Among them are the K-nearest neighbor (KNN) [18] and the Bayesian [19] classifiers, the support vector machine (SVM) [20], random forest [12], Markov random field [11], and neural networks [14]. Supervised color texture segmentation based on deep learning has been developed in the last decade [21,22,23]. These efficient methods generally use convolutional neural networks (CNN), such as the U-Net [24], deep visual model (DA) [25], pyramid scene parsing network (PSP-Net) [26], supervised fully convolutional network for texture (FCNT) [21], and empirical wavelet transform-based fully convolutional network for texture (EWT-FCNT) [23].

Unsupervised segmentation does not require prior knowledge and discovers different classes by clustering pixels from their features only [5,6]. For example, the popular K-means clustering algorithm is mainly employed to perform the classification of pixels [6,27].

Other color texture segmentation methods rely on graph cut techniques [28], mean shift clustering [29], estimation–maximization [1], the random Markov model [11], dictionary learning and sparse representation-based classification (DLSRC) [13], and spectral clustering [3].

Semi-supervised segmentation is suitable where only partial prior knowledge about the training samples and their class labels is available. For instance, in [17], the constrained spectral clustering algorithm was applied for semi-supervised classification of pixels characterized by color texture features.

Supervised segmentation methods, especially deep learning based methods, provide better results than unsupervised and semi-supervised approaches thanks to prior information [17]. It should also be noted that some of the above methods perform segmentation at the pixel level and others at the "superpixel" level. This is the case for the methods discussed in [1,3]. Recall that superpixels group neighboring pixels based on their spatial and color similarity, and are used as samples in order to speed up color texture segmentation.

1.3. Fuzzy Color Texture Features

All aforementioned methods assume that images are crisp and free from vagueness. However, in practice, color images carry more or less fuzziness because of the imprecision inherent to the discretization of both the spatial domain (sampling) and the color component levels (quantization). Therefore, boundaries separating the various image regions are not precisely defined, and pixel levels are imprecise measures of the reflectance of surfaces observed by the camera. Furthermore, the assumption that texture images are mainly represented by spatial repetitions of a pattern may not be valid any longer. Texture analysis techniques based on fuzzy concepts have then been proposed in order to take this imprecision into account.

For instance, fuzzy histograms [30], fuzzy local binary patterns [31], and local fuzzy patterns [32] are extracted from gray-level texture images. Fuzzy gray-level co-occurrence matrices (FGLCMs) are also proposed to characterize spatial interactions between gray-levels of neighboring pixels [33,34,35,36,37]. However, characterizing a color texture by at least three FGLCMs (one by channel) is memory expensive. To overcome this drawback, Ledoux et al. [38] proposed to extend FGLCMs to color images by defining fuzzy color sets. A color image is then represented by one single fuzzy color co-occurrence matrix (FCCM) that characterizes the local interactions between colors of neighboring pixels. Moreover, FGLCMs and FCCMs only consider spatially-invariant neighborhoods. Hammouche et al. [39] showed that adaptive neighborhoods are useful for texture analysis and provided an elegant formalism to deal with spatially-variant neighborhoods thanks to the aura concept.

In the framework based on the aura set concept, Elfadel and Picard [40] proposed a generalization of GLCMs called gray-level aura matrices (GLAMs). A GLAM quantifies the presence of a set of pixels with a specified level in the neighborhood of another set of pixels having another level. The amount of neighboring pixels with the specified level is quantified by means of the aura measure. GLAMs are used for texture representation and synthesis [41,42], image retrieval [43,44], classification [45,46,47,48,49,50], and segmentation [51,52,53]. A generalization of GLAMs to the fuzzy framework has been proposed by Hammouche et al. [39]. Representing each color channel by a fuzzy GLAM (FGLAM) outperforms the FGLCM representation for texture classification. Recently, FGLAMs have been used to improve the accuracy of wood species classification [54]. However, as for FGLCMs, the computation of FGLAMs for each pixel is memory and time expensive, which makes their use for color image segmentation a challenge.

To circumvent these constraints, we adopted a vector strategy and propose to extend FGLAMs to fuzzy color aura matrices (FCAMs). An FCAM makes it possible to locally characterize the interactions between colors of neighboring pixels. While one FGLAM must be computed for each color channel, a single low-dimensionnal FCAM is required to describe the color texture.

1.4. FCAM for Image Segmentation by Superpixel Classification

In this study, we applied FCAM to color texture image segmentation. As FCAM is based on a locally-adaptive neighborhood, it can characterize the texture represented by small connected pixel subsets with different shapes, i.e., by superpixels. We use the simple linear iterative clustering (SLIC) scheme [55] to generate superpixels from a color image. These are then classified using a simple supervised classifier to segment the color texture image.

The remainder of this paper is organized as follows. In Section 2, we first give an overview of basic SLIC algorithm; then we propose a modified version that incorporates the regional information. Section 3 introduces some definitions of the fuzzy color aura concept, and explains how to characterize the color texture of each superpixel by an FCAM. In Section 4, we first present the datasets used in the experiments; then we give details about the proposed color texture segmentation method based on FCAMs. Next, we assess the regional SLIC algorithm and discuss its parameter settings. Last, we compare the segmentation results achieved by our supervised segmentation approach with those obtained by fuzzy texture features and by several other state-of-the-art color texture segmentation methods. Concluding remarks about the contribution of this paper are given in Section 5.

2. Superpixel

The proposed color texture image segmentation is based on the classification of superpixels. A superpixel is a compact set of connected sites with similar properties. It is usually adopted to replace the pixel grid in an image in order to reduce the computational burden of subsequent processing. Superpixels are generated by over-segmentation algorithms using color, spatial, and/or texture information. Among superpixel generation algorithms, simple linear iterative clustering (SLIC) is widely used due to its simplicity, speed, and ability to adhere to image boundaries. In this section, we briefly give an overview of SLIC algorithm used to generate superpixels. Then, we propose a modified version that incorporates the regional information.

2.1. Basic SLIC

The SLIC algorithm [55] generates a desired number of regular superpixels, with la ow computational overhead, by clustering sites based on their spatial and color features. Let

I

be an RGB image defined on a lattice

S

, such that each site

s \in S

is characterized by three color components:

I (s) = (I^{R} (s)

,

I^{G} (s)

,

I^{B} {(s))}^{⊺}

. The RGB color components are transformed into the

L a^{*} b^{*}

color space, so that SLIC represents each site

s (x_{s}, y_{s})

by a five-dimensional feature vector:

(I^{L} (s)

,

I^{a} (s)

,

I^{b} (s), x_{s}, y_{s})^{⊺}

. SLIC follows a k-means clustering strategy but searches the nearest cluster center according to the distance

D (s, s^{'}) = \sqrt{d_{c}^{2} (s, s^{'}) + m^{2} \cdot d_{s}^{2} (s, s^{'}) / S^{2}}

between two sites

s

and

s^{'}

, where

d_{c}^{2} (s, s^{'}) = {(I^{L} (s) - I^{L} (s^{'}))}^{2} + {(I^{a} (s) - I^{a} (s^{'}))}^{2} + {(I^{b} (s) - I^{b} (s^{'}))}^{2}

and

d_{s}^{2} (s, s^{'}) = {(x_{s} - x_{s^{'}})}^{2} + {(y_{s} - y_{s^{'}})}^{2}

measure the color and spatial proximity. The compactness parameter m is set to its default value of 1, and the maximum spatial distance within a cluster is defined as the sampling step

S = \sqrt{|S| / P}

, where

|S|

is the total number of sites in the image and P is the number of superpixels. Figure 1b shows the results of the segmentation achieved by SLIC on the color image of Figure 1a. It should be emphasized that SLIC takes a post-processing step to enforce connectivity by merging small isolated superpixels with nearby larger ones. Therefore, the actual number of superpixels produced by SLIC can be slightly lower than the desired number of superpixels.

2.2. Regional SLIC

The basic SLIC algorithm achieves good pre-segmentation on color images, but may fail to find boundaries of textures. Indeed, it successfully detects homogeneous regions but somewhat fails to separate areas with different textures (see Figure 1a and arrows on Figure 1b). To improve this pre-segmentation, we propose a modified version of the algorithm, referred to as regional SLIC, that takes color, spatial, and regional information into account. From the basic SLIC pre-segmentation result, we compute the regional feature

f_{R}

of each superpixel

R

as [56]:

f_{R} = [p_{R} - p_{R} \cdot l o g (p_{R})] + \frac{1}{| N_{R} |} \sum_{R^{'} \in N_{R}} [p_{R^{'}} - p_{R^{'}} \cdot l o g (p_{R^{'}})],

(1)

where

p_{R} = |R| / |S|

is the area ratio of

R

to the whole image, and

N_{R}

is the set of superpixels that are adjacent to

R

. The regional feature

f_{R}

is a sum of two terms: the first one reflects the size of the superpixel

R

and monotonously increases with its size (because

p_{R} \in] 0, 1 [

), and the second one reflects the superpixel context by taking the influence of adjacent superpixels into account. A small superpixel surrounded by small superpixels provides a low value of

f_{R}

, and conversely, if both

R

and adjacent superpixels are large,

f_{R}

is high (see Figure 1c). At each site

s

of superpixel

R

, we replace the luminance component

I^{L} (s)

by the regional feature

f_{R}

and we apply the SLIC algorithm again with

(f_{R}

,

I^{a} (s)

,

I^{b} (s), x_{s}, y_{s})^{⊺}

as the feature vector (see Algorithm 1).

Thanks to this regional SLIC method, the lattice

S

is partitioned into P superpixels

{P^{p}}_{p = 1}^{P}

, i.e.,

⋃_{p = 1}^{P} P^{p} = S

and

P^{p} \cap P^{p^{'}} = \emptyset

, for

p \neq p^{'}

. A superpixel

P^{p}

is defined as a set of connected sites, i.e., any two sites

s

and

s^{'}

in

P^{p}

are connected by at least one path composed of sites in

P^{p}

. Figure 1d shows the final superpixels extracted from the color texture image of Figure 1a. Note (see arrows) that areas with different textures are better delineated with the regional SLIC than with its basic version, despite the regional feature describing the superpixel contextual information in the pre-segmented image and not being able to be considered as a texture feature.

Algorithm 1:Regional SLIC.

Input: RGB image

I

1.: Convert $I$ from RGB to $L a^{*} b^{*}$ color space;
2.: Apply SLIC to $S$ where each site $s$ is characterized by $(I^{L} (s)$ , $I^{a} (s)$ , $I^{b} (s), x_{s}, y_{s})^{⊺}$ ;
3.: Extract the regional feature $f_{R}$ from each superpixel $R$ of the SLIC map using Equation (1);
4.: Apply SLIC to $S$ where each site $s$ is characterized by $(f_{R}$ , $I^{a} (s)$ , $I^{b} (s), x_{s}, y_{s})^{⊺}$ .

Output: Partition of

I

into superpixels

{P^{p}}_{p = 1}^{P}

3. Fuzzy Color Aura

In this section, we extend the aura concept introduced by Elfadel and Picard for gray-level images [40] to color images. We explain how it is used to characterize a superpixel thanks to a matrix of aura cardinals. As this matrix is huge when all RGB colors are considered, we propose to extend the color aura concept to fuzzy colors whose number is reduced.

3.1. Color Aura Set

Let

I

be an RGB image defined on a lattice

S

, such that each site

s \in S

is characterized by its color

I (s)

. For a given color

x \in RGB

, we define the set of sites with this color as

S_{x} = {s \in S, I (s) = x}

. Then,

{S_{x}}_{x \in RGB}

is a partition of

S

—i.e.,

⋃_{x \in RGB} S_{x} = S

and

S_{x} \cap S_{x^{'}} = \emptyset

for

x \neq x^{'}

.

Given two colors

(x, x^{'}) \in {RGB}^{2}

, we define the color aura set

A_{S_{x^{'}}} (S_{x})

of

S_{x}

with respect to

S_{x^{'}}

as:

A_{S_{x^{'}}} (S_{x}) = ⋃_{s \in S} (N_{S_{x}}^{s} \cap S_{x^{'}}),

(2)

where

N_{S_{x}}^{s}

is the neighboring site set of each site

s \in S_{x}

:

N_{S_{x}}^{s} = \{r \in S, (s \in S_{x}) \land (r \in N^{s})\} .

(3)

N^{s} \overset{\cdot}{=} \{{r \in S, ∥ r - s ∥}_{\infty} \leq d\}

is the neighborhood of a site

s

. In this paper,

d = 1

such that

N^{s}

is only composed of the eight closest sites from

s

.

A_{S_{x^{'}}} (S_{x})

is the subset of

S_{x^{'}}

composed of the sites that are present in the neighborhood of those of

S_{x}

. It provides an interpretation of the presence of

S_{x^{'}}

in the neighborhood of

S_{x}

. The color aura set is a generalization of the gray-level aura set introduced by Elfadel and Picard [40] to the color case.

Figure 2a shows a color image, defined on a lattice

S

of

7 \times 7

pixels, and three color sets,

S_{R}, S_{G}

, and

S_{B}

. For the neighborhood

N^{s}

, the aura set

A_{S_{G}} (S_{B})

of

S_{B}

with respect to

S_{G}

is composed of the green sites marked as circles. Note that it differs from the aura set

A_{S_{B}} (S_{G})

of

S_{G}

with respect to

S_{B}

, composed of the blue sites marked as diamonds.

3.2. Color Aura Set in a Superpixel

Rather than building an aura set from all image sites, we focus on a superpixel

P^{p}

and define the color aura set

A_{S_{x^{'}}}^{p} (S_{x})

of

S_{x}

with respect to

S_{x^{'}}

for the superpixel

P^{p}

as:

A_{S_{x^{'}}}^{p} (S_{x}) = ⋃_{s \in S} (N_{S_{x}}^{p, s} \cap S_{x}^{'}),

(4)

where

N_{S_{x}}^{p, s}

is the neighboring site set of each site

s \in S_{x}

within the superpixel

P^{p}

:

N_{S_{x}}^{p, s} = \{r \in S, (s \in S_{x} \cap P^{p}) \land (r \in N^{s} \cap P^{p})\} .

(5)

The image of Figure 2a contains two textures, one represented by red and green vertical stripes on the left three columns, and another represented by red and blue horizontal stripes on the right three columns. These two textures are separated by the fourth column composed of red, green, and blue sites. Figure 2b shows the partitioning of this image into two superpixels delimited by a vertical black line.

P^{1}

covers the four left columns and

P^{2}

the three right ones. The aura sets

A_{S_{G}}^{1} (S_{B})

and

A_{S_{B}}^{1} (S_{G})

are not empty, since

P^{1}

contains neighboring green and blue sites, but

A_{S_{G}}^{2} (S_{B})

and

A_{S_{B}}^{2} (S_{G})

are empty because

P^{2}

contains no green site. Figure 2c shows another partition where

P^{1}

covers the three left columns and

P^{2}

the four right ones. In that case,

A_{S_{G}}^{1} (S_{B})

and

A_{S_{B}}^{1} (S_{G})

are empty because

P^{1}

contains no blue site. This example illustrates that color aura sets depend on superpixel edges. Indeed, along the fourth column that separates the two textures, only one green site (on first row) and one blue site (second row) belong to the aura sets of superpixels in both Figure 2b,c.

3.3. Color Aura Cardinal

The aura measure was introduced by Elfadel and Picard [40] to characterize an aura set by a number that expresses the amount of mixing between neighboring site sets. We use here a simpler measure [39] and quantify the color aura set of a color site set

S_{x}

with respect to another color site set

S_{x^{'}}

within the superpixel

P^{p}

thanks to its cardinal defined as:

m^{p} (x, x^{'}) = |A_{S_{x^{'}}}^{p} (S_{x})| .

(6)

The aura cardinal measures for all the possible pairs of color sets,

(S_{x}, S_{x^{'}})

,

(x, x^{'}) \in {RGB}^{2}

, are gathered in a matrix. This color aura cardinal matrix

[m^{p} (x, x^{'})]

can be then considered as the texture feature of the superpixel

P^{p}

. However, when color components of the image are defined on 256 levels, as is classically the case, the number of possible colors reaches

256^{3}

and

m^{p}

would be of size

256^{3} \times 256^{3}

. As such, a huge memory requirement is needed in practice, so we propose to decrease the number of analyzed colors by introducing fuzzy colors.

3.4. Fuzzy Color

To form a small subset

C

of C colors among the

256^{3}

possible ones, we use the uniform quantization technique for its simplicity of implementation. For each color component

k \in {R, G, B}

, the full level range

[[0, 255]]

is divided into

C^{k}

disjoint intervals

[[0, L^{k} - 1]]

,

[[L^{k}, 2 L^{k} - 1]], \dots, [[(C^{k} - 1) L^{k}, 255]]

, of respective width

L^{k} \overset{\cdot}{=} 256 / C^{k}

. The

C^{k}

centers

{⌊ (L^{k} - 1) / 2 ⌋, ⌊ (3 L^{k} - 1) / 2 ⌋, \dots, ⌊ 256 - (L^{k} + 1) / 2 ⌋}

of the intervals define the k-th component of the colors in

C

. The numbers

C^{R}

,

C^{G}

, and

C^{B}

are chosen so that the number of colors

C = C^{R} \cdot C^{G} \cdot C^{B}

is much lower than

256^{3}

.

In the fuzzy framework [38], a fuzzy color

\tilde{c}

is characterized by its membership function

μ_{\tilde{c}} : RGB ⟶ [0, 1]

. The membership degree

μ_{\tilde{c}} (x)

of any color

x \in RGB

is defined using its infinity norm or Euclidean distance to the crisp counterpart

c \in C

of

\tilde{c}

, thanks to either:

the crisp membership function:

$μ_{\tilde{c}} (x) = \{\begin{matrix} 1 & {if ‖ x - c ‖}_{\infty} \leq ⌊ \frac{L^{k}}{2} ⌋, \\ 0 & otherwise, \end{matrix}$

(7)
the symmetrical Gaussian function:

$μ_{\tilde{c}} (x) = exp (- \frac{{‖ x - c ‖}_{2}^{2}}{2 α^{2}}),$

(8)
the triangular function:

$μ_{\tilde{c}} (x) = max (1 - \frac{{‖ x - c ‖}_{2}}{β}, 0),$

(9)
or the fuzzy C-means (FCM) membership function:

$μ_{\tilde{c}} (x) = \frac{1}{\sum_{c^{'} \in C} {(\frac{{‖ x - c ‖}_{2}}{‖ x - c^{'} ‖_{2}})}^{\frac{2}{ζ - 1}}} .$

(10)

Here,

α

and

β

are real positive constants used to control the span of the fuzzy color, and

ζ

is any real number greater than 1. In this paper, we set

β = (L^{R}, L^{G}, L^{B})

,

α = β / \sqrt{2 ln (2)}

, and

ζ = 2

. The parameters

α

and

β

are chosen to ensure that

μ_{\tilde{c}} (x) = 0.5

at the bounds of each color domain of width

{(L^{R}, L^{G}, L^{B})}^{⊺}

centered at

c

. Figure 3 shows the shapes of the four membership functions computed at

c = (192, 64, 64)

, for plane

(R, G)

.

3.5. Fuzzy Color Aura Set in a Superpixel

A fuzzy color site set

S_{c}

,

c \in C

, is defined by its membership function

μ_{S_{c}}

. The membership degree

μ_{S_{c}} (s)

of each site

s \in S

to

S_{c}

is the membership degree

μ_{\tilde{c}} (I (s))

of its color

I (s) \in RGB

to the fuzzy color

\tilde{c}

[38]:

μ_{S_{c}} (s) = μ_{\tilde{c}} (I (s)) .

(11)

From there, we define the fuzzy color aura set

A_{S_{c^{'}}}^{p} (S_{c})

of

S_{c}

with respect to

S_{c^{'}}

, for any color pair

(c, c^{'}) \in C^{2}

, as the fuzzy site set with the following membership degree at each site

r \in S

:

μ_{A_{S_{c^{'}}}^{p} (S_{c})} (r) = min \{sup_{s \in S} [min (μ_{S_{c}} (s), n^{p} (r, s))], μ_{S_{c^{'}}} (r)\} .

(12)

The neighborhood function

n^{p} (r, s)

expresses the membership degree of

r

to the neighborhood of

s

within the superpixel

P^{p}

,

p \in [[1, P]]

. This function may have any support size and shape, and may take any real value between 0 and 1. In the simplest binary case, we design it as:

n^{p} (r, s) = \{\begin{matrix} 1 & if (s \in P^{p}) \land (r \in N^{s} \cap P^{p}), \\ 0 & otherwise . \end{matrix}

(13)

As a justification of Equation (12), we consider the fuzzy color aura set by analogy with the crisp case [39]. The crisp set union operator ⋃ of Equation (4) is transcribed by the fuzzy operator sup to get the membership degree of the fuzzy color aura set

S_{c}

with respect to

S_{c^{'}}

at each site

r \in S

:

μ_{A_{S_{c^{'}}}^{p} (S_{c})} (r) = sup_{s \in S} (μ_{N_{S_{c}}^{p, s} \cap S_{c^{'}}} (r)) .

(14)

The crisp set intersection operator ∩ is transcribed by the fuzzy operator min, such that the fuzzy counterpart of the color site set

N_{S_{x}}^{p, s} \cap S_{x^{'}}

of Equation (4) is defined by its membership function given by:

μ_{N_{S_{c}}^{p, s} \cap S_{c^{'}}} (r) = min [μ_{N_{S_{c}}^{p, s}} (r), μ_{S_{c^{'}}} (r)] .

(15)

The fuzzy counterpart of

N_{S_{x}}^{p, s}

(see Equation (5)) is defined for the fuzzy color site set

S_{c}

,

c \in C

, by the following membership degree at each site

r \in S

:

μ_{N_{S_{c}}^{p, s}} (r) = min (μ_{S_{c}} (s), n^{p} (r, s)) .

(16)

Plugging Equation (16) into (15) and the result into (14) provides the definition (12) of the fuzzy aura set after swapping the first two operators.

3.6. Fuzzy Color Aura Cardinal

The fuzzy color aura cardinal of

S_{c}

with respect to

S_{c^{'}}

within

P^{p}

is exactly defined as in the crisp case (see Equation (6)) and directly follows from Equation (12):

{\tilde{m}}^{p} (c, c^{'}) = \sum_{r \in P^{p}} μ_{A_{S_{c^{'}}}^{p} (S_{c})} (r) = \sum_{r \in P^{p}} min \{sup_{s \in P^{p}} [min (μ_{S_{c}} (s), n^{p} (r, s))], μ_{S_{c}^{'}} (r)\} .

(17)

Following the crisp scheme, we define a fuzzy color aura matrix (FCAM) as the collection of all fuzzy color aura cardinals of a superpixel for the

C^{2}

color pairs

(c, c^{'}) \in C^{2}

. Note that to define the FCAM, we only consider a few of the

256^{3}

colors on which the RGB image is defined—namely, the set

C

of C colors such that

C ≪ 256^{3}

. This provides a compact FCAM of small size

C \times C

that is a suitable color texture descriptor of a superpixel for image segmentation purpose.

To make superpixels of different sizes comparable, their FCAMs are normalized element-wise to sum up to one:

{\bar{m}}^{p} (c, c^{'}) = \frac{{\tilde{m}}^{p} (c, c^{'})}{\sum_{(y, y^{'}) \in C^{2}} {\tilde{m}}^{p} (y, y^{'})} .

(18)

Finally, each superpixel

P^{p}

of an image

I

is characterized by

C^{2}

features that are the elements of its normalized FCAM

{\bar{m}}^{p}

.

4. Experiments

In this section, we first present the experimental dataset and explain how FCAM features are used to segment its color texture images. To evaluate the performance of the proposed approach, we then assess the regional SLIC algorithm, discuss parameter settings, and study the relevance of FCAM features. We finally compare the segmentation results achieved by our supervised segmentation approach with those obtained by several state-of-the-art color texture segmentation methods.

4.1. Experimental Setup

4.1.1. Dataset

As the experimental dataset, we used the challenging Prague texture segmentation benchmark [57]. It contains 20 color texture mosaics to be segmented (input test images), some of which are shown in Figure 4 (top) with their corresponding segmentation ground truth images. Each of these images represents from 3 to 12 classes and has been synthetically generated from the same number of original color texture images. The original Prague dataset is composed of 89 images (one image per texture class) grouped into 10 categories of natural and artificial textures. Figure 4 (bottom) shows some of these images that form the training dataset. All images of Prague database are of size

512 \times 512

pixels.

4.1.2. Color Texture Image Segmentation Based on FCAMs

Our method of color texture image segmentation based on FCAMs is a supervised superpixel classification procedure. Each test image

I

represents

K_{I}

classes whose pixels should be retrieved by the segmentation. To this end, we used a simple artificial neural network known as the extreme learning machine (ELM) that comes with a very fast learning algorithm [58]. It consists of three fully-connected neuron layers: an input layer of

C^{2}

neurons that receives FCAM elements, a single hidden layer, and an output layer with

K_{I}

neurons. The number of neurons in the hidden layer was empirically set to

100 \cdot K_{I}

. The initial weights of hidden neurons were set to random values, and the output weights were determined according to a least-square solution [58].

The proposed color texture segmentation of any test image

I

based on FCAMs follows the two successive stages outlined in Figure 5. The ELM is first trained with a set of

K_{I} \cdot T

training samples, where T is the number of prototype sites per class that are randomly selected on each of the

K_{I}

training images. Each training sample is the FCAM feature computed over a square patch W^t,

t \in {1, \dots, T}

, centered at a prototype site and of size

{(2 W + 1)}^{2}

. The trained ELM is then used to segment

I

as follows. First,

I

is segmented into superpixels using the regional SLIC method. Then, the FCAM of each superpixel is fed as input into the trained ELM, whose output provides the estimated texture class. All sites in the superpixel are finally assigned to this class. A refinement procedure can be applied to further improve segmentation accuracy. This consists of reassigning superpixels smaller than

0.5 %

of the image size with the label of the largest adjacent superpixel [21]. All steps of the proposed color texture image segmentation are summarized in Algorithm 2.

Algorithm 2:Color texture image segmentation.

Input: Test image

I

,

K_{I}

training images

Parameters: Number T of prototypes per class, number P of superpixels, number C of fuzzy colors, patch half width W, membership function

μ_{\tilde{c}}

.

Step 1: Training stage

1.: In each training image, randomly select T prototype sites.
2.: At each prototype site, compute the normalized FCAM ${\bar{m}}^{t}$ over a square patch W^t of size ${(2 W + 1)}^{2}$ using Equation (18).
3.: Train the ELM classifier with the $K_{I} \cdot T$ normalized FCAMs of prototypes.

Step 2: Segmentation stage

1.: Run regional SLIC (Algorithm 1) on $I$ to provide P superpixels ${P^{p}}_{p = 1}^{P}$ .
2.: Compute the normalized FCAM ${\bar{m}}^{p}$ of each superpixel $P^{p}$ using Equation (18).
3.: Feed ${\bar{m}}^{p}$ into the trained ELM and assign each pixel in $P^{p}$ to the ELM output class.

Step 3: Refinement (optional)

Output: Segmented image

4.1.3. Regional SLIC—Preliminary Assessment

To demonstrate the relevance of the proposed regional SLIC algorithm, we compare its results with those achieved by the basic SLIC algorithm thanks to four standard metrics. The achievable segmentation accuracy (ASA) quantifies the segmentation performance achievable by assigning each superpixel to the ground truth region with the highest overlap [55,59]. The boundary recall (BR) assesses the boundary adherence with respect to the ground truth boundaries [60]. The under-segmentation error (UE) evaluates the segmentation boundary accuracy as the overlap between superpixels and ground truth regions [55,59]. The compactness (COM) measures the compactness of superpixels [60]. Higher BR, ASA, and COM, and smaller UE, indicate better pre-segmentation.

Both the basic and regional SLIC algorithms require one to set the number of superpixels. This number must be small enough to get superpixels that are large enough to contain homogeneous textures, and small enough for the superpixels to finely fit boundaries between textures. We empirically found that

P = 400

is a good trade-off for the Prague dataset. Table 1 shows the average and standard deviation of the four metrics obtained by the basic and modified SLIC algorithms over the 20 images of Prague dataset. From this table, we can see that the results achieved by the regional SLIC are better than those obtained by the basic SLIC according to all metrics.

4.1.4. Parameter Settings

The proposed color texture segmentation requires one to set several parameters. For the training stage, the number of prototypes was set to

T = 1000

, since CNN-based methods use 1000 training texture mosaic images for each image to segment [23]. The size

{(2 W + 1)}^{2}

of the patch centered at each prototype site and used for FCAM computation during the training stage was set according to the image size N and the number P of superpixels such that

{(2 W + 1)}^{2} \approx N / P

. As

N = 512 \times 512

and

P = 400

, the patch size was set to

27 \times 27

pixels.

To characterize a site by an FCAM in the segmentation stage, we have to set both the number C of fuzzy colors and the membership function

μ_{\tilde{c}}

that defines the membership degree

μ_{\tilde{c}} (s)

of each site

s

to any fuzzy color

\tilde{c}

according to its color

I (s)

. We consider very few colors in order to evaluate how the memory cost of FCAMs can be reduced while preserving their relevance as texture features. Specifically, the number of fuzzy colors one of these values each time:

C = 2 \times 2 \times 1

,

C = 2 \times 2 \times 2

,

C = 2 \times 3 \times 2

,

C = 2 \times 4 \times 2

,

C = 3 \times 3 \times 2

,

C = 3 \times 4 \times 2

, or

C = 4 \times 4 \times 2

, so that the FCAM size was

4 \times 4

,

8 \times 8

,

12 \times 12

,

16 \times 16

,

18 \times 18

,

24 \times 24

, or

32 \times 32

. Note that we privileged the G color component over R and B because it is similar to luminance; other combinations provide close classification accuracy results. Regarding the membership function, we had to choose one among the four functions (crisp, Gaussian, triangular, or FCM) presented in Section 3.4.

We retained a given number of fuzzy colors and membership function on the grounds of segmentation accuracy. To make this choice independent of the classification algorithm, we used the nearest neighbor classifier (1NN) instead of the ELM, and with no refinement step. Figure 6 shows the average accuracy obtained with each membership function according to the number C of fuzzy colors over the 20 images. From this figure, we can see that the fuzzy membership functions (Gaussian, triangular, and FCM) largely outperform the crisp one, especially when the number of fuzzy colors is low. However, when

C \geq 12

, all fuzzy membership functions lead to similar segmentation accuracies, even if the Gaussian membership function seems to perform slightly better. Moreover, accuracies do not vary significantly beyond

C = 16

. In the following, all experiments were therefore performed with

C = 16

fuzzy colors and the Gaussian membership function.

4.2. Comparison with Other Fuzzy Texture Features

We here compare the relevance of FCAMs with respect to other fuzzy texture features, namely, the fuzzy gray-level co-occurrence matrix (FGLCM) [33], the fuzzy color co-occurrence matrix (FCCM) [38], and the fuzzy gray-level aura cardinal matrix (FGLAM) [39]. Note that FGLCM and FCCM are similar to a fuzzy color aura matrix computed with a fuzzy aura local measure, as shown in [39]. FGLCM and FGLAM features are formed by the concatenation of the marginal features of the three R, G, and B components; and a single FCCM captures the interactions among neighboring pixel colors. For a fair comparison, all these matrices were computed with the same parameters as the FCAM (Gaussian membership function,

C^{R} = C^{G} = C^{B} = 16

for FGLCM and FGLAM, and

C = 16

for FCCM).

The comparison is based on the segmentation accuracy as the main performance measure and on the computational time required by each feature extraction method. Experiments were carried out using the 1NN classifier instead of the ELM, on a computer with an Intel Core i7 3.60 GHz CPU and 8 GB of RAM. The results over the 20 images of the Prague dataset are summarized in Table 2. They clearly show that FCAM is more relevant, three times faster to compute, and requires less memory than marginal features (FGLCM and FGLAM). FCAM is also more efficient and slightly faster to compute than FCCM.

4.3. Comparison with State-of-the-Art Supervised Segmentation Methods

In this section, we compare the results obtained by the proposed segmentation method with the results of several state-of-the-art segmentation methods, on the Prague dataset. The assessment of segmentation performance was based on the conventional measures provided on the Prague texture segmentation website [57], which include: (1) region-based criteria: correct segmentation (CS), over-segmentation (OS), under-segmentation (US), missed error (ME), and noise error (NE); (2) pixel-wise based criteria: omission error (O), commission error (C), class accuracy (CA), recall (CO), precision (CC), type I error (I.), type II error (II.), mean class accuracy estimate (EA), mapping score (MS), root mean square proportion estimation error (RM), and comparison index (CI); (3) consistency-error criteria: global consistency error (GCE) and local consistency error (LCE).

The methods involved in the comparison were: (1) the MRF algorithm based on a Markov random field pixel classification model [11], (2) the COF algorithm that uses the co-occurrence features and the 1NN classifier [57], (3) the Con-Col algorithm [57], (4) the supervised fully convolutional network for texture (FCNT) algorithm [23] without refinement or (5) with refinement, (6) the empirical-wavelet-transform-based fully convolutional network for texture (EWT-FCNT) [23] that combines the empirical wavelet transform with FCNT, (7) U-Net [24], (8) the deep visual model (DA) [25], (9) the pyramid scene parsing network (PSP-Net) [26], and (10) our proposed method without refinement (FCAM nr) or (11) with refinement (FCAM wr).

The segmentation results of EWT-FCNT, FCNT, MRF, COF, and Con-Col were taken from the Prague benchmark website [57]; those of U-Net, DA, and PSP-Net were taken from [23]. All these methods except MRF, COF, and Con-Col, involve a refinement step to improve performance. Regarding the number of prototypes, no information is available for MRF, COF, and Con-Col. In contrast, the CNN-based methods used a training set of 1000 texture mosaic images of size

512 \times 512

pixels, specifically created from the original Prague dataset, for each image to segment.

From Table 3, we can see that our proposed method, with and even without refinement, outperforms the classical supervised methods, namely, MRF, COF, and Con-Col.In contrast, EWT-FCNT and other CNN-based methods provided better segmentation results than our method. However, by carefully analyzing these results, we found that—except the EWT-FCNT method, which provided exceptional results—our method provided similar results (to U-Net, FCNT, DA, and PSP-Net). For example, the gaps between its accuracy (CO =

95.20 %

with refinement) and those of U-Net, FCNT, DA, and PSP-Net were lower than

1.7 %

. We can also see that our FCAM-based method did not suffer from under-segmentation nor over-segmentation (US and OS measures are equal to 0) compared to deep learning methods. It is also noticeable that our method without refinement outperformed FCNT without refinement according to most of the criteria.

To thoroughly analyze segmentation results, Table 4 presents the accuracy obtained on each individual test image by our proposed FCAM method; the handcrafted supervised methods MRF, COF, and Con-Col; and the two CNN-based methods, FCNT (with refinement) and EWT-FCNT. Only the segmentation results of individual test images obtained by the compared methods (EWT-FCNT, FCNT, MRF, COF, and Con-Col) are available on the Prague [57] benchmark website. Table 4 shows that accuracy obtained by our FCAM method was higher than

96 %

for half of the 20 test images and outperformed MRF, COF and Con-Col for almost all images. For 6 of the 20 tested images, our method also outperformed FCNT and ranked second behind EWT-FCNT. Even better, it provideed the best accuracy for image 11. For the other images, the accuracy obtained by our method is close to that of FCNT, though for images 7, 8, 10, and 13, the obtained accuracy is poor (below

91 %

). This explains why the average accuracy dropped to

95.20 %

.

Figure 7 shows a visual comparison of two handcrafted methods (COF and Con-Col) and two deep learning-based ones (FCNT wr and EWT-FCNT) with the proposed method. The results of these methods are publicly available online [57]. Overall, FCAM provided satisfactory visual segmentation results that are close to the ground truth. Unlike other methods, FCAM is not prone to over-segmentation, and classification errors almost only occurred at region boundaries. This is mainly due to our pre-segmentation based on regional SLIC that, despite its superiority over the basic SLIC, lacks the accuracy to correctly determine the boundaries between two or more different texture regions.

However, it is important to underline that deep learning-based methods (including EWT-FCNT) need to create a large training set (1000 images for each image to segment) from the original training database. Moreover, they require an expensive learning step to extract features and classify pixels. In order to get an overview on the computational costs of our approach compared to CNN-based methods, the following section gives the computational times measured during both training and segmentation phases.

4.4. Processing Time

In Section 4.2, we estimated the FCAM computational time only at superpixels. In order to provide an overview of the computational requirements of the entire FCAM-based color texture segmentation method, we estimated its overall runtime over the Prague dataset on an 3.60 GHz Intel Core i7 computer with 8 GB RAM. Table 5 displays the average computing time of our proposed method and those of CNN-based methods (EWT-FCNT, FCNT, U-Net, DA, and PSP-Net). These processing times are separated into two parts: the training time and the segmentation time. The segmentation times for CNNs were taken from [23]. They were measured on a laptop computer with 2.5 GHz quad-core Intel Core i7 processor and 16 GB memory, equipped with GTX 1080Ti external GPU with 11 GB memory. The training times of these CNNs were unfortunately not provided. From this table, we can see that the processing time required by our method during the segmentation stage is the highest. It is about

41.14

s for a

512 \times 512

image. FCAM computation for superpixels consumes most of this time (

33.12

s). The remaining time is mainly shared among the regional SLIC computation (8 s) and the classification of superpixels by ELM (

0.02

s). It should be noted that the reported computing time for our method was obtained with MATLAB code without any GPU acceleration, and consequently, our method could be implemented for real-time applications with this enhancement.

In contrast, the total duration of the training stage for our method remained lower than 5 min and can be detailed as 260 s to compute the FCAM at the prototypes, and

0.9

s to train the ELM. The overall training time (

260.9

s) was much lower than with CNNs, which typically require several hours on powerful computers equipped with large-memory GPUs.

5. Conclusions

In this paper, we introduced fuzzy color aura cardinal matrices (FCAMs) to locally characterize colors and textures, and applied them for color texture image segmentation. The FCAM feature makes it possible to locally characterize the interactions between colors of neighboring sites. A single low-dimensionnal FCAM is required to describe the color texture at each site, unlike in the marginal approach where a fuzzy gray-level aura matrix (FGLAM) must be computed for each color channel.

The proposed color texture image segmentation is based on the classification of superpixels, generated from a modified version of the SLIC algorithm to incorporate regional information. An FCAM is then computed for each superpixel thanks to a locally-adaptive neighborhood function. The superpixels are finally classified using a simple supervised ELM classifier. Experiments on the Prague texture segmentation benchmark showed that the proposed color texture segmentation based on FCAMs outperforms the classical state-of-the-art segmentation methods and is competitive with recent methods based on deep learning. However, unlike CNN-based approaches that require an expensive learning procedure and a large training set of segmented texture images, whose construction is time-consuming, our method is applied straightforwardly from a much smaller database using ELM-based classification.

Despite its respectable performance, our segmentation method remains sensitive to pre-segmentation results. Although the regional SLIC improves segmentation results in comparison with the basic SLIC, the detection of color texture boundaries is still not accurate enough. In future work, we intend to use a texture-aware superpixel procedure that takes into account the properties of the available color spaces. The proposed segmentation method was developed in a supervised context; we also plan to adapt it to unsupervised color texture segmentation.

Author Contributions

Conceptualization, Z.H., K.H., O.L. and L.M.; methodology, Z.H., K.H., O.L. and L.M.; software, Z.H.; writing—review and editing, Z.H., K.H., O.L. and L.M. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

Borovec, J.; Švihlík, J.; Kybic, J.; Habart, D. Supervised and unsupervised segmentation using superpixels, model estimation, and graph cut. J. Electron. Imaging 2017, 26, 061610. [Google Scholar] [CrossRef]
Mikeš, S.; Haindl, M.; Scarpa, G.; Gaetano, R. Benchmarking of remote sensing segmentation methods. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2015, 8, 2240–2248. [Google Scholar] [CrossRef]
Akbarizadeh, G.; Rahmani, M. Efficient combination of texture and color features in a new spectral clustering method for PolSAR image segmentation. Natl. Acad. Sci. Lett. 2017, 40, 117–120. [Google Scholar] [CrossRef]
Zhang, C.; Zou, K.; Pan, Y. A method of apple image segmentation based on color-texture fusion feature and machine learning. Agronomy 2020, 10, 972. [Google Scholar] [CrossRef]
Ilea, D.E.; Whelan, P.F. Image segmentation based on the integration of colour–texture descriptors–A review. Pattern Recognit. 2011, 44, 2479–2501. [Google Scholar] [CrossRef]
Haindl, M.; Mikeš, S. A competition in unsupervised color image segmentation. Pattern Recognit. 2016, 57, 136–151. [Google Scholar] [CrossRef]
Garcia-Lamont, F.; Cervantes, J.; López, A.; Rodriguez, L. Segmentation of images by color features: A survey. Neurocomputing 2018, 292, 1–27. [Google Scholar] [CrossRef]
Bianconi, F.; Harvey, R.W.; Southam, P.; Fernández, A. Theoretical and experimental comparison of different approaches for color texture classification. J. Electron. Imaging 2011, 20, 043006. [Google Scholar] [CrossRef]
Ledoux, A.; Losson, O.; Macaire, L. Color local binary patterns: Compact descriptors for texture classification. J. Electron. Imaging 2016, 25, 061404. [Google Scholar] [CrossRef]
Liu, L.; Chen, J.; Fieguth, P.; Zhao, G.; Chellappa, R.; Pietikäinen, M. From BoW to CNN: Two decades of texture representation for texture classification. Int. J. Comput. Vis. 2019, 127, 74–109. [Google Scholar] [CrossRef] [Green Version]
Kato, Z.; Pong, T.C.; Lee, J.C.M. Color image segmentation and parameter estimation in a Markovian framework. Pattern Recognit. Lett. 2001, 22, 309–321. [Google Scholar] [CrossRef]
Nakyoung, O.; Choi, J.; Kim, D.; Kim, C. Supervised classification and segmentation of textured scene images. In Proceedings of the International Conference on Consumer Electronics (ICCE 2015), Las Vegas, NV, USA, 9–12 January 2015; IEEE: Piscataway, NJ, USA, 2015; pp. 473–476. [Google Scholar] [CrossRef]
Yang, S.; Lv, Y.; Ren, Y.; Yang, L.; Jiao, L. Unsupervised images segmentation via incremental dictionary learning based sparse representation. Inf. Sci. 2014, 269, 48–59. [Google Scholar] [CrossRef]
Laboreiro, V.R.S.; de Araujo, T.P.; Bessa Maia, J.E. A texture analysis approach to supervised face segmentation. In Proceedings of the IEEE Symposium on Computers and Communications (ISCC 2014), Funchal, Portugal, 23–26 June 2014; IEEE: Piscataway, NJ, USA, 2014; pp. 1–6. [Google Scholar] [CrossRef]
Panjwani, D.; Healey, G. Markov random field models for unsupervised segmentation of textured color images. IEEE Trans. Pattern Anal. Mach. Intell. 1995, 17, 939–954. [Google Scholar] [CrossRef]
Chen, K.M.; Chen, S.Y. Color texture segmentation using feature distributions. Pattern Recognit. Lett. 2002, 23, 755–771. [Google Scholar] [CrossRef]
Salmi, A.; Hammouche, K.; Macaire, L. Constrained feature selection for semisupervised color-texture image segmentation using spectral clustering. J. Electron. Imaging 2021, 30, 013014. [Google Scholar] [CrossRef]
Jenicka, S. Supervised Texture-Based Segmentation Using Basic Texture Models. In Land Cover Classification of Remotely Sensed Images; Springer: Berlin/Heidelberg, Germany, 2021; pp. 73–88. [Google Scholar]
Al-Kadi, O.S. Supervised texture segmentation: A comparative study. In Proceedings of the IEEE Jordan Conference on Applied Electrical Engineering and Computing Technologies 2011 (AEECT 2011), Amman, Jordan, 6–8 December 2011; IEEE: Piscataway, NJ, USA, 2011; pp. 1–5. [Google Scholar] [CrossRef]
Yang, H.Y.; Wang, X.Y.; Wang, Q.Y.; Zhang, X.J. LS-SVM based image segmentation using color and texture information. J. Vis. Commun. Image Represent. 2012, 23, 1095–1112. [Google Scholar] [CrossRef]
Andrearczyk, V.; Whelan, P.F. Texture segmentation with fully convolutional networks. arXiv 2017, arXiv:1703.05230. [Google Scholar]
Garcia-Garcia, A.; Orts-Escolano, S.; Oprea, S.; Villena-Martinez, V.; Martinez-Gonzalez, P.; Garcia-Rodriguez, J. A survey on deep learning techniques for image and video semantic segmentation. Appl. Soft Comput. 2018, 70, 41–65. [Google Scholar] [CrossRef]
Huang, Y.; Zhou, F.; Gilles, J. Empirical curvelet based fully convolutional network for supervised texture image segmentation. Neurocomputing 2019, 349, 31–43. [Google Scholar] [CrossRef]
Ronneberger, O.; Fischer, P.; Brox, T. U-Net: Convolutional networks for biomedical image segmentation. In Proceedings of the International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI 2015), Munich, Germany, 5–9 October 2015; Springer: Cham, Switzerland, 2015; pp. 234–241. [Google Scholar] [CrossRef]
Wang, W.; Shen, J. Deep visual attention prediction. IEEE Trans. Image Process. 2017, 27, 2368–2378. [Google Scholar] [CrossRef]
Zhao, H.; Shi, J.; Qi, X.; Wang, X.; Jia, J. Pyramid Scene Parsing Network. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2017), Honolulu, HI, USA, 21–26 July 2017; IEEE: Piscataway, NJ, USA, 2017; pp. 6230–6239. [Google Scholar] [CrossRef]
Yuan, J.; Wang, D.; Cheriyadat, A.M. Factorization-based texture segmentation. IEEE Trans. Image Process. 2015, 24, 3488–3497. [Google Scholar] [CrossRef] [PubMed]
Yang, Y.; Han, S.; Wang, T.; Tao, W.; Tai, X.C. Multilayer graph cuts based unsupervised color–texture image segmentation using multivariate mixed Student’s t-distribution and regional credibility merging. Pattern Recognit. 2013, 46, 1101–1124. [Google Scholar] [CrossRef]
Akbulut, Y.; Guo, Y.; Şengür, A.; Aslan, M. An effective color texture image segmentation algorithm based on hermite transform. Appl. Soft Comput. 2018, 67, 494–504. [Google Scholar] [CrossRef]
Barcelo, A.; Montseny, E.; Sobrevilla, P. Fuzzy texture unit and fuzzy texture spectrum for texture characterization. Fuzzy Sets Syst. 2007, 158, 239–252. [Google Scholar] [CrossRef]
Keramidas, E.; Iakovidis, D.; Maroulis, D. Fuzzy binary patterns for uncertainty-aware texture representation. Electron. Lett. Comput. Vis. Image Anal. 2011, 10, 63–78. [Google Scholar] [CrossRef]
Vieira, R.T.; de Oliveira Chierici, C.E.; Ferraz, C.T.; Gonzaga, A. Local Fuzzy Pattern: A New Way for Micro-pattern Analysis. In Proceedings of the 13th International Conference on Intelligent Data Engineering and Automated Learning (IDEAL 2012), Natal, Brazil, 29–31 August 2012; Lecture Notes in Computer Science. Springer: Cham, Switzerland, 2012; Volume 7435, pp. 602–611. [Google Scholar] [CrossRef]
Jawahar, C.; Ray, A. Incorporation of gray-level imprecision in representation and processing of digital images. Pattern Recognit. Lett. 1996, 17, 541–546. [Google Scholar] [CrossRef]
Cheng, H.D.; Chen, C.H.; Chiu, H.H. Image segmentation using fuzzy homogeneity criterion. Inf. Sci. 1997, 98, 237–262. [Google Scholar] [CrossRef]
Sen, D.; Pal, S.K. Image segmentation using global and local fuzzy statistics. In Proceedings of the IEEE India Council International Conference (INDICON 2006), New Delhi, India, 15–17 September 2006; IEEE: Piscataway, NJ, USA, 2006; pp. 1–6. [Google Scholar] [CrossRef]
Munklang, Y.; Auephanwiriyakul, S.; Theera-Umpon, N. A novel fuzzy co-occurrence matrix for texture feature extraction. In Proceedings of the 13th International Conference on Computational Science and its Applications (ICCSA 2013), Ho Chi Minh City, Vietnam, 24–27 June 2013; Lecture Notes in Computer Science. Springer: Berlin/Heidelberg, Germany, 2013; Volume 7973, pp. 246–257. [Google Scholar] [CrossRef]
Khaldi, B.; Kherfi, M.L. Modified integrative color intensity co-occurrence matrix for texture image representation. J. Electron. Imaging 2016, 25, 053007. [Google Scholar] [CrossRef]
Ledoux, A.; Losson, O.; Macaire, L. Texture classification with fuzzy color co-occurrence matrices. In Proceedings of the IEEE International Conference on Image Processing (ICIP 2015), Québec City, QC, Canada, 27–30 September 2015; IEEE: Piscataway, NJ, USA, 2015; pp. 1429–1433. [Google Scholar] [CrossRef]
Hammouche, K.; Losson, O.; Macaire, L. Fuzzy aura matrices for texture classification. Pattern Recognit. 2016, 53, 212–228. [Google Scholar] [CrossRef]
Elfadel, I.M.; Picard, R.W. Gibbs random fields, cooccurrences, and texture modeling. IEEE Trans. Pattern Anal. Mach. Intell. 1994, 16, 24–37. [Google Scholar] [CrossRef]
Qin, X.; Yang, Y.H. Basic gray level aura matrices: Theory and its application to texture synthesis. In Proceedings of the 10th International Conference on Computer Vision (ICCV’05), Beijing, China, 17–20 October 2005; IEEE: Piscataway, NJ, USA, 2005; Volume 1, pp. 128–135. [Google Scholar] [CrossRef]
Qin, X.; Yang, Y.H. Aura 3D textures. IEEE Trans. Vis. Comput. Graph. 2007, 13, 379–389. [Google Scholar] [CrossRef]
Qin, X.; Yang, Y.H. Similarity measure and learning with gray level aura matrices (GLAM) for texture image retrieval. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Washington, DC, USA, 27 June–2 July 2004; IEEE: Piscataway, NJ, USA, 2004; Volume 1, pp. I–326–I–333. [Google Scholar] [CrossRef]
Wiesmüller, S.; Chandy, D.A. Content based mammogram retrieval using gray level aura matrix. Int. J. Comput. Commun. Inf. Syst. 2010, 2, 217–223. [Google Scholar]
Liao, S.; Chung, A.C.S. Texture classification by using advanced local binary patterns and spatial distribution of dominant patterns. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP’07), Honolulu, HI, USA, 15–20 April 2007; IEEE: Piscataway, NJ, USA, 2007; pp. 1221–1224. [Google Scholar] [CrossRef]
Wang, Y.; Gao, X.; Fu, R.; Jian, Y. Dayside corona aurora classification based on X-gray level aura matrices. In Proceedings of the 9th ACM International Conference on Image and Video Retrieval (CIVR 2010), Xi’an, China, 5–7 July 2010; ACM: New York, NY, USA, 2010; pp. 282–287. [Google Scholar] [CrossRef]
Hannan, M.A.; Arebey, M.; Begum, R.A.; Basri, H. Gray level aura matrix: An image processing approach for waste bin level detection. In Proceedings of the World Congress on Sustainable Technologies (WCST 2011), London, UK, 7–10 November 2011; IEEE: Piscataway, NJ, USA, 2011; pp. 77–82. [Google Scholar] [CrossRef]
Khalid, M.; Yusof, R.; Khairuddin, A.S.M. Improved tropical wood species recognition system based on multi-feature extractor and classifier. Int. J. Electr. Comput. Eng. 2011, 5, 1495–1501. [Google Scholar]
Hannan, M.A.; Arebey, M.; Begum, R.A.; Basri, H. An automated solid waste bin level detection system using a gray level aura matrix. Waste Manag. 2012, 32, 2229–2238. [Google Scholar] [CrossRef]
Yusof, R.; Khalid, M.; Khairuddin, A.S.M. Application of kernel-genetic algorithm as nonlinear feature selection in tropical wood species recognition system. Comput. Electron. Agric. 2013, 93, 68–77. [Google Scholar] [CrossRef]
Haliche, Z.; Hammouche, K. The gray level aura matrices for textured image segmentation. Analog. Integr. Circuits Signal Process. 2011, 69, 29–38. [Google Scholar] [CrossRef]
Haliche, Z.; Hammouche, K.; Postaire, J.G. Texture image segmentation based on the elements of gray level aura matrices. In Proceedings of the Global Summit on Computer & Information Technology (GSCIT 2014), Sousse, Tunisia, 14–16 June 2014; IEEE: Piscataway, NJ, USA, 2014; pp. 1–6. [Google Scholar] [CrossRef]
Haliche, Z.; Hammouche, K.; Postaire, J.G. A fast algorithm for texture feature extraction from gray level aura matrices. Int. J. Circuits Syst. Signal Process. 2015, 9, 54–61. [Google Scholar]
Han, J.C.; Zhao, P.; Wang, C.K. Wood species recognition through FGLAM textural and spectral feature fusion. Wood Sci. Technol. 2021, 55, 535–552. [Google Scholar] [CrossRef]
Achanta, R.; Shaji, A.; Smith, K.; Lucchi, A.; Fua, P.; Süsstrunk, S. SLIC superpixels compared to state-of-the-art superpixel methods. IEEE Trans. Pattern Anal. Mach. Intell. 2012, 34, 2274–2282. [Google Scholar] [CrossRef] [Green Version]
Chen, X.; Zheng, C.; Yao, H.; Wang, B. Image segmentation using a unified Markov random field model. IET Image Process. 2017, 11, 860–869. [Google Scholar] [CrossRef]
Mikes, S.; Haindl, M. Texture Segmentation Benchmark. IEEE Trans. Pattern Anal. Mach. Intell. 2021, 44, 5647–5663. [Google Scholar] [CrossRef] [PubMed]
Huang, G.B.; Zhu, Q.Y.; Siew, C.K. Extreme learning machine: Theory and applications. Neurocomputing 2006, 70, 489–501. [Google Scholar] [CrossRef]
Liu, M.Y.; Tuzel, O.; Ramalingam, S.; Chellappa, R. Entropy rate superpixel segmentation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR’11), Colorado Springs, CO, USA, 20–25 June 2011; IEEE: Piscataway, NJ, USA, 2011; pp. 2097–2104. [Google Scholar] [CrossRef]
Stutz, D.; Hermans, A.; Leibe, B. Superpixels: An evaluation of the state-of-the-art. Comput. Vis. Image Underst. 2018, 166, 1–27. [Google Scholar] [CrossRef] [Green Version]

Figure 1. SLIC pre-segmentation results (

P = 400

superpixels).

f_{R}

in (c) is rescaled to

[[0, 255]]

. SLIC: simple linear iterative clustering.

Figure 1. SLIC pre-segmentation results (

P = 400

superpixels).

f_{R}

in (c) is rescaled to

[[0, 255]]

. SLIC: simple linear iterative clustering.

Figure 2. Examples of two aura sets in a color image. (a) Original image composed of 3 color site sets,

S_{R}, S_{G}

, and

S_{B}

, and aura sets

A_{S_{G}} (S_{B})

(circles) and

A_{S_{B}} (S_{G})

(diamonds). (b) Superpixels

P^{1}

(left) and

P^{2}

(right), and aura sets

A_{S_{G}}^{1} (S_{B})

(①) and

A_{S_{B}}^{1} (S_{G})

(

).

A_{S_{G}}^{2} (S_{B})

and

A_{S_{B}}^{2} (S_{G})

are empty. (c) Superpixels

P^{1}

(left) and

P^{2}

(right), and aura sets

A_{S_{G}}^{2} (S_{B})

(②) and

A_{S_{B}}^{2} (S_{G})

(

).

A_{S_{G}}^{1} (S_{B})

and

A_{S_{B}}^{1} (S_{G})

are empty.

Figure 2. Examples of two aura sets in a color image. (a) Original image composed of 3 color site sets,

S_{R}, S_{G}

, and

S_{B}

, and aura sets

A_{S_{G}} (S_{B})

(circles) and

A_{S_{B}} (S_{G})

(diamonds). (b) Superpixels

P^{1}

(left) and

P^{2}

(right), and aura sets

A_{S_{G}}^{1} (S_{B})

(①) and

A_{S_{B}}^{1} (S_{G})

(

).

A_{S_{G}}^{2} (S_{B})

and

A_{S_{B}}^{2} (S_{G})

are empty. (c) Superpixels

P^{1}

(left) and

P^{2}

(right), and aura sets

A_{S_{G}}^{2} (S_{B})

(②) and

A_{S_{B}}^{2} (S_{G})

(

).

A_{S_{G}}^{1} (S_{B})

and

A_{S_{B}}^{1} (S_{G})

are empty.

Figure 3. Membership functions

μ_{\tilde{c}}

at

c = (192, 64, 64)

with

C^{R} = C^{G} = C^{B} = 2

.

Figure 3. Membership functions

μ_{\tilde{c}}

at

c = (192, 64, 64)

with

C^{R} = C^{G} = C^{B} = 2

.

Figure 4. Examples of test images (top two rows) and training images (from “flowers” and “man-made” categories, bottom two rows) from the Prague dataset. Note that the framed test image represents

K_{I} = 6

classes from the sole “flowers” category.

Figure 4. Examples of test images (top two rows) and training images (from “flowers” and “man-made” categories, bottom two rows) from the Prague dataset. Note that the framed test image represents

K_{I} = 6

classes from the sole “flowers” category.

Figure 5. Flowchart of the proposed color texture image segmentation (SLIC: simple linear iterative clustering, FCAM: fuzzy color aura maxtrix, ELM: extreme learning machine).

Figure 6. Average accuracy (%) vs. number of colors (C) for different membership functions.

Figure 7. Sample segmentation results on Prague dataset, from left to right: image 01, image 03, image 06, image 15, and image 19.

Table 1. Basic and regional SLIC performances on the Prague dataset. Arrows denote the metric direction (the higher ↑ or smaller ↓, the better), and best results are in bold.

Metric	↑BR	↓UE	↑ASA	↑COM
Basic SLIC	$0.46 \pm 0.15$	$0.36 \pm 0.08$	$0.79 \pm 0.05$	$0.56 \pm 0.05$
Regional SLIC	$0.52 \pm 0.13$	$0.32 \pm 0.08$	$0.82 \pm 0.05$	$0.63 \pm 0.04$

Table 2. Average accuracy (%) and computation time (s) for different fuzzy features on the Prague dataset. “nr” means no segmentation refinement, and “wr” means with segmentation refinement.

Feature	Size	Accuracy nr	Accuracy wr	Comp. Time
FGLCM	$3 \times 16^{2} = 768$	86.69	90.97	123.53
FCCM	$16^{2} = 256$	86.52	90.77	37.62
FGLAM	$3 \times 16^{2} = 768$	87.24	91.28	112.32
FCAM	$16^{2} = 256$	87.82	92.00	33.12

Table 3. Results of supervised methods on the Prague dataset (20 test images). Arrows

↑, ↓

denote the required criterion direction; “nr” means no segmentation refinement, and “wr” means with segmentation refinement. Best results are in bold, and the second best ones in italic.

Table 3. Results of supervised methods on the Prague dataset (20 test images). Arrows

↑, ↓

denote the required criterion direction; “nr” means no segmentation refinement, and “wr” means with segmentation refinement. Best results are in bold, and the second best ones in italic.

Criteria	MRF	COF	Con-	FCNT	FCNT	EWT-	U-Net	DA	PSP-	FCAM	FCAM
Criteria	MRF	COF	Col	nr	wr	FCNT	U-Net	DA	Net	nr	wr
↑ CS	46.11	52.48	84.57	87.52	96.01	98.45	96.71	94.18	96.45	84.24	91.27
↓ OS	0.81	0.00	0.00	0.00	1.56	0.00	1.71	0.00	0.17	0.00	0.00
↓ US	4.18	1.94	1.70	0.00	1.20	0.00	0.00	1.18	0.41	0.00	0.00
↓ ME	44.82	41.55	9.50	6.70	0.78	0.37	0.68	3.42	1.23	11.45	5.93
↓ NE	45.29	40.97	10.22	6.90	0.89	0.46	0.48	3.24	1.12	11.39	5.36
↓ O	14.52	20.74	7.00	7.46	2.72	0.93	0.72	3.13	2.75	4.91	2.96
↓ C	16.77	22.10	5.34	6.16	2.29	1.04	0.70	1.32	2.39	5.89	2.72
↑ CA	65.42	67.01	86.21	87.08	93.95	97.67	95.86	94.53	93.89	87.28	91.54
↑ CO	76.19	77.86	92.02	92.61	96.73	98.78	96.91	96.23	96.06	92.60	95.20
↑ CC	80.30	78.34	92.68	93.26	97.02	98.81	97.38	97.01	96.41	93.65	95.96
↓ I.	23.81	22.14	7.98	7.39	3.27	1.22	3.09	3.77	3.94	7.40	4.80
↓ II.	4.82	4.40	1.70	1.49	0.68	0.25	0.41	0.58	0.69	1.32	0.87
↑ EA	75.40	76.21	91.72	92.68	96.68	98.77	97.01	96.24	96.08	92.58	95.13
↑ MS	64.29	66.79	88.03	88.92	95.10	98.17	95.37	94.35	94.08	88.90	92.80
↓ RM	6.43	4.47	2.08	1.38	0.86	0.24	0.61	1.07	0.70	1.64	1.24
↑ CI	76.69	77.05	92.02	92.81	96.77	98.78	97.08	96.41	96.15	92.84	95.35
↓ GCE	25.79	23.94	11.76	12.54	5.55	2.33	2.13	3.50	4.67	11.60	7.45
↓ LCE	20.68	19.69	8.61	9.94	3.75	1.68	1.46	2.47	3.52	8.76	5.31

Table 4. Accuracy (CO) for each test image of the Prague dataset. “nr” means no segmentation refinement, and “wr” means with segmentation refinement. Best results are in bold, and the second best ones in italic.

Image	MRF	COF	Con-	FCNT	EWT-	FCAM	FCAM
Image	MRF	COF	Col	wr	FCNT	nr	wr
01	99.79	96.19	96.92	99.12	99.91	99.54	99.72
02	77.36	93.02	92.70	97.71	99.51	92.66	96.79
03	95.60	96.56	92.33	97.47	99.21	98.66	99.03
04	73.20	68.63	93.73	98.41	98.77	96.38	98.58
05	89.72	89.67	91.64	96.64	98.95	92.95	93.83
06	84.00	57.59	95.78	97.30	99.52	96.97	97.64
07	67.74	58.08	89.71	96.09	96.71	79.65	84.70
08	58.12	71.27	90.10	95.67	98.80	84.16	90.21
09	72.95	61.29	93.23	96.70	99.35	94.90	95.47
10	71.52	58.97	85.72	92.52	96.69	86.36	89.51
11	61.28	80.88	96.82	96.72	95.32	94.44	97.12
12	81.55	63.01	87.20	96.03	99.51	92.74	95.84
13	74.35	84.32	77.02	96.10	98.48	87.99	90.11
14	91.29	90.32	96.51	97.75	99.17	94.25	96.39
15	57.77	79.61	96.26	97.70	99.56	95.31	98.61
16	61.33	60.63	91.19	94.31	99.46	86.23	93.92
17	62.74	72.31	91.92	96.96	99.38	91.21	95.10
18	77.81	92.96	96.16	98.24	99.58	97.18	98.54
19	76.07	94.98	97.02	98.55	99.78	97.64	99.39
20	89.68	86.87	88.48	94.66	97.91	92.82	93.44
Average	76.19	77.86	92.02	96.73	98.78	92.60	95.20

Table 5. Average computation time per image of FCNT, EWT-FCNT, U-Net, DA, PSP-Net, and FCAM on the Prague dataset.

Stage	FCNT	EWT-FCNT	U-Net	DA	PSP-Net	FCAM
Segmentation	3.61 ms	1.830 s	4.98 ms	3.80 ms	14.39 ms	41.14 s
Training	-	-	-	-	-	260.9 s

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Haliche, Z.; Hammouche, K.; Losson, O.; Macaire, L. Fuzzy Color Aura Matrices for Texture Image Segmentation. J. Imaging 2022, 8, 244. https://doi.org/10.3390/jimaging8090244

AMA Style

Haliche Z, Hammouche K, Losson O, Macaire L. Fuzzy Color Aura Matrices for Texture Image Segmentation. Journal of Imaging. 2022; 8(9):244. https://doi.org/10.3390/jimaging8090244

Chicago/Turabian Style

Haliche, Zohra, Kamal Hammouche, Olivier Losson, and Ludovic Macaire. 2022. "Fuzzy Color Aura Matrices for Texture Image Segmentation" Journal of Imaging 8, no. 9: 244. https://doi.org/10.3390/jimaging8090244

APA Style

Haliche, Z., Hammouche, K., Losson, O., & Macaire, L. (2022). Fuzzy Color Aura Matrices for Texture Image Segmentation. Journal of Imaging, 8(9), 244. https://doi.org/10.3390/jimaging8090244

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Fuzzy Color Aura Matrices for Texture Image Segmentation

Abstract

1. Introduction

1.1. Color Texture Features

1.2. Color Texture Image Segmentation by Pixel Classification

1.3. Fuzzy Color Texture Features

1.4. FCAM for Image Segmentation by Superpixel Classification

2. Superpixel

2.1. Basic SLIC

2.2. Regional SLIC

3. Fuzzy Color Aura

3.1. Color Aura Set

3.2. Color Aura Set in a Superpixel

3.3. Color Aura Cardinal

3.4. Fuzzy Color

3.5. Fuzzy Color Aura Set in a Superpixel

3.6. Fuzzy Color Aura Cardinal

4. Experiments

4.1. Experimental Setup

4.1.1. Dataset

4.1.2. Color Texture Image Segmentation Based on FCAMs

4.1.3. Regional SLIC—Preliminary Assessment

4.1.4. Parameter Settings

4.2. Comparison with Other Fuzzy Texture Features

4.3. Comparison with State-of-the-Art Supervised Segmentation Methods

4.4. Processing Time

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI