The Application of Tsallis Entropy Based Self-Adaptive Algorithm for Multi-Threshold Image Segmentation

Zhang, Kailong; He, Mingyue; Dong, Lijie; Ou, Congjie

doi:10.3390/e26090777

Open AccessArticle

The Application of Tsallis Entropy Based Self-Adaptive Algorithm for Multi-Threshold Image Segmentation

by

Kailong Zhang

,

Mingyue He

,

Lijie Dong

and

Congjie Ou

^*

College of Information Science and Engineering, Huaqiao University, Xiamen 361021, China

^*

Author to whom correspondence should be addressed.

Entropy 2024, 26(9), 777; https://doi.org/10.3390/e26090777

Submission received: 5 August 2024 / Revised: 7 September 2024 / Accepted: 8 September 2024 / Published: 10 September 2024

(This article belongs to the Special Issue Entropy and Information Theory in Machine Learning: Theoretical Insights and Applications)

Download

Browse Figures

Versions Notes

Abstract

:

Tsallis entropy has been widely used in image thresholding because of its non-extensive properties. The non-extensive parameter q contained in this entropy plays an important role in various adaptive algorithms and has been successfully applied in bi-level image thresholding. In this paper, the relationships between parameter q and pixels’ long-range correlations have been further studied within multi-threshold image segmentation. It is found that the pixels’ correlations are remarkable and stable for images generated by a known physical principle, such as infrared images, medical CT images, and color satellite remote sensing images. The corresponding non-extensive parameter q can be evaluated by using the self-adaptive Tsallis entropy algorithm. The results of this algorithm are compared with those of the Shannon entropy algorithm and the original Tsallis entropy algorithm in terms of quantitative image quality evaluation metrics PSNR (Peak Signal-to-Noise Ratio) and SSIM (Structural Similarity). Furthermore, we observed that for image series with the same background, the q values determined by the adaptive algorithm are consistently kept in a narrow range. Therefore, similar or identical scenes during imaging would produce similar strength of long-range correlations, which provides potential applications for unsupervised image processing.

Keywords:

tsallis entropy; long-range correlations; self-adaptive algorithm; multi-level thresholding; robustness

1. Introduction

In recent years, with the increase in digital imaging devices, image data have grown rapidly. Therefore, image processing has become more and more crucial in machine vision. During image processing, image segmentation is a fundamental step that divides an image into different regions by means of intensity, color, contour and so on. It has been successfully used in various fields [1,2,3,4] and the achievements are still growing. Technically, image segmentation mainly contains threshold-based segmentation, edge-based segmentation [5], clustering-based segmentation [6], and region-growing segmentation [7]. The threshold-based image segmentation has become the most frequently used method due to its simplicity, efficiency, and stability.

In 1980, Pun [8] first applied information entropy to image thresholding and it was improved by Kapur [9], who proposed the Maximum Shannon Entropy Thresholding algorithm. Kapur’s main idea is to treat the digital gray-level image as a matrix that contains pixels with different gray-level values. The gray-level histogram of the pixels can be considered as a kind of probability distribution. The entropy of the gray-level distribution can be maximized by a proper threshold, which is similar to maximizing the contrast between the object and the background of the image. If two or more objects exist in the same background, resulting in a multimodal gray-level distribution, the above-mentioned algorithm can be naturally extended to multi-threshold segmentation. It is worth noting that with the increasing number of thresholds, the computational complexity grows exponentially. In order to overcome this complexity and yield the optimal multi-threshold solution, swarm intelligence optimization algorithms are frequently used to solve such problems [10,11,12,13,14,15].

The concept of entropy was first proposed in thermo-statistical physics to deal with extensive systems [16]. Shannon entropy [17] inherits the extensivity and has been widely used in information theory. However, there are a lot of complex systems that present long-range interactions and the extensivities of them are broken. Extensive entropies are unsuitable to describe such systems anymore. Tsallis indicates a generalized entropic form [18] for those systems and their abnormal behaviors are well-fitted by the non-extensive parameter q. Over the years, Tsallis entropy has been applied not only in physics [19], but also in financial markets [20], seismology [21], bioinformatics [22], fractal networks [23], and so on. Regarding image segmentation, Tsallis entropy shows high adaptability for different types of target recognition [11,24,25] since the non-extensive parameter q is related to the strength of long-range correlations among image pixels. However, determining an appropriate value of q for a given image is still an open question in practice, since different types of images may present different patterns of correlations among pixels. Generally, the estimation of q values is performed empirically [26,27] in Tsallis entropy-based image segmentation, and the relationships between q values and the pixels’ correlations need to be further discussed.

In 2009, Rodrigues et al. [28] proposed a method to yield the optimal q values of images by maximizing the difference between q-dependent entropy and the upper limit of the histogram entropy, which sheds light on the patterns recognition of the pixels’ long-range correlations. In 2016, Ramírez-Reyes et al. [29] provided another method to obtain the non-extensive q values. It is based on the concept of redundancy in information theory and the entropy maximization principle, and has been successfully applied to the bi-level image thresholding [30]. While extending the bi-level segmentation to multi-level cases, with the increasing number of objects at different gray levels, the patterns of the pixels’ long-range correlation may also increase. In order to avoid the perturbations from uncertain interactions among pixels, the images generated by the unified imaging process should be adopted to illustrate the relationships between the q value and the long-range correlations of pixels. It is worth mentioning that in 2022, Mousavirad et al. [31] proposed a novel self-adaptive method to yield the optimal parameter r of Masi entropy, without relying on prior knowledge of histogram distribution or image type. This method demonstrated excellent performance in the multi-level segmentation of images with randomly natural scenes. Nevertheless, the histogram distribution may represent the correlations among pixels of an image. Therefore, it is of interest to study the relationships between the optimal entropic parameter q and the image histogram, which is different from the work of Mousavirad et al. [31] and can be applied to images generated by some known physical principles.

The rest of this paper is organized as follows. Section 2 briefly reviews the q-redundancy maximization method and introduces its application in multi-level image segmentation. In Section 3, according to the known physical principles, six image datasets are adopted for testing, and the quantitative image quality evaluation metrics such as PSNR and SSIM are introduced within multi-level image segmentation. In Section 4, the statistical results of different image datasets are illustrated so that the relationships between q-values and pixels’ long-range correlations are further discussed. In Section 5, the conclusions are presented.

2. Methods for Calculating Tsallis Entropy Index q and Image Segmentation

Assuming a given image size is

M \times N

, representing the total number of pixels in it. The range of gray-level of the image is defined as

i = 0, 1, 2, \dots, L - 1

, where L represents the maximum gray-level of the image, such as 256. Thus, the gray-level probability distribution of the image is defined as:

p_{i} = \frac{h_{i}}{M \times N},

(1)

where

h_{i}

is the number of pixels that the gray-level value is equal to i, and

p_{i} \geq 0, \sum_{i = 0}^{L - 1} p_{i} = 1

hold. Obviously,

\{p_{i}\}

represents the gray-level histogram distribution of the image. And Tsallis entropy is written as [18,19,24,32]:

S_{T} = \frac{1 - \sum_{i} p_{i}^{q}}{1 - q},

(2)

where q is the non-extensive index.

2.1. Entropy Index q and the Long-Range Correlation

Ramírez-Reyes et al. suggest that each complex system has its own entropy index, and it should not be determined arbitrarily. In practice, an image can be considered a non-extensive pixel system so that the long-range correlations among them can be quantified by q. Therefore, two fundamental concepts, redundancy and the maximum entropy principle, play important roles in evaluating the non-extensive parameter q. According to the non-extensive properties of Tsallis entropy, the q-redundancy of an image’s histogram can be written as [29]:

R_{T} = 1 - \frac{S_{T}}{S_{T max}},

(3)

where

S_{T max} = \frac{1 - L^{1 - q}}{q - 1}

. It means that the entropy reaches its maximal at the equiprobability case, i.e.,

p_{i} = p_{j} = 1 / L (\forall i, j)

. For a given image with a known gray-level histogram, the corresponding q-redundancy can be adjusted by parameter q. On the other hand, the histogram may exhibit long-range correlations among the pixels of the image. Therefore, maximizing the q-redundancy is a hopeful way to recognize the pattern of long-range correlations and yields a suitable non-extensive parameter q, i.e.,

q^{*} = \arg \max (R_{T}) .

(4)

With the help of

q^{*}

, the gray-level histogram of the image is re-normalized to deviate from equal probabilities as much as possible. This can result in a clearer representation of different clusters within the image, aiding in improving the quality of the image segmentation.

2.2. Multi-Level Thresholding Using Tsallis Entropy

Assuming that the gray-level histogram of an image is divided into

m + 1

parts by a set of thresholds

\vec{t} = (t_{1}, t_{2}, \dots, t_{m})

, denoted as

\vec{C} = (C_{0}, C_{1}, \dots, C_{m})

, after normalization, the probability distribution of each class is defined as:

\begin{matrix} C_{0} : \frac{p_{0}}{P_{0}}, \frac{p_{1}}{P_{0}}, \dots, \frac{p_{t_{1}}}{P_{0}} \\ ⋮ \\ C_{j} : \frac{p_{t_{j} + 1}}{P_{j}}, \frac{p_{t_{j} + 2}}{P_{j}}, \dots, \frac{p_{t_{j}}}{P_{j}}, \\ ⋮ \\ C_{m} : \frac{p_{t_{m} + 1}}{P_{m}}, \frac{p_{t_{m} + 2}}{P_{m}}, \dots, \frac{p_{L - 1}}{P_{m}} \end{matrix}

(5)

where the cumulative probabilities of

m + 1

categories are defined as:

\begin{matrix} P_{0} = \sum_{i = 0}^{t_{1}} p_{i} \\ ⋮ \\ P_{j} = \sum_{i = t_{j} + 1}^{t_{j + 1}} p_{i}, \\ ⋮ \\ P_{m} = \sum_{i = t_{m} + 1}^{L - 1} p_{i} \end{matrix}

(6)

Tsallis entropy of each region of

\vec{C} = (C_{0}, C_{1}, \dots, C_{m})

is obtained by the following definition:

\{\begin{matrix} \begin{matrix} S_{q}^{0} = \{1 - \sum_{i = 0}^{t_{1}} {(\frac{p_{i}}{P_{0}})}^{q}\} / (q - 1) \\ ⋮ \end{matrix} \\ S_{q}^{j} = \{1 - \sum_{i = t_{j} + 1}^{t_{j + 1}} {(\frac{p_{i}}{P_{j}})}^{q}\} / (q - 1) . \\ \begin{matrix} ⋮ \\ S_{q}^{m} = \{1 - \sum_{i = t_{m} + 1}^{L - 1} {(\frac{p_{i}}{P_{m}})}^{q}\} / (q - 1) \end{matrix} \end{matrix}

(7)

According to the pseudo-additivity property of Tsallis entropy, its multi-threshold objective function is defined as follows:

S_{q} (t_{1}, t_{2}, \dots, t_{m}) = \sum_{i} S_{q}^{i} + (1 - q) \sum_{j \neq k} S_{q}^{j} S_{q}^{k} + {(1 - q)}^{2} \sum_{u \neq v \neq w} S_{q}^{u} S_{q}^{v} S_{q}^{w} + \dots + {(1 - q)}^{m} \prod_{r = 0}^{m} S_{q}^{r} .

(8)

Maximizing the objective function

S_{q} (t_{1}, t_{2}, \dots, t_{m})

yields an optimal set of thresholds as follows:

{(\vec{t})}^{*} = arg max \{S_{q} (t_{1}, t_{2}, \dots, t_{m})\},

(9)

this algorithm is highly favored for its simplicity, intuitiveness, versatility, and excellent performance in image segmentation [33,34,35].

3. Image Test Sets and Quality Evaluation Parameters

There is a lot of evidence showing that parameter q has deep relevance with the long-range interaction in bi-level image segmentation [24,29,30,36]. However, extending bi-level thresholding to multi-level thresholding and drawing the conclusions seems not so straight. In fact, it is found that if the backgrounds of the images are of random natural scenes, the above algorithm does not exhibit significant advantages in comparison with the traditional Shannon algorithm and the original Tsallis algorithm [12]. In order to further show the relevance between pixels’ long-range correlations and nonextensivity during the imaging process, several different types of images are employed for comparison.

BSDS0500 is an image dataset consisting of randomly natural scenes. This dataset contains 500 images taken from real-world natural scenes, covering a variety of views and objects, including but not limited to modern urban landscapes, natural landscapes, animals and plants, human activities, and so on. These images provide diverse scenes and various visual information. Here, are a few examples from this dataset (Figure 1).

INFRAIMGS1, INFRAIMGS2, INFRAIMGS3, and INFRAIMGS4 are series of image datasets containing lots of infrared images captured by fixed infrared cameras at different moments. These datasets record specific activities and movements of objects in different scenes.

INFRAIMGS1: These images capture the activities of pedestrians and vehicles on two fixed outdoor road scenes. The dataset consists of 464 images extracted from frames, with a resolution of $550 \times 365$ .
INFRAIMGS2: This dataset depicts student activities at a fixed intersection near a teaching building. It comprises 264 images extracted from frames, with a resolution of $320 \times 240$ .
INFRAIMGS3: Presenting scenes fixed inside a cabin, focusing on the movement of individuals in the area. This dataset contains 253 images extracted from frames, capturing scenes of interaction and movement between individuals, with a resolution of $320 \times 240$ .
INFRAIMGS4: Showcasing scenes fixed in squares or similar open spaces, capturing pedestrians engaged in activities such as running, walking, or other leisure activities. The dataset comprises 118 images extracted from frames, with a resolution of $360 \times 240$ .

CTIMGS is a collection of medical chest CT images covering scans of chests from different patients. These images have a fixed black background, and the dataset comprises a total of 600 images, with a resolution of

224 \times 224

. Below are examples of images from these datasets.

In the same dataset of Figure 2, those images are taken from the same background and generated by the same imaging principle, i.e., infrared imaging for INFRAIMGS1-4 and X-ray imaging for CTIMGS. These specified types of images can help us to further understand the pixels’ long-range correlations in the imaging stage.

The images shown in Figure 3 are generated by satellite remote sensing that captures changes in the Yellowstone and Padma regions over many years, with a resolution of

720 \times 480

.

In order to evaluate the effectiveness of this self-adaptive multi-level segmentation algorithm, PSNR (Peak Signal-to-Noise Ratio) and SSIM (Structural Similarity Index) are adopted as quality indices. PSNR [37] represents the ratio of the peak signal to the noise. In image multiple-thresholding due to the gray-level compression, the output image is generally different from the original one. PSNR can precisely measure this difference and it is defined as:

P S N R = 10 {log}_{10} (\frac{255^{2}}{M S E}),

(10)

MSE in Equation (10) is the mean squared error between the output image and the input image, and 255 is the maximum gray-level value in the image in general. MSE can be written as:

M S E = \frac{1}{M \times N} \sum_{i = 1}^{M} \sum_{j = 1}^{N} {[I (i, j) - K (i, j)]}^{2},

(11)

where

I (i, j)

and

K (i, j)

represent the original image and the image after segmentation, respectively.

The typical PSNR values in image segmentation range from 10 dB to 50 dB [38]. A higher PSNR value indicates a smaller distortion in the output image and a higher quality of segmentation. PSNR closing to 50 dB indicates that the segmented image has very minor errors. If PSNR is greater than 30 dB, it is difficult for the human eyes to perceive differences between the segmented image and the original one. For PSNR ranging from 20 dB to 30 dB, the differences become noticeable to the human eyes. In the range of 10 dB to 20 dB, the differences become larger. Nevertheless, the human eyes can still recognize the main structures in the output image. If PSNR is below 10 dB, it becomes challenging for humans to determine if there are any correlations between the input and output images. PSNR is currently the most frequently used objective measure for evaluating image quality. However, many experimental results have shown that PSNR scores may not fully coincide with the visual quality perceived by the human eyes. It is possible for images with higher PSNR to appear worse in visual quality than those with lower PSNR scores since the human visual system’s sensitivity to errors is affected by a lot of factors that are more complicated than Equation (10).

SSIM [39] is another quality metric that measures the similarity between two digital images. The recognition criteria of the human visual system, such as luminance, contrast, and structural information, are taken into account [40,41] to yield the expression of SSIM as:

S S I M (x, y) = \frac{(2 μ_{x} μ_{y} + C_{1}) (2 σ_{x y} + C_{2})}{(μ_{x}^{2} + μ_{y}^{2} + C_{1}) (σ_{x}^{2} + σ_{y}^{2} + C_{2})},

(12)

where x and y represent the images before and after segmentation,

μ_{x}

and

μ_{y}

denote the mean intensity of the corresponding images,

σ_{x}^{2}

and

σ_{y}^{2}

represent the standard deviations respectively,

σ_{x y}

denotes the covariance of the images before and after segmentation,

C_{1}

and

C_{2}

are two constants to avoid zeros appearing in the denominator. Equation (12) shows that SSIM is a dimensionless value between 0 and 1, where smaller differences between the original and segmented images yield closer value to 1. Due to its simplicity and effectiveness, SSIM has been widely used in various applications related to image and video processing in recent years, such as image compression [42], image watermarking [43], wireless video streaming [44], and magnetic resonance imaging [45].

In practice, PSNR is more sensitive to additive Gaussian noise, while it exhibits lower sensitivity to JPEG compression. Conversely, SSIM is more sensitive to JPEG compression but relatively less responsive to additive Gaussian noise [37]. Therefore, we employ both PSNR and SSIM to assess the quality of the self-adaptive multi-level segmentation.

4. Experimental Results and Discussion

In order to show the detailed relevance between pixels’ long-range correlations and non-extensive entropy index q within a multi-level segmentation case, Shannon entropy and traditional Tsallis entropy are adopted as benchmarks to show the performance of the proposed self-adaptive algorithm. As mentioned above, Shannon entropy neglects the long-range correlations among image pixels and shows the extensive property. Tsallis entropy generalized the application scope of Shannon entropy by linking the strength of long-range correlations to non-extensive index q.

Images from the six datasets mentioned in Section 3 are processed by using three different multi-level segmentation algorithms, i.e., Shannon, and Tsallis (q = 0.8), and proposed, to yield the optimal results, respectively.

Figure 4 shows the four-level segmentation results of sample images from eight datasets. For the image from BSDS0500, the result of the Shannon algorithm looks closest to the original image, which indicates that the long-range correlations among image pixels can be neglected. Since the images in BSDS0500 are generated from random scenes, it is inadequate to say that the pixels of different images should always exhibit a long-range correlation. However, for images from the other seven datasets, the segmentation results of the proposed algorithm consistently show the superiority to the other two algorithms. Images in INFRAIMGS1-4 are generated by infrared cameras so infrared radiation plays an important role during the imaging process. It is well known that infrared radiation depends closely on the temperature of the objects. Therefore, the patterns of pixels’ long-range correlations actually reflect the temperature distributions of different objects in infrared images. This kind of long-range correlation can be successfully captured by the self-adaptive multi-level segmentation algorithm, which is more flexible than the traditional Tsallis entropy with a fixed q index. The evidence can also be found in the dataset of medical CT images, in which the pixels’ gray-level directly depend on the absorption of X-rays by different organs inside human body. Therefore, the gray-level values of pixels belonging to the same organ should have strong correlations and it is suitable to describe this kind of correlation by adaptive q index rather than fixed q. Moreover, the proposed algorithm is also effective to color images since color image can be considered as a combination of three colors (red, green, and blue), and the strength distribution of each color is similar to that of gray-level image. In fact, the satellite remote sensing images record the geography information of the earth’s surface so that the pixels belong to the same landform are correlated.

In order to show the segmentation results quantitatively, the output images of the three algorithms are compared with the corresponding original images in terms of PSNR and SSIM. Table 1 shows part of the PSNR results of BSDS0500 images by using Shannon, Tsallis (q = 0.8), and the proposed muti-level thresholding algorithms, in which the number of thresholds is 4. In each line of Table 1, the maximum PSNR value (with bold font) indicates that the corresponding algorithm is the most suitable one for the image named at the beginning of the line.

Therefore, for all 500 images of BSDS0500, one can statistically obtain the most suitable rates of three algorithms. They are 32% for the Shannon algorithm, 24.6% for the Tsallis algorithm with q = 0.8, and 55% for the proposed self-adaptive algorithm. It is worth mentioning that the sum of the above three most suitable rates slightly exceeds 100% because there are a few images, such as BSDS00109 in Table 1, that happen to obtain the same best result by using different algorithms. Nevertheless, the probability of such a case is small so that the images in the other five datasets can be processed in the same way. The statistical results are shown in Table 2.

Interestingly, unlike the distribution of the most suitable rates in BSDS0500, all the other five datasets show a notable tendency (the corresponding rates are far larger than 65%) to the proposed algorithm. Especially for INFRAIMGS2, all of the 264 images in it recognize the self-adaptive q as the most suitable values to present the pixels’ long-range correlations. The experimental results also show that the distribution of 264 q values ranges from 0.490 to 0.513, a very small interval. In fact, all images in INFRAIMGS2 have the same background and the ratios of the foreground (moving objects) to the full image size are small. It is reasonable to say that the strength of long-range correlations in each image of this dataset should be quite similar, but cannot be empirically determined by a fixed value. Other datasets that have the same characteristics with INFRAIMGS2, all exhibit the consistency in the range of q, such as

0.512 \leq q \leq 0.602

for INFRAIMGS1,

0.509 \leq q \leq 0.580

for INFRAIMGS3,

0.381 \leq q \leq 0.512

for INFRAIMGS4. These behaviors coincide with the imaging principles mentioned above.

The validity of self-adaptive q can be further confirmed by SSIM. Table 3 shows part of the SSIM results for the same images adopted in Table 1 by using three different multi-level thresholding algorithms, in which the number of thresholds is still 4. Since the definition of SSIM is totally different from that of PSNR, their responses to the same output image may not always be consistent with each other. Such as BSDS00116, according to the results of PSNR, Tsallis algorithm with q = 0.8 is suggested as the most suitable one, while Shannon algorithm yields the highest SSIM score. Nevertheless, the statistical results of the most suitable rates suggested by SSIM can also be obtained in the same way as Table 2, and they are shown in Table 4.

The sum of the most suitable rate for each dataset also slightly exceeds 100%, and the reason is similar to that in Table 2. It is found that all of the infra-image datasets show their preferences for the adaptive q as a measure of the strength of pixels’ long-range correlations under the criterion of SSIM. The statistical result of INFRAIMGS2 is the most notable one. All of the images in it adopt the proposed algorithm to achieve the highest scores defined by not only PSNR but also SSIM. Besides Table 2 and Table 4, the results of the most suitable rates over six datasets can be extended to the cases of a larger number of thresholds, as shown in Table 5, Table 6, Table 7 and Table 8.

Table 5 and Table 6 list the most suitable rates for the five-level segmentation of different datasets, where PSNR and SSIM are adopted as the criteria, respectively. And increasing the number of thresholds from 5 to 6, the results are listed in Table 7 and Table 8. Impressively, images of INFRAIMGS2 show their robust preferences for the proposed algorithm in spite of the increasing number of thresholds. This kind of robustness can also be found in other datasets, such as INFRAIMGS1, INFRAIMGS4, CTIMGS. Therefore, it is suitable to adopt the self-adaptive q value to measure the strength of long-range correlations within images generated by known physical principles. In other words, the physical properties of objects shown in the images can be connected to the non-extensive parameter q by maximizing the redundancy of the histogram distribution. It is worth mentioning that for INFRAIMGS3, the most suitable rate of proposed algorithm yields by PSNR keep decreasing when the number of thresholds grows. Since the gray-level gradations of images in INFRAIMGS3 are not plentiful, the increasing number of thresholds may lead to over-segmentation and the results evaluated by PSNR and SSIM become unstable. Nevertheless, in most cases, the proposed algorithm shows effectiveness (with the most suitable rate higher than 65%) and robustness (keeps fixed when the number of thresholds increases) in automatically detecting the long-range correlations among pixels of infrared images and medical images.

In Table 9, we compare the PSNR and SSIM results of multi-level segmentation for images of Figure 3 by using the same algorithms of Table 1, Table 2, Table 3, Table 4, Table 5, Table 6, Table 7 and Table 8. The results clearly show that the proposed self-adaptive algorithm consistently performs the best in most cases. Therefore, using the self-adaptive algorithm can more accurately capture the long-range correlation strength of surface features in satellite remote sensing images. This indicates that the proposed self-adaptive algorithm is suitable not only for grayscale images but also for color images that are generated by known physical principles.

In order to evaluate the robustness of three algorithms in different threshold levels, we randomly adopted an image to yield the optimal fitness values for comparison. For a given number of thresholds, each algorithm is independently applied to the image for 100 runs. Since the idea of swarm intelligence is included in the algorithm, the corresponding 100 results are stochastic to some extent. Nevertheless, the results can be sorted in ascending order and plotted with the rearranged sequence, as they are shown in Figure 5. Clearly, for different algorithms with different numbers of thresholds, all of the plotted results have a flat and long tail. It means that the highest fitness value can be reproduced a lot of times over 100 stochastic runs. Therefore, the above-mentioned algorithms are robust enough to yield reliable results.

5. Conclusions

In image segmentation, determining the non-extensive parameter q of Tsallis entropy is an intriguing task. Since the value of q represents the strength of long-range interactions among pixels of the images that are generated by some known physical principles, it cannot be determined empirically. In the present paper, with the help of maximizing q-redundancy, we further study the connections between the physical properties of objects shown in the images and the self-adaptive value of q in multi-threshold image segmentation. In comparison with the Shannon entropy algorithm and the traditional Tsallis entropy algorithm with q = 0.8, it is found that the self-adaptive algorithm shows high effectiveness and robustness to infrared images, medical CT images, and color satellite remote sensing images. The superiority and consistency of the present algorithm are qualitatively illustrated by means of PSNR and SSIM when the number of thresholds is set as 4, 5, and 6, respectively. In addition, for a series of images generated by the same process and sharing the same background, the long-range correlation pattern among pixels should be quite similar. The self-adaptive q values of those images are also quite similar, as expected. All of these advantages will be helpful for the further applications of Tsallis entropy in multi-level image segmentation.

Author Contributions

Conceptualization, C.O. and K.Z.; methodology, C.O.; software, K.Z.; validation, K.Z. and M.H.; formal analysis, C.O. and K.Z.; investigation, K.Z. and L.D.; resources, L.D.; data curation, K.Z.; writting—original draft preparation, K.Z.; writing—review and editing, C.O.; visualization, K.Z.; supervision, C.O. All authors have read and agreed to the published version of the manuscript.

Funding

The authors would like to thank the support by the National Natural Science Foundation of China (No. 11775084), the Program for prominent Talents in Fujian Province, and Scientific Research Foundation for the Returned Overseas Chinese Scholars.

Institutional Review Board Statement

Not applicable.

Data Availability Statement

The original contributions presented in the study are included in the article, further inquiries can be directed to the corresponding author.

Acknowledgments

The authors would like to thank https://www2.eecs.berkeley.edu (accessed on 17 December 2023), http://vcipl-okstate.org/pbvs/bench/ (accessed on 17 December 2023), https://wiki.cancerimagingarchive.net/display/Public/CT+Images+in+COVID-19 (accessed on 17 December 2023) and https://earthobservatory.nasa.gov/features (accessed on 25 January 2024) for providing source images.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Hosny, K.M.; Khalid, A.M.; Hamza, H.M.; Mirjalili, S. Multilevel thresholding satellite image segmentation using chaotic coronavirus optimization algorithm with hybrid fitness function. Neural Comput. Appl. 2023, 35, 855–886. [Google Scholar] [CrossRef]
Abualigah, L.; Habash, M.; Hanandeh, E.S.; Hussein, A.M.; Shinwan, M.A.; Zitar, R.A.; Jia, H. Improved reptile search algorithm by salp swarm algorithm for medical image segmentation. J. Bionic Eng. 2023, 20, 1766–1790. [Google Scholar] [CrossRef]
Xing, Z.; He, Y. Many-objective multilevel thresholding image segmentation for infrared images of power equipment with boost marine predators algorithm. Appl. Soft Comput. 2021, 113, 107905. [Google Scholar] [CrossRef]
Khudov, H.; Makoveichuk, O.; Butko, I.; Gyrenko, I.; Stryhun, V.; Bilous, O.; Shamrai, N.; Kovalenko, A.; Khizhnyak, I.; Khudov, R. Devising a method for segmenting camouflaged military equipment on images from space surveillance systems using a genetic algorithm. East.-Eur. J. Enterp. Technol. 2022, 117, 9. [Google Scholar] [CrossRef]
Iannizzotto, G.; Vita, L. Fast and accurate edge-based segmentation with no contour smoothing in 2-D real images. IEEE Trans. Image Process. 2000, 9, 1232–1237. [Google Scholar] [CrossRef]
Wang, L.; Yu, B.; Chen, F.; Li, C.; Li, B.; Wang, N. A cluster-based partition method of remote sensing data for efficient distributed image processing. Remote Sens. 2022, 14, 4964. [Google Scholar] [CrossRef]
Tang, J. A color image segmentation algorithm based on region growing. In Proceedings of the 2010 2nd International Conference on Computer Engineering and Technology, Bali Island, Indonesia, 19–21 March 2010; IEEE: Piscataway, NJ, USA, 2010; Volume 6, p. V6-634. [Google Scholar] [CrossRef]
Pun, T. A new method for grey-level picture thresholding using the entropy of the histogram. Signal Process. 1980, 2, 223–237. [Google Scholar] [CrossRef]
Kapur, J.N.; Sahoo, P.K.; Wong, A.K.C. A new method for gray-level picture thresholding using the entropy of the histogram. Comput. Vis. Graph. Image Process. 1985, 29, 273–285. [Google Scholar] [CrossRef]
Agrawal, S.; Panda, R.; Bhuyan, S.; Panigrahi, B.K. Tsallis entropy based optimal multilevel thresholding using cuckoo search algorithm. Swarm Evol. Comput. 2013, 11, 16–30. [Google Scholar] [CrossRef]
Bhari, A.K.; Kumar, A.; Singh, G.K. Tsallis entropy based multilevel thresholding for colored satellite image segmentation using evolutionary algorithms. Expert Syst. Appl. 2015, 42, 8707–8730. [Google Scholar] [CrossRef]
Sharma, A.; Chaturvedi, R.; Kumar, S.; Dwivedi, U.K. Multi-level image thresholding based on Kapur and Tsallis entropy using firefly algorithm. J. Interdiscip. Math. 2020, 23, 563–571. [Google Scholar] [CrossRef]
Zhao, D.; Liu, L.; Yu, F.; Heidari, A.A.; Wang, M.; Oliva, D.; Muhammad, K.; Chen, H. Ant colony optimization with horizontal and vertical crossover search: Fundamental visions for multi-threshold image segmentation. Expert Syst. Appl. 2021, 167, 114122. [Google Scholar] [CrossRef]
Abdel-Basset, M.; Mohamed, R.; AbdelAziz, N.M.; Abouhawwash, M. HWOA: A hybrid whale optimization algorithm with a novel local minima avoidance method for multi-level thresholding color image segmentation. Expert Syst. Appl. 2022, 190, 116145. [Google Scholar] [CrossRef]
Wang, S.; Fan, J. Simplified expression and recursive algorithm of multi-threshold Tsallis entropy. Expert Syst. Appl. 2024, 237, 121690. [Google Scholar] [CrossRef]
Pathria, R.K. Statistical Mechanics, 2nd ed.; Elsevier (Singapore) Pte Ltd.: Singapore, 2001; pp. 9–28. ISBN 981-4095-20-6. [Google Scholar]
Shannon, C.E. A mathematical theory of communication. Bell Syst. Tech. J. 1948, 27, 379–423. [Google Scholar] [CrossRef]
Tsallis, C. Possible generalization of Boltzmann-Gibbs statistics. J. Stat. Phys. 1988, 52, 479–487. [Google Scholar] [CrossRef]
Tsallis, C.I. Nonextensive statistical mechanics and thermodynamics: Historical background and present status. In Nonextensive Statistical Mechanics and Its Applications; Springer: Berlin/Heidelberg, Germany, 2001; pp. 3–98. [Google Scholar] [CrossRef]
Tsallis, C. Inter-occurrence times and universal laws in finance, earthquakes and genomes. Chaos Solitons Fractals 2016, 88, 254–266. [Google Scholar] [CrossRef]
Sigalotti, L.D.G.; Ramírez-Rojas, A.; Vargas, C.A. Tsallis q-Statistics in Seismology. Entropy 2023, 25, 408. [Google Scholar] [CrossRef]
Pavlos, G.P.; Karakatsanis, L.P.; Iliopoulos, A.C.; Pavlos, E.G.; Xenakis, M.N.; Clark, P.; Duke, J.; Monos, D.S. Measuring complexity, nonextensivity and chaos in the DNA sequence of the Major Histocompatibility Complex. Phys. A Stat. Mech. Its Appl. 2015, 438, 188–209. [Google Scholar] [CrossRef]
Zhang, Q.; Luo, C.; Li, M.; Deng, Y.; Mahadevan, S. Tsallis information dimension of complex networks. Phys. A Stat. Mech. Its Appl. 2015, 419, 707–717. [Google Scholar] [CrossRef]
De Albuquerque, M.P.; Esquef, I.A.; Mello, A.R.G. Image thresholding using Tsallis entropy. Pattern Recognit. Lett. 2004, 25, 1059–1065. [Google Scholar] [CrossRef]
Raja, N.S.M.; Fernandes, S.L.; Dey, N.; Satapathy, S.C.; Rajinikanth, V. Contrast enhanced medical MRI evaluation using Tsallis entropy and region growing segmentation. J. Ambient. Intell. Humaniz. Comput. 2024, 1–12. [Google Scholar] [CrossRef]
Tsallis, C. Nonextensive Statistical Mechanics: Construction and Physical Interpretation. In TNonextensive Entropy: Interdisciplinary Applications; Gell-Mann, M., Tsallis, C., Eds.; Oxford University Press: New York, NY, USA, 2004; pp. 2–55. ISBN 0-19-515976-4. [Google Scholar]
Tsallis, C. Nonadditive entropy and nonextensive statistical mechanics—An overview after 20 years. Braz. J. Phys. 2009, 39, 337–356. [Google Scholar] [CrossRef]
Rodrigues, P.S.; Giraldi, G.A. Computing the q-index for Tsallis nonextensive image segmentation. In Proceedings of the 2009 XXII Brazilian Symposium on Computer Graphics and Image Processing, Rio de Janeiro, Brazil, 11–15 October 2009; IEEE: Piscataway, NJ, USA, 2009; pp. 232–237. [Google Scholar] [CrossRef]
Ramírez-Reyes, A.; Hernández-Montoya, A.R.; Herrera-Corral, G.; Domínguez-Jiménez, I. Determining the entropic index q of Tsallis entropy in images through redundancy. Entropy 2016, 18, 299. [Google Scholar] [CrossRef]
Deng, Q.; Shi, Z.; Ou, C. Self-adaptive image thresholding within nonextensive entropy and the variance of the gray-level distribution. Entropy 2022, 24, 319. [Google Scholar] [CrossRef] [PubMed]
Mousavirad, S.J.; Oliva, D.; Chakrabortty, R.K.; Zabihzadeh, D.; Hinojosa, S. Population-based self-adaptive Generalised Masi Entropy for image segmentation: A novel representation. Knowl.-Based Syst. 2022, 245, 108610. [Google Scholar] [CrossRef]
Dos Santos, R.J.V. Generalization of Shannon’s theorem for Tsallis entropy. J. Math. Phys. 1997, 38, 4104–4107. [Google Scholar] [CrossRef]
Naidu, M.S.R.; Kumar, P.R.; Chiranjeevi, K. Shannon and fuzzy entropy based evolutionary image thresholding for image segmentation. Alex. Eng. J. 2018, 57, 1643–1655. [Google Scholar] [CrossRef]
Zou, Y.; Zhang, J.; Upadhyay, M.; Sun, S.; Jiang, T. Automatic image thresholding based on Shannon entropy difference and dynamic synergic entropy. IEEE Access 2020, 8, 171218–171239. [Google Scholar] [CrossRef]
Ifan Roy Thanaraj, R.; Anand, B.; Allen Rahul, J.; Rajinikanth, V. Appraisal of breast ultrasound image using Shannon’s thresholding and level-set segmentation. In Progress in Computing, Analytics and Networking, Proceedings of the ICCAN 2019, Bhubaneswar, India, 14–15 December 2019; Springer: Berlin/Heidelberg, Germany, 2020; pp. 621–630. [Google Scholar] [CrossRef]
Lin, Q.; Ou, C. Tsallis entropy and the long-range correlation in image thresholding. Signal Process. 2012, 92, 2931–2939. [Google Scholar] [CrossRef]
Hore, A.; Ziou, D. Image quality metrics: PSNR vs. SSIM. In Proceedings of the 2010 20th International Conference on Pattern Recognition, Istanbul, Turkey, 23–26 August 2010; IEEE: Piscataway, NJ, USA, 2010; pp. 2366–2369. [Google Scholar] [CrossRef]
Sheikh, H.R.; Bovik, A.C. Image information and visual quality. IEEE Trans. Image Process. 2006, 15, 430–444. [Google Scholar] [CrossRef]
Wang, Z.; Bovik, A.C.; Sheikh, H.R.; Simoncelli, E.P. Image quality assessment: From error visibility to structural similarity. IEEE Trans. Image Process. 2004, 13, 600–612. [Google Scholar] [CrossRef] [PubMed]
Wang, Z.; Bovik, A.C. Mean squared error: Love it or leave it? A new look at signal fidelity measures. IEEE Signal Process. Mag. 2009, 26, 98–117. [Google Scholar] [CrossRef]
Sheikh, H.R.; Sabir, M.F.; Bovik, A.C. A statistical evaluation of recent full reference image quality assessment algorithms. IEEE Trans. Image Process. 2006, 15, 3440–3451. [Google Scholar] [CrossRef]
Richter, T.; Kim, K.J. A MS-SSIM optimal JPEG 2000 encoder. In Proceedings of the 2009 Data Compression Conference, Snowbird, UT, USA, 16–18 March 2009; IEEE: Piscataway, NJ, USA, 2009; pp. 401–410. [Google Scholar] [CrossRef]
Alattar, A.M.; Lin, E.T.; Celik, M.U. Digital watermarking of low bit-rate advanced simple profile MPEG-4 compressed video. IEEE Trans. Circuits Syst. Video Technol. 2003, 13, 787–800. [Google Scholar] [CrossRef]
Vukadinovic, V.; Karlsson, G. Trade-offs in bit-rate allocation for wireless video streaming. IEEE Trans. Multimed. 2009, 11, 1105–1113. [Google Scholar] [CrossRef]
Reinsberg, S.A.; Doran, S.J.; Charles-Edwards, E.M.; Leach, M.O. A complete distortion correction for MR images: II. Rectification of static-field inhomogeneities by similarity-based profile mapping. Phys. Med. Biol. 2005, 50, 2651. [Google Scholar] [CrossRef]

Figure 1. Example images from the BSDS0500 image dataset.

Figure 2. (a,b) are images in the image set INFRAIMGS1. (c) are images in the image set INFRAIMGS2. (d) are images in the image set INFRAIMGS3. (e) are images in the image set INFRAIMGS4. (f) are images in the image set CTIMGS.

Figure 3. Example satellite images of Yellowstone and Padma. In the first row, from left to right, they are Yellowstone 1993, 1997, 2002, 2009, and 2017. In the second row, from left to right, they are Padma 1992, 1996, 2004, 2014, and 2016.

Figure 4. The four-level segmentation results of the typical images from eight datasets by using three different algorithms.

Figure 5. Sorted fitness values for INFRAIMGS2 from Figure 4, based on 100 independent runs by different algorithms with different number of thresholds.

Table 1. Part of PSNR results for images in BSDS0500 with different four-level thresholding algorithms.

	Shannon	Tsallis q = 0.8	Proposed
BSDS00065	27.7979	27.8888	28.0085
BSDS00109	28.0865	28.0865	27.4921
BSDS00116	26.9156	26.9787	26.9286
BSDS00203	29.1844	29.0696	29.1492
BSDS00474	29.5335	29.3923	23.0752

Bold font refer to the best results.

Table 2. The most suitable rates of three algorithms suggested by PSNR for six datasets when the number of thresholds is 4.

	Shannon	Tsallis q = 0.8	Proposed
BSDS0500	32%	24.6%	55%
INFRAIMGS1	7.9%	13.1%	84.3%
INFRAIMGS2	0%	0%	100%
INFRAIMGS3	7.9%	13.8%	87.3%
INFRAIMGS4	14.1%	16.3%	73.7%
CTIMGS	14.1%	15.6%	75.8%