A Low-Complexity Lossless Compression Method Based on a Code Table for Infrared Images

Zhu, Yaohua; Huang, Mingsheng; Zhu, Yanghang; Zhang, Yong

doi:10.3390/app15052826

Open AccessArticle

A Low-Complexity Lossless Compression Method Based on a Code Table for Infrared Images

¹

Shanghai Institute of Technical Physics, Chinese Academy of Sciences, Shanghai 200083, China

²

University of Chinese Academy of Sciences, Beijing 100049, China

³

Laboratory of Infrared Detection and Imaging Technology, Chinese Academy of Sciences, Shanghai 200083, China

^*

Author to whom correspondence should be addressed.

Appl. Sci. 2025, 15(5), 2826; https://doi.org/10.3390/app15052826

Submission received: 19 December 2024 / Revised: 3 March 2025 / Accepted: 4 March 2025 / Published: 5 March 2025

Download

Browse Figures

Versions Notes

Abstract

:

Traditional JPEG series image compression algorithms have limitations in speed. To improve the storage and transmission of 14-bit/pixel images acquired by infrared line-scan detectors, a novel method is introduced for achieving high-speed and highly efficient compression of line-scan infrared images. The proposed method utilizes the features of infrared images to reduce image redundancy and employs improved Huffman coding for entropy coding. The improved Huffman coding addresses the low-probability long coding of 14-bit images by truncating long codes, which results in low complexity and minimal loss in the compression ratio. Additionally, a method is proposed to obtain a Huffman code table that bypasses the pixel counting process required for entropy coding, thereby improving the compression speed. The final implementation is a low-complexity lossless image compression algorithm that achieves fast encoding through simple table lookup rules. The proposed method results in only a 10% loss in compression performance compared to JPEG 2000, while achieving a 20-fold speed improvement. Compared to dictionary-based methods, the proposed method can achieve high-speed compression while maintaining high compression efficiency, making it particularly suitable for the high-speed, high-efficiency lossless compression of line-scan panoramic infrared images. The code table compression effect is 5% lower than the theoretical value. The algorithm can also be applied to analyze images with more bits.

Keywords:

image lossless compression; Huffman coding; information entropy; line-scan infrared panoramic images; JPEG

1. Introduction

Matter, energy, and information are the three fundamental elements constituting the objective world. Information technology, in particular, plays a key role in humanity’s perception of the objective world, with infrared imaging technology serving as a vital component of this technology. Infrared imaging technology encompasses types such as line-array detector scanning imaging and area-array detector staring imaging [1]. The area-array detector captures the entire field of view at once and is commonly used in applications such as night vision devices, video surveillance, and others. In contrast, the line-array detector captures panoramic images by rotating 360 degrees, making it ideal for large-field applications like airborne small-target detection and border security surveillance. As infrared imaging technology advances toward higher resolution, faster frame rates, and greater pixel bit depth, high-rate data transmission and storage face corresponding challenges. For example, a 14-bit line-scan image with 3072 pixels per column and approximately 60,000 columns per panoramic image has an imaging cycle of about 25 microseconds per column, leading to an imaging rate of 240 MB/s and a frame size of 350 MB. Lossless data compression, which can reduce data redundancy, is essential for improving data transmission and storage efficiency.

Image compression methods leverage the natural compressibility of images by utilizing correlations between adjacent pixels to reduce redundancy and concentrate information. Finally, entropy coding is applied to remove coding redundancy, further achieving compression. Image transformation and prediction are widely used techniques for reducing image redundancy. To complement these approaches, dictionary-based compression methods, such as Lempel–Ziv–Welch (LZW), Lempel–Ziv 77 (LZ77), and Lempel–Ziv 78 (LZ78), are also employed [2,3]. These methods reduce redundancy by identifying and encoding repeated patterns or sequences in the image data, which can be especially effective for certain types of image content. Transform-based image compression converts images into a domain with less redundancy, where information is concentrated and easier to encode. The image is usually transformed into the frequency or spatial domain and then encoded using the properties of the transform coefficients, such as image compression based on discrete wavelet transform (DWT) [4,5,6], discrete cosine transform (DCT) [7], and integer discrete Tchebichef transform [8]. Prediction-based image compression exploits the correlation between pixels. The residual image obtained by subtracting the original image from the predicted image contains less redundant information. The residual image typically has a narrow range of pixel values, making it more effectively compressed through entropy coding techniques such as Huffman coding [9] or arithmetic coding [10]. Typical prediction-based compression includes DPCM [11] and LOCO-I [12]. Predictive techniques are typically integrated with transformations, such as utilizing DPCM for DC coefficients in the DCT within the JPEG, or applying DPCM to wavelet coefficients in JPEG 2000. This amalgamation allows for further compression by exploiting the inter-coefficient correlation. The intra prediction coding in video coding standards such as H.264/AVC and High Efficiency Video Coding (HEVC) also utilizes DPCM [13,14]. Dictionary-based compression methods dynamically build or reference predefined dictionaries of patterns, replacing repeated sequences with shorter symbols to achieve efficient compression without relying on transformation-specific knowledge. However, the data encoded by the dictionary may still contain redundancy, such as certain symbols or indices appearing more frequently. To address this, entropy coding is applied to further compress the data by assigning shorter codes to more frequently occurring symbols, such as in the Lempel–Ziv–Markov chain Algorithm (LZMA) and Deflate [15,16]. Dictionary-based lossless image compression has inherent limitations, as these methods rely on repeated patterns within the data and do not fully leverage spatial redundancy in images, which makes them less effective in eliminating spatial redundancy. Furthermore, dictionary encoding lacks sufficient capability to handle fine details. These limitations become particularly evident in 14-bit line-scan panoramic images, which feature a large number of source symbols, a wide symbol range, abundant details, complex structures, and rapidly changing scenes.

Traditional image compression methods, such as JPEG, JPEG-LS, and JPEG 2000 [17,18,19,20,21], are designed to minimize perceived quality loss by the human visual system (HVS) while reducing data transmission rates to improve the efficiency of image transmission and storage. JPEG (Joint Photographic Experts Group) is a widely used lossy image compression standard that reduces file sizes by discarding information that is less noticeable to the human eye, often resulting in some loss of image quality. JPEG-LS (JPEG Lossless and Near-Lossless Compression), on the other hand, is a standard for lossless or near-lossless compression, providing higher compression ratios while maintaining the original image quality. In recent years, more advanced image compression schemes have been continuously proposed and developed. For instance, ref. [22] proposed a compressive sensing-based image compression system. Nevertheless, their advantages over JPEG are not significant enough to justify the creation of new standards.

Image compression includes both lossy and lossless methods. JPEG is a lossy image compression method designed for lower bit-depth images, specifically for 8-bit images. Ref. [22] is essentially a lossy compression method. However, lossy compression may not be suitable for many imaging applications that require high precision, such as hyperspectral imaging and infrared weak target detection. Since lossless compression methods can fully recover the original data, they are more favored in these applications. JPEG 2000 offers lossless compression capabilities, achieving efficient compression at the cost of increased computational and memory resource consumption. Its complexity arises from the implementation of the 5/3 lifting wavelet transform, bit-plane coding, and MQ coding, which refers to a context-based arithmetic coding technique used to efficiently encode the quantized wavelet coefficients in the image, improving compression performance by exploiting the statistical dependencies between the coefficients. JPEG-LS achieves a low-complexity lossless compression that is easy to implement in hardware, using simple prediction, context modeling, and Golomb coding. This approach sacrifices compression efficiency in favor of speed improvement. The performance of JPEG-LS improves with simpler image scenes, whereas in more complex scenes, the prediction and context updating become more intricate, leading to a decrease in compression speed. JPEG-LS is only applicable to 8-bit and 12-bit images and is not suitable for images with higher bit depths. The complexity of image redundancy removal and entropy coding, along with limitations in pixel bit depth, restricts the application of JPEG-based algorithms in line-scan imaging with high data rates and high pixel bit depths.

This study focuses on 14-bit line-scan infrared panoramic images. Unlike traditional area-array images, each row of the image is generated by the same photosensitive element, leading to stronger inter-column correlation. Conventional JPEG series image compression methods do not take into account the characteristics of the line-column scanning image. Based on the characteristics of 14-bit line-scan infrared panoramic images, this paper analyzes the feasibility of removing spatial redundancy through inter-column differencing. The inter-column differencing DPCM prediction method is used to replace the complex wavelet transform in JPEG2000 for removing spatial redundancy in the image. This paper also designs an improved Huffman coding scheme to replace the complex entropy coding in JPEG 2000. The improved Huffman coding simplifies the image compression process by using a code table. Additionally, a method for generating the code table is proposed, simplifying the compression process by avoiding pixel statistics in entropy coding. Based on the proposed methods, a low-complexity lossless compression algorithm based on the code table is ultimately implemented using a simple lookup method. The structure of this paper is as follows: Section 2 introduces related work on Huffman coding, Section 3 analyzes methods for redundancy removal in line-scan infrared panoramic images, Section 4 presents the code-table-based lossless compression method proposed in this paper, Section 5 provides the experimental results, and Section 6 concludes the paper.

2. Related Works

The speed and efficiency of image compression are common objectives in both academia and industry. There are two main approaches to improving the encoder speed: hardware acceleration and designing low-complexity algorithms. The speed of the encoder is dependent on the processor’s performance, and the slowdown in processor performance improvements has driven the development of parallel architecture processors. As a result, image encoding algorithms have transitioned from single-threaded to multi-threaded algorithms [23,24]. However, in embedded platforms with limited hardware resources, it is crucial to design lossless compression algorithms with higher bit depths and lower computational complexity.

The core of the method proposed in this article is Huffman coding. We researched and improved the Huffman code. Lossless data compression is based on information theory, with its theoretical limit being entropy [25]. Huffman coding needs to count the probability of occurrence of source symbols, assign shorter codes to the symbols with high probability and longer codes to those with low probability, thereby achieving entropy coding, which approximates the theoretical entropy value. Huffman coding requires two passes over the source data to build the frequency table and generate the code table. Vitter [26] proposed a dynamic Huffman algorithm to scan the data only once. However, with the constant modification of the Huffman tree as new symbols appear, the dynamic Huffman algorithm leads to a rapid increase in computational effort. This algorithm is not suitable for compressing large datasets with multiple source symbols. Schwartz’s [27] canonical Huffman coding requires minimal data storage to reconstruct the Huffman tree. Unfortunately, both Huffman and canonical Huffman coding need to count the probability of symbol occurrence before compression, which seriously affects the compression speed. Reinhardt [28] improved compression speed by pre-allocating a Huffman code table. Nevertheless, this method requires offline data analysis to define the code table and is limited to 256 symbols, restricting its range of applications. Yunge’s [29] algorithm for dynamic code table enhances adaptability by increasing the number of code table. However, it requires evaluating the variance of symbol changes to reselect the code table each time, leading to high complexity and reduced compression speed. The code table method necessitates longer codes when compressing large datasets with multiple source symbols. Nonetheless, symbols corresponding to these longer codes typically have low occurrence probabilities, and constructing such long codes can adversely affect both the compression efficiency and processing speed. Reinhardt [30] proposed a truncated Huffman tree algorithm by only preserving codes with high-frequency occurrences, while symbols with low-frequency occurrences have no designated code. To distinguish between coded and uncoded bit streams, an extra 1-bit (0 or 1) is required, which increases the length of all encoded symbols by one bit and significantly degrades the compression performance. Xu’s [31] modified adaptive Huffman coding algorithm adds unknown symbol nodes to achieve uniform coding without requiring additional 1-bit identifiers, but it suffers from high complexity.

To reduce the algorithmic complexity, this paper proposes a novel method for constructing a Huffman code table that bypasses the process of calculating pixel occurrence probabilities in entropy coding. Additionally, an improved Huffman coding scheme is introduced to handle the longer codes required for 14-bit images by truncating longer codes with low complexity and minimal compression ratio loss. This ultimately achieves a low-complexity lossless compression method based on a code table for infrared images.

3. Redundancy Analysis of Line-Scan Panoramic Infrared Images

From information theory, information can be measured by the self-information. Suppose the set of source symbols is

S = {s_{1}, s_{2}, \dots s_{n}}

. The probability of occurrence of each symbol is

P = {p (s_{1}), p (s_{2}), \dots p (s_{n})}

. The self-information

I (s_{i})

of

s_{i}

is defined by the equation

I (s_{i}) = {log}_{2} 1 / p (s_{i}) (bit),

(1)

The smaller the probability of symbol

s_{i}

, the more information it conveys. The information entropy

H (S)

is the mathematical expectation of the self-information

I (s_{i})

. The information entropy

H (S)

indicates the minimum number of bits needed to represent each source symbol in a binary computer, which is defined as

H (S) = - \sum_{i = 1}^{n} p (s_{i}) \cdot {log}_{2} p (s_{i}) (bit / symbol),

(2)

A two-dimensional (2D) image is a kind of information that humans can intuitively perceive. The continuous tonal distribution in nature leads to significant spatial redundancy in visible images, while the scene’s infrared radiation continuity results in similar redundancy in infrared images. The image spatial redundancy is manifested as a large number of neighboring pixels with little or no change, resulting in a high correlation between the image pixels. Infrared images, due to their spatial redundancy, exhibit low actual information entropy, indicating high compression potential. However, obtaining the exact source entropy is challenging and can only be approximated closely by certain methods. In the field of digital imaging, image differencing represents the changes between adjacent pixels. The correlation between difference pixels is weak, effectively reducing spatial redundancy. Digital image differentiation involves both inter-column and inter-row differencing. Inter-column differencing is defined as

d p (i, j) = p (i, j) - p (i, j - 1),

(3)

where

p (i, j)

is the current column pixel,

p (i, j - 1)

is the previous column pixel, and

d p (i, j)

is the differential pixel

j \geq 2

. The inter-row differencing can be derived analogously.

Infrared line scanning imaging, as it is distinct from area array imaging, involves capturing panoramic images through a 360-degree rotating scan. In line-scan images, each row is acquired by the same photosensitive element, leading to stronger inter-column correlations compared to traditional images. We choose two 640 × 512 14-bit infrared images, which are a portion of a line-scan panoramic infrared image, and analyze the correlation of the original and differential images, named image A and image B. We calculate the inter-column correlation coefficients and count the probability of occurrence pixels. The experimental results are shown in Figure 1 and Table 1.

The inter-column correlation coefficient is defined as

r (j) = \frac{\sum_{i = 1}^{512} σ (i, j) \cdot σ (i, j - 1)}{\sqrt{\sum_{i = 1}^{512} σ {(i, j)}^{2} \cdot \sum_{i = 1}^{512} σ {(i, j - 1)}^{2}}},

(4)

σ (i, j) = p (i, j) - \bar{p (j)},

(5)

where

\bar{p (j)}

is the mean value of the jth column. The inter-row correlation coefficient can be derived analogously.

The experimental results indicate that the correlation coefficients of the original image all exceed 0.99. Inter-column differencing significantly reduces the correlation of the image. The entropy of the original image A is 10.3854 bit/symbol, while the entropy of the inter-column differential image A, at 4.9182 bit/symbol, is lower than that of the inter-row differential image A, which is 6.3071 bit/symbol. Further analysis on images from 51 diverse scenarios consistently demonstrates that the entropy of inter-column differential images reaches a minimum. These findings are summarized in Table 2. The 51 images are 14-bit infrared images captured by a line-scan infrared detector in different scenarios. The image sizes vary and include 640 × 512, 1000 × 2000, 2000 × 4000, 1000 × 4000, and 2000 × 8000.

Additionally, the infrared panoramic images processed in this study consist of 3072 pixels per column, with an image width of approximately 60,000 columns. When employing inter-column or inter-row differencing, preserving the first column or row is necessary for the restoration of subsequent rows or columns. As the first column of the image is generated at the beginning of the detector’s operation, only 3072 original pixels need to be immediately stored for inter-column differencing as opposed to 60,000 for inter-row differencing at different times. To maximize the compression speed, we only utilize inter-column differencing to eliminate image redundancy.

After the image redundancy is removed, the subsequent entropy coding requires the probability distribution information of the pixels. For a 14-bit infrared image, the dynamic range of pixel values is

[0, 2^{14} - 1]

, but the differential image dynamic range is doubled to

[- 2^{14} + 1, 2^{14} - 1]

. The experimental results also show that differential images often contain many 0 pixels, approximating a Laplace distribution. Therefore, through extensive experimentation, a general probability distribution model can be found to predict other differential images. A general distribution probability model can generate a general Huffman code table applicable to generic images. The higher the prediction accuracy of the probability distribution model, the better the compression.

4. Proposed Method

4.1. The Creation and Coding of General Code Table

4.1.1. Canonical Huffman Coding

The first step in Huffman coding is counting the frequency of each pixel symbol. The frequency is weight, and n pixel symbols can be used to construct an n-th Huffman tree based on the weights.

A 3 × n array

\vec{N}

is created to store the differential pixels, frequency of occurrence, and code length. The code length is known after building the Huffman tree in Algorithm 1 and calculating the code lengths in Algorithm 2, which is initialized to 0. After counting the pixels of an image, a frequency table of

n = 188

pixel symbols is obtained, with data from image B used for illustration. The pixel frequency statistics are shown in Table 3.

The canonical Huffman tree only needs to obtain the code length of pixel symbols, and an array

\vec{H}

of size 2 × (2

n - 1

) can describe the Huffman tree to obtain the code length. The two columns of

\vec{H}

are used to record the weight and parent of each node, respectively. The first n rows of the

\vec{H}

record the leaf nodes and the last

n - 1

rows record the new synthesized nodes, setting the parent of the root node to 0. The canonical Huffman tree construction algorithm is shown in Algorithm 1.

\vec{H}

after running Algorithm 1 is shown in Table 4.

Algorithm 1 Canonical Huffman tree creation.

Require:

\vec{N}

Ensure:

\vec{H}

1:: $\vec{H}$ $\Leftarrow 0, i = 0$
2:: while $i \neq n$ do
3:: $\vec{H}$ (i,1) = $\vec{N}$ (i,2);
4:: $i = i + 1$
5:: end while
6:: while $i \neq 2 \times n - 1$ do
7:: ( $i n d e x 1, i n d e x 2$ ) = $f i n d 2 m i n w e i g h t$ ( $\vec{H}$ , $i - 1$ )
8:: $\vec{H}$ ( $i n d e x 1, 2$ ) = i
9:: $\vec{H}$ ( $i n d e x 2, 2$ ) = i
10:: $\vec{H}$ ( $i n d e x, 1$ ) = $\vec{H}$ ( $i n d e x 1, 1$ ) + $\vec{H}$ ( $i n d e x 2, 1$ )
11:: $i = i + 1$
12:: end while

The Huffman tree, also known as an optimal binary tree, has the shortest path length with weights. Higher weights in a Huffman tree correspond to closer proximity to the root node. In image coding, higher weights signify a higher frequency of occurrence of pixel symbols, while being closer to the root node indicates shorter coded symbols. The shortest weighted path length represents optimal compression for images by the Huffman tree among binary trees.

Algorithm 2 Root node backtracking for calculating code lengths.

Require:

\vec{N}

Ensure:

\vec{H}

1:: $i = 0$
2:: while $i \neq n$ do
3:: $c o d e_l e n g t h = 0$
4:: $p a r e n t =$ $\vec{H}$ ( $i, 2$ )
5:: while $p a r e n t \neq 0$ do
6:: $c o d e_l e n g t h = c o d e_l e n g t h + 1$
7:: $p a r e n t =$ $\vec{H}$ ( $p a r e n t, 2$ )
8:: $\vec{N}$ ( $i, 3$ ) = $c o d e_l e n g t h$
9:: end while
10:: $i = i + 1$
11:: end while

However, Huffman coding requires significant space to store the Huffman tree, so canonical Huffman coding is commonly employed. Canonical Huffman coding reconstructs the Huffman tree by recording only the number of codes of a certain code length in all codes. After the

\vec{H}

is established, the code length of each leaf node can be calculated by backtracking from the leaf node to the root node as is shown in Algorithm 2. Figure 2 shows the leaf node −3 backtracking towards the root node. A canonical Huffman code table

\vec{C}

can be obtained by performing another count of the number of leaf nodes for a given code length as shown in Algorithm 3.

Algorithm 3 Calculate

\vec{C}

Require:

\vec{H}

Ensure:

\vec{C}

1:: $i = 0,$ $\vec{C}$ $\Leftarrow 0$
2:: while $i \neq n$ do
3:: $c o d e_l e n g t h$ = [ $\vec{H}$ ( $i, 3$ )]
4:: $\vec{C}$ ( $i, 1$ ) = $\vec{C}$ ( $i, 1$ )+1
5:: $i = i + 1$
6:: end while

The results of the experiment are

\vec{C}

=

[0, 1, 0, 5, 6, 6, 6, 9, 17, 23, 20, 20, 22, 10, 10, 12, 11, 10]

. The result shows that the longest code length is 18 bits and there are 188 codes, corresponding to the 188 differential pixel symbols that occur. Canonical Huffman coding can recover the Huffman tree by

\vec{C}

and three rules:

The first code value of the shortest code is 0.
When lengths are equal, values increase by 1.
For code lengths plus $m (m > 0)$ , the first code value = (the last code value of the second shortest length + 1) × $2^{m}$ .

A set of prefix codes can be obtained from three rules and

\vec{C}

as shown in Table 5.

4.1.2. Canonical Huffman Coding Based on General Code Table

For the compression of multi-frame images or large panoramic image datasets, we need to count several times to obtain the frequency table of each frame before constructing the Huffman tree, which will seriously affect the compression speed. In Section 3, we learn that the probability distribution of differential pixels follows a certain pattern and approximately adheres to a Laplace distribution. Therefore, through extensive experimentation, a differential pixel probability distribution model suitable for most scenarios can be statistically derived. This model can then be used to generate a code table based on Huffman coding rules.

We select the 53 images mentioned earlier and count the frequency of occurrence of differential pixels. The partial images are shown in Figure 3. We calculate the total probability of pixels occurring within the boundary, all with absolute values smaller than that of the boundary pixel. The cumulative probability of boundary pixels having values less than 150 exceeds 0.99 as shown in Figure 4. Thus, the differential pixels are limited to [−150, +151] to create a general code table for broad application. In total, there are 302 points ranging from −150 to 151. We sort the differential pixels by frequency and plot the first 302 points with the highest probability of occurrence as shown in Figure 5.

The mean probability distribution of the 53 images serves as a model for most scenes as shown in Figure 5. The Huffman tree is constructed from the average probability distribution model to obtain the general code table:

\vec{C}

= [0 1 0 4 8 7 7 7 8 15 23 32 36 43 44 53 14]. The

\vec{C}

has a total of 302 codes. Each pixel symbol in the differential pixel range [−150, +151] has one-to-one mapped code.

Create an array

\vec{S}

to record the code values and lengths in order from the smallest to the largest code value. The pixel values have the following correspondence with the

\vec{S}

element index:

i n d e x = \{\begin{matrix} 2 \times (0 - p i x e l), & if p i x e l \leq 0 \\ 2 \times p i x e l - 1, & otherwise . \end{matrix},

(6)

The probability distribution of the difference image pixels is essentially symmetric around zero. The probability of a difference pixel being

+ p i x e l

(

p i x e l

> 0) is approximately equal to that of

- p i x e l

. In Huffman coding, a probability model that ranks probabilities from high to low is required. Therefore, the probability of

- p i x e l

is placed after

+ p i x e l

and before

+ p i x e l + 1

. This is the reasoning behind the equation mentioned above. The new coding rules are shown in Table 6.

For a 14-bit image, there is a possibility that the number of symbols is more than 302, so longer codes are required. However constructing long codes can negatively impact both the compression efficiency and speed. Considering the practical situation, differential pixels are concentrated in small ranges, and pixels beyond the range occur infrequently but inevitably. The

\vec{C}

is very difficult to apply. So an improved Huffman coding lossless compression algorithm is proposed.

4.2. An Improved Huffman Coding Lossless Compression Algorithm

The probability distribution is concentrated in a small range, we only keep shorter codes with high frequency. However, the above operation will result in out-of-range pixels that are not encoded. The proposed method represents all out-of-bounds pixels by the code with maximum code value. The code “11111111111111111” with a maximum code value is used to encode “

+ 151

”. The improved Huffman coding rules are as follows:

i n d e x = \{\begin{matrix} 2 \times (0 - p i x e l), & if p i x e l \leq 0 \\ 2 \times p i x e l - 1, & if p i x e l > 0 \\ 301, & if i n d e x > 300 \end{matrix},

(7)

The coding rules are shown in Table 7.

When

i n d e x > 300

, let

i n d e x = 301

, encode all out-of-bounds pixels with the maximum code value, and we need to create an array

\vec{B}

to record the original values of the out-of-bounds pixels in the order in which they appear. Since the probability of an out-of-bounds pixel is extremely small in the whole image,

\vec{B}

does not take up much space and has little impact on the compression ratio. When the longest code is decoding, then the out-of-bounds pixels are taken out of the

\vec{B}

sequentially so that the differential image can be recovered. The algorithm is described in Algorithm 4 and Figure 6.

Algorithm 4 Lossless compression algorithm for the improved Huffman coding based on a code table.

Require:

i m g_d a t a

,

\vec{B}

Ensure:

c o m p r e s s_d a t a

1:: $i = 1, j = 1$
2:: $\vec{S}$ = $c r e a t_S$ ( $\vec{C}$ )
3:: $d a t a 1$ = $s a v e_f i r s t_c o l u m n$ ( $i m g_d a t a$ )
4:: $d i f_i m g_d a t a$ = $i n t e r_c o l u m n_d i f f e r e n c i n g$ ( $c o m p r e s s_d a t a$ )
5:: while $i \neq i m g_w i d t h - 1$ do
6:: while $j \neq i m g_h e i g h t$ do
7:: $p i x e l = d i f_i m g_d a t a (j, i)$
8:: if $p i x e l$ ≤ 0 then
9:: $i n d e x = 2 \times (0 - p i x e l)$
10:: else
11:: $i n d e x = 2 \times p i x e l - 1$
12:: end if
13:: if $i n d e x$ >300 then
14:: $i n d e x = 301$
15:: $\vec{B}$ $\Leftarrow p i x e l$
16:: end if
17:: $j = j + 1$
18:: end while
19:: $b i t s t r e a m = e n c o d i n g$ ( $\vec{S}$ , $i n d e x$ )
20:: $i = i + 1$
21:: end while
22:: $c o m p r e s s_d a t a =$ $s a v e_d a t e$ ( $d a t a 1$ , $b i t s t r e a m$ , $\vec{B}$ )

For more-bit images, we need to find a new differential pixel range of one-to-one encoding. The new differential pixel range must cover most of the pixels in scenes in order to ensure that the improved Huffman coding is efficient.

The original image can be recovered from the first column of the original image data and the differential image data. The formula for image recovery is as follows:

p^{'} (i, j) = \{\begin{matrix} p (i, 1), & if j = 1 \\ p^{'} (i, j - 1) + d p (i, j - 1), & otherwise j > 1 \end{matrix},

(8)

where

p^{'} (i, j)

is the current column pixel of the recovery image,

p^{'} (i, j - 1)

is the previous column pixel of the recovery image,

p (i, 1)

is the first column of the original image data, and

d p (i, j - 1)

is the difference pixel.

5. Results

The mean square error (MSE) between the original and reconstructed images is used to verify that the compression is lossless. MSE is defined as

M S E = \frac{1}{M \cdot N} \sum_{i = 1}^{M} \sum_{i = 1}^{N} {[p^{'} (i, j) - p (i, j)]}^{2},

(9)

where

p^{'} (i, j)

is the pixel at row i and column j of the reconstructed image, and

p (i, j)

is a pixel at row i and column j of the original image. MSE describes the reconstruction error between the reconstructed image and the original image. The MSEs of the 53 scenes calculated during our experiment are all 0, indicating that the proposed algorithm achieves lossless compression.

Additionally, SSIM (Structural Similarity Index) is a metric used to measure the similarity between two images, defined as follows:

S S I M (p, p^{'}) = [\frac{2 μ_{p} μ_{p^{'}} + C_{1}}{μ_{p}^{2} + μ_{p^{'}}^{2} + C_{1}}] \cdot [\frac{2 σ_{p p^{'}} + C_{2}}{σ_{p}^{2} + σ_{p^{'}}^{2} + C_{2}}],

(10)

where

μ_{p}

and

μ_{p^{'}}

are the mean values of the original and reconstructed images, respectively.

σ_{p}^{2}

and

σ_{p^{'}}^{2}

denote the variances of the original and reconstructed images, respectively.

σ_{p p^{'}}

represents the covariance between the original and reconstructed images.

C_{1}

and

C_{2}

are small constants used to stabilize the computation and prevent the denominator from approaching zero. All 53 images have an SSIM of 1, further confirming that the compression is lossless and the reconstructed images are structurally identical to the originals, with no loss in image quality.

The code table is calculated from the 53 scenes, so we need another scene to verify the algorithmic generality. We capture another 37 scenes for validation, partially displayed in Figure 7.

The compression ratio is defined as

C r = \frac{S o u r c e f i l e s i z e}{C o m p r e s s e d f i l e s i z e},

(11)

We use a 16-bit space to store a 14-bit pixel, so the theoretical limit compression ratio is defined as

T C r = \frac{16}{I m a g e i n f o r m a t i o n e n t r o p y},

(12)

5.1. Proposed Method Compared with JPEG Series Algorithms

On an experimental platform of 12th Gen Intel(R) Core(TM) i7-12700H, a 20-core CPU at 2.30 GHz, and 16 GB of RAM, we test JPEG 2000, JPEG XL, JPEG XT, and the method proposed in this paper, and the results are shown in Figure 8 and Table 8. Figure 8a shows the compression ratio test results of the proposed method and JPEG series methods on 37 images, along with the theoretical compression ratio calculated based on the entropy of the difference image. Figure 8b presents the speed test results. Table 8 records the average values of the compression ratio and speed from Figure 8. It also includes the percentage change in the compression ratio of the proposed algorithm compared to both the JPEG series methods and the theoretical value.

The method proposed in this paper achieves an average compression ratio of 3.3 for infrared line-scan images, which is 5% lower than the theoretical value. Compared to JPEG 2000, the proposed method incurs a 10% loss in compression efficiency but provides a 20-fold speed improvement, reaching an average of 210 MB/s. Its performance and speed both outperform JPEG XT, and it is approximately 8 times faster than JPEG XL.

5.2. Proposed Method Compared with TIFF

TIFF (Tagged Image File Format) is an image storage format. The term “Tagged” in “TIFF” refers to the complex file structure of this format. TIFF allows the flexible use of compression methods to maintain image integrity and clarity. Lossless compression methods that can be used include LZW, Deflate, LZMA, and Packbits [32,33]. The dictionary-based table lookup encoding in TIFF, such as LZW, completely avoids frequency statistics, providing a very high compression speed. However, dictionary-based lossless image compression has limitations, as it cannot effectively remove spatial redundancy in images. The Deflate method can achieve higher compression efficiency by adjusting the dictionary size. However, using an excessively large dictionary increases memory and time consumption, which may degrade the compression speed.

In the experiment, we test the performance of LZW, Deflate, LZMA, and Packbits. The results are shown in Figure 9 and Table 9.

The Deflate method combines LZ77 and Huffman coding. Under the condition of maximum compression efficiency, its speed is 8 times faster than JPEG 2000. However, it experiences a significant loss in compression ratio, approximately 46%. In contrast, the proposed method outperforms Deflate in both speed and efficiency.

In TIFF, the compression efficiency of the LZW method is not adjustable. Although it achieves a significant speed improvement, approximately 31 times faster than JPEG 2000, the compression ratio loss is substantial, reaching 63%. The proposed method, compared to LZW, is better at achieving high-speed image compression while maintaining high compression efficiency.

The LZMA method achieves a high compression ratio, but this comes at the expense of compression speed, with speeds comparable to JPEG2000 at its highest compression efficiency, yet resulting in a 17% loss in compression ratio. The proposed method outperforms LZMA in both speed and efficiency.

The PackBits method is a simple variant of Run-Length Encoding (RLE). The RLE pattern is as follows: repeated symbol + count of repetitions. While the PackBits method provides high compression speed, in images with few repeating patterns, the overhead of recording the repetition count causes the compressed files to be twice the size of the original, resulting in no effective compression.

It is worth noting that the method in this paper was tested only in single-threaded, single-core mode, without any SIMD (Single Instruction, Multiple Data) optimizations. Based on the above experiments, we conclude that, compared to dictionary-based methods, the proposed method ensures high compression efficiency while achieving fast compression for line-scan panoramic infrared images.

The detailed experimental data are shown in Table A1 and Table A2 in Appendix A.

6. Conclusions

This paper designs a new low-complexity lossless compression algorithm for 14-bit line-scan infrared images. This paper proposes a method for constructing a Huffman code table, replacing the pixel probability statistical step in entropy coding, thereby improving the compression speed. For more bit images, the method proposed in this paper can be used to find a new probability model to compute a new code table. Additionally, this paper designs an improved Huffman coding scheme to handle the longer codes of 14-bit images, truncating long codes with low complexity and minimal compression ratio loss, ultimately realizing a low-complexity lossless image compression algorithm. The method proposed in this paper achieves an average compression ratio of 3.3 for infrared line-scan images, which is 5% lower than the theoretical value. Compared to JPEG 2000, the proposed method incurs a 10% loss in compression efficiency but provides a 20-fold speed improvement, reaching an average of 210 MB/s. Its performance and speed both outperform JPEG XT. Compared to dictionary-based lossless compression methods, the proposed method can achieve high-speed compression while maintaining high compression efficiency.

The method proposed in this paper can be extended to other general image domains that require high-speed compression and can tolerate some loss in compression ratio. Additionally, if the code table is concealed, this method could also facilitate encrypted image transmission, making it applicable to secure communication systems. However, there are certain limitations to the proposed method. For instance, while the method significantly speeds up compression, the 10% loss in compression efficiency compared to JPEG 2000 suggests that further optimizations could be made to balance both the speed and compression ratio. Additionally, the proposed algorithm might face challenges when applied to more complex image types or images with larger bit depths, as the probability model used may need to be adapted to these cases.

Future research can explore several directions. First, improving the Huffman coding scheme to achieve better compression efficiency without compromising speed is a potential area of investigation. Second, adapting the proposed method to work with images of higher bit depths or more complex data types may enhance its applicability in other fields, such as medical imaging or remote sensing. This could be achieved by incorporating more widely used image prediction methods to reduce redundancy, which would allow the approach to better handle more complex image types. Finally, investigating hybrid methods that combine the strengths of both dictionary-based and statistical compression techniques could lead to even more efficient algorithms for infrared image compression.

Author Contributions

Conceptualization, Y.Z. (Yaohua Zhu); Supervision, Y.Z. (Yong Zhang); Validation, M.H. and Y.Z. (Yanghang Zhu); Writing—original draft, Y.Z. (Yaohua Zhu). All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data will be available reasonably on request.

Acknowledgments

The authors sincerely thank the anonymous reviewers for their insightful comments and valuable suggestions.

Conflicts of Interest

The authors declare that there are no conflicts of interest.

Appendix A

Table A1. The detailed experimental data for JPEG series.

Scene	TCr	Cr of the Proposed Method	Compression Speed of the Proposed Method (MB/S)	Cr of JPEG 2000	Compression Speed of JPEG 2000 (MB/S)	Cr of JPEG XL	Compression Speed of JPEG XL (MB/S)	Cr of JPEG XT	Compression Speed of JPEG XT (MB/S)
1	3.1768	2.9919	220.5027	3.5481	11.9596	3.2813	29.7892	2.5360	8.2258
2	3.0486	2.8599	208.7386	3.4462	11.1542	3.1815	22.4394	2.5160	7.8654
3	3.1527	2.9754	215.9772	3.5805	12.2606	3.3094	29.4655	2.6200	8.2037
4	3.1756	2.9938	209.7428	3.5775	11.5322	3.3189	31.1404	2.6602	8.2928
5	3.1685	2.9889	210.6113	3.5943	11.5322	3.3319	28.4655	2.7169	8.0310
6	3.2182	3.0275	213.7085	3.6435	10.1285	3.3655	27.1404	2.7588	7.9061
7	3.0342	2.8428	209.0245	3.4227	11.1542	3.1791	25.4314	2.5528	7.7065
8	3.1534	2.9606	211.3406	3.5747	11.1542	3.3046	27.2479	2.6841	7.6294
9	3.1471	2.9638	210.1762	3.5644	11.6240	3.3050	27.2479	2.6657	7.8654
10	3.2034	3.0163	217.8271	3.6192	10.8144	3.3512	29.9192	2.6967	7.5914
11	3.2252	3.0337	216.1302	3.6467	10.6719	3.3722	27.3439	2.7325	7.5539
12	3.2179	3.0270	204.2676	3.6107	12.3951	3.3584	28.2571	2.6170	7.9473
13	3.2383	3.0417	203.8388	3.6076	11.0542	3.3598	25.8416	2.5991	7.6294
14	3.1951	2.9963	205.9216	3.6026	11.1285	3.2906	23.8419	2.6255	7.6294
15	3.2632	3.0518	204.2676	3.6090	12.1285	3.3435	28.2571	2.6780	7.7851
16	3.1874	2.9937	217.9827	3.5267	11.3951	3.2584	32.4655	2.5446	7.8250
17	3.1927	3.0095	209.0245	3.5053	10.6240	3.2840	25.4314	2.5461	7.8654
18	3.1927	3.0095	201.0381	3.5053	10.6240	3.2840	26.3083	2.5461	8.0310
19	3.0372	2.8662	205.3673	3.3692	10.6240	3.2028	26.3083	2.4760	7.7851
20	3.1576	2.9898	200.7735	3.5245	11.1285	3.3199	26.3083	2.6196	7.6294
21	3.1619	2.9896	204.8160	3.5364	11.3849	3.3211	25.4314	2.6349	7.7065
22	3.1744	2.9936	209.0245	3.5708	10.3849	3.3346	27.2479	2.6729	7.7065
23	3.2536	3.0648	219.7090	3.6500	11.3951	3.3833	31.1404	2.7380	7.9061
24	3.7155	3.5391	214.6103	3.6718	10.1285	4.1812	25.4314	2.6504	7.8654
25	3.9185	3.7452	210.4661	3.7511	11.1285	4.2950	25.4314	2.7711	7.9473
26	3.9677	3.8641	212.6661	3.7611	10.9596	4.2307	25.1404	2.7459	8.2928
27	3.9908	3.8094	210.7568	3.8141	10.3951	4.3504	26.3083	2.8613	8.0310
28	3.8319	3.7394	213.4096	3.6899	11.3951	4.1242	28.3439	2.7448	8.1598
29	4.3438	4.1742	212.3701	3.9925	11.2588	4.4811	29.1713	2.6606	8.6208
30	4.5013	4.2840	221.1419	4.1943	10.2588	4.6864	28.9085	2.8206	8.3840
31	3.6849	3.5832	211.8644	3.6936	7.8127	4.0498	12.9763	2.6066	5.2085
32	4.0417	3.9486	217.3617	3.8516	10.5322	4.2752	27.5176	2.7352	8.3840
33	4.0588	3.9554	218.2946	3.8121	12.1285	4.2626	32.4655	2.6399	8.5245
34	4.0579	3.9499	218.9209	3.7868	12.3849	4.3119	23.8419	2.7562	8.2037
35	3.5538	3.4228	223.2143	3.4917	10.4169	3.9657	12.8410	2.5395	4.8078
36	3.7599	3.6812	210.7568	3.6787	11.1285	4.0933	29.9192	2.6909	7.9473
37	3.4841	3.2131	208.3333	3.5793	7.8127	4.0715	13.1251	2.5615	5.6820
Mean	3.4564	3.2864	211.7291	3.6379	10.9728	3.6599	26.3214	2.6546	7.7399

Table A2. The detailed experimental data for TIFF.

Scene	Cr of TIFF-Deflate	Compression Speed of TIFF-Deflate (MB/S)	Cr of TIFF-Lzw	Compression Speed of TIFF-Lzw (MB/S)	Cr of TIFF-Packbits	Compression Speed of TIFF-Packbits (MB/S)	Cr of TIFF-LZMA	Compression Speed of TIFF-LZMA (MB/S)
1	1.7323	137.2362	1.2237	354.335	0.4961	764.2569	2.6897	12.8439
2	1.7141	88.9397	1.2233	338.4602	0.4961	675.058	2.5507	12.7596
3	1.9159	44.2973	1.4343	387.4113	0.4961	667.0406	2.7915	11.8134
4	1.9748	41.3006	1.4649	389.5875	0.4961	691.4349	2.7722	12.0544
5	1.9429	42.8154	1.4577	388.409	0.4961	684.289	2.7927	11.7786
6	1.9797	44.5966	1.4961	386.0983	0.4961	733.714	2.8641	11.8401
7	1.7188	81.3959	1.2235	337.59	0.4961	616.2892	2.5039	12.8117
8	1.8508	53.9238	1.3696	343.7339	0.4961	641.3252	2.747	12.0309
9	1.8356	60.8802	1.3469	344.9068	0.4961	663.6218	2.7225	11.8702
10	1.8551	56.232	1.3971	375.5523	0.4961	717.4474	2.7892	11.8174
11	1.8572	51.9528	1.3714	341.9328	0.4961	692.1899	2.8047	11.5897
12	1.8167	66.8348	1.32	342.2499	0.4961	678.6133	2.746	11.8523
13	1.8042	77.4624	1.3211	341.2405	0.4961	639.7997	2.7303	12.2185
14	2.0123	44.9636	1.541	361.3333	0.4961	525.7964	2.8806	11.7433
15	1.8378	70.7508	1.3248	341.7528	0.4961	601.8106	2.7392	12.159
16	1.7254	112.1859	1.1928	358.6109	0.4961	664.3967	2.6726	12.6656
17	1.7668	81.4572	1.2561	336.6756	0.4961	687.7876	2.6101	12.3507
18	1.7668	81.6966	1.2561	335.509	0.4961	642.4015	2.6101	12.1252
19	1.6724	95.5526	1.1785	340.2141	0.4961	625.8755	2.4342	12.3639
20	1.8337	61.2563	1.3462	343.386	0.4961	669.5403	2.6852	11.9983
21	1.82	52.1605	1.3543	348.6664	0.4961	673.4058	2.6875	11.8417
22	1.8759	46.1301	1.427	353.5438	0.4961	635.2889	2.7911	11.6741
23	1.9998	39.9932	1.5235	388.0428	0.4961	739.2285	2.8998	11.6483
24	2.274	85.3041	1.446	343.6877	0.4961	616.094	3.4519	11.4221
25	2.4019	69.5528	1.5407	349.4065	0.4961	625.0381	3.607	10.425
26	2.2553	84.8093	1.4204	373.6603	0.4961	733.056	3.52	11.229
27	2.3745	71.4099	1.4962	350.0444	0.4961	685.9447	3.6391	11.1167
28	2.2085	84.5489	1.415	373.6742	0.4961	599.9558	3.4078	11.3581
29	2.2003	141.8779	1.2285	356.2801	0.4961	699.6826	3.7269	11.9364
30	2.2873	189.0449	1.2331	362.9844	0.4961	689.2137	3.9037	13.0254
31	1.9966	49.1781	1.2575	130.3163	0.496	329.6872	3.162	7.8971
32	2.0742	161.997	1.1831	351.285	0.4961	704.2595	3.4939	12.3269
33	2.0721	189.1148	1.1485	350.3113	0.4961	670.6786	3.5074	12.7368
34	2.252	94.9202	1.352	341.5509	0.4961	643.1316	3.6304	12.1598
35	2.0703	41.3493	1.2683	127.6347	0.496	296.0901	3.2096	8.0878
36	2.1171	102.2465	1.3242	373.0382	0.4961	718.1107	3.3303	11.9698
37	2.2903	24.6021	1.4882	127.7127	0.496	316.9249	3.4111	7.0688
Mean	1.9779	79.0262	1.3473	337.5899	0.4961	639.4183	3.0139	11.6381

References

Li, H. Infrared thermal imaging technology towards the new century. Laser Optoelectron. Prog. 2002, 39, 48–51. [Google Scholar]
Singh, S.; Pandey, P. Enhanced LZW technique for medical image compression. In Proceedings of the 2016 3rd International Conference on Computing for Sustainable Global Development (INDIACom), New Delhi, India, 16–18 March 2016; pp. 1080–1084. [Google Scholar]
Giri, K.; Mishra, A.; Rongali, A. An innovation analysis of LZ77 and LZ78 Compression Algorithms for Data Compression & Source Coding. In Proceedings of the 2024 15th International Conference on Computing Communication and Networking Technologies (ICCCNT), Mandi, India, 18–22 June 2024; pp. 1–5. [Google Scholar]
Antonini, M.; Barlaud, M.; Mathieu, P.; Daubechies, I. Image coding using wavelet transform. IEEE Trans. Image Process. 1992, 1, 20–25. [Google Scholar] [CrossRef] [PubMed]
Yea, S.; Pearlman, W.A. A wavelet-based two-stage near-lossless coder. IEEE Trans. Image Process 2006, 11, 3488–3500. [Google Scholar] [CrossRef] [PubMed]
Usevitch, B.E. A tutorial on modern lossy wavelet image compression: Foundations of JPEG 2000. IEEE Signal Process. Mag. 2001, 18, 22–35. [Google Scholar] [CrossRef]
Mandyam, G.; Ahmed, N.; Magotra, N. Lossless image compression using the discrete cosine transform. J. Vis. Commun. Image Represent. 1997, 8, 21–26. [Google Scholar] [CrossRef]
Xiao, B.; Lu, G.; Zhang, Y.; Li, W.; Wang, G. Lossless image compression based on integer Discrete Tchebichef Transform. Neurocomputing 2016, 214, 587–593. [Google Scholar] [CrossRef]
Huffman, D.A. A method for the construction of minimum-redundancy codes. Proc. IRE 1952, 40, 1098–1101. [Google Scholar] [CrossRef]
Witten, I.H.; Neal, R.M.; Cleary, J.G. Arithmetic coding for data compression. Commun. ACM 1987, 30, 520–540. [Google Scholar] [CrossRef]
Jiang, W.W.; Kiang, S.Z.; Hakim, N.Z.; Meadows, H.E. Lossless compression for medical imaging systems using linear/nonlinear prediction and arithmetic coding. In Proceedings of the 1993 IEEE International Symposium on Circuits and Systems (ISCAS), Chicago, IL, USA, 3–6 May 1993; pp. 283–286. [Google Scholar]
Weinberger, M.J.; Seroussi, G.; Sapiro, G. The LOCO-I lossless image compression algorithm: Principles and standardization into JPEG-LS. IEEE Trans. Image Process. 2000, 9, 1309–1324. [Google Scholar] [CrossRef]
Tan, Y.H.; Yeo, C.; Li, Z. Residual DPCM for lossless coding in HEVC. In Proceedings of the 2013 IEEE International Conference on Acoustics, Speech and Signal Processing, Vancouver, BC, Canada, 26–31 May 2013; pp. 2021–2025. [Google Scholar]
Sanchez, V.; Auli-Llinas, F.; Serra-Sagrista, J. DPCM-based edge prediction for lossless screen content coding in HEVC. IEEE J. Emerg. Sel. Top. Circuits Syst. 2016, 6, 497–507. [Google Scholar] [CrossRef]
Harnik, D.; Khaitzin, E.; Sotnikov, D. A fast implementation of deflate. In Proceedings of the 2014 Data Compression Conference, Snowbird, UT, USA, 26–28 March 2014; pp. 223–232. [Google Scholar]
Leavline, E.; Singh, D. Hardware implementation of LZMA data compression algorithm. Int. J. Appl. Inf. Syst. 2013, 5, 51–56. [Google Scholar]
Wallace, G.K. The JPEG still picture compression standard. Commun. ACM 1991, 34, 30–44. [Google Scholar] [CrossRef]
Skodras, A.; Christopoulos, C.; Ebrahimi, T. The JPEG 2000 still image compression standard. IEEE Signal Process. Mag. 2001, 18, 36–58. [Google Scholar] [CrossRef]
Chiou, P.T.; Sun, Y.; Young, G.S. A complexity analysis of the JPEG image compression algorithm. In Proceedings of the 2017 9th Computer Science and Electronic Engineering (CEEC), Colchester, UK, 27–29 September 2017; pp. 65–70. [Google Scholar]
Alakuijala, J.; Van Asseldonk, R.; Boukortt, S.; Bruse, M.; Comșa, I.M.; Firsching, M. JPEG XL next-generation image compression architecture and coding tools. In Proceedings of the Applications of Digital Image Processing XLII, San Diego, CA, USA, 6 September 2019; Volume 11137, pp. 112–124. [Google Scholar]
Artusi, A.; Mantiuk, R.K.; Richter, T.; Hanhart, P.; Korshunov, P.; Agostinelli, M. Overview and evaluation of the JPEG XT HDR image compression standard. J. Real-Time Image Process. 2019, 16, 413–428. [Google Scholar] [CrossRef]
Yuan, X.; Haimi-Cohen, R. Image compression based on compressive sensing: End-to-end comparison with JPEG. IEEE Trans. Multimed. 2020, 22, 2889–2904. [Google Scholar] [CrossRef]
Cea-Dominguez, C.; Moure, J.C.; Bartrina-Rapesta, J.; Auli-Llinas, F. Complexity scalable bitplane image coding with parallel coefficient processing. IEEE Signal Process. Lett. 2020, 27, 840–844. [Google Scholar] [CrossRef]
Wu, Z.; Zhang, W.; Jing, P.; Liu, Y. A High-Performance Dual-Context MQ Encoder Architecture Based on Extended Lookup Table. IEEE Trans. Very Large Scale Integr. (VLSI) Syst. 2023, 31, 897–901. [Google Scholar] [CrossRef]
Shannon, C.E. A mathematical theory of communication. Bell Syst. Tech. J. 1948, 27, 379–423. [Google Scholar] [CrossRef]
Vetter, J. Design and analysis of dynamic Huffman coding. In Proceedings of the 26th Annual Symposium on Foundations of Computer Science, Portland, OR, USA, 21–23 October 1985; pp. 293–302. [Google Scholar]
Schwartz, E.S.; Kallick, B. Generating a canonical prefix encoding. Commun. ACM 1964, 7, 166–169. [Google Scholar] [CrossRef]
Reinhardt, A.; Christin, D.; Hollick, M.; Schmitt, J.; Mogre, P.S.; Steinmetz, R. Trimming the tree: Tailoring adaptive huffman coding to wireless sensor networks. In Proceedings of the Wireless Sensor Networks: 7th European Conference, Coimbra, Portugal, 17–19 February 2010; pp. 33–48. [Google Scholar]
Yunge, D.; Park, S.; Kindt, P.; Chakraborty, S. Dynamic alternation of Huffman codebooks for sensor data compression. IEEE Embed. Syst. Lett. 2017, 9, 81–84. [Google Scholar] [CrossRef]
Reinhardt, A.; Christin, D.; Steinmetz, R. Pre-allocating code mappings for energy-efficient data encoding in wireless sensor networks. In Proceedings of the 2013 IEEE International Conference on Pervasive Computing and Communications Workshops (PERCOM Workshops), San Diego, CA, USA, 18–22 March 2013; pp. 578–583. [Google Scholar]
Xu, L.; Li, Q.; Zhu, B. Modified adaptive Huffman coding algorithm for wireless sensor network. J. Nanjing Univ. Sci. Technol. 2013, 37, 813–817. [Google Scholar]
Kabachinski, J. TIFF, GIF, and PNG: Get the picture? Biomed. Instrum. Technol. 2007, 41, 297–300. [Google Scholar] [CrossRef] [PubMed]
Wei, Z.; Sun, Z.; Xie, Y. GPU Acceleration of integer wavelet transform for TIFF image. In Proceedings of the 2010 3rd International Symposium on Parallel Architectures, Algorithms and Programming, Dalian, China, 18–20 December 2010; pp. 138–143. [Google Scholar]

Figure 1. Frequency statistics and correlation analysis of infrared images. (a) Original image A. (b) Original image A pixel frequency. (c) Difference image A. (d) Difference image A pixel frequency. (e) Correlation analysis of A. (f) Original image B. (g) Original image B pixel frequency. (h) Difference image B. (i) Difference image B pixel frequency. (j) Correlation analysis of B.

Figure 2. Leaf node

- 3

backtracking towards the root node.

Figure 2. Leaf node

- 3

backtracking towards the root node.

Figure 3. Various experimental scenes (partial of 53).

Figure 4. The total probability of the occurrence of pixels within the boundary pixel.

Figure 5. Differential pixels probability distribution. (a) Probability distribution of the first 302 differential pixels in 53 images. (b) Average probability distribution of 53 images.

Figure 6. The lossless compression algorithm. (a) Coding rules of the improved Huffman coding. (b) Framework of the lossless compression algorithm.

Figure 7. Various experimental scenes (partial of 37).

Figure 8. Speed and compression ratio of the proposed method and JPEG series. (a) Compression ratio. (b) Compression speed.

Figure 9. Speed and compression ratio of the proposed method and tiff. (a) Compression speed. (b) Compression speed. (c) Compression ratio.

Table 1. Image information entropy.

Image	Original Image	Entropy1 ¹	Entropy2 ²
A	10.3854	4.9182	6.3071
B	9.91121	4.3792	5.7613

¹ entropy1 represents inter-column differential image entropy (bit/symbol). ² entropy2 represents inter-row differential image entropy (bit/symbol).

Table 2. Image information entropy.

Image	Original Image	Entropy1	Entropy2	Image	Original Image	Entropy1	Entropy2
1	9.9600	5.2174	6.7721	27	9.8706	4.3181	5.8708
2	9.9683	4.7322	6.4009	28	9.7086	3.9973	5.4276
3	10.2698	4.5515	6.4508	29	9.5983	3.4375	5.2666
4	9.5139	5.3250	6.1609	30	9.2272	3.5810	5.1778
5	9.7559	4.1851	5.9531	31	9.4682	3.6876	5.3521
6	9.8494	5.1104	6.1961	32	9.1860	4.0401	5.5243
7	10.1632	5.0388	6.9598	33	9.2012	3.9555	5.4476
8	9.3903	3.9345	5.3837	34	8.9853	3.8260	5.4016
9	8.8980	3.7792	5.2160	35	9.0392	4.0108	5.4408
10	7.9109	3.5563	4.7983	36	11.4619	4.5274	6.3586
11	9.7017	4.0398	5.4874	37	11.0728	4.8246	6.3974
12	10.0977	4.3205	5.7156	38	10.0466	5.1714	6.0446
13	10.0881	4.9229	6.1077	39	9.0339	4.8714	5.6439
14	11.4858	4.0247	6.1788	40	10.4894	5.0793	6.0699
15	10.6309	4.6104	6.3493	41	9.7625	5.0286	5.4812
16	10.2181	5.0015	6.1660	42	8.7624	5.0047	5.2910
17	10.5578	5.2280	6.2924	43	9.0660	5.0095	5.3068
18	10.3562	4.8746	5.9831	44	8.1629	4.9647	5.2399
19	10.5153	5.8710	6.6050	45	8.6185	5.1068	5.4800
20	10.2684	4.9927	6.0834	46	10.2691	5.4793	6.0277
21	10.9376	4.9940	6.2051	47	10.5526	5.5133	6.0654
22	10.4347	4.8137	6.2901	48	10.5964	5.6067	6.1470
23	10.2343	4.9127	6.2521	49	10.4337	5.3266	5.8868
24	10.1956	4.9139	6.3524	50	10.0411	5.4127	5.9088
25	10.5413	5.6776	6.5868	51	10.1527	5.2030	5.8944
26	8.9403	3.7664	5.1789

Table 3. Differential pixel frequency statistics of image B.

Pixel ¹	Frequency	Code Length
… ²	…	…
−3	14,253	0
−2	17,124	0
−1	19,885	0
0	106,708	0
1	17,535	0
2	14,818	0
3	12,135	0
…	…	…

¹ ‘Pixel’ represents differential pixel; ² ‘…’ represents omitted data.

Table 4.

\vec{H}

describing the Huffman tree.

Table 4.

\vec{H}

describing the Huffman tree.

Pixel	Index	Weight	Parent
…	…	…	…
−3	77	14,253	364
−2	78	17,124	366
−1	79	19,885	368
0	80	106,708	374
1	81	17,535	366
2	82	14,818	365
3	83	12,135	363
…	…	…	…
\ ¹	356	12,674	363
\	357	14,755	364
\	358	15,480	365
…	…	…	…
\	361	21,246	368
\	362	23,294	369
\	363	24,809	369
\	364	29,008	370
\	365	30,298	370
\	366	34,659	371
\	367	37,261	371
\	368	41,131	372
\	369	48,103	372
\	370	59,306	373
\	371	71,920	373
\	372	89,234	374
\	373	131,226	375
\	374	195,942	375
\	375	327,168	0

¹ ‘\’ represents a synthesized node, which is a non-leaf node, so it does not have corresponding pixels.

Table 5. Prefix codes derived from

\vec{C}

.

Table 5. Prefix codes derived from

\vec{C}

.

Pixel	Frequency	Code Length	Code Value	Code
0	106,708	2	0	00
−1	19,885	4	4	0100
1	17,535	4	5	0101
−2	17,124	4	6	0110
2	14,818	4	7	0111
−3	14,253	4	8	1000
3	12,135	5	18	10010
…	…	…	…	…

Table 6. Coding rules of

\vec{C}

.

Table 6. Coding rules of

\vec{C}

.

Pixel	Index	$\vec{S}$ (Code Length)	$\vec{S}$ (Code Value)	Code
0	0	2	0	00
1	1	4	4	0100
−1	2	4	5	0101
2	3	4	6	0110
−2	4	4	7	0111
3	5	5	16	10000
−3	6	5	17	10001
…	…	…	…	…
150	299	17	131,069	11111111111111101
−150	300	17	131,070	11111111111111110
151	301	17	131,071	11111111111111111

Table 7. Coding rules of the variant Huffman coding.

Pixel	Index	$\vec{S}$ (Code Length)	$\vec{S}$ (Code Value)	Code
0	0	2	0	00
1	1	4	4	0100
−1	2	4	5	0101
2	3	4	6	0110
−2	4	4	7	0111
3	5	5	16	10000
−3	6	5	17	10001
…	…	…	…	…
150	299	17	131,069	11111111111111101
−150	300	17	131,070	11111111111111110
<−150
or	301	17	131,071	11111111111111111
>150

Table 8. Experimental results of the proposed method and JPEG series.

The Average Results	Proposed Method	Theoretical Value	JPEG 2000	JPEG XL	JPEG XT
Cr	3.2864	3.4564	3.6379	3.6599	2.6546
The percentage change	\	−4.9175%	−9.6633%	−10.2072%	23.7988%
Compression speed (MB/S)	211.7291	\	10.9728	26.3214	7.7399

Table 9. Experimental results for TIFF.

The Average Results	JPEG 2000	TIFF-Deflate	TIFF-Lzw	TIFF-Packbits	TIFF-LZMA
Cr	3.6379	1.9779	1.3473	0.4961	3.0139
The percentage change	\	−45.6307%	−62.9649%	−86.3630%	−17.1528%
Compression speed (MB/S)	10.9728	79.0262	337.5899	639.4183	11.6381

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Zhu, Y.; Huang, M.; Zhu, Y.; Zhang, Y. A Low-Complexity Lossless Compression Method Based on a Code Table for Infrared Images. Appl. Sci. 2025, 15, 2826. https://doi.org/10.3390/app15052826

AMA Style

Zhu Y, Huang M, Zhu Y, Zhang Y. A Low-Complexity Lossless Compression Method Based on a Code Table for Infrared Images. Applied Sciences. 2025; 15(5):2826. https://doi.org/10.3390/app15052826

Chicago/Turabian Style

Zhu, Yaohua, Mingsheng Huang, Yanghang Zhu, and Yong Zhang. 2025. "A Low-Complexity Lossless Compression Method Based on a Code Table for Infrared Images" Applied Sciences 15, no. 5: 2826. https://doi.org/10.3390/app15052826

APA Style

Zhu, Y., Huang, M., Zhu, Y., & Zhang, Y. (2025). A Low-Complexity Lossless Compression Method Based on a Code Table for Infrared Images. Applied Sciences, 15(5), 2826. https://doi.org/10.3390/app15052826

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Low-Complexity Lossless Compression Method Based on a Code Table for Infrared Images

Abstract

1. Introduction

2. Related Works

3. Redundancy Analysis of Line-Scan Panoramic Infrared Images

4. Proposed Method

4.1. The Creation and Coding of General Code Table

4.1.1. Canonical Huffman Coding

4.1.2. Canonical Huffman Coding Based on General Code Table

4.2. An Improved Huffman Coding Lossless Compression Algorithm

5. Results

5.1. Proposed Method Compared with JPEG Series Algorithms

5.2. Proposed Method Compared with TIFF

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Appendix A

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI