Next Article in Journal
Results for a Control Problem for a SIS Epidemic Reaction–Diffusion Model
Previous Article in Journal
Novel Analysis between Two-Unit Hot and Cold Standby Redundant Systems with Varied Demand
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Reversible Data Hiding in Encrypted Images Based on Hybrid Prediction and Huffman Coding

1
School of Computer Science and Engineering, Xi’an University of Technology, Xi’an 710048, China
2
Shaanxi Key Laboratory for Network Computing and Security Technology, Xi’an 710048, China
3
Shaanxi Province Key Lab of Thin Film Technology and Optical Test, Xi’an Technological University, Xi’an 710048, China
*
Author to whom correspondence should be addressed.
Symmetry 2023, 15(6), 1222; https://doi.org/10.3390/sym15061222
Submission received: 9 May 2023 / Revised: 2 June 2023 / Accepted: 5 June 2023 / Published: 8 June 2023

Abstract

:
As an interesting technique that allows data extraction and image recovery without any loss, reversible data hiding in encrypted images is an area of great concern in the field of information security. In this paper, a new reversible data hiding method with high embedding capacity is proposed based on hybrid prediction and Huffman coding. The combination of two embedding mechanisms is innovatively designed to improve the embedding capacity according to different parts of the original image, i.e., the most significant bit-plane and the remaining seven bit-planes. In the first part of this method, the prediction value of each pixel is obtained by calculating the average value of its two neighboring pixels, and all of the most significant bits can be vacated to accommodate additional data. In the second part, the prediction value of each pixel is calculated using the median edge detector predictor, on which the tag map is built. Then, Huffman coding is used to compress the tag map so that a large amount of vacant space is obtained. Finally, the secret data can be embedded into the vacated space by directly using bit substitution. Compared with some recently reported methods, experimental results and analysis have demonstrated that an original image with high visual symmetry/quality can be recovered. Also, larger embedding capacity can be achieved, such as 3.3894 bpp and 3.2824 bpp, for BOSSBase and BOWS2 databases, respectively.

1. Introduction

In the past few years, as researchers have paid more attention to privacy protection [1], the reversible data hiding (RDH) [2,3,4,5] technique has greatly developed, in which secret information can be embedded into the multimedia medium without any obvious distortion. Meanwhile, the extracted embedded data is lossless, and even the restored host medium is exactly the same as the original. Because of these characteristics, RDH has a significant influence in various fields, such as medical applications [6] and multimedia archive management [7]. Many investigations have been extensively devoted to RDH methods. The embedding mechanism can be classified into one of three categories: histogram shifting [8,9,10,11], difference expansion [12,13,14], and lossless compression [15,16]. Specifically, the embedding methods based on histogram shifting exploit the peak and zero point of the image histogram to embed the watermark bits, while the embedding, based on difference expansion, in which secret bits are inserted into the difference between adjacent pixels. In addition, the researchers also have carried out reversible embedding of dual cover images [17,18]. Since the RDH methods mentioned above all involve plaintext images, which are easily tampered or stolen by an unauthorized user during the transmission process, reversible data hiding in an encrypted image (RDH-EI) [19,20,21] is presented to further improve the security of the image.
Generally speaking, RDH-EI methods can be divided into two categories according to whether there is embeddable space before the image encryption operation: vacating room after encryption (VRAE) [22,23,24,25,26,27,28,29,30] and reserving room before encryption (RRBE) [31,32,33,34,35,36]. The first RDH-EI method was proposed by Puech et al. [22], where the local standard deviation of the marked encrypted image is analyzed and the embedded information is extracted during the process of image decryption. In the method of Zhang [23], the encrypted image is divided several nonoverlapping blocks, where the secret data is embedded by modifying some encrypted pixels in the block. The disadvantage of this method is that if the divided blocks are quite small, there will be errors in the extracted data. In another method proposed by Zhang [24], the spare space for additional data accommodation is provided by compressing the least significant bits of the encrypted image, thereby creating reversible data hiding. Hong et al. [25] utilized the side match approach to reduce the error rate of extracted data. Moreover, a method based on absolute mean difference of multiple neighboring pixels was created by Liao et al. [26], in which the complexity of an image block is evaluated by using multiple adjacent pixels. In the Qin et al.’s method [27], the encrypted image is divided into non-overlapping blocks that are processed by a run-length coding algorithm, and then Huffman coding is applied to encode each block as an embeddable block that can contain secret data. Yu et al. [28] implemented secret data embedding with the help of histogram shifting, based on two-layer adjacent encrypted pixel errors. Fu et al. [29] adopted block rearrangement and stream cipher encryption to encrypt the original image, and then adaptively compress the most significant bits of the embeddable block, to provide embedding space for additional data. Wu et al. [30] employed a parameter binary tree to mark encrypted pixels, which makes full use of the spatial correlation of the entire image and reserves room for the embedding of additional data.
Since image encryption can destroy the spatial correlation of the pixels and cause the entropy of the image to reach its maximum, it is troublesome to vacate spare room for additional data accommodation from the encrypted image. Therefore, in order to increase the image embedding capacity, Ma et al. [31] were the first to consider a way of reserving room before encryption. On this basis, Cao et al. [32] put forward a patch-based RDHEI method to take advantage of the correlation of adjacent pixels, in which the patch is regarded as whole and represented by a limited number of coefficients, thereby producing high room capacity. Additionally, Yi and Zhou [33] exploited a binary-block embedding (BBE) method to embed the binary values of less significant bit-planes of the host image into more significant bit-planes, vacating space for the embedding of additional data. In Wu et al.’s method [34], before encryption, a way of predicting the most significant bit is applied and the binary bits in the least significant bit-plane are reversibly saved into other bit-planes. Arun et al. [35] used parametric binary tree to mark pixels, with which a pixel is identified as regular or irregular pixel. Then, regular pixels are used to accommodate secret bits. Mittal et al. [36] adopted two schemes to embed secret bits based on the checkerboard pattern divided from the cover image, and obtained more rooms for embedding.
In many existing RDH-EI methods, the embedding of secret data is achieved by substituting the least significant bit of the pixel, which limits the embedding capacity of the image and is difficult for the recovery of the image. Based on the above reasons, the researcher’s goal has gradually shifted from the least significant bit (LSB) to the most significant bit (MSB). In 2018, Puteaux and Puech [37] introduced the RDH-EI algorithm based on MSB prediction, in which the most significant bits of the original image are predicted and then the most significant bits of pixels without prediction errors can be replaced by bits of secret data. In another method proposed by Puteaux and Puech [38], the other bit-planes of the original image are predicted and analyzed recursively from MSB to LSB to improve embedding capacity as much as possible. In Yin et al.’s method [39], the median edge detector predictor is utilized to predict the multi-MSB of the image to generate a label map that contains all the pixels’ labels, and is encoded by using Huffman coding to reduce the space occupied by itself into auxiliary information. Then, according to the label map, the multi-MSB can be substituted by the secret bits.
As it is pointed out in [3], the embedding capacity and visual symmetry/quality are complementary, which yields a very poor quality recovered image. Also, as can be seen from the above discussion, only one main mechanism is applied to create spare space for embedding secret data in most methods, which seriously limits the improvement of embedding capacity. To make full use of redundant space of an original image, this paper proposes a hybrid prediction strategy to implement reversible data hiding based on Huffman coding. In the first part of the proposed method, the original image is decomposed into two parts, i.e., the most significant bit-plane and the remaining bit-planes, where two different embedding mechanisms are applied. For each pixel, its predicted value is obtained by calculating the average value of its two neighboring pixels, from which, any errors are identified. After modifying pixels with prediction errors, which are indicated in the error location map, all of the most significant included in the first part can be vacated to record secret data. In the second part of the process, a new image can be considered, comprised of the seven least significant bits of all pixels. Using the correlation between adjacent pixels, vacant bits can be obtained from higher bit-planes of this new image. Each pixel in this image is assigned a tag by comparing its actual value to its predicted value, where its predicted value is calculated using the median edge detector predictor. Finally, a tag map for this image is built comprised of all tag values, and then Huffman coding is used for compression, to obtain a large amount of vacant space. It should be noted that, through compression of the error location map and reference pixels in the first part, the number of embeddable bits can be increased further. Extensive experiments are performed to demonstrate the performance of the proposed method in terms of several characteristics such as security, embedding capacity, and robustness against potential attacks.
The rest of this paper is organized as follows. In Section 2, the proposed method including detailed descriptions of the generation of the tag and error location maps, encryption of the original image, secret data embedding, and data extraction and image recovery. In Section 3, a series of experiments and analyses are presented that demonstrate the performance of the proposed method. Finally, a brief conclusion is given in Section 4.

2. Proposed Method

In this section, a new reversible data hiding method based on hybrid prediction and Huffman coding is described in detail, with which, not only higher embedding capacity, but also error-free secret data extraction and image recovery can be achieved. As shown in Figure 1, there are three main processes included in the corresponding schematic diagram, i.e., encryption of the original image, embedding of the secret data, and extraction of the secret data as well as recovery of the original image. In the encryption process, the most significant bits of the pixels are predicted to reserve redundant space to hide the data. The pixels with prediction errors are marked to generate the error location map, and the original image is modified based on this. Then, the pixels composed of the seven least significant bit-planes of the modified image are predicted to obtain the pixel tag value. A part of the auxiliary data is embedded into the most significant bits of encrypted pixels with the help of the error location map. To save embeddable bits, the error location map is compressed using the run-length coding algorithm. In the embedding process, applying bit substitution, the remaining of the auxiliary data and the secret data are adaptively accommodated into bits of other high bit-planes by referencing the tag map. Finally, secret data extraction and image recovery can be implemented separately, according to different secret keys, such as the image encryption key and the data hiding key, where the retrieved secret data is error-free and the restored image is the same as the original image.

2.1. Auxiliary Data Generation

2.1.1. Error Location Map Generation

Since a part of the calculated auxiliary data is embedded into most significant bits of encrypted pixels, their original values are modified after secret data hiding. To recover the original image without errors in the future, it is important to be able to predict them accurately. For this purpose, a binary error location map is built, which consists of basic and additional parts. Except for pixels in the first row and column, which are used as reference pixels, each pixel is analyzed to identify any prediction errors. If there is an error, the corresponding bit in the error location map is set to 1, otherwise it is set to 0. Arranging the error location map column by column, a sequence can be obtained as the basic part as shown in Figure 2a.
Suppose that pixel values are in the range [ 0 ,   255 ] and the size of the original image I is M × N pixels, an initial binary error location map denoted as L is created, where all bits are set to 0. Scanning all pixels in the original image, the basic part of the error location map is generated as follows:
(1)
For the current pixel I ( i , j ) , where 2 i M and 2 j N , its inverse value I i n v ( i , j ) is calculated as
I i n v ( i , j ) = ( I ( i , j ) + 128 )   mod   256 ,
where I i n v ( i , j ) denotes the inverse value.
(2)
To predict the current pixel more accurately, especially when there is a large difference between the pixel and its neighboring values, the average of its top pixel I ( i 1 , j ) and left pixel I ( i , j 1 ) is considered as its predicted value and calculated as
I p r e d ( i , j ) = I ( i 1 , j ) + I ( i , j 1 ) 2 ,
where I p r e d ( i , j ) denotes the prediction value.
(3)
Two absolute difference values between I ( i , j ) and I p r e d ( i , j ) , and between I i n v ( i , j ) and I p r e d ( i , j ) are calculated and denoted as:
{ Φ ( i , j ) = | I p r e d ( i , j ) I ( i , j ) | Φ i n v ( i , j ) = | I p r e d ( i , j ) I i n v ( i , j ) | ,
where Φ ( i , j ) and Φ i n v ( i , j ) represent two absolute difference values, respectively.
(4)
If the original value is closer to its prediction value than the inverse value, it is assumed that there is no prediction error according to the current pixel, and the bit L ( i , j ) in the error location map remains unchanged. Otherwise, there is an error and the bit L ( i , j ) is set to 1. The details can be described as
L ( i , j ) = { 0   if   Φ ( i , j ) < Φ i n v ( i , j ) 1   if   Φ ( i , j ) Φ i n v ( i , j ) .
(5)
Finally, concatenating all bits in the column in order of priority, i.e., from top to bottom and from left to right, a binary sequence is engendered as the basic part of the error location map.
Generally, most significant bits in pixels with prediction errors cannot be vacated for secret data embedding. For example, in the high-capacity reversible data hiding approach proposed by Puteaux and Puech [37], if at least one prediction error is determined in a block of eight pixels, information will not be embedded in the 24 most significant bits included in this block, nor its previous and following blocks. Therefore, the maximum embedding capacity of this method are less than 1 bit per pixel (bpp). To solve this issue, in the proposed method, a new image is constructed by modifying original values of pixels with prediction errors, and then, all information, including auxiliary data and secret data, are embedded into it. Meanwhile, the additional part of the error location map, as shown in Figure 2b, is designed to identify what kind of modification is performed for the pixels with prediction errors.

2.1.2. Original Image Modification

In order to make all most significant bits of the original image embeddable, it is necessary to modify the problematic pixels before embedding, so that a new image without any prediction errors can be obtained. In this proposed method, the new image denoted as I is formulated by taking account two aspects, i.e., the two absolute difference values calculated using Equation (3) and the distribution of pixel values. For each pixel in the original image, whether it is modified depends on the following provisions:
I ( i , j ) = { I ( i , j ) if   Φ ( i , j ) < Φ i n v ( i , j ) I ( i , j ) + 128   mod 256 if   Φ ( i , j ) > Φ i n v ( i , j )   I ( i , j ) + 1 if   Φ ( i , j ) = Φ i n v ( i , j )   and   0 I ( i , j ) < 128 I ( i , j ) 1 if   Φ ( i , j ) = Φ i n v ( i , j )   and   I ( i , j ) 128 .
From Equation (5), it can be seen that the original pixel value remains unchanged in the new image if there is no prediction error, otherwise it should be modified. When the absolute difference value between I ( i , j ) and I p r e d ( i , j ) is larger than that between I i n v ( i , j ) and I p r e d ( i , j ) , the original pixel value is directly substituted with its inverse value. When two absolute difference values are equal, the original pixel value in the range [ 0 ,   128 ) is added to 1, while those in the range [ 128 ,   255 ] are subtracted by 1. Thus, the problematic pixels with Φ ( i , j ) = Φ i n v ( i , j ) are transformed to pixels without prediction errors. For example, if a pixel I ( i , j ) = 1 , satisfying the condition Φ ( i , j ) = Φ i n v ( i , j ) is problematic; it is known that its inverse value I i n v ( i , j ) is 129, as calculated using Equation (1), and its prediction value I p r e d ( i , j ) should be 65. Adding the original pixel value to 1, according to Equation (5), its inverse value will become 130. At the same time, two absolute difference values Φ ( i , j ) and Φ i n v ( i , j ) will become 63 and 65, respectively, as calculated in Equation (3), which satisfies Φ ( i , j ) < Φ i n v ( i , j ) . This means that the corresponding pixel without prediction error is obtained.
Correspondingly, the additional part of the error location map as shown in Figure 2b is used to identify what kind of modification took place for pixels with prediction errors. Because there are three kinds of pixel value modifications in the new image, as expressed in Equation (5), two bits are required to identify them. As shown in Figure 2b, suppose that there are K pixels with prediction errors in the basic part. Scanning all pixels in the basic part individually, if a pixel is marked with 1 and the condition Φ > Φ i n v is satisfied, two bits “00” are appended in the second part of the error location map. For example, for the i t h pixel in the basic part, which is the first pixel with prediction error, the first two bits in the second part are set to “00”. If a pixel is marked with 1 and its original value is not larger than 128, two bits “01” are appended in the second part, otherwise two bits “10” are added. For example, for the l t h pixel in the basic part, which is the last pixel with a prediction error, the last two bits in the second part of the error location map are set to “10”. Thus, a total of 2 × K bits denoted as L a are used to record the modification of the pixel values.
Obviously, because the basic part of the error location map is employed to identify all pixels of the original image as described in Equation (4), a total of M × N bits are needed. If it is directly embedded as a part of auxiliary data, a larger number of embeddable bits will be consumed. To solve this issue, it should be further compressed. As an important data compression algorithm, run-length coding can achieve lossless compression of the data, in which the same value occurs in many consecutive data elements and a sequence of the same values, i.e., a run, can be replaced by a code to indicate the value and the number of times it occurs. Because the number of pixels with prediction error is very low, there will be many long runs of “0” in the basic part of the error location map. Therefore, there is no doubt that it can be efficiently compressed using the run-length coding. Obviously, the value of the first run of the sequence corresponding to the error location map generated by any image is always 0. The reason for this is that when the average value of adjacent pixels is used to predict the most significant bit of the current pixel, the pixels of the first row and the first column are taken as reference pixels without processing, so the values of elements in the first row and column of the corresponding error location map are 0. As shown in Figure 3, there are four parts of data in the compressed basic part. Among them are l L , b bits to record the length of the compressed basic part, l L , 0 and l L , 1 bits to record the length of the maximum run of “0” and “1”, respectively. The remaining bits, denoted as L b are the binary sequence of the compressed basic part.
Since most significant bits of reference pixels are utilized to embed information in the proposed method, their original values also should be compressed and recorded as a part of the auxiliary data. Concatenating all of the most significant bits of pixels in the first row and column, the obtained sequence is compressed using the run-length coding. Similar to the structure of the compressed basic part, as shown in Figure 3, the compressed result denoted as Γ includes five parts of data, as shown in Figure 4. In this, 1 bit, denoted as f Γ , is used to identify whether the initial run is a run of “0” or “1”, l Γ bits are used to record the length of the compressed result, l Γ , 0 and l Γ , 1 bits to record the length of the maximum run of “0” and “1”, respectively, and the remaining bits denoted as Γ c are the resultant sequence after compression.

2.1.3. Tag Map Generation

As described above, most significant bits of the modified image I can only accommodate a part of the auxiliary data. In order to record the remaining part and the secret data, embeddable bits should be reserved from other high bit-planes of each pixel. To achieve this goal, a new image denoted as I L S B is constituted with seven least significant bit-planes of the modified image in the beginning. Obviously, all pixel values are in the range [ 0 ,   127 ] in this new image. Then, multiple bits in high bit-planes of each pixel in I L S B are predicted adaptively and assigned to the corresponding Huffman code.
Except for the first pixel I L S B ( 1 ,   1 ) in the new image, the prediction value of each pixel in I L S B should be calculated at first. If the current pixel is in the first row, the value of its neighboring left pixel is regarded as its prediction value. Similarly, if the current pixel is in the first column, the value of its neighboring top pixel is regarded as its prediction value. For other pixels, the corresponding prediction values can be evaluated using the median edge detector (MED) predictor. For the current pixel I L S B ( i , j ) in the new image, its prediction value can be calculated as
I ^ L S B ( i , j ) = { min ( b , c ) , a max ( b , c ) max ( b , c ) , a min ( b , c ) b + c a ,   otherwise ,
where a = I L S B ( i 1 , j 1 ) , b = I L S B ( i , j 1 ) , c = I L S B ( i 1 , j ) and I ^ L S B ( i , j ) is the corresponding prediction value. Next, the original value and the prediction value are converted into two 7-bit binary sequences, respectively, using the following formulas as
p k ( i , j ) = I L S B ( i , j ) mod 2 8 k 2 7 k , k = 1 , 2 , , 7 p ^ k ( i , j ) = I ^ L S B ( i , j ) mod 2 8 k 2 7 k , k = 1 , 2 , , 7 ,
where p k ( i , j ) and p ^ k ( i , j ) represent the k t h bit of two obtained binary sequences, respectively. Compare these two sequences from the most significant bit to the least significant bit until a pair of bits are different. Then, a tag denoted as ε is used to mark the current pixel, which indicates the number of bits in high bit-planes for which their original and prediction values are equal. This is calculated as
argmax ε   I L S B ( i , j ) ε M S B = I ^ L S B ( i , j ) ε M S B , ε = 0 , 1 , , 7 subject   to { I L S B ( i , j ) ε M S B = I L S B ( i , j ) ( I L S B ( i , j ) mod 2 7 ε ) I ^ L S B ( i , j ) ε M S B = I ^ L S B ( i , j ) ( I ^ L S B ( i , j ) mod 2 7 ε ) ,
where I L S B ( i , j ) ε M S B and I ^ L S B ( i , j ) ε M S B are the corresponding ε most significant bits, respectively. Because two sequences have 7 bits, the tag ε is an integer that is no larger than 7, which means that there are 8 kinds of tags at most. In the embedding process, there are ε + 1 embeddable bits in high bit-planes can be reserved to save information. For example, if a pixel value is 108 and its prediction value is 116, two values can be converted into “1101100” and “1110100”, respectively. Comparing them in turn from high to low bit-plane, it can be found that there are 2 equal bits until a pair of bits are different. Therefore, the tag of this pixel is set to 2 and bits in three high bit-planes can be vacated to embed information. Notably, the pixel with the tag ε = 7 only affords 7 embeddable bits, because the binary sequence of the pixel contains only 7 bits.
Except for the first pixel, scanning all other pixels using the aforementioned procedure, a tag map of the modified image I can be obtained. Meanwhile, a large number of embeddable bits are created in high bit-planes of pixels in I L S B to accommodate information. Note that the pixel I L S B ( 1 ,   1 ) does not provide spare bits and remains unchanged. It can be directly used as the prediction value of some pixels, such as I L S B ( 1 ,   2 ) and I L S B ( 2 ,   1 ) , not only in the embedding process but also in the image recovery process.

2.2. Huffman Coding

To ensure that the original image can be recovered without any error, the tag map generated in I L S B also should be considered as a part of the auxiliary data and can be embedded after the modified image I is encrypted. It is obvious that the number of each tag in I L S B is different due to region characteristics of the original image. Calculating the occurrence frequency for each kind of tag, the tag map is recorded through Huffman coding, which will be very helpful to decrease the amount in this part of the auxiliary data, thereby improving the embedding capacity.
As described in Section 2.1.3, there are 8 kinds of tags at most, where ε is an integer in the range [ 0 ,   7 ] . If a pixel in I L S B is assigned with the tag 0, the bit in the 7 t h bit-plane can be vacated to record information. In the recovery process, the original bit can be obtained by reversing the value of the corresponding position of its prediction value. As the tag is an integer in the range [ 1 ,   6 ] , a total of ε + 1 bits from the 7 t h bit-plane to the ( 7 ε ) t h bit-plane are reserved. The first ε original bits can be recovered by retrieving the ε most significant bits of its prediction value, and the last bit is obtained using the method similar to that with the tag as 0. If the tag is equal to 7, the bits can be directly recovered from 7 bits occupied by its prediction value. Correspondingly, a group of 8 Huffman codes is defined to represent 8 kinds of tags, which are “00”, “01”, “100”, “101”, “1100”, “1101”, “1110”, and “1111”.
In the Huffman coding process, the numbers of different kinds of tags are calculated initially in accordance with the generated image I L S B of an original image, and then all kinds of tags are sorted. According to their occurrence frequency from high to low, 8 kinds of Huffman codes are assigned to them in turn. In other word, the tag with the maximum frequency is allocated with the shortest Huffman code such as “00” or “01”, while the tag with the minimum frequency assigned with the longest Huffman code such as “1100”, “1101”, “1110”, and “1111”. In this way, it is obvious that the amount of the tag map generated in I L S B can be compressed greatly. Take the test image Baboon, as an example, where the image size is 512 × 512 pixels, the numbers of all kinds of tags and the corresponding Huffman code are listed in Table 1. It can be intuitively observed that the number of pixels with the tag 0 is the largest, so the corresponding pixels are represented by the Huffman code “00”, while the numbers of pixels with tag 6 and 7 are smaller than others; therefore, the corresponding pixels are marked with “1111” and “1110”, respectively. It is notable that the Huffman coding rule composed of 8 codes should be considered as a part of auxiliary data to be embedded.
After the process of Huffman coding is implemented, the compressed tag map occupying ψ c bits can be obtained, where its length can be calculated by the formula as
L e n g t h ψ = t = 0 7 ( n t × c t ) , t = 0 , 1 , , 7 ,
where n t represents the number of pixels with the tag t and c t denotes the length of the corresponding Huffman code. At the same time, l ψ bits are applied to record the length of the compressed tag map.

2.3. Image Encryption

To enhance the security level, each pixel of the modified image I generated in Section 2.1.2 is encrypted by the aid of the encryption key K e in the proposed method. Initially, a pseudo-random matrix S with the same size as I is generated using the encryption key. Then, the bitwise exclusive-OR (XOR) calculation is applied between the current pixel I ( i , j ) and the corresponding element S ( i , j ) in the pseudo-random matrix. In other word, each pixel in I ( i , j ) can be processed as
I e ( i , j ) = I ( i , j ) S ( i , j ) ,
where I e ( i , j ) is the encrypted result of the current pixel and denotes the bitwise exclusive-OR operator. After all pixels in the modified image I are processed using Equation (10), the encrypted image denoted as I e is obtained. However, and importantly, the encrypted image is very noisy and no valid information can be observed. In addition, if it is needed to improve the security level further, the pseudo-random matrix used in Equation (10) can be generated with chaotic functions, such as in the Chen system. This can guarantee a random matrix with non-periodic and non-convergent properties that is more suitable for encrypting the image.

2.4. Auxiliary Data Embedding

As discussed in previous subsections, there are four kinds of auxiliary data in the proposed method which are used to describe the compressed error location map, the compressed binary sequence of most significant bits of reference pixels, the Huffman coding rule, and the compressed tag map. To implement extraction of the secret data and recovery of the original image successfully, all auxiliary data are rearranged as shown in Figure 5. In doing this, a long binary sequence is formed, and its previous part should be directly embedded in most significant bits of the encrypted image I e using bit substitution.
According to the structure as shown in Figure 5, it should be pointed out that the number of bits occupied by the Huffman coding rule is fixed, that is, 26 bits. At the same time, 8 Huffman codes are arranged in the order from 0 to 8, in accordance with tags they represent. Most importantly, because all of most significant bits of the encrypted image I e are used to accommodate information, a large number of partial auxiliary data can be stored into them by bit substitution. Therefore, when recovering the original image, the necessary information, such as the Huffman coding rule, the error location map, and the original most significant bits of reference pixels, as shown in Figure 5, can be easily retrieved at first. Based on the extracted Huffman coding rule, the tag map can be gradually retrieved from the 7 least significant bits of pixels. After the auxiliary data are completely obtained, the original image also can be easily recovered in accordance with the error location map and the original most significant bits of reference pixels.
Finally, scanning I e from the left to right and from top to bottom, the remaining part of the auxiliary data is embedded into a part of the pixels by replacing multiple bits of high bit-planes, which can be described as
I e ( i , j ) = { I e ( i , j ) mod 2 6 ε + s = 0 ε ( b s × 2 6 s ) , 0 ε 5   s = 1 7 ( b s × 2 7 s ) , 6 ε 7   ,
where I e ( i , j ) represents the current pixel containing the auxiliary information, ε is its tag and b s is a bit of the remaining auxiliary information to be embedded. From Equation (11), it can be seen that vacant bits for data accommodation only are determined from the 7 least significant bits of each pixel. After all the auxiliary information is embedded, the encrypted image containing auxiliary data, denoted as I e , is obtained, in which enough vacant bits will be reserved from the unprocessed pixels to embed the secret data.

2.5. Secret Data Hiding

Before embedding the secret data, the Huffman coding rule and the tag of the pixel containing all of the auxiliary data should be recovered from the encrypted image I e . In the beginning, partial auxiliary information is retrieved from all of the most significant bits of I e , and the necessary information, such as the Huffman coding rule and the partial compressed tag map as shown in Figure 5, can be easily obtained. Then, a part of the pixels in I e are scanned from left to right and from top to bottom. For the current pixel, its Huffman code is retrieved from the previously restored compressed tag map, from which its tag ε is obtained by referencing the Huffman coding rule. Only considering its 7 least significant bits, the ε + 1 bits of auxiliary information can be extracted from high bit-planes. In this way, after all the auxiliary data are retrieved, the tag map is restored. Finally, according to their tags, bits in high bit-planes of the remaining pixels can be vacated to accommodate the secret data. Using Equation (11), the marked encrypted image I e w containing the secret data is obtained. It should be noted that the secret data is encrypted before embedding them in order to enhance the security of the embedded data. Similar to the image encryption using Equation (10), the binary bits are engendered with the help of the data hiding key K d at first, and then the secret bits are encrypted using the XOR calculation between them and the generated binary bits.

2.6. Data Extraction and Image Recovery

To extract the secret data and recover the original image successfully, the entire auxiliary data should be extracted from the marked encrypted image I e w first. Since all of most significant bits are applied to save the previous part of the auxiliary data, information, such as the compressed error location map, the compressed result of most significant bits of reference pixels, the Huffman coding rule, and a part of the compressed tag map as shown in Figure 5, can be extracted directly. The remaining of the compressed tag map are recovered by scanning the 7 least-significant bits of pixels from left to right and from top to bottom. For a pixel, its Huffman code is retrieved from the previously obtained compressed tag map, and its tag is obtained by referencing the Huffman coding rule. Thus, several corresponding bits containing the information of the remaining compressed tag map can be extracted from its high bit-planes. Until the length of the compressed tag map is satisfied, the tag map can be completely restored from all processed pixels. Obviously, the entire auxiliary data are recovered without involving any secret key, with which different results will be obtained depending on the different key.
If the data hiding key K d is accessible, with the aid of the restored tag map, the embedded secret information can be extracted from the 7 least significant bits of the remaining unprocessed pixels. For each unprocessed pixel, according to its tag map, several spare bits containing a part of secret information are extracted from its high bit-planes. After all the remaining pixels are processed, the entire secret information is extracted and then decrypted with the help of the binary bits engendered with the data hiding key. The decryption process is implemented using the XOR calculation between the secret bits and the generated binary bits. Notably, the information of the original image is not revealed due to the absence of the encryption key.
If the encryption key K e is available, the original image can be recovered without any error with the help of the entire auxiliary data. First, using the reverse process described in Equation (10), where the pseudo-random matrix is the same and generated with the encryption key, the decrypted image I d is directly obtained from the marked encrypted image. After Γ c is decompressed, the obtained most significant bits are put back to the first row and column. Next, an image I d , L S B is reconstituted from the seven least significant bit-planes of the decrypted image and scanned from left to right and from top to bottom to recover original bits for the 7 least significant bit-planes. For the current pixel, its tag ε can be retrieved from the restored tag map. With the aid of the MED predictor, its prediction value I ^ d , L S B ( i , j ) is calculated employing its neighboring pixels that are confirmed in the process of recovering its previous pixel. Thus, its original value is recovered by means of its tag and its prediction value, which can be described as
I d , L S B ( i , j ) = { s = 0 ε I ^ d , L S B ( i , j ) s M S B × 2 7 s + ( 1 I ^ d , L S B ( i , j ) ( ε + 1 ) M S B ) × 2 6 ε + I d , L S B ( i , j ) mod 2 6 ε , 0 ε 6 I ^ d , L S B ( i , j ) , ε = 7 ,
where I ^ d , L S B ( i , j ) ε M S B is calculated by using Equation (8). Based on this equation, it can be seen that ε bits from the 7 t h bit-plane to the ( 7 ε + 1 ) t h bit-plane of I d , L S B ( i , j ) are directly recovered from the corresponding bits of I ^ d , L S B ( i , j ) while the bit in the ( 7 ε ) t h bit-plane is obtained by negating the corresponding bit. Meanwhile, if the tag is equal to 7, the original value is equal to its prediction value. In this way, two aspects of the original information are recovered without any error. One aspect is the most significant bits of pixels in the first row and column, and another is the 7 least significant bits of all pixels.
Next, the most significant bits of other pixels are recovered according to the compressed error location map. After L b is decompressed, the basic part of the error location map can be obtained, while the additional part is directly acquired from L a as shown in Figure 5. The obtained image is denoted as I d , for the sake of simplicity. Except for pixels in the first row and column, all pixels are scanned from the left to right and from the top to bottom. For the current pixel I d ( i , j ) , its most significant bit is recovered as follows:
(1)
The average value of its left and top pixels, which are previously confirmed, is obtained as its prediction value I ^ d ( i , j ) similar to the calculation using Equation (2).
(2)
Two possible values denoted as I d ( i , j ) | M S B = 0 and I d ( i , j ) | M S B = 1 are considered. Keeping the 7 least significant bits of the current pixel unchanged, I d ( i , j ) | M S B = 0 and I d ( i , j ) | M S B = 1 are obtained by making its most significant bit equal to 0 and 1, respectively.
(3)
The absolute difference values between I d ( i , j ) | M S B = 0 and I ^ d ( i , j ) and between I d ( i , j ) | M S B = 1 and I ^ d ( i , j ) are calculated as
{ Δ 0 ( i , j ) = | I d ( i , j ) | M S B = 0 I ^ d ( i , j ) | Δ 1 ( i , j ) = | I d ( i , j ) | M S B = 1 I ^ d ( i , j ) | .
(4)
Finally, its most significant bit is determined by comparing the two absolute differences described above. If the condition Δ 0 ( i , j ) < Δ 1 ( i , j ) is satisfied, the most significant bit is set to 0, otherwise it is set to 1.
Repeat the above steps for each pixel, and the decrypted modified image, denoted as I d , can be obtained, where most significant bits of some pixels are modified as described with Equation (5). Using the decompressed basic and additional part of the error location map, these most significant bits can be corrected. First, the basic part of the error location map is arranged to the two-dimensional binary matrix L as described with Equation (4), and all pixels in I d are scanned from left to right and from top to bottom. For a pixel, when L ( i , j ) = 1 is satisfied, two corresponding bits are retrieved from the second part of the error location map as shown in Figure 2. Then, if two bits are “00”, its value is corrected as
I ( i , j ) = ( I d ( i , j ) + 128 ) mod 256 .
When the retrieved two bits are “01” or “10”, its value is corrected by subtracting 1 or adding 1, respectively. Obviously, the correction is the reverse process described with Equation (5). Consequently, the original image is obtained without any loss.

3. Experimental Results and Analysis

To demonstrate the performance of the proposed method, experimental results are analyzed in terms of security and embedding rate, and compared against state-of-the-art methods. In the experiments, six commonly used test images with size of 512 × 512 pixels, including smooth and textural images, as shown in Figure 6 are taken as examples. A randomly generated bit stream is used as the secret data. The effectiveness of the proposed method is measured using images from different image databases, such as BOSSBase and BOWS2. In addition, for the error location map as shown in Figure 3, l L , 0 = 18 and l L , 1 = 18 bits are utilized to save the length of the maximum run of “0” and “1”, respectively, and l L , b = 18 bits is used to save the length of the compressed basic part. For the compressed results, as shown in Figure 4, f Γ occupies 1 bit, l Γ , 0 = 10 and l Γ , 1 = 10 bits are applied to save he length of the maximum run of “0” and “1”, respectively, and l Γ = 10 bits is used to record the length of the compressed most significant bits of reference pixels. For the compressed tag map, l ψ = 20 bits are used to record its occupied bits.

3.1. Security Analysis

Before embedding the secret data, the original image is encrypted by using the XOR calculation between it and a pseudo-random matrix generated by the encryption key K e as described with Equation (10). In order to verify the security of the proposed method, experimental results are analyzed from three parameters: the histogram, information entropy and correlation coefficient.
Using the image Lena as an example, the experimental results: the encrypted image, the encrypted image containing auxiliary data, the marked encrypted image, and the recovered image are shown in Figure 7. It can be seen that all encrypted images, especially the marked encrypted image, are noisy and no valid information can be observed from them. The histograms corresponding to the original image, the encrypted image, and the marked encrypted image are shown in Figure 8. Clearly, statistical features can be easily observed from the histogram of the original image’s data, while no valid information can be retrieved from other histograms due to their uniform distribution. Furthermore, after embedding the auxiliary information and secret data, the overall pixel distribution has somewhat changed, as shown in Figure 8c; however, it is still difficult to recover the original image content. Similar results were obtained for other test images.
As a quantitative measure of the randomness of a signal source, information entropy is used to evaluate the randomness of the marked encrypted image, which is mathematically defined as
H ( x ) = i = 0 255 P ( x i ) log 2 P ( x i ) ,
where x i is the gray value and P ( x i ) is its probability. In general, for an encrypted image, its ideal entropy is 8. The more uniform a frequency distribution is, the higher is its information entropy. The information entropy values of six test images as shown in Figure 6 are listed in Table 2, and they are all very close to 8. Therefore, the proposed method can efficiently withstand this kind of statistical attack. Furthermore, in order to measure the degree of correlation between the original image and the marked encrypted image, the corresponding correlation coefficient is calculated as
ρ ( I , I e w ) = C o v ( I , I e w ) σ ( I ) σ ( I e w )
where C o v ( I , I e w ) is the cross covariance between the data representing the original image and the marked encrypted image, and σ ( I ) and σ ( I e w ) are their standard deviations, respectively. The correlation coefficient values of six test images are listed in Table 3, from which it can be seen that all values are close to zero. This means that each marked encrypted image has a very low correlation with its corresponding original image. In addition, the pseudo-random matrix generated with the key K e , used to encrypt the original image as described with Equation (10), can be considered as a binary sequence with the M × N × 8 bits, for which there are 2 8 M N possibilities. Due to the very large number of possibilities, it is almost impossible to solve this pseudo-random matrix. This then guarantees that the proposed method can resist a brute-force security threat.

3.2. Performance Analysis

As described in Section 2, in order to make full use of pixel redundancy to embed information, the original image is divided into two parts in the proposed method. The first part consists of all the most significant bits. After identifying and modifying pixels with prediction errors, all the most significant bits can be vacated to accommodate information. The second part is a new image constituted with the 7 least significant bits of pixels. Utilizing the correlation between neighbor pixels, vacant bits can be obtained from higher bit-planes of this new image. In the above two processes, different kinds of auxiliary data are generated, as shown in Figure 5. Among them, the tag map obtained from the second part occupies a large number of bits. Using Huffman coding, a concentrated tag map can be generated to reduce the volume of these data greatly. At the same time, through compression of the error location map and the most significant bits of reference pixels, as noted in the first part of the process, the number of embeddable bits is further increased. Consequently, the embedding capacity of the proposed method is efficiently improved.
After all the auxiliary data are generated in the two parts of the proposed method, the number of bits they occupied is known, and the total embedding capacity of the original image can be calculated. Subtracting the number of bits consumed by auxiliary data from the total capacity, the net embedding capacity is obtained, i.e., the maximum length of secret data. Using the test image Tiffany as an example, the total embedding capacity generated in the first part and the number of bits occupied by the corresponding auxiliary data are listed in Table 4. It can be seen that many embeddable bits are consumed, in which the compressed basic part of the error location map is saved. Additionally, a fixed number of extra bits are used to record the length of the compressed basic part of the error location map, the length of the compressed bits of reference pixels, and the maximum length of each run, and so on. In this example, the net payload of the first part is 256,195 bits. For the second part, excluding the 26 bits used to record the Huffman coding rule, the tag distribution and encoding of pixels are listed in Table 5. The first column lists 8 types of tags, and the second column lists the corresponding Huffman codes. The third and fourth columns list the numbers of tags and the length of each tag type, respectively. The total number of bits occupied by the compressed tag map is 779,926 bits, using Equation (9). The fifth column lists the numbers of bits that can be embedded in each pixel, and the total embedding capacity of the second part is 1,271,386 bits. The last column shows the net payload that each pixel can provide. In addition, a fixed number of bits are used to record the length of the compressed tag map, which is 20 bits. Thus, the total net payload of the second part is 491,414 bits after subtracting 26 bits used for the Huffman coding rule from the total embedding capacity. Finally, adding the net payloads of the two parts of the process, the total net embedding capacity is 747,609 bits, and the corresponding embedding rate reaches 2.8519 bpp.
The net payloads for each of the six test images as shown in Figure 6 was calculated and their corresponding maximum embedding rates are listed in Table 6. From this table, it can be seen that high embedding rate can be obtained when the original image’s histogram is smoothly distributed. For example, a total of 1,637,384 embeddable bits are vacated in the smooth image Splash. Although 762,626 bits are used to record the entire auxiliary data, there are still 874,758 bits available to record secret data, which yields a high embedding rate, 3.3369 bpp. For a textural image, with little visual uniformity, the total number of embeddable bits is relatively low, and most of them are consumed to save auxiliary data. Using the image Baboon as an example, a total of 1,093,027 bits are vacated from the image, which is obviously less than that of the image Splash. At the same time, only 315,512 bits are used to save the secret data, so that a relatively low embedding rate 1.2036 bpp is obtained. Nevertheless, the original image can be recovered without any error, regardless of whether it is smooth or textural. For example, as shown in Figure 7d, although a total of 693,570 secret bits are embedded in the marked encrypted image and the net embedding rate achieves 2.6458 bpp, the test image Lena can be completely reconstructed as long as the encryption key is accessible.
To demonstrate the performance of the proposed method in means of maximum embedding rate, extensive experiments were performed on independent databases, BOSSBase [40] and BOWS2 [41]. In the BOSSBase and BOWS2 databases, there are 10,000 standard test images with the size of 512 × 512 pixels. As shown in Table 7, for images in BOSSBase and BOWS2, the best embedding rates are 5.9212 bpp and 5.6459 bpp, respectively, while the worst embedding rates are 0.6886 bpp and 0.5348 bpp, respectively. Meanwhile, the average embedding rates are 3.3894 bpp and 3.2824 bpp, respectively. Most importantly, regardless of which image in these databases is tested, it can be reconstructed without errors, i.e., the peak signal-to-noise ratio (PSNR) between the original image and its recovered image tends to infinity, and the maximum structural similarity (SSIM) between these images equals 1.

3.3. Comparison with State-of-the-Art Methods

For the purpose of evaluating the proposed method relative to others, the embedding rates of this method and five state-of-the-art methods [29,30,33,37,39], using the six test images shown in Figure 6, are presented in Figure 9. In Fu et al.’s method [29], the block size that affects the embedding capacity was set to 4 × 4 pixels to achieve the maximum embedding rate. Meanwhile, the number of higher significant bit-layers of the image was set to 5, and the relevant threshold was set to 4. Similarly, the block partition in an original image was utilized and the parameters were set as α = 5 and β = 8 in Wu et al.’s method [30]. It can be seen from Figure 9 that in the two methods described above, excluding the image Baboon, the embedding rate of Wu et al.’s method is higher than that of Fu et al.’s. Using the images Lena and Splash to illustrate, the embedding rates of the method described in [30] are 2.6447 bpp and 2.6515 bpp, respectively, while the embedding rates are 2.4797 bpp and 2.5908 bpp using the method in [29]. For the method of Yi and Zhou [33], the block size was also set to 4 × 4 pixels, but the embedding rates of this method on the six test images shown in Figure 6 are lower than that of method [29]. This is because the number of embeddable blocks is too low, such that no more vacant bits were created for embedding [33]. In Puteaux and Puech’s method [37], the secret data is mainly embedded using MSB substitution. Pixels with prediction errors should be marked and some of the most significant bits are consumed. Therefore, the maximum embedding rates of the six test images of this method are less than 1 bpp. However, in Yin et al.’s method [39], the secret data is embedded using adaptive multi-MSB substitution, and better embedding performance can be obtained. For example, for smooth images such as Splash and Airplane, the highest embedding rate can be achieved, i.e., 3.3369 bpp and 3.0671 bpp, respectively. For the textural image Baboon, no matter which method was used, its embedding rate is the lowest among all test images. Nevertheless, it is obvious from Figure 9 that regardless of which image is tested, the proposed method can provide higher embedding capacity than other methods.
In Table 8, results from experiments performed using the proposed method and other methods on all images in the two comparison databases are presented. These results further demonstrate that, among the tested methods, the proposed method can achieve the highest embedding capacity. In detail, the average embedding rates of Fu et al.’s method [29] for the two databases are 2.1733 bpp and 2.0454 bpp, respectively, while Wu et al.’s [29] method’s performance was higher by 0.388 bpp and 0.474 bpp, respectively. In Yi and Zhou’s scheme [33], the average embedding rate using images in BOSSBase is slightly higher than that using the images in BOWS2. In addition, the average embedding rates of Puteaux and Puech’s method [37] are less than 1 bpp. Yin et al.’ s method [39] takes full advantage of the local correlation in an original image to achieve better performance, where the embedding rates for the two databases are 3.3613 bpp, and 3.2455 bpp, respectively. Compared with that method, more redundant space and fewer bits occupied by auxiliary information resulted with the proposed method. The average embedding rates of the proposed method are 3.3894 bpp and 3.2824 bpp, respectively; thus, the performance of the proposed method was stronger than others.

4. Conclusions

In this paper, a new high-capacity reversible data hiding method for encrypted images is proposed based on hybrid prediction and Huffman coding. In order to obtain high embedding capacity to accommodate secret data, the proposed method can make full use of image redundancy to create larger vacant spaces. This goal has been achieved using the hybrid prediction scheme according to the most significant bit-plane and the remaining bit-planes of an original image. For each pixel, considering the average value of two neighboring pixels as its prediction, errors can be identified. After modifying pixels with prediction errors, all the most significant bits can be vacated to record additional information. For an image comprised of the remaining seven bit-planes, the tag value of each pixel is obtained based on its prediction value, calculated using the median edge detector predictor. Then, compressing the entire tag map with Huffman coding, more spaces can be vacated to embed secret data. The experimental results show that higher embedding performance can achieved using the proposed method compared to other published methods, with embedding rates of 3.3894 bpp and 3.2824 bpp, using BOSSBase and BOWS-2 databases, which are 0.0281 bpp and 0.0369 bpp higher than a method using multiple most significant bit prediction alone.

Author Contributions

Methodology, L.S. and H.L.; Software, J.L. and H.L.; Validation, Z.X. and A.T.; Writing—original draft, H.L. All authors have read and agreed to the published version of the manuscript.

Funding

National Natural Science Foundation of China (NSFC): 62031023.

Data Availability Statement

Data will be made available on request.

Conflicts of Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

References

  1. Wang, Y.; Plataniotis, K.N. An analysis of random projection for changeable and privacy-preserving biometric verification. IEEE Trans. Syst. Man Cybern. B Cybern. 2010, 40, 1280–1293. [Google Scholar] [CrossRef] [PubMed]
  2. Li, X.; Zhang, W.; Gui, X.; Yang, B. A novel reversible data hiding scheme based on two-dimensional difference-histogram modification. IEEE Trans. Inf. Forensics Secur. 2013, 8, 1091–1100. [Google Scholar]
  3. Hu, Y.C.; Lin, Y.H.; Lo, C.C.; Wu, C.M. Implementation of block-based hierarchical prediction for developing an error-propagation-free reversible data hiding scheme. Symmetry 2019, 11, 1146. [Google Scholar] [CrossRef] [Green Version]
  4. Lee, C.F.; Shen, J.J.; Wu, Y.J.; Agrawal, S. PVO-based reversible data hiding exploiting two-layer embedding for enhancing image fidelity. Symmetry 2020, 12, 1164. [Google Scholar] [CrossRef]
  5. Pang, Z.; Li, H.; Xiao, Z.; Sui, L. Reversible data hiding in encrypted images based on an adaptive recognition strategy for blocks. Symmetry 2023, 15, 524. [Google Scholar] [CrossRef]
  6. Coatrieux, G.; Guillou, C.L.; Cauvin, J.M.; Roux, C. Reversible watermarking for knowledge digest embedding and reliability control in medical images. IEEE Trans. Inform. Technol. Biomed. 2009, 13, 158–165. [Google Scholar] [CrossRef]
  7. Lee, S.; Yoo, C.D.; Kalker, T. Reversible image watermarking based on integer-to-integer wavelet transform. IEEE Trans. Inf. Forensics Secur. 2007, 2, 321–330. [Google Scholar] [CrossRef]
  8. Li, X.; Li, B.; Yang, B.; Zeng, T.Y. General framework to histogram-shifting-based reversible data hiding. IEEE Trans. Image Process. 2013, 22, 2181–2191. [Google Scholar] [CrossRef]
  9. Li, X.; Zhang, W.M.; Zhao, X.F.; Yu, N.H.; Li, F.H. Efficient reversible data hiding based on multiple histograms modification. IEEE Trans. Inf. Forensics Secur. 2015, 10, 2016–2027. [Google Scholar]
  10. Ni, Z.; Shi, Y.Q.; Ansari, N.; Wei, S. Reversible data hiding. IEEE Trans. Circuits Syst. Video Technol. 2006, 16, 354–362. [Google Scholar]
  11. Qin, C.; Chang, C.C.; Huang, Y.H.; Liao, L.T. An inpainting-assisted reversible steganographic scheme using a histogram shifting mechanism. IEEE Trans. Circuits Syst. Video Technol. 2013, 23, 1109–1118. [Google Scholar] [CrossRef]
  12. Ou, B.; Li, X.; Zhao, Y.; Ni, R.; Shi, Y. Pairwise prediction-error expansion for efficient reversible data hiding. IEEE Trans. Image Process. 2013, 22, 5010–5021. [Google Scholar] [CrossRef] [PubMed]
  13. Tian, J. Reversible data embedding using a difference expansion. IEEE Trans. Circuits Syst. Video Technol. 2003, 13, 890–896. [Google Scholar] [CrossRef] [Green Version]
  14. Thodi, D.M.; Rodriguez, J.J. Expansion embedding techniques for reversible watermarking. IEEE Trans. Image Process. 2007, 16, 721–730. [Google Scholar] [CrossRef] [PubMed]
  15. Celik, M.; Sharma, G.; Tekalp, A.; Saber, E. Lossless generalized-LSB data embedding. IEEE Trans. Image Process. 2005, 14, 253–266. [Google Scholar] [CrossRef]
  16. Fridrich, J.; Golian, M. Lossless data embedding for all image formats. In Proceedings of the IS&T/SPIE Electronic Imaging, San Jose, CA, USA, 30 May 2002; pp. 572–583. [Google Scholar]
  17. Lu, T.; Chi, L.P.; Wu, C.H.; Chang, H.P. Reversible data hiding in dual stego-images using frequency-based encoding strategy. Multimed. Tools Appl. 2017, 76, 23903–23929. [Google Scholar] [CrossRef]
  18. Qin, C.; Chang, C.C.; Hsu, T.J. Reversible data hiding scheme based on exploiting modification direction with two steganographic images. Multimed. Tools Appl. 2015, 74, 5861–5872. [Google Scholar] [CrossRef]
  19. Zhou, J.; Sun, W.W.; Dong, L.; Liu, X.M.; Au, O.C.; Tang, Y.Y. Secure reversible image data hiding over encrypted domain via key modulation. IEEE Trans. Circuits Syst. Video Technol. 2016, 26, 441–452. [Google Scholar] [CrossRef]
  20. Mohammadi, A.; Nakhkash, M.; Akhaee, M.A. A high-capacity reversible data hiding in encrypted images employing local difference predictor. IEEE Trans. Circuits Syst. Video Technol. 2020, 30, 2366–2376. [Google Scholar] [CrossRef]
  21. Zhang, H.; Li, L.; Li, Q.Y. Reversible data hiding in encrypted images based on block-wise multi-predictor. IEEE Access 2021, 9, 61943–61954. [Google Scholar] [CrossRef]
  22. Puech, W.; Chaumont, M.; Strauss, O. A reversible data hiding method for encrypted images. In Proceedings of the SPIE: Security, Forensics, Steganography, and Watermarking of Multimedia Contents X, Bellingham, WA, USA, 18 March 2008; Volume 6819. [Google Scholar]
  23. Zhang, X.P. Reversible data hiding in encrypted image. IEEE Signal Process. Lett. 2011, 18, 255–258. [Google Scholar] [CrossRef]
  24. Zhang, X.P. Separable reversible data hiding in encrypted image. IEEE Trans. Inf. Forensics Secur. 2012, 7, 826–832. [Google Scholar] [CrossRef]
  25. Hong, W.; Chen, T.S.; Wu, H.Y. An improved reversible data hiding in encrypted images using side match. IEEE Signal Process. Lett. 2012, 19, 199–202. [Google Scholar] [CrossRef]
  26. Liao, X.; Shu, C.W. Reversible data hiding in encrypted images based on absolute mean difference of multiple neighboring pixels. J. Vis. Commun. Image Represent. 2015, 28, 21–27. [Google Scholar] [CrossRef]
  27. Qin, C.; He, Z.H.; Luo, X.Y.; Dong, J. Reversible data hiding in encrypted image with separable capability and high embedding capacity. Inf. Sci. 2018, 465, 285–304. [Google Scholar] [CrossRef]
  28. Yu, C.; Zhang, X.Q.; Tang, Z.J.; Xie, X.J. Separable and error-free reversible data hiding in encrypted image based on two-layer pixel errors. IEEE Access 2018, 6, 76956–76969. [Google Scholar] [CrossRef]
  29. Fu, Y.J.; Kong, P.; Yao, H.; Tang, Z.J.; Qin, C. Effective reversible data hiding in encrypted image with adaptive encoding strategy. Inf. Sci. 2019, 494, 21–36. [Google Scholar] [CrossRef]
  30. Wu, Y.Q.; Xiang, Y.Z.; Guo, Y.T.; Tang, J.; Yin, Z.X. An improved reversible data hiding in encrypted images using parametric binary tree labeling. IEEE Trans. Multimed. 2020, 22, 1929–1938. [Google Scholar] [CrossRef] [Green Version]
  31. Ma, K.D.; Zhang, W.M.; Zhao, X.F.; Yu, N.H.; Li, F.H. Reversible data hiding in encrypted images by reserving room before encryption. IEEE Trans. Inf. Forensics Secur. 2013, 8, 553–562. [Google Scholar] [CrossRef]
  32. Cao, X.C.; Du, L.; Wei, X.X.; Meng, D.; Guo, X.J. High capacity reversible data hiding in encrypted images by patch-level sparse representation. IEEE Trans. Cybern. 2017, 46, 1132–1143. [Google Scholar] [CrossRef]
  33. Yi, S.; Zhou, Y.C. Binary-block embedding for reversible data hiding in encrypted images. Signal Process. 2017, 133, 40–51. [Google Scholar] [CrossRef]
  34. Wu, H.T.; Yang, Z.Y.; Cheung, Y.M.; Xu, L.L.; Tang, S.H. High-capacity reversible data hiding in encrypted images by bit plane partition and MSB prediction. IEEE Access 2019, 7, 62361–62371. [Google Scholar] [CrossRef]
  35. Arun, K.R.; Neeraj, K.; Rajeev, K.; Hari, O.; Satish, C.; Ki, H.J. Intra-block correlation based reversible data hiding in encrypted images using parametric binary tree labeling. Symmetry 2021, 13, 1072. [Google Scholar] [CrossRef]
  36. Mittal, S.; Goyal, S.; Aggarwal, S.; Kumar, R. Interpolative AMBTC based reversible data hiding in encrypted images using rhombus mean. In Proceedings of the 2023 International Conference on Device Intelligence, Computing and Communication Technologies, Dehradun, India, 17–18 March 2023; pp. 451–456. [Google Scholar]
  37. Puteaux, P.; Puech, W. An efficient MSB prediction-based method for high-capacity reversible data hiding in encrypted images. IEEE Trans. Inf. Forensics Secur. 2018, 13, 1670–1681. [Google Scholar] [CrossRef] [Green Version]
  38. Puteaux, P.; Puech, W. EPE-based huge-capacity reversible data hiding in encrypted images. In Proceedings of the IEEE International Workshop on Information Forensics and Security, Hong Kong, China, 11–13 December 2018; pp. 1–7. [Google Scholar]
  39. Yin, Z.; Xiang, Y.Z.; Zhang, X.P. Reversible data hiding in encrypted images based on multi-MSB prediction and Huffman coding. IEEE Trans. Multimed. 2020, 22, 874–884. [Google Scholar] [CrossRef]
  40. Bas, P.; Filler, T.; Pevny, T. Break our steganographic system-The ins and outs of organizing BOSS. In Proceedings of the 13th International Conference, IH 2011, Prague, Czech Republic, 18–20 May 2011; pp. 59–70. [Google Scholar]
  41. Bas, P.; Furon, T. Image Database of BOWS-2. Available online: http://bows2.ec-lille.fr/ (accessed on 20 June 2017).
Figure 1. The schematic diagram of the proposed method.
Figure 1. The schematic diagram of the proposed method.
Symmetry 15 01222 g001
Figure 2. The structure of the error location map: (a) basic part, (b) additional part.
Figure 2. The structure of the error location map: (a) basic part, (b) additional part.
Symmetry 15 01222 g002
Figure 3. The structure of the compressed basic part.
Figure 3. The structure of the compressed basic part.
Symmetry 15 01222 g003
Figure 4. The structure of the compressed result of most significant bits of reference pixels.
Figure 4. The structure of the compressed result of most significant bits of reference pixels.
Symmetry 15 01222 g004
Figure 5. The structure of the entire auxiliary data.
Figure 5. The structure of the entire auxiliary data.
Symmetry 15 01222 g005
Figure 6. Test image: (a) Baboon; (b) Lena; (c) Tiffany; (d) Airplane; (e) Splash; and (f) Car.
Figure 6. Test image: (a) Baboon; (b) Lena; (c) Tiffany; (d) Airplane; (e) Splash; and (f) Car.
Symmetry 15 01222 g006
Figure 7. Test image Lena: (a) The encrypted image, (b) the encrypted image containing the auxiliary data, (c) the marked encrypted image, and (d) the recovered image.
Figure 7. Test image Lena: (a) The encrypted image, (b) the encrypted image containing the auxiliary data, (c) the marked encrypted image, and (d) the recovered image.
Symmetry 15 01222 g007
Figure 8. The histogram of (a) the original image, (b) the encrypted image, and (c) the marked encrypted image.
Figure 8. The histogram of (a) the original image, (b) the encrypted image, and (c) the marked encrypted image.
Symmetry 15 01222 g008
Figure 9. Comparison of embedding rate among different methods [29,30,33,37,39] on six test images.
Figure 9. Comparison of embedding rate among different methods [29,30,33,37,39] on six test images.
Symmetry 15 01222 g009
Table 1. The numbers of all kinds of tags and their Huffman codes for the image Baboon.
Table 1. The numbers of all kinds of tags and their Huffman codes for the image Baboon.
Tag01234567
The number of pixels65,31643,41847,84542,95929,23216,41184008562
Huffman code00100011011100110111111110
Table 2. The information entropy values of six marked encrypted images.
Table 2. The information entropy values of six marked encrypted images.
ImageBaboonLenaTiffanyAirplaneSplashCar
Entropy7.95057.99287.99027.98187.98907.9826
Table 3. The correlation coefficient (CC) values of six test images.
Table 3. The correlation coefficient (CC) values of six test images.
ImageBaboonLenaTiffanyAirplaneSplashCar
CC0.0186−0.00220.00350.0123−0.01470.0153
Table 4. The total embedding capacity and bits occupied by the auxiliary data in the first part using the image Tiffany.
Table 4. The total embedding capacity and bits occupied by the auxiliary data in the first part using the image Tiffany.
Total Capacity (Bits)Compressed Result of Reference Values (Bits)Compressed Basic Part of Error Location Map (Bits)Additional Part of
Error Location Map (Bits)
262,144365260568
Table 5. The total embedding capacity and bits occupied by the compressed tag map of the image Tiffany.
Table 5. The total embedding capacity and bits occupied by the compressed tag map of the image Tiffany.
TagCodeNumber of TagCode Length (Bits)Capacity (Bits)Net Payload
(Bits)
0111110,37041−3
1111018,39742−2
210135,157330
310039,949341
40052,165253
50144,605264
6110128,092473
7110033,408473
Total--779,9261,271,386491,460
Table 6. The total embedding capacity (EC) and the auxiliary information of six test images.
Table 6. The total embedding capacity (EC) and the auxiliary information of six test images.
Test ImageThe First Part
(Bits)
The Second Part
(Bits)
Embedding Rate (bpp)
Total EC Auxiliary DataTotal EC Auxiliary Data
Baboon262,14441,511830,883735,8731.2036
Lena262,14438841,214,630779,1892.6458
Tiffany262,14458641,271,386779,9262.8519
Airplane262,14441031,329,135783,0213.0671
Splash262,1448671,375,240761,6283.3369
Car262,14485761,527,323798,5572.7468
Table 7. The performance analysis on two image databases.
Table 7. The performance analysis on two image databases.
DatabaseIndicatorsBest CaseWorst CaseAverage
BOSSBaseER (bpp)5.92120.68863.3894
PSNR + + +
SSIM111
BOWS-2ER (bpp)5.64590.53483.2824
PSNR + + +
SSIM111
Table 8. Average embedding rate comparison for two image databases.
Table 8. Average embedding rate comparison for two image databases.
DatabaseAverage Embedding Rates for Different Databases (bpp)
[29][30][33][37][39]Proposed
BOSSBase2.17332.56132.29570.96593.36133.3894
BOWS22.04542.51942.17480.96813.24553.2824
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Sui, L.; Li, H.; Liu, J.; Xiao, Z.; Tian, A. Reversible Data Hiding in Encrypted Images Based on Hybrid Prediction and Huffman Coding. Symmetry 2023, 15, 1222. https://doi.org/10.3390/sym15061222

AMA Style

Sui L, Li H, Liu J, Xiao Z, Tian A. Reversible Data Hiding in Encrypted Images Based on Hybrid Prediction and Huffman Coding. Symmetry. 2023; 15(6):1222. https://doi.org/10.3390/sym15061222

Chicago/Turabian Style

Sui, Liansheng, Han Li, Jie Liu, Zhaolin Xiao, and Ailing Tian. 2023. "Reversible Data Hiding in Encrypted Images Based on Hybrid Prediction and Huffman Coding" Symmetry 15, no. 6: 1222. https://doi.org/10.3390/sym15061222

APA Style

Sui, L., Li, H., Liu, J., Xiao, Z., & Tian, A. (2023). Reversible Data Hiding in Encrypted Images Based on Hybrid Prediction and Huffman Coding. Symmetry, 15(6), 1222. https://doi.org/10.3390/sym15061222

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop