LVGG-IE: A Novel Lightweight VGG-Based Image Encryption Scheme

Sun, Mingliang; Yuan, Jie; Li, Xiaoyong; Liu, Dongxiao; Wei, Xinghai

doi:10.3390/e26121013

Open AccessArticle

LVGG-IE: A Novel Lightweight VGG-Based Image Encryption Scheme

by

Mingliang Sun

,

Jie Yuan

^*,

Xiaoyong Li

,

Dongxiao Liu

and

Xinghai Wei

Key Laboratory of Trustworthy Distributed Computing and Service (BUPT), Ministry of Education, Beijing University of Posts and Telecommunications, Beijing 100876, China

^*

Author to whom correspondence should be addressed.

Entropy 2024, 26(12), 1013; https://doi.org/10.3390/e26121013

Submission received: 23 September 2024 / Revised: 15 November 2024 / Accepted: 21 November 2024 / Published: 23 November 2024

(This article belongs to the Section Multidisciplinary Applications)

Download

Browse Figures

Versions Notes

Abstract

:

Image security faces increasing challenges with the widespread application of computer science and artificial intelligence. Although chaotic systems are employed to encrypt images and prevent unauthorized access or tampering, the degradation that occurs during the binarization process in chaotic systems reduces security. The chaos- and DNA-based image encryption schemes increases its complexity, while the integration of deep learning with image encryption is still in its infancy and has several shortcomings. An image encryption scheme with high security and efficiency is required for the protection of the image. To address these problems, we propose a novel image encryption scheme based on the lightweight VGG (LVGG), referred to as LVGG-IE. In this work, we design an LVGG network with fewer layers while maintaining a high capacity for feature capture. This network is used to generate a key seed, which is then employed to transform the plaintext image into part of the initial value of a chaotic system, ensuring that the chaos-based key generator correlates with the plaintext image. A dynamic substitution box (S-box) is also designed and used to scramble the randomly shuffled plaintext image. Additionally, a single-connected (SC) layer is combined with a convolution layer from VGG to encrypt the image, where the SC layer is dynamically constructed by the secret key and the convolution kernel is set to

1 \times 2

. The encryption efficiency is simulated, and the security is analyzed. The results show that the correlation coefficient between adjacent pixels in the proposed scheme achieves

10^{- 4}

. The NPCR exceeds 0.9958, and the UACI falls within the theoretical value with a significance level of 0.05. The encryption quality, the security of the dynamic S-box and the SC layer, and the efficiency are tested. The result shows that the proposed image encryption scheme demonstrates high security, efficiency, and robustness, making it effective for image security in various applications.

Keywords:

VGG; image encrypt; S-box; single-connected layer; chaos

1. Introduction

The importance of image security has grown due to its wide range of applications. Traditional cryptosystems often require significant time to encrypt large color images. As a result, cryptosystems with faster speeds and lower costs are being researched and developed. Recently, chaos-based image cryptosystems have garnered increasing attention for their lightweight nature and high efficiency [1,2]. The initial value sensitivity, ergodicity, and periodic point density of chaotic systems ensure that they are locally random but globally bounded, making their output difficult to predict and meeting the confusion and diffusion requirements of a good cryptosystem. Much research focuses on chaotic systems, attractors, and chaotic sequences to achieve high security for image encryption [3,4,5]. Chaos-based image encryption algorithms can be categorized as either 1D chaos-based or higher-dimensional chaos-based. For instance, 1D chaotic systems with two seed maps are proposed in [6] to create a novel image encryption scheme, transforming the original image into different encrypted images with the same key. Following a similar approach, two different 1D chaotic maps are used in [7] to output sequences for encryption. A cosine-transform-based chaotic system is presented in [8] to scramble and diffuse the image. Although 1D chaotic systems are efficient, they have a small key space and lack complexity, making their orbits predictable, which can lead to security vulnerabilities [9,10,11]. High-dimensional chaotic systems, with more complex behavior and larger key spaces, provide greater security. In [12], a color image is converted into a two-dimensional matrix, which is scrambled using a combined DNA coding operation with a three-dimensional chaotic system and Fisher–Yates scrambling. Mansouri and Wang [13] improved the Arnold system by combining it with a shuffle operation to scramble and diffuse the image. The Lorenz system, a classical chaotic system, has been modified and widely used in chaos-based image encryption and communications [14,15,16]. In [17], the nonlinear term of the general Lorenz system was replaced by the sum of an exponential function and the square of a single variable. This new Lorenz system is then used to generate keys for scrambling image pixels, effectively resisting chosen plaintext attacks. In [18], a coupled chaotic system with complex dynamic behavior is proposed for image encryption, achieving higher security and speed.

DNA coding has been introduced in chaos-based color image encryption [19,20,21,22] to enhance security. These schemes divide the image into three channels and transform them into matrices using DNA coding, then use the chaotic system-generated keys to diffuse the image. Ravichandran et al. proposed a two-level image encryption scheme based on the chaotic map and deoxyribonucleic acid (DNA) [23,24]. In [25], a 4D cat map and elliptic curve ElGamal are used to encrypt color images, resulting in high resistance to known attacks. Numerous other chaos-based color image encryption schemes have also been proposed [26,27], all demonstrating high security and robustness. However, chaos-based encryption schemes often require multiple pixel scans and sorting operations, leading to high computational complexity [28,29,30]. Furthermore, the binarization of chaotic systems introduces degradation, reducing security.

Deep learning is being used for image encryption due to its nonlinear structure, though it is still in its early stages. Ding and Zheng et al. proposed image encryption schemes based on GAN, cycle-GAN, and their variants [31,32]. In GAN-based encryption, a set of encrypted images is used as hidden factors to train the network, which transforms plain images into cipher images. In [32], cycle-GAN is used to disguise plain images with cover images. In [33], plain images are diffused before being passed through the encryption model, where GAN is used as the encryption component. Wang et al. generated cipher images directly, without training a neural network, using scrambled DCT coefficients instead. Another method uses deep learning to generate secret keys [34,35], achieving a high key space. However, convolution operations in deep learning without normalization cause pixel values to exceed 255, and normalization results in float values that cannot be displayed as images. Consequently, deep learning is typically used to generate control parameters for image encryption. In [36], the facial image of a person is employed to extract features using a convolutional neural network (CNN). These features are then used to control the sine logistic modulation map, which generates chaotic matrices for the encryption of CT images. Zhou et al. proposed an image encryption scheme based on a conditional generative adversarial network (CGAN) [37]. The primary image is encoded into two noise-like images, which are then used to generate a speckle pattern and trained with the primary image by the CGAN. Upon receiving the ciphertext of the two noise-like images, they are first decrypted and recombined into the speckle pattern. This speckle pattern is then input into the CGAN to output the corresponding original image. Panwar et al. summarized the latest deep learning-based image encryption methods, analyzing their advantages and possible vulnerabilities to attacks [38]. These studies show that deep learning-based image encryption schemes may be vulnerable to attacks common to deep learning models [38,39], such as hidden factor leakage and network architecture exposure. Additionally, since the secret keys do not correlate with the plaintext image, they may be compromised by chosen plaintext attacks [40,41].

From the above, we can conclude that chaos-based image encryption schemes, when combined with DNA coding or other nonlinear components, enhance security while increasing complexity. Currently, no image encryption scheme employs the VGG network. Deep learning-based image encryption schemes, such as CNN-based encryption and CGAN-based encryption, may be vulnerable to attacks common to deep learning models. To address these security issues, we designed a lightweight VGG (LVGG) neural network based on VGG-16 [42], which offers high efficiency. We then proposed a LVGG-based image encryption scheme that combines the nonlinearity of deep learning models with the randomness of chaotic systems. The LVGG has fewer layers than the classical VGG. Since the VGG network achieves the same receptive field with smaller convolutional kernels and uses fewer parameters than other CNNs, the proposed LVGG can achieve high efficiency in image encryption.

Our contributions are as follows: (i) We propose an LVGG-based key seed generator that takes a plain image as input, where the LVGG with only 7 layers improves the efficiency of key seed generation. We design a novel 4D chaotic system with complex dynamic behavior, based on the Lorenz system, to generate the key seed. This key seed is used as part of the initial values for generating the secret key, correlating the plain image with the encryption process and enhancing resistance to chosen plaintext attacks. (ii) We design a dynamic substitution box (S-box) to scramble the pixels of the image, improving the encryption’s resistance to statistical attacks. (iii) A dynamic SC layer, along with a convolutional addition and modular operation, is dynamically generated for image encryption. The convolutional addition is applied to the image using a convolution kernel of size 1 × 2, followed by modulo 256 calculations, achieving high efficiency in confusion. Finally, the security and robustness of the proposed scheme are analyzed through simulation.

The remainder of this article is organized as follows. In Section 2, we introduce the VGG network and the Lorenz chaotic system. In Section 3, we present the design of the LVGG-based image encryption scheme. This includes the LVGG network, a novel 4D Lorenz-based chaotic system, and the LVGG-chaos-based pseudorandom generator. The dynamic S-box and SC layer, constructed by the secret key, are used to scramble and diffuse the image. Finally, a convolution kernel is designed for further encryption. In Section 4, we discuss the simulation and security analysis of the proposed scheme. Conclusions are drawn in Section 5.

2. Preliminaries

In this section, we introduce the notations for the VGG network, the Lorenz system, and the S-box.

2.1. The VGG Network

The VGG network shows that the performance of a neural network can be affected by the depth of the network to a certain extent. There are two classical structures of VGG, referred to as VGG16 and VGG19. The only difference between them is the depth of the network. The network structure of VGG networks in [42] is shown in Table 1:

Here, conv1-64 denotes a convolutional layer with a kernel size of 1 × 1, and 64 refers to the number of channels. The other convolutional layers follow the same convention. In Table 1, six different VGG networks are introduced, denoted as A, A-LRN, B, C, D, and E. The differences between these VGG variants lie in the number of channels used in the convolutional layers and the normalization functions applied. The max-pooling layer uses a filter of size

2 \times 2

with a stride of 2. “

F C - n

” denotes a fully connected layer with

n

nodes. By using the same size of convolution kernel (

3 \times 3

) and and max-pooling (

2 \times 2

), the network structure is simplified, allowing deeper networks to enhance performance. The total number of layers is defined as the sum of the convolutional and fully connected layers.

2.2. The Lorenz Chaotic System

The Lorenz system, proposed in 1963 to model weather patterns [43], is defined as follows:

\{\begin{array}{l} {\dot{x}}_{1} = a (x_{2} - x_{1}) \\ {\dot{x}}_{2} = b x_{1} - x_{2} - x_{1} x_{3} \\ {\dot{x}}_{3} = x_{1} x_{2} - c x_{3} \end{array}

(1)

Here,

x_{1}

,

x_{2}

,

x_{3}

are the state variables. When

a = 10

,

b = 28

,

c = \frac{8}{3}

, the system is chaotic.

2.3. Property of the S-Box

The S-box is a nonlinear component in block ciphers that directly determines the security of the encryption algorithm. Generally, the S-box serves as a substitution table for a given input; by looking up the table, one can obtain the corresponding output. It is crucial for resisting linear and differential attacks on block ciphers. The definition of the S-box was first introduced in [44]: an

n \times m

S-box

S

is a mapping

S

:

{0,1}^{n} \to {0,1}^{m}

.

S

can be represented as

2^{n}

m

-bit numbers, denoted

{r_{0}, r_{1}, \dots, r_{2^{n} - 1}}

, where

S (x) = r_{x}

for

0 \leq x < 2^{n}

, and the

r_{i}

values are the rows of the S-box.

3. The Proposed Image Encryption Scheme LVGG-IE

In this section, we design an LVGG neural network to generate a plaintext image-correlated secret key seed, which has higher efficiency than the classical VGG network. This secret key seed is then used as part of the initial values of the proposed chaotic system, with the other two initial values chosen randomly. The pseudorandom sequence generated by the chaotic system serves as the secret key for constructing the substitution box and for image encryption. Additionally, we design an SC layer using the secret key to further confuse the image. Finally, a convolutional layer with a kernel size of

1 \times 2

is applied to the image’s pixel matrix. The details are as follows.

3.1. The LVGG-Based Key Seed Generator

To enhance image encryption against chosen plaintext attacks, the plaintext should be fed into the encryption process. We designed an LVGG-based key seed generator that takes a plain image as input, while the LVGG improves the efficiency of key seed generation. The neural network structure of the proposed LVGG is shown below.

In Figure 1, the proposed LVGG neural network contains 7 layers and an input of the

256 \times 256 \times 3

image. The parameter

N o n e

denotes the batch size, and the output is a vector with size two. The LVGG can classify the image with lower resource consumption and high speed. We trained the network model using a training set of

10^{4}

images and a test set of

10^{3}

. Both the training set and the test set contain 50% human images and 50% non-human images. All images in these sets were normalized and then resized to

256 \times 256

. The number of epochs was set to 10, and the batch size was set to 32. The learning rate was optimized using the RMSprop algorithm, with a value of

10^{- 4}

. The network was trained for 10 epochs, achieving an accuracy of 0.788. The proposed LVGG is compared with the VGG 16 and VGG 19 of [42] in Table 2.

From Table 2, we can see that the proposed LVGG requires only 20 s to train the model, which is one-fifteenth of the time needed by VGG 16 and VGG 19. This makes LVGG more efficient for image encryption, especially when the neural network needs to be retrained.

Since the LVGG network uses the softmax function for classification, any image can be input into the LVGG neural network to obtain a vector with two floating point values,

{k s}_{1}

and

{k s}_{2}

, where

{k s}_{1} \in (0,1)

and

{k s}_{2} \in (0,1)

are used as two key seeds. When

{k s}_{1} > {k s}_{2}

, it outputs classification 1; otherwise, it outputs classification 0. Thus, the probability that

{k s}_{1} > {k s}_{2}

is 78.8% if the input image belongs to a specified type. Although an attacker could correctly guess the type of input image with a probability of 78.8%, they cannot obtain the specific values of

{k s}_{1}

and

{k s}_{2}

. In other words, the proposed LVGG network can not only utilize certain features of the input image but also resist attacks targeting the neural network. However, the key seed needs to be transmitted to the receiver via a secure channel or public cryptosystem for each image, which improves the complexity of key management.

3.2. Pseudorandom Generator Based on LVGG and Chaos

To mitigate the degradation caused by the binarization of the chaotic system, we constructed a four-dimensional (4D) system by adding a new controller to the Lorenz system, which exhibits more complex dynamic behavior. The new 4D system is shown in Formula (2):

\{\begin{array}{l} \dot{x} = a (y - x) + e y z \\ \dot{y} = c x + d y - x z - w \\ \dot{z} = x y - b z \\ \dot{w} = h x^{2} + r w \end{array}

(2)

When

a = 1, b = 4, c = - 1.5, \cdot d \in {[0.95,1.26] \cup [1.35,3]}

,

e = 3

,

h = 1

and

r = - 5

, the system becomes chaotic. Its phase diagram is shown in Figure 2:

The Lyapunov exponents are

{L E}_{1} = 0.4222

,

{L E}_{2} = 0.0698

,

{L E}_{3} = - 4.0025

, and

{L E}_{4} = - 5.3752

for

d = 1.1

. The Lyapunov dimension, which represents the complexity of the attractor, can be calculated using Formula (3).

D_{L i} = j + \frac{1}{|L_{j + 1}|} \sum_{i = 1}^{j} L_{j}

(3)

where

L_{j}

is the

j

-th Lyapunov exponent, and

j

is the largest index that makes

\sum_{i = 1}^{j} L_{j} > 0

. The Lyapunov dimension of the proposed scheme is 2.1229. From the above, it can be seen that the proposed 4D chaotic system has a larger chaotic range and more complex dynamic behavior than the Lorenz system, which effectively reduces the degradation of the quantization of the chaotic sequence.

Additionally, we designed a pseudorandom generator based on the LVGG and chaos, ensuring that the secret key is correlated with the plain image. In the pseudorandom sequence generation process, we use self-perturbation to minimize degeneration. The details are shown in Figure 3.

The pseudorandom sequence generation process is as follows:

Step 1: Input the initial value (

x_{0}

,

y_{0}

,

z_{0}

,

w_{0}

) into the chaotic system, where the output of LVGG

{k s}_{1}

and

{k s}_{2}

are added to

x_{0}

and

y_{0}

, respectively. While

z_{0}

and

w_{0}

are chosen randomly. The outputs of the chaotic system are denoted by

x_{i}

,

y_{i}

,

z_{i}

, and

w_{i}

. Discard the values of the first 200 iterations in each 10,000 iterations.

Step 2: Obtain the fractional part of the values

x_{i}

,

y_{i}

,

z_{i}

, and

w_{i}

by

x_{i} = x_{i} - f l o o r (x_{i})

. Here,

f l o o r (x)

denotes the largest integer less than or equal to

x

.

Step 3: Multiply

x_{i}

,

y_{i}

,

z_{i}

, and

w_{i}

by

2^{10}

and apply modulo 256 using Formula (4).

{k e y}_{x_{i}} = m o d (f l o o r (x_{i} \times 10^{10}), 256)

(4)

The binary sequence can be obtained by cascading

{k e y}_{x_{i}}

,

{k e y}_{y_{i}}

,

{k e y}_{z_{i}}

, and

{k e y}_{w_{i}}

, denoted by

k e y = {k e y}_{x_{i}} | | {k e y}_{y_{i}} | | {k e y}_{z_{i}} | | {k e y}_{w_{i}}

.

Step 4: If the iteration is a multiple of 10,000, compute the fractional part of

w_{i}

, denoted as

ε

. Then, use it to disturb the chaotic system according to Formula (5). Otherwise, continue the iteration.

\{\begin{matrix} x_{i} = x_{i} + ε, i f z_{i} > 0 \\ y_{i} = y_{i} + ε, i f z_{i} < 0 \end{matrix}

(5)

Since the length of the key generated in each iteration is 32 bits, the length of each of

{k e y}_{x_{i}}

,

{k e y}_{y_{i}}

,

{k e y}_{z_{i}}

, and

{k e y}_{w_{i}}

is 8 bits. The key generation process will stop after

({l e n}_{k e y} / 32) + 200

itertions when

({l e n}_{k e y} / 32) < 10,000

, where

{l e n}_{k e y}

denotes the required key length. It will stop after

({l e n}_{k e y} / 32) + 200 ({l e n}_{k e y} % 320,000)

itertions when

({l e n}_{k e y} / 32) \geq 10,000

, where

a % b

is the remainder when

a

is divided by

b

, with

a \in Z^{+}

and

b \in Z^{+}

.

3.3. Design of the Dynamic S-Box

To achieve high efficiency, we design a dynamic substitution box (S-box). A sequence

s

with length 256 is selected from the secret key. This sequence is sorted in ascending order to obtain a new sequence

S

. The index of the element

S_{i}

in

s

is reshaped into a

16 \times 16

matrix, which serves as the dynamic S-box. An example is shown in Figure 4.

To transform the image pixels into cipher image pixels using the S-box, each pixel value of the plain image is divided into the left 4 bits and the right 4 bits. The decimal values derived from these bits represent the row and column numbers of the S-box, respectively.

3.4. LVGG-IE-Based Image Encryption Scheme

In this section, we present the proposed image encryption scheme, LVGG-IE, which consists of substitution via the S-box, permutation using the SC layer, convolutional addition, and modular operations. The encryption model is shown in Figure 5.

In this encryption model, the sub-keys are separated from the secret key by

k e y = [k e y 1, k e y 2, k e y 3, k e y 4, k e y 5, k e y 6, k e y 7, k e y 8, k e y 9]

. For an image of size

r o w \times c o l

, the length of these sub-keys are

r o w \times c o l

, 256,

r o w \times c o l

,

c o l

, 2,

c o l

,

r o w

,

r o w + 1

, and 2, respectively. These sub-keys are generated by the pseudorandom generator designed in Section 3.3. Consequently, the secret key for the proposed image encryption scheme consists of the key seed and the other four initial inputs for the proposed chaotic system. Thus, the secret key comprises 6 real numbers, requiring only 384 bits. The details of the image encryption are as below:

Step 1: First, permute the pixels of the plain image using

k e y 1

. Then, construct the dynamic S-box as described in Figure 4 using

k e y 2

. Divide each pixel of the image into left 4 bits and right 4 bits, where the decimal values derived from these bits represent the row and column numbers of the S-box, respectively. Next, substitute all the pixels of the image with the corresponding values from the S-box.

Step 2: Perform modular addition for each bit-shifted pixel of the resulting image and the key matrix generated by

k e y 3

. Generate the SC layer by

k e y 4

, and permutate each column of the image using the SC layer, as shown in Figure 6.

Step 3: Generate the convolution kernel

C o n v

by Formula (6):

\{\begin{matrix} C o n v = [1,1], i f k e y 5 (1) i s o d d \\ C o n v = [- 1,1], i f k e y 5 (1) i s e v e n \end{matrix}

(6)

Then, transform the

n \times n

image matrix

P

into one-dimension vectors

P_{v}

in column order. Perform the convolutional addition operation using the convolution kernel

C o n v

. The pixel value is obtained by applying modulo 256. The details of the convolutional addition are shown in Figure 7.

The different dotted boxes in Figure 7 denote the different convolutional units. The cipher pixel can be calculated by Formula (7):

P_{v}^{'} (i) = \{\begin{matrix} m o d (P_{v} (i) + (2 * m o d (k e y 5 (1), 2) - 1) * k e y 5 (2), 256), i f i = 1 \\ m o d (P_{v} (i) + (2 * m o d (k e y 5 (1), 2) - 1) * P_{v} (i - 1), 256), i f i > 1 \end{matrix}

(7)

Step 4: Transform the one-dimension vectors

P_{v}

into the

n \times n

image matrix

P_{c}

. Apply modular addition to each row of

P_{c}

using Formula (8):

\{\begin{matrix} P_{c} (i, 1 : c o l) = m o d (P_{v}^{'} (i, 1 : c o l) + k e y 6 (1 : c o l), 256), i f i = 1 \\ P_{c} (i, 1 : c o l) = m o d (P_{v}^{'} (i, 1 : c o l) + P_{v}^{'} (i - 1,1 : c o l), 256), i f i > 1 \end{matrix}

(8)

Step 5: Apply modular addition to each column of

P_{c}

using Formula (9).

\{\begin{array}{l} P_{c} (1 : r o w, i) = m o d (P_{c} (1 : r o w, i) + k e y 7 (c o l + 1 : c o l + r o w), 256), i f i = 1 \\ P_{c} (1 : r o w, i) = m o d (P_{c} (1 : r o w, i) + P_{c} (1 : r o w, i - 1), 256), i f i > 1 \end{array}

(9)

The process is shown in Figure 8, in which the different dotted boxes denote the different modular addition units:

Step 6: Generate the SC layer by

k e y 8

and permutate each row of the image by the SC layer. The operation is similar to that in Figure 6, except that the input is now each row of the image.

Step 7: Generate the convolution kernel

C o n v

by

k e y 9 (1)

and encryption the pixels of the image by convolutional addition and modulo operation, as shown in Figure 7.

The encryption process is detailed in Algorithm 1.

Algorithm 1. The image encryption algorithm
Input: $P, k e y 1, k e y 2, k e y 3, k e y 4, k e y 5$ , $k e y 6$ , $k e y 7$ , $k e y 8$ , $k e y 9$ Output: $C$
1:	$I 2 \leftarrow s o r t (k e y 1)$ //Obtain the index of the sorted sequence of $k e y 1$
2:	$I \leftarrow s o r t (k e y 2)$ //Obtain the index of the sorted sequence of $k e y 2$
3:	$S \leftarrow r e s h a p e (I, 16,16)$ //Generate the S-box by the index value
4:	$C 00 \leftarrow r e s h a p e (P, 1, r o w * c o l)$ //Reshape the plain image to one dimension vector
5:	for $i = 1,2, \dots, r o w * c o l$ do
6:	$C 01 (I 2 (i)) \leftarrow C 00 (i)$ //Shuffle the plain image by the index $I 2$
7:	$C 02 (I 2 (i)) \leftarrow S u b s t i t u t e (C 01 (I 2 (i)), S)$ //Substitute the value by S-box
8:	end
9:	$C 03 \leftarrow r e s h a p e (C 02, r o w, c o l)$
10:	$C 1 \leftarrow b i t s h i f t (C 03,3) \oplus k e y 3$
11	$f o r i = 1,2, \dots, c o l$ do //Permute the image by SC layer
12:	$t m p 1 \leftarrow C 1 (1 : k e y 4 (i), i)$ , $t m p 2 \leftarrow C 1 (k e y 4 (i) + 1 : r o w, i)$
13:	$C 2 \leftarrow [t m p 2; t m p 1]$
14:	end
15:	$C 3 \leftarrow r e s h a p e (C 2,1, r o w * c o l)$ //Transform the image matrix to one-dimension vector
16:	$C 4 (1) \leftarrow m o d (C 3 (1)$ + 2( $m o d (k e y 5 (1), 2) - 1$ )* $k e y 5 (2)$ ,256)//Encrypt image by convolutional addition and modulo
17:	for $i = 2, \dots, c o l * r o w$ do
18:	$C 4 (i) \leftarrow m o d (C 3 (i)$ + 2( $m o d (k e y 5 (1), 2) - 1$ )* $C 3 (i - 1)$ ,256)
19:	end
20:	$C 4 \leftarrow r e s h a p e (C 4, r o w, c o l)$ //Transform the one-dimension vector to image matrix
21:	$C 5 (1,1 : c o l) \leftarrow m o d (C 4 (1,1 : c o l) + k e y 6 (1 : c o l), 256)$ //modular addition on row
22:	for $i = 2, \dots, r o w$ do
23:	$C 5 (i, 1 : c o l) \leftarrow m o d (C 4 (i, 1 : c o l)$ + $C 4 (i - 1,1 : c o l)$ ,256)
24:	end
25:	$C 6 (1 : r o w, 1) \leftarrow m o d (C 5 (1 : r o w, 1) + k e y 7 (c o l + 1 : c o l + r o w), 256)$ //modular addition on column
26:	for $i = 2, \dots, c o l$ do
27:	$C 7 (1 : r o w, i) \leftarrow m o d (C 6 (1 : r o w, i)$ + $C 6 (1 : r o w, i - 1)$ ,256)
28:	end
29:	$f o r i = 1,2, \dots, r o w$ do //Permute the image by SC layer
30:	$t m p 1 \leftarrow C 7 (i, 1 : k e y 8 (i))$ , $t m p 2 \leftarrow C 7 (i, k e y 8 (i) + 1 : c o l)$
31:	$C 8 \leftarrow [t m p 2 t m p 1]$
32:	end
33:	$C 8 \leftarrow r e s h a p e (C 8,1, r o w * c o l)$ //Transform the image matrix to one-dimension vector
34:	$C 9 (1) \leftarrow m o d (C 8 (1)$ + 2( $m o d (k e y 9 (1), 2) - 1$ )* $k e y 9 (2)$ ,256)//Encrypt image by convolutional addition and modular
35:	for $i = 2, \dots, c o l * r o w$ do
36:	$C (i) \leftarrow m o d (C 9 (i)$ + 2( $m o d (k e y 9 (1), 2) - 1$ )* $C 9 (i - 1)$ ,256)
37:	end
38:	$C \leftarrow r e s h a p e (C, r o w, c o l)$ //Transform the one-dimension vector to image matrix
39:	return $C$

The decryption is the inverse of the encryption. The differences are as follows:

First, the inverse of convolutional addition and modular operations simply requires changing the addition operation to subtraction. The inverse of the SC layer follows the same steps as the decryption process shown in Figure 5. Second, the bit shift XOR operation is modified to

C_{v} (i) = b i t s h i f t (C_{v} (i), - 3) \oplus k e y

. The modular addition of rows or columns is changed to modular subtraction

C (i, 1 : c o l) = m o d (C (i, 1 : c o l) - C (i - 1,1 : c o l), 256)

. Third, the inverse of the S-box is performed by searching for the value of each pixel in the cipher image within the S-box, obtaining its row and column values, and converting them into two 4-bit sequences. These are combined and then converted to a decimal value, which is the decrypted value from the inverse S-box.

4. Simulation and Security Analysis

The security of the proposed color image encryption scheme is analyzed in terms of key randomness, key sensitivity, the histogram of the cipher image, the correlation coefficient of adjacent pixels, and information entropy. The scheme’s ability to resist differential attacks, data loss attacks, and noise attacks is also simulated. Since the image “Lena” is not recommended by many journals, we replaced it with the image “Peppers,” which shares similar feature space characteristics [45].

4.1. Randomness of the Key

The randomness of the key generated by the LVGG and chaos-based pseudorandom sequence generator in Section 3.2 is tested by NIST SP 800. The initial value for the image “Peppers” is

(0.3433926, 0.6566074, 1.33, 0.28)

. The result is shown in Table 3.

From Table 3, we can see that the key generated by the proposed pseudorandom sequence generator passes all tests, with 10 items passing at a proportion of 100%. This indicates good randomness for encryption. When the same method is applied to the Lorenz system, however, the pseudorandom sequence based on Lorenz fails the SP 800 test. A comparison of the two sequences is shown in Table 3, where most p-values for the Lorenz sequence are less than 0.05, which does not meet the SP 800 test requirements. Therefore, the Lorenz-based pseudorandom sequence is insecure for image encryption.

4.2. Security and Efficiency Analysis of the Dynamic S-Box and SC Layer

To validate the security of the dynamic S-box, we test its nonlinearity and strict avalanche criterion (SAC) in this section. We also analyze the security of the dynamic SC layer.

4.2.1. Nonlinearity Test

We generate 10,000 S-boxes using the method described in Section 3.3 and calculate their nonlinearity using Formula (10):

N_{f} = 2^{n - 1} (1 - 2^{- n} \max_{ω \in GF (2^{n})} | S_{(f)} (ω) |)

(10)

Here, the cyclic spectrum of function

f (x)

is denoted as

S_{(f)} (ω) = \sum_{ω \in G F (2^{n})} {(- 1)}^{f (x) \oplus x \cdot ω}

, where

x \cdot ω

is the dot product of

x

and

ω

. The nonlinearity values are shown in Figure 9.

In Figure 9, the number of S-boxes with a nonlinearity greater than 110 exceeds half of the total. Since the nonlinearity of the S-box in AES is 110, the proposed method for generating dynamic S-boxes is secure.

4.2.2. SAC Test

The SAC of the S-box is another significant index for evaluating its security. It is defined such that the S-box is considered secure if its output flips with a probability of 50% when any single input bit is changed. We altered each bit of the input and calculated the flipping probability of each output for 10,000 different S-boxes. The results are shown in Table 4.

In Table 4, we observe that the probability of the S-box output is approximately 0.5 for different inputs. The average deviation from 0.5 is 0.00048, meeting the theoretical SAC deviation requirements.

4.2.3. Security of the Dynamic S-Box and Dynamic SC Layer

The static S-box can be brute-forced using chosen plaintext attacks, where the adversary constructs specific plaintext and obtains the substituted ciphertext through the S-box. In contrast, the dynamic S-box offers higher security since it changes with each encryption. Furthermore, the dynamic S-box proposed in this scheme exhibits high nonlinearity, with the average nonlinearity approaching that of the AES algorithm. It also shows a small deviation from the ideal SAC value. Therefore, the proposed generation method for the S-box ensures high security.

The dynamic SC layer is designed as a shift operation for image encryption in this work. A static SC layer may lead to a statistical attack, in which an adversary could generate different shift results for the plaintext image and deduce the entire SC layer. Based on this guessed SC layer, the adversary could reconstruct the plaintext after the SC shift, rendering subsequent operations ineffective. For example, an attacker might design a plaintext image where the first pixel encrypted by the SC layer is 0, so that the output of the subsequent convolutional addition and modular operation equals the key used in that step. By using a dynamic SC layer, the shift changes with each encryption, making it impossible to guess the SC layer and thereby enhancing security.

4.2.4. Efficiency of Dynamic Generation for S-Box and SC Layer

The dynamic generation of the S-box takes minimal time compared to the entire image encryption scheme. We tested the implementation time on an 11th Gen Intel(R) Core(TM) i7-1165G7 @ 2.80 GHz platform. The results indicate that dynamically generating the S-box requires only 0.0002 s. The dynamic generation of the SC layer requires storage for only

c o l

or

r o w

bits, as it is merely a rule for bit shifting. Thus, the dynamic method achieves high efficiency.

4.3. Encryption Results

In this section, the correctness of the proposed color image encryption scheme is validated. The images “Peppers” (

256 \times 256

), Lake (

512 \times 512

), and Female (

256 \times 256

) are encrypted and decrypted as shown in Figure 10.

The results in Figure 10 show that the cipher images differ from the plaintext and resemble noise. The decrypted images are indistinguishable from the plaintext images, confirming that the proposed scheme can correctly encrypt and decrypt color images.

4.4. Key Space and Key Sensitivity

The strength of the image encryption algorithm relies heavily on the robustness of the key. An image encryption algorithm with a key space larger than

2^{128}

is capable of resisting brute-force attacks. The proposed scheme uses the plaintext to generate the key seed and obtain the key sequence through the proposed 4D chaotic system. The key seed is composed of two floating point numbers with a bit length of 64, giving it a key space of

2^{128}

. The control variables in Figure 2 are

x \in [- 9, - 1]

,

y \in [- 3,4.5]

,

z \in [- 4,2]

, and

w \in [0,12]

when

d \in {[0.95,1.26] \cup [1.35,3]}

. The key space of the chaotic map is approximately

8 \times 7.5 \times 6 \times 12 \times 2.96 \times 10^{14 \times 5} \approx 10^{74}

when the key precision is set to 14. Therefore, the total key space is about

10^{112}

. The key space of the proposed scheme is compared with others in Table 5:

In Table 5, the key space of the proposed scheme is larger than that of [20,22], and only slightly smaller than that of [23,25]. This key space is sufficient to resist brute-force attacks.

Chaotic systems are sensitive to initial inputs, but quantization may reduce this sensitivity. To evaluate the proposed pseudorandom sequence generator’s capacity, we tested the key sensitivity in the encryption and decryption of color images. We generated a key

k_{0}

using the initial value

X_{0} = (x_{0}, y_{0}, z_{0}, w_{0}) = (0.34339260000002, 0.6566074, 1.33, 0.28)

. Then, four different keys

k_{1}

,

k_{2}

,

k_{3}

and

k_{4}

are generated by changing

x_{0}, y_{0}, z_{0}

, and

w_{0}

by

10^{- 14}

, respectively. The sensitivity of these keys to tiny changes is tested. For clarity, the changes are presented in decimal form in Figure 11.

Figure 11 demonstrates that a tiny change of 0.00000000000001 in the initial value results in a complete alteration of the key sequence. This confirms that the key is highly sensitive to changes in the initial value. Additionally, we decrypted the cipher image of “Peppers” (

256 \times 256

), which had been encrypted using

k_{0}

. The result is shown in Figure 12.

The rate of change between the decrypted images and the plaintext image is shown in Table 6.

From the data, it is evident that the difference between the decrypted image using a slightly altered key and the plaintext image exceeds 99%. This confirms that the proposed scheme exhibits high key sensitivity.

4.5. Histogram Analysis

The histogram describes the distribution of pixel values in an image. A uniform pixel value distribution indicates stronger resistance to statistical attacks. The histograms of the plaintext image “Peppers” (

256 \times 256

) and its encryption version are shown in Figure 13.

The uniformity of the histogram can be estimated by the variance, which is calculated by Formula (11).

v a r (X) = \frac{1}{n^{2}} \sum_{i = 1}^{n} \sum_{j = 1}^{n} \frac{{(x_{i} - x_{j})}^{2}}{2}

(11)

Here,

x_{i}

and

x_{j}

represent the number of pixels in the histogram

X

with gray values

i

and

j

, respectively. The parameter

n

refers to the gray level. Since the variances of the histograms of image in [21] is the smallest, we compare the variances of the histograms of different images for different schemes in Table 7.

Figure 13 shows that the histogram of the cipher image exhibits a uniform distribution, contrasting with the plaintext image’s histogram. From Table 7, we observe that the variances of the histograms of the cipher images encrypted using Algorithm 1 are smaller than those in [21], indicating that the proposed scheme can resist statistical attacks.

4.6. Encryption Quality Analysis

In this section, the accuracy of the proposed color image encryption scheme is validated. The quality of encryption is analyzed by examining the closeness of the obtained image to an ideally encrypted image [46]. An ideal encrypted image has a uniform pixel distribution across all intensity levels, which can be assessed using metrics such as deviation from ideality, maximum deviation, and irregular deviation.

4.6.1. Deviation from Ideality

The histogram of an encrypted image generated by a robust encryption scheme should be uniformly distributed. The histogram of the ideal encrypted image

C_{i}

can be measured by Formula (12), where a small deviation indicates high security:

H (C_{i}) = \{\begin{array}{l} \frac{M \times N}{256}, 0 \leq C_{i} \leq 255 \\ 0, O t h e r w i s e \end{array}

(12)

where

M

is the number of rows, and

N

is the number of columns in the image. The deviation from the ideal encrypted image can be calculated using Formula (13):

D_{H} = \frac{1}{M N} \sum_{C_{i} = 0}^{255} |H_{C_{i}} - H_{c}|

(13)

where

H_{c}

is the histogram of the encypted image.

4.6.2. Maximum Deviation

The maximum deviation (MD) evaluates the difference between the histograms of the cipher and plain images, as calculated by Formula (14). A larger MD represents higher security:

M D = \frac{D_{0} + D_{N - 1}}{2} + \sum_{i = 1}^{N - 2} D_{i}

(14)

where

N

is the total number of pixel values (for an 8-bit image,

N = 2^{8} = 256

), and

D_{i}

is the difference between the

i

-th histogram of the original and encrypted images.

4.6.3. Irregular Deviation

Since maximum deviation alone may yield inaccurate results in some cases, it cannot be solely relied upon to assess encryption quality. Irregular deviation (ID) measures how close the statistical distribution deviation between the plain and cipher images is to a uniform distribution, as calculated by Formula (15):

\{\begin{matrix} I D = \sum_{i = 0}^{N - 1} {H D}_{i} \\ {H D}_{i} = |d_{i} - μ_{H}| \end{matrix}

(15)

where

d_{i}

is the difference between the histogram values of the plain and cipher images, and

μ_{H}

is the mean of the histogram values. A higher ID indicates a more uniform pixel distribution.

We calculate the deviation from ideality, maximum deviation, and irregular deviation for the proposed scheme. Additionally, we compare the encryption quality of the proposed scheme using the key seed generated by the LVGG model with that using a random key seed. The results are shown in Table 8.

In Table 8, “Peppers” represents the cipher image encrypted with a key generated from a random key seed, while “Peppers*” represents the cipher image encrypted with a key generated using the key seed from the LVGG network. The results show that the deviation from ideality of the proposed scheme decreases to 0.55, indicating high security. The MD and ID values of the proposed scheme are sufficiently large. Furthermore, the encryption quality of the cipher image using the key seed generated by the LVGG network is better than that using the random key seed. Thus, the proposed image encryption scheme demonstrates high encryption quality.

4.7. Correlation Coefficient Between Adjacent Pixels Analysis

A good image encryption scheme should yield a low correlation coefficient between adjacent pixels in any direction, making it more resistant to statistical attacks. The correlation coefficient between adjacent pixels is calculated using Formula (16).

\{\begin{array}{l} \bar{x} = \frac{1}{N} \sum_{i = 1}^{N} x_{i} \\ D (x) = \frac{1}{N} \sum_{i = 1}^{N} {(x_{i} - \bar{x})}^{2} \\ C o n v (x, y) = \frac{1}{N} \sum_{i = 1}^{N} (x_{i} - \bar{x}) (y_{i} - \bar{y}) \\ γ_{x y} = \frac{C o n v (x, y)}{\sqrt{D (x)} \sqrt{D (y)}} \end{array}

(16)

where

N

represents the number of chosen adjacent pixels in any direction of the image. In this test, 5000 adjacent pixels were selected to calculate the correlation coefficient between adjacent pixels of both the plaintext and cipher images for “Peppers” (

256 \times 256

). The result is shown in Figure 14.

The correlation coefficients are summarized in Table 9 and Table 10.

Furthermore, we compare the correlation coefficient between the adjacent pixel of the image Peppers (

256 \times 256

) with the cipher image Lena (

256 \times 256

) of other schemes. The lowest correlation coefficient of the three components is chosen to compare with the other schemes. The result is shown in Table 10.

The results show that the correlation coefficients of the encrypted images using our scheme are the lowest, with the minimum coefficient being about one-tenth of that of other schemes. The correlation coefficients of images encrypted using Lorenz-based keys are the highest, indicating that the Lorenz system is inadequate for image encryption due to its poor resistance to differential analysis.

4.8. Differential Attack

The number of pixels change rate (NPCR) and unified average changing intensity (UACI) are commonly used to evaluate the ability of an image encryption scheme to withstand differential attacks. If NPCR is close to 100% and UACI is near 33%, the encryption scheme is considered secure against such attacks. Let

{P I}_{1}

and

{P I}_{2}

be two plaintext images that differ by only one pixel. There exist that

{P I}_{1} (i, j) \in {P I}_{1}

and

{P I}_{2} (i, j) \in {P I}_{2}

.

{C I}_{1}

and

{C I}_{2}

are generated by encrypting

{P I}_{1}

and

{P I}_{2}

with the same key.

{C I}_{1} (i, j)

and

{C I}_{2} (i, j)

are any pixel of

{C I}_{1}

and

{C I}_{2}

correspondingly, we set

D (i, j) = 1

if

C_{1} (i, j) \neq C_{2} (i, j)

. Then, the NPCR and UACI can be calculated by the Formula (17):

\{\begin{matrix} N P C R = \frac{1}{N \times M} \sum_{i = 1}^{N} \sum_{j = 1}^{M} D (i, j) \times 100 % \\ U A C I = \frac{1}{N \times M} \sum_{i = 1}^{N} \sum_{j = 1}^{M} \frac{|C_{1} (i, j) - C_{2} (i, j)|}{255} \times 100 % \end{matrix}

(17)

where

N

and

M

are the numbers of rows and columns in the image matrix. Recently, the theoretical marginal values for NPCR and UACI have been defined in Formula (18).

\{\begin{matrix} N_{α}^{*} = \frac{F - Φ^{- 1} (α) \sqrt{\frac{F}{M N}}}{F + 1} \\ U_{α}^{* +} = \frac{F + 2}{3 F + 3} + Φ^{- 1} (\frac{α}{2}) \sqrt{\frac{(F + 2) (F^{2} + 2 F + 3)}{18 {(F + 1)}^{2} M N F}} \\ U_{α}^{* -} = \frac{F + 2}{3 F + 3} - Φ^{- 1} (\frac{α}{2}) \sqrt{\frac{(F + 2) (F^{2} + 2 F + 3)}{18 {(F + 1)}^{2} M N F}} \end{matrix}

(18)

where

F

represents the largest pixel value supported by the ciphertext image format, and

Φ^{- 1} (\cdot)

is the inverse cumulative density function of the stand normal distribution

N (0,1)

. The

α

is the significance level. NPCR results for different images are shown in Table 11.

The UACI of different images is tested in Table 12.

The comparison of NPCR and UACI values for “Peppers” (

256 \times 256

) and “Lena” (

256 \times 256

) is presented in Table 13.

The results show that NPCR and UACI values for the proposed scheme fall within the critical value ranges. Additionally, the UACI of the proposed scheme is closer to the critical values compared to other schemes, except for [25]. Thus, the proposed scheme efficiently resists differential attacks. In contrast, the Lorenz-based encryption scheme fails the NPCR test, and its UACI only reaches the

U_{0.001}^{* +}

critical value, indicating its vulnerability to differential attacks.

Additionally, we use the LVGG network to generate the key seed from the plaintext image, which enhances resistance against chosen plaintext attacks. Attackers could potentially construct specific images, such as an image with only one pixel set to “1” and all other pixels set to “0,” denoted as

{P I}_{1}

. By changing the position of the “1” within the pixel grid or its location in the image, attackers could create multiple plaintext images

{P I}_{i}

. After encrypting all these plaintext images

{P I}_{i}

to obtain ciphertext image

{C I}_{i}

, attackers could build a map containing

{P I}_{i}

,

{C I}_{i}

and

k e y

. They could then attempt to guess the key by performing subtractions on pairs of

{C I}_{i}

and analyzing the differences between them. For instance, when

C 8 (1) = 0

in Algoruthm 1 for

{C I}_{i}

and

{C I}_{j}

, the subtraction

({C I}_{i} - {C I}_{j})

can cancel out the convolutional addition and modulo operations from lines 34 to 37 in Algorithm 1. Additionally,

k e y 1

could be compromised by collecting more than

r o w \times c o l

pairs of tuples

({P I}_{i}, {C I}_{i})

, since each

{P I}_{i}

contains only one “1” within the

r o w * c o l

pixels. The same approach could potentially be applied to obtain

k e y 2

, and other sub-keys might also be vulnerable to brute-force attacks using this method. On the contrary, when the key is generated from a key seed produced by the LVGG network, each tuple pair

({P I}_{i}, {C I}_{i})

corresponds to a different

{k e y}_{i}

. These vulnerabilities are mitigated because the attacker cannot establish a direct mapping between the tuple pairs and the key.

4.9. Information Entropy Analysis

Information entropy reflects the degree of image confusion, which is defined by Formula (19).

H (m) = \sum_{i = 1}^{L} p (m_{i}) \log_{2} (\frac{1}{p (m_{i})})

(19)

where

m

is the image information, and

p (m_{i})

is the probability of the gray value

m_{i}

.

L

is the number of the gray values in image

m

. The theoretical value of a random image with 256 gary levels is 8. The information entropy of the plain images and the cipher images generated by the proposed scheme is shown in Table 14.

The information entropy of the cipher image “Peppers” (

256 \times 256

) generated by our scheme is compared with that of the cipher image “Lena” (

256 \times 256

) from other schemes in Table 15.

Table 14 and Table 15 show that the information entropy of all cipher images generated by our scheme is very close to 8 and is higher than in other schemes, indicating that the proposed scheme is more resistant to statistical attacks. The entropy of the cipher image generated by the Lorenz system is lower, demonstrating less efficiency in resisting statistical attacks.

4.10. Robustness Analysis

Since encrypted images may suffer from noise interference or cropping attacks during transmission, robustness against such attacks is essential for an efficient image encryption scheme. We tested the robustness of the image “Peppers” against Gaussian noise (GN), Salt & Pepper noise (SPN), and Speckle noise (SN), as shown in Figure 15.

To evaluate robustness against cropping attacks, different areas of the cipher image of “Peppers” were cropped and then decrypted, as shown in Figure 16.

Figure 13 shows that even cropped cipher images can be partly decrypted. Though some pixels are lost, the image content remains recognizable. Additionally, we used the signal-to-noise ratio (PSNR) to measure resilience to noisy images. A higher PSNR indicates better resilience, and the PSNR is defined in Formula (20).

\{\begin{array}{l} M S E = \frac{1}{N \times M} \sum_{i = 1}^{N} \sum_{j = 1}^{M} {|P^{'} (i, j) - P (i, j)|}^{2} \\ P S N R = 10 \times \log_{10} \frac{255 \times 255}{M S E} \end{array}

(20)

where

N

and

M

represent the image size,

P

is the original image, and the noisy cipher image of

P

is decrypted to

P^{'}

. The PSNR for the noisy cipher images is shown in Table 16.

The PSNR values in Table 16 are all above 17 dB for all noise attacks. Even when 25% of the cipher image information is lost, the recovered image remains recognizable, with a PSNR value of around 11 dB. Therefore, the proposed scheme is robust against noise and cropping attacks.

4.11. Visual Quality Analysis

The visual quality analysis can be evaluated using MSE and PSNR. The MSE and PSNR values of the plaintext image and the ciphertext image for the proposed scheme are shown in Table 17.

Table 17 shows that the MSE between the plaintext image and the ciphertext image for the proposed scheme is high, and the PSNR is below 10. Therefore, the proposed image encryption scheme demonstrates high security.

4.12. Performance Analysis

To validate the efficiency of the proposed scheme, we tested the implementation time and compared it with other schemes, as shown in Table 18.

Table 18 shows that the encryption speed of the proposed image encryption scheme is higher than that of AES and other chaos-based image encryption schemes, making it suitable for image encryption.

From the above, it is clear that the proposed scheme has a high efficiency and capacity to resist statistical, noise, and cropping attacks, making it more secure than other schemes.

5. Conclusions

In this work, we proposed a lightweight VGG-based image encryption scheme. By reducing the number of layers and convolution kernel size in VGG, we achieved high efficiency while maintaining a specified input size. The plain image is input into the LVGG network to generate a plaintext-correlated key seed, which serves as part of the input for the proposed novel 4D chaotic system for secret key generation. The dynamic S-box is designed to confuse the pixels of the plain image, while a single-connected layer diffuses the bit-shifted pixels. Finally, the VGG convolution layer is improved to perform convolutional addition and modular operations on the image pixels. In this process, we leverage the advantages of both deep learning and chaotic systems. The proposed scheme’s ability to resist differential, statistical, noise, and cropping attacks is simulated and analyzed. The results show that our image encryption scheme is more secure and robust than state-of-the-art methods.

Although the LVGG-based key seed generation improves resistance against chosen plaintext attacks, it requires an additional secure channel to transmit the key seed for each encryption, which increases the complexity of key management. Future work should focus on improving this aspect to avoid transmitting the key seed for each image. Additionally, the nonlinear properties of deep learning could be further explored to enhance the security and efficiency of image encryption systems.

Author Contributions

The authors confirm contribution to the paper as follows: Study conception and design: M.S.; data collection and draft manuscript preparation: J.Y.; picture drawing and analysis of results: X.L., D.L. and X.W. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the National Key Research and Development Program of China under Grant 2023YFB3107605 and the Key Laboratory of Trusted Distributed Computing and Services, Ministry of Education (Beijing University of Posts and Telecommunications).

Institutional Review Board Statement

Not applicable.

Data Availability Statement

Data are contained within the article.

Acknowledgments

The authors would like to express their gratitude to the editors and reviewers for their thorough review and valuable recommendations.

Conflicts of Interest

The authors declare that they have no conflicts of interest to report regarding the present study.

References

Chai, X.; Bi, J.; Gan, Z.; Liu, X.; Zhang, Y.; Chen, Y. Color image compression and encryption scheme based on compressive sensing and double random encryption strategy. Signal Process. 2020, 176, 107684. [Google Scholar] [CrossRef]
Dai, J.; Hao, X.; Yan, X.; Li, Z. Adaptive false-target recognition for the proximity sensor based on joint-feature extraction and chaotic encryption. IEEE Sens. J. 2022, 22, 10828–10840. [Google Scholar] [CrossRef]
Sayed, W.S.; Roshdy, M.; Said, L.A.; Radwan, A.G. Design and FPGA verification of custom-shaped chaotic attractors using rotation, offset boosting and amplitude control. IEEE Trans. Circuits Syst. II Express Briefs 2021, 68, 3466–3470. [Google Scholar] [CrossRef]
Preishuber, M.; Hütter, T.; Katzenbeisser, S.; Uhl, A. Depreciating motivation and empirical security analysis of chaos-based image and video encryption. IEEE Trans. Inf. Forensics Secur. 2018, 13, 2137–2150. [Google Scholar] [CrossRef]
Lin, C.M.; Pham, D.H.; Huynh, T.T. Encryption and decryption of audio signal and image secure communications using chaotic system synchronization control by TSK fuzzy brain emotional learning controllers. IEEE Trans. Cybern. 2021, 52, 13684–13698. [Google Scholar] [CrossRef] [PubMed]
Zhou, Y.; Bao, L.; Chen, C.L.P. A new 1D chaotic system for image encryption. Signal Process. 2014, 97, 172–182. [Google Scholar] [CrossRef]
Pak, C.; Huang, L. A new color image encryption using combination of the 1D chaotic map. Signal Process. 2017, 138, 129–137. [Google Scholar] [CrossRef]
Hua, Z.; Zhou, Y.; Huang, H. Cosine-transform-based chaotic system for image encryption. Inf. Sci. 2019, 480, 403–419. [Google Scholar] [CrossRef]
Liu, W.; Sun, K.; Zhu, C. A fast image encryption algorithm based on chaotic map. Opt. Lasers Eng. 2016, 84, 26–36. [Google Scholar] [CrossRef]
Cao, C.; Sun, K.; Liu, W. A novel bit-level image encryption algorithm based on 2D-LICM hyperchaotic map. Signal Process. 2018, 143, 122–133. [Google Scholar] [CrossRef]
Hoang, T.M. A novel design of multiple image encryption using perturbed chaotic map. Multimed. Tools Appl. 2022, 81, 26535–26589. [Google Scholar] [CrossRef]
Wang, X.; Su, Y.; Liu, L.; Zhang, H.; Di, S. Color image encryption algorithm based on Fisher-Yates scrambling and DNA subsequence operation. Vis. Comput. 2023, 39, 43–58. [Google Scholar] [CrossRef]
Mansouri, A.; Wang, X. Image encryption using shuffled Arnold map and multiple values manipulations. Vis. Comput. 2021, 37, 189–200. [Google Scholar] [CrossRef]
Kaur, M.; Kumar, V. Efficient image encryption method based on improved Lorenz chaotic system. Electron. Lett. 2018, 54, 562–564. [Google Scholar] [CrossRef]
Alexan, W.; ElBeltagy, M.; Aboshousha, A. Lightweight image encryption: Cellular automata and the lorenz system. In Proceedings of the International Conference on Microelectronics (ICM), Cairo, Egypt, 19–22 December 2021; pp. 34–39. [Google Scholar]
Moon, S.; Baik, J.J.; Seo, J.M. Chaos synchronization in generalized Lorenz systems and an application to image encryption. Commun. Nonlinear Sci. Numer. Simul. 2021, 96, 105708. [Google Scholar] [CrossRef]
Li, T.; Yan, W.; Chi, Z. A new image encryption algorithm based on optimized Lorenz chaotic system. Concurr. Comput. Pract. Exp. 2022, 34, e5902. [Google Scholar] [CrossRef]
Zhang, N.; Liu, J.; Tong, X.; Jiao, W.; Gan, H. A novel lorenz-sine coupling chaotic system and its application on color image encryption. Phys. Scr. 2023, 98, 095217. [Google Scholar] [CrossRef]
Wu, X.; Wang, K.; Wang, X.; Kan, H.; Kurths, J. Color image DNA encryption using NCA map-based CML and one-time keys. Signal Process. 2018, 148, 272–287. [Google Scholar] [CrossRef]
Ur Rehman, A.; Liao, X.; Ashraf, R.; Ullah, S.; Wang, H. A color image encryption technique using exclusive-OR with DNA complementary rules based on chaos theory and SHA-2. Optik 2018, 159, 348–367. [Google Scholar] [CrossRef]
Chai, X.; Fu, X.; Gan, Z.; Lu, Y.; Chen, Y. A color image cryptosystem based on dynamic DNA encryption and chaos. Signal Process. 2019, 155, 44–62. [Google Scholar] [CrossRef]
Wu, X.; Kan, H.; Kurths, J. A new color image encryption scheme based on DNA sequences and multiple improved 1D chaotic maps. Appl. Soft Comput. 2015, 37, 24–39. [Google Scholar] [CrossRef]
Ravichandran, D.; Padmaa, M.; Rajagopal, N.; Thanikaiselvan, V.; Amirtharajan, R. Chaos and DNA blended hybrid encryption algorithm for secure image transmission over dct pre-coded ofdm. Wirel. Pers. Commun. 2023, 129, 703–727. [Google Scholar] [CrossRef]
Ravichandran, D.; Banu, S.A.; Murthy, B.K.; Balasubramanian, V.; Fathima, S.; Amirtharajan, R. An efficient medical image encryption using hybrid DNA computing and chaos in transform domain. Med. Biol. Eng. Comput. 2021, 59, 589–605. [Google Scholar] [CrossRef] [PubMed]
Wu, J.; Liao, X.; Yang, B. Color image encryption based on chaotic systems and elliptic curve ElGamal scheme. Signal Process. 2017, 141, 109–124. [Google Scholar] [CrossRef]
Zhang, Y.; Xiao, D. Self-adaptive permutation and combined global diffusion for chaotic color image encryption. Int. J. Electron. Commun. 2014, 68, 361–368. [Google Scholar] [CrossRef]
Es-Sabry, M.; El Akkad, N.; Merras, M.; Saaidi, A.; Satori, K. A new color image encryption algorithm using multiple chaotic maps with the intersecting planes method. Sci. Afr. 2022, 16, e01217. [Google Scholar] [CrossRef]
Bezerra, J.I.M.; de Almeida Camargo, V.V.; Molter, A. A new efficient permutation-diffusion encryption algorithm based on a chaotic map. Chaos Solitons Fractals 2021, 151, 111235. [Google Scholar] [CrossRef]
Zhang, Y.; Chen, A.; Tang, Y.; Dang, J.; Wang, G. Plaintext-related image encryption algorithm based on perceptron-like network. Inf. Sci. 2020, 526, 180–202. [Google Scholar] [CrossRef]
Talhaoui, M.Z.; Wang, X. A new fractional one dimensional chaotic map and its application in high-speed image encryption. Inf. Sci. 2021, 550, 13–26. [Google Scholar] [CrossRef]
Ding, Y.; Wu, G.; Chen, D.; Zhang, N.; Gong, L.; Cao, M.; Qin, Z. DeepEDN: A deep-learning-based image encryption and decryption network for internet of medical things. IEEE Internet Things J. 2020, 8, 1504–1518. [Google Scholar] [CrossRef]
Zheng, Z.; Liu, H.; Yu, Z.; Zheng, H.; Wu, Y.; Yang, Y.; Shi, J. Encryptgan: Image steganography with domain transform. arXiv 2019, arXiv:1905.11582. [Google Scholar]
Bao, Z.; Xue, R. Research on the avalanche effect of image encryption based on the Cycle-GAN. Appl. Opt. 2021, 60, 5320–5334. [Google Scholar] [CrossRef]
Wang, C.; Zhang, Y. A novel image encryption algorithm with deep neural network. Signal Process. 2022, 196, 108536. [Google Scholar] [CrossRef]
Ding, Y.; Tan, F.; Qin, Z.; Cao, M.; Choo, K.K.R.; Qin, Z. DeepKeyGen: A deep learning-based stream cipher generator for medical image encryption and decryption. IEEE Trans. Neural Netw. Learn. Syst. 2021, 33, 4915–4929. [Google Scholar] [CrossRef] [PubMed]
Abdellatef, E.; Naeem, E.A.; El-Samie, F.E.A. DeepEnc: Deep learning-based CT image encryption approach. Multimed. Tools Appl. 2024, 83, 11147–11167. [Google Scholar] [CrossRef]
Zhou, Q.; Wang, X.; Jin, M.; Zhang, L.; Xu, B. Optical image encryption based on two-channel detection and deep learning. Opt. Lasers Eng. 2023, 162, 107415. [Google Scholar] [CrossRef]
Panwar, K.; Kukreja, S.; Singh, A.; Singh, K.K. Towards deep learning for efficient image encryption. Procedia Comput. Sci. 2023, 218, 644–650. [Google Scholar] [CrossRef]
Maniyath, S.R.; Thanikaiselvan, V. An efficient image encryption using deep neural network and chaotic map. Microprocess. Microsyst. 2020, 77, 103134. [Google Scholar] [CrossRef]
Wu, Z.; Pan, P.; Sun, C.; Zhao, B. Plaintext-related dynamic key chaotic image encryption algorithm. Entropy 2021, 23, 1159. [Google Scholar] [CrossRef]
Munir, N.; Khan, M.; Hazzazi, M.M.; Aljaedi, A.; Alharbi, A.R.; Hussain, I. Cryptanalysis of internet of health things encryption scheme based on chaotic maps. IEEE Access 2021, 9, 105678–105685. [Google Scholar] [CrossRef]
Simonyan, K.; Zisserman, A. Very deep convolutional networks for large-scale image recognition. arXiv 2014, arXiv:1409.1556. [Google Scholar]
Lorenz, E.N. Deterministic nonperiodic flow. J. Atmos. Sci. 1963, 20, 130–141. [Google Scholar] [CrossRef]
Mister, S.; Adams, C. Practical S-box design. In Proceedings of the Workshop on Selected Areas in Cryptography, SAC, Ottawa, ON, Canada, 15–16 August 1996; pp. 61–76. [Google Scholar]
Khanzadi, H.; Eshghi, M.; Borujeni, S.E. Image encryption using random bit sequence based on chaotic maps. Arab. J. Sci. Eng. 2014, 39, 1039–1047. [Google Scholar] [CrossRef]
Aashiq, B.S.; Amirtharajan, R. Bio-inspired cryptosystem on reciprocal domain: DNA strands mutate to secure health data. Front. Inform. Technol. Electron. Eng. 2021, 22, 940–956. [Google Scholar] [CrossRef]

Figure 1. The structure of the proposed LVGG.

Figure 2. The phase diagram of the proposed chaotic system. (a) Phase diagram of the

y - z - w

plane; (b) Phase diagram of the

x - w

plane; (c) Phase diagram of the

y - w

plane; (d) Phase diagram of the

z - w

plane.

Figure 2. The phase diagram of the proposed chaotic system. (a) Phase diagram of the

y - z - w

plane; (b) Phase diagram of the

x - w

plane; (c) Phase diagram of the

y - w

plane; (d) Phase diagram of the

z - w

plane.

Figure 3. Chaos-based pseudorandom sequence generator.

Figure 4. Generation of the dynamic S-box.

Figure 5. The encryption model.

Figure 6. The SC layer-based permutation.

Figure 7. The convolutional addition operation.

Figure 8. The modular addition on row and column of image.

Figure 9. The distribution of the nonlinearity of the S-boxes.

Figure 10. The encryption and decryption results: (a) Plaintext image of Peppers; (b) Cipher image of Peppers; (c) Decrypted Peppers; (d) Plaintext image of Female; (e) Cipher image of Female; (f) Decrypted Female; (g) Plaintext image of Lake; (h) Cipher image of Lake; (i) Decrypted Lake.

Figure 11. Bit changes in keys with tiny differences in initial value: (a) Bit change between

k_{0}

and

k_{1}

; (b) Bit change between

k_{0}

and

k_{2}

; (c) Bit change between

k_{0}

and

k_{3}

; (d) Bit change between

k_{0}

and

k_{4}

.

Figure 11. Bit changes in keys with tiny differences in initial value: (a) Bit change between

k_{0}

and

k_{1}

; (b) Bit change between

k_{0}

and

k_{2}

; (c) Bit change between

k_{0}

and

k_{3}

; (d) Bit change between

k_{0}

and

k_{4}

.

Figure 12. The decryption of the cipher image of Peppers encrypted by

k_{0}

: (a) Plaintext image of Peppers; (b) Encrypted by

k_{0}

; (c) Decrypted by

k_{1}

; (d) Decrypted by

k_{2}

; (e) Decrypted by

k_{3}

; (f) Decrypted by

k_{4}

.

Figure 12. The decryption of the cipher image of Peppers encrypted by

k_{0}

: (a) Plaintext image of Peppers; (b) Encrypted by

k_{0}

; (c) Decrypted by

k_{1}

; (d) Decrypted by

k_{2}

; (e) Decrypted by

k_{3}

; (f) Decrypted by

k_{4}

.

Figure 13. Histogram of the plaintext image Peppers and the cipher image: (a) red component of Peppers; (b) green component of Peppers; (c) blue component of Peppers; (d) red component of encrypted Peppers; (e) green component of encrypted Peppers; (f) blue component of encrypted Peppers.

Figure 14. Distribution of adjacent pixels of the blue channel of image Peppers: (a) horizontal direction of the plaintext image; (b) vertical direction of the plaintext image; (c) diagonal direction of the plaintext image; (d) horizontal direction of the cipher image; (e) vertical direction of the cipher image; (f) diagonal direction of the cipher image.

Figure 15. The noised cipher images and their decryptions: (a) Noisy cipher image by SPN with density 0.002; (b) decryption of (a); (c) Noisy cipher image by SPN with density 0.005; (d) decryption of (c); (e) Noisy cipher image by SN with variance 0.000002; (f) decryption of (e); (g) Noisy cipher image by GN with variance 0.000001; (h) decryption of (g).

Figure 16. The cropped cipher images and their decryptions: (a) cipher image with 1/8 data cropped at the lower-left corner; (b) cipher image with 1/4 data cropped at the top-left corner; (c) cipher image with 1/4 data cropped at the bottom-right corner; (d) decryption of (a); (e) decryption of (b); (f) decryption of (c).

Table 1. The structure of different VGG networks.

A (11 Weight Layers)	A-LRN (11 Weight Layers)	B (13 Weight Layers)	C (16 Weight Layers)	D (16 Weight Layers)	E (19 Weight Layers)
Input (224 × 224 RGB image)
conv3-64	conv3-64 LRN	conv3-64 conv3-64	conv3-64 conv3-64	conv3-64 conv3-64	conv3-64 conv3-64
maxpool
conv3-128	conv3-128	conv3-128 conv3-128	conv3-128 conv3-128	conv3-128 conv3-128	conv3-128 conv3-128
maxpool
conv3-256 conv3-256	conv3-256 conv3-256	conv3-256 conv3-256	conv3-256 conv3-256 conv1-256	conv3-256 conv3-256 conv3-256	conv3-256 conv3-256 conv3-256 conv3-256
maxpool
conv3-512 conv3-512	conv3-512 conv3-512	conv3-512 conv3-512	conv3-512 conv3-512 conv1-512	conv3-512 conv3-512 conv3-512	conv3-512 conv3-512 conv3-512 conv3-512
maxpool
conv3-512 conv3-512	conv3-512 conv3-512	conv3-512 conv3-512	conv3-512 conv3-512 conv1-512	conv3-512 conv3-512 conv3-512	conv3-512 conv3-512 conv3-512 conv3-512
maxpool
FC-4096
FC-4096
FC-1000
soft-max

Table 2. Comparison of the different neural networks.

Networks	Number of Parameters	Time (s)
Networks	Number of Parameters	Training	Key Seed Generation
Proposed LVGG	3,453,121	200	0.5
VGG 16 [42]	138,357,544	2890	24.1
VGG 19 [42]	143,667,240	3180	28.9

Table 3. The NIST SP 800 test.

Test Item	Proposed Scheme		The Pseudorandom Sequence Based on Lorenz System
Test Item	p-Value	Proportion	p-Value	Proportion
Frequency	0.23681	0.99	0.006196	0.99
BlockFrequency	0.289667	1	0.028817	0.99
CumulativeSums	0.514124	0.99	0.021999	0.99
Runs	0.304126	1	0.000513	0.99
LongestRun	0.011791	0.98	0.023545	1
Rank	0.637119	1	0.003996	0.99
FFT	0.719747	1	0.162606	0.98
NonOverlappingTemplate	0.999438	1	0.00004	1
OverlappingTemplate	0.401199	0.98	0.030806	1
Universal	0.798139	0.97	0.419021	0.99
ApproximateEntropy	0.494392	1	0.028817	1
RandomExcursions	0.941144	1	0.000691	1
RandomExcursionsVariant	0.848588	1	0.000411	0.9841
Serial	0.383827	1	0.000003	0.98
LinearComplexity	0.23681	1	0.0007	0.99

Table 4. SAC of the S-box in the proposed scheme.

Input	$f_{0}$	$f_{1}$	$f_{2}$	$f_{3}$	$f_{4}$	$f_{5}$	$f_{6}$	$f_{7}$
00000001	0.4531	0.4844	0.5000	0.4219	0.5312	0.5469	0.5156	0.5156
00000010	0.4531	0.5781	0.5156	0.5469	0.5312	0.5156	0.4531	0.5469
00000100	0.5312	0.5781	0.5469	0.5312	0.5156	0.5312	0.5000	0.4531
00001000	0.5156	0.5156	0.4375	0.4688	0.4844	0.5156	0.4844	0.5625
00010000	0.4062	0.5469	0.5781	0.4375	0.4844	0.5000	0.5000	0.4844
00100000	0.4375	0.4375	0.4219	0.5312	0.4844	0.6094	0.4531	0.4531
01000000	0.4062	0.5156	0.5781	0.4219	0.5000	0.5312	0.4844	0.5156
10000000	0.5312	0.5156	0.5625	0.4844	0.4531	0.5156	0.4531	0.5156

Table 5. Comparison of the key space for different schemes.

Image Encryption Scheme	Key Space
Ravichandran et al. [23]	$10^{126}$
Wu et al. [25]	$10^{117}$
Wu et al. [22]	$10^{90}$
Rehman [20]	$10^{94}$
Ours	$10^{112}$

Table 6. Change ratio of the pixel.

Figure	Pixel Change Ratio of Decryption
Figure	Red	Green	Blue
Peppers	-	-	-
Decrypted by $k_{1}$	0.9962	0.9964	0.9962
Decrypted by $k_{2}$	0.9962	0.9960	0.9964
Decrypted by $k_{3}$	0.9959	0.9960	0.9961
Decrypted by $k_{4}$	0.9959	0.9961	0.9962

Table 7. The variances of the histograms of different images.

Image	Scheme	Plaintext Image			Cipher Image
Image	Scheme	Red	Green	Blue	Red	Green	Blue
Peppers	ours	$5.45 \times 10^{4}$	$6.43 \times 10^{4}$	$10.69 \times 10^{4}$	240.18	265.42	248.01
Lena	Ref [21]	$6.24 \times 10^{4}$	$2.64 \times 10^{4}$	$8.5 \times 10^{4}$	247.78	279.62	265.71
Couple	ours	$2.1 \times 10^{5}$	$3.37 \times 10^{5}$	$2.89 \times 10^{5}$	244.54	293.94	263.28
Couple	Ref [21]	$2.1 \times 10^{5}$	$3.37 \times 10^{5}$	$2.89 \times 10^{5}$	284.35	247.37	260.76
Female	ours	$7.9 \times 10^{5}$	$8.61 \times 10^{5}$	$6.2 \times 10^{5}$	310.32	269.23	240.76
Female	Ref [21]	$7.9 \times 10^{5}$	$8.61 \times 10^{5}$	$6.2 \times 10^{5}$	280.64	280.46	230.42
Tree	ours	$8.13 \times 10^{4}$	$5.7 \times 10^{4}$	$1.29 \times 10^{5}$	244.64	232.09	248.51
Tree	Ref [21]	$8.13 \times 10^{4}$	$5.7 \times 10^{4}$	$1.29 \times 10^{5}$	282.81	254.87	225.79

Table 8. Comparison of the encryption quality.

Image	Deviation from Ideality			Maximum Deviation			Irregular Deviation
Image	Red	Green	Blue	Red	Green	Blue	Red	Green	Blue
Peppers	0.5464	0.5311	0.5242	52,260	36,568	58,576	23,134	35,346	32,500
Peppers*	0.5329	0.5288	0.5204	53,098	37,236	58,819	23,702	35,880	33,414
Couple	0.5537	0.5517	0.5530	86,285	89,183	90,493	30,850	39,334	42,006
Couple*	0.5319	0.5339	0.5290	86,622	89,355	90,698	31,089	39,966	42,152
Female	0.5388	0.5582	0.5555	52,457	51,494	73,061	33,330	30,138	26,816
Female*	0.5116	0.5403	0.5489	52,659	51,763	73,356	33,483	30,538	27,051
Tree	0.5371	0.5584	0.5320	48,239	46,532	62,784	41,678	35,658	39,050
Tree*	0.5328	0.5420	0.5103	48,944	46,932	63,598	41,974	36,013	39,352

Table 9. Correlation coefficient between the adjacent pixels.

Figure	Direction	Correlation Coefficients
		Plaintext Image			Cipher Image
		Red	Green	Blue	Red	Green	Blue
Peppers	Horizontal	0.94928	0.95959	0.94074	0.00012	0.00328	−0.00355
	Vertical	0.95416	0.96558	0.94979	0.00092	−0.00003	−0.00277
	Diagonal	0.91445	0.9298	0.90273	−0.00286	−0.00118	−0.00332
Couple	Horizontal	0.9556	0.9337	0.92265	0.00122	−0.00553	−0.00040
	Vertical	0.95614	0.95278	0.94711	0.00093	−0.00714	−0.00330
	Diagonal	0.91617	0.9088	0.88896	−0.00169	−0.00453	0.00245
Female	Horizontal	0.97937	0.96848	0.95122	−0.00500	−0.00140	−0.00599
	Vertical	0.98724	0.98146	0.97751	0.00142	0.00379	0.00407
	Diagonal	0.96795	0.95132	0.92774	0.00012	−0.00587	0.00605
Tree	Horizontal	0.95896	0.96869	0.96211	−0.00119	−0.00270	0.00446
	Vertical	0.93627	0.94623	0.9405	−0.00456	−0.00029	−0.00422
	Diagonal	0.91022	0.93092	0.92458	0.00364	−0.00218	−0.00326

Table 10. Comparison of the correlation coefficients.

Scheme	Correlation Coefficients				Ranked by Average Value
Scheme	Horizontal	Vertical	Diagonal	Average	Ranked by Average Value
Plaintext image (Lena)	0.86939	0.90917	0.85649	0.8787	-
Plaintext image (Peppers)	0.94928	0.95959	0.94074	0.9499	-
Ours (Peppers)	0.00012	0.00092	−0.00286	0.0013	1
Ref. [20]	−0.0073	0.0010	−0.0013	0.0032	4
Ref. [21]	0.0027	0.0033	−0.0035	0.0031	3
Ref. [22]	−0.0084	0.0004	−0.0015	0.0034	5
Ref. [25]	−0.0001	0.0089	0.0091	0.0060	6
Ref. [26]	0.0028	0.0018	0.0036	0.0027	2
Encrypted by Lorenz	0.0019	−0.012	−0.021	0.0116	7

Table 11. The NPCR of different images.

Image	NPCR			NPCR Critical Values
Image	Red	Green	Blue	$N_{0.05}^{*}$ = 0.995693	$N_{0.01}^{*} = 0.995527$	$N_{0.001}^{*} = 0.995341$
Peppers	0.995941	0.995895	0.996841	Pass	Pass	Pass
Couple	0.995941	0.995895	0.996841	Pass	Pass	Pass
Female	0.995941	0.996198	0.996487	Pass	Pass	Pass
Tree	0.996216	0.995102	0.996155	Pass	Pass	Pass

Table 12. The UACI of different images.

Image	UACI			UACI Critical Values
	Red	Green	Blue	$U_{0.05}^{* -}$ = 0.332824	$U_{0.01}^{* -}$ = 0.332255	$U_{0.001}^{* -}$ = 0.331594
	Red	Green	Blue	$U_{0.05}^{* +}$ = 0.336447	$U_{0.01}^{* +}$ = 0.337016	$U_{0.001}^{* +}$ = 0.337677
Peppers	0.336394	0.335153	0.334672	Pass	Pass	Pass
Couple	0.333639	0.334626	0.335963	Pass	Pass	Pass
Female	0.335716	0.335668	0.336635	Pass	Pass	Pass
Tree	0.333633	0.335124	0.333045	Pass	Pass	Pass

Table 13. Comparison of NPCR and UACI.

Scheme	NPCR			UACI
Scheme	Red	Green	Blue	Red	Green	Blue
Ours	0.9959	0.9959	0.9968	0.3364	0.3352	0.3347
Ref. [20]	0.9961	0.9961	0.9961	0.3343	0.3343	0.3342
Ref. [21]	0.9960	0.9961	0.9961	0.3356	0.3345	0.3349
Ref. [22]	0.9961	0.9961	0.9961	0.3346	0.3350	0.3348
Ref. [25]	1	1	1	0.3355	0.3342	0.3344
Ref. [26]	0.9968	0.9966	0.9966	0.3342	0.3342	0.3343
Encrypted by Lorenz	0.986938	0.994246	0.992371	0.332312	0.332861	0.332167

Table 14. Information entropy of the plaintext images and the cipher images.

Image	Plaintext Image			Cipher Image
Image	Red	Green	Blue	Red	Green	Blue
Peppers	7.3449	7.5607	7.1003	7.9971	7.9973	7.9972
Couple	6.2499	5.9642	5.9309	7.9974	7.9973	7.9974
Female	7.2549	7.2704	6.7825	7.9974	7.9967	7.9970
Tree	6.9207	7.4136	7.2104	7.9967	7.9969	7.9970

Table 15. Comparison of information entropy.

Scheme	Information Entropy			Ranked by Average Value
Scheme	Red	Green	Blue	Ranked by Average Value
Ours	7.9971	7.9973	7.9972	1
Ref. [20]	-	-	-	-
Ref. [21]	7.9973	7.9969	7.9971	2
Ref. [22]	7.9893	7.9896	7.9903	6
Ref. [25]	7.9912	7.9912	7.9912	5
Ref. [26]	7.9965	7.9963	7.9964	3
Encrypted by Lorenz	7.9923	7.9952	7.9967	4

Table 16. The PSNR of the noisy cipher images.

Item	PSNR (dB)
Item	Red	Green	Blue
SPN with density 0.002	12.8633	11.2245	11.2165
SPN with density 0.005	25.2144	24.3228	23.7920
SN with variance 0.000002	21.6322	19.7540	19.5796
GN with variance 0.000001	15.9396	14.2369	13.9492
1/8 data cropped at the lower-left corner	15.4233	14.2925	14.1868
1/4 data cropped at the top-left corner	12.5953	11.1434	11.1340
1/4 data cropped at the bottom-right corner	12.6446	11.2594	11.2602

Table 17. MSE and PSNR of the plaintext and ciphertext images for the proposed scheme.

Scheme	MSE			PSNR
Scheme	Red	Green	Blue	Red	Green	Blue
Peppers	8032	11,300	11,101	9.0826	7.6001	7.6773
Couple	14,051	15,907	16,173	6.6538	6.1150	6.0430
Female	9399.6	9117.2	6973.1	8.3997	8.5322	9.6965
Tree	8785.2	14,140	9685.3	8.6933	7.5578	8.2697

Table 18. Comparison of the implementation time.

Scheme	Encryption Speed
Scheme	Encryption Speed (Mbps)	Decryption Speed (Mbps)
AES	2.6907	2.6907
Wu et al. [22]	3.348	3.348
Chai et al. [21]	1.28	1.28
Ours	3.423	3.569

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Sun, M.; Yuan, J.; Li, X.; Liu, D.; Wei, X. LVGG-IE: A Novel Lightweight VGG-Based Image Encryption Scheme. Entropy 2024, 26, 1013. https://doi.org/10.3390/e26121013

AMA Style

Sun M, Yuan J, Li X, Liu D, Wei X. LVGG-IE: A Novel Lightweight VGG-Based Image Encryption Scheme. Entropy. 2024; 26(12):1013. https://doi.org/10.3390/e26121013

Chicago/Turabian Style

Sun, Mingliang, Jie Yuan, Xiaoyong Li, Dongxiao Liu, and Xinghai Wei. 2024. "LVGG-IE: A Novel Lightweight VGG-Based Image Encryption Scheme" Entropy 26, no. 12: 1013. https://doi.org/10.3390/e26121013

APA Style

Sun, M., Yuan, J., Li, X., Liu, D., & Wei, X. (2024). LVGG-IE: A Novel Lightweight VGG-Based Image Encryption Scheme. Entropy, 26(12), 1013. https://doi.org/10.3390/e26121013

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

LVGG-IE: A Novel Lightweight VGG-Based Image Encryption Scheme

Abstract

1. Introduction

2. Preliminaries

2.1. The VGG Network

2.2. The Lorenz Chaotic System

2.3. Property of the S-Box

3. The Proposed Image Encryption Scheme LVGG-IE

3.1. The LVGG-Based Key Seed Generator

3.2. Pseudorandom Generator Based on LVGG and Chaos

3.3. Design of the Dynamic S-Box

3.4. LVGG-IE-Based Image Encryption Scheme

4. Simulation and Security Analysis

4.1. Randomness of the Key

4.2. Security and Efficiency Analysis of the Dynamic S-Box and SC Layer

4.2.1. Nonlinearity Test

4.2.2. SAC Test

4.2.3. Security of the Dynamic S-Box and Dynamic SC Layer

4.2.4. Efficiency of Dynamic Generation for S-Box and SC Layer

4.3. Encryption Results

4.4. Key Space and Key Sensitivity

4.5. Histogram Analysis

4.6. Encryption Quality Analysis

4.6.1. Deviation from Ideality

4.6.2. Maximum Deviation

4.6.3. Irregular Deviation

4.7. Correlation Coefficient Between Adjacent Pixels Analysis

4.8. Differential Attack

4.9. Information Entropy Analysis

4.10. Robustness Analysis

4.11. Visual Quality Analysis

4.12. Performance Analysis

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI