Leveraging Data Locality in Quantum Convolutional Classifiers

Jeng, Mingyoung; Nobel, Alvir; Jha, Vinayak; Levy, David; Kneidel, Dylan; Chaudhary, Manu; Islam, Ishraq; Facer, Audrey; Singh, Manish; Baumgartner, Evan; Vanderhoof, Eade; Arshad, Abina; El-Araby, Esam

doi:10.3390/e26060461

Open AccessArticle

Leveraging Data Locality in Quantum Convolutional Classifiers

by

Mingyoung Jeng

,

Alvir Nobel

,

Vinayak Jha

,

David Levy

,

Dylan Kneidel

,

Manu Chaudhary

,

Ishraq Islam

,

Audrey Facer

,

Manish Singh

,

Evan Baumgartner

,

Eade Vanderhoof

,

Abina Arshad

and

Esam El-Araby

^*

Department of Electrical Engineering and Computer Science, University of Kansas, Lawrence, KS 66045, USA

^*

Author to whom correspondence should be addressed.

Entropy 2024, 26(6), 461; https://doi.org/10.3390/e26060461

Submission received: 5 March 2024 / Revised: 18 May 2024 / Accepted: 24 May 2024 / Published: 28 May 2024

(This article belongs to the Special Issue Quantum Computation, Communication and Cryptography)

Download

Browse Figures

Versions Notes

Abstract

:

Quantum computing (QC) has opened the door to advancements in machine learning (ML) tasks that are currently implemented in the classical domain. Convolutional neural networks (CNNs) are classical ML architectures that exploit data locality and possess a simpler structure than a fully connected multi-layer perceptrons (MLPs) without compromising the accuracy of classification. However, the concept of preserving data locality is usually overlooked in the existing quantum counterparts of CNNs, particularly for extracting multifeatures in multidimensional data. In this paper, we present an multidimensional quantum convolutional classifier (MQCC) that performs multidimensional and multifeature quantum convolution with average and Euclidean pooling, thus adapting the CNN structure to a variational quantum algorithm (VQA). The experimental work was conducted using multidimensional data to validate the correctness and demonstrate the scalability of the proposed method utilizing both noisy and noise-free quantum simulations. We evaluated the MQCC model with reference to reported work on state-of-the-art quantum simulators from IBM Quantum and Xanadu using a variety of standard ML datasets. The experimental results show the favorable characteristics of our proposed techniques compared with existing work with respect to a number of quantitative metrics, such as the number of training parameters, cross-entropy loss, classification accuracy, circuit depth, and quantum gate count.

Keywords:

convolutional neural networks; quantum computing; variational quantum algorithms; quantum machine learning

1. Introduction

The choice of an appropriate machine learning model for specific applications requires consideration of the size of the model since it is linked to the performance [1]. Considering the aforementioned factor, convolutional neural networks (CNNs)s are preferable to multi-layer perceptrons (MLPs)s because of their smaller size and reduced training time while maintaining high accuracy [2,3]. Preserving the spatiotemporal locality of data allows CNNs to reduce unnecessary data connections and therefore reduces their memory requirements [2,3]. This phenomenon reduces the number of required training parameters and thus incurs less training time [2,3].

In the context of quantum computing, great emphasis has been given to quantum-based machine learning, and, in recent years, various techniques have been devised to develop this field [4]. The contemporary quantum machine learning (QML) techniques can be considered as hybrid quantum–classical variational algorithms [5,6,7,8,9]. Generally, variational quantum algorithms (VQAs) utilizes parameterized rotation gates in fixed quantum circuit structures, usually called ansatz, and is optimized using classical techniques like gradient descent [5,6,7,8,9]. However, like MLPs, preserving data locality is challenging for QML algorithms. For instance, the multidimensionality of input datasets is ignored in contemporary QML algorithms and are flattened into one-dimensional arrays [5,6,7,8,9]. Furthermore, the absence of a generalizable technique for quantum convolution limits the capability of QML algorithms to directly adapt CNN structures.

In this work, we present an multidimensional quantum convolutional classifier (MQCC) to address the shortcomings of the existing CNN implementations in reconciling the locality of multidimensional input data. The proposed VQA technique leverages quantum computing to reduce the number of training parameters and time complexity compared with classical CNN models. Similar to the CNN structures, MQCC contains a sequence of convolution and pooling layers for multifeature extraction from multidimensional input data and a fully connected layer for classification.

The subsequent sections of this paper are organized in the following structure. Section 2 discusses fundamental background information regarding different basic and complex quantum operations. Section 3 highlights existing works that are related to the proposed techniques. The proposed methodology is introduced in Section 4 with details given to the constituent parts. The experimental results and the explanation of the used verification metrics are presented in Section 5. Further discussion about the obtained results is provided in Section 6. Finally, Section 7 concludes this work with potential future directions.

2. Background

In this section, we present background information pertaining to quantum computing and quantum machine learning. Here, we present the quantum gates and operations that are utilized in the proposed multidimensional quantum convolutional classifier (MQCC). In addition, interested readers may find fundamental details related to quantum information and computing in Appendix A.

2.1. Quantum Measurement and Reset

The quantum measurement operation of a qubit is usually and informally referred to as a measurement “gate”. The measurement gate is a nonunitary operation that can project the quantum state of a qubit |

ψ

〉 to the |0〉 or |1〉 basis states [10]. The likelihood of measuring any basis state can be obtained by taking the squared magnitude of their corresponding basis state coefficient. For an n-qubit register |

ψ

〉 with

2^{n}

possible basis states, the probability of measuring each qubit in any particular basis state |j〉, where

0 \leq j < 2^{n}

is given by

| c_{j} |^{2}

[11]. The classical output of n-qubit amplitude-encoded [12] data can be decoded as

ψ_{decoded - data}^{classical}

. This classical output vector can be reconstructed by the square root of the probability distribution

\sqrt{P (| ψ 〉)}

, as shown in (1), (2), and Figure 1. When amplitude-encoding [12] is used for encoding positive real classical data, the coefficients of the corresponding quantum pure state [10]|

ψ

〉 are also positive real, i.e.,

c_{j} \in R^{+}

, where

0 \leq j < 2^{n}

. Thus, the amplitudes of |

ψ

〉 are numerically equal in values to the coefficients of

ψ_{decoded - data}^{classical}

, i.e.,

| ψ 〉 = ψ_{decoded - data}^{classical}

. Therefore, the quantum state |

ψ

〉 can be completely determined from the measurement probability distribution such that

| ψ 〉 = \sqrt{P (| ψ 〉)}

only when the amplitudes of the quantum state are all of positive real values. Moreover, the probability distribution

P (| ψ 〉)

can be reconstructed by repeatedly measuring (sampling) the quantum state |

ψ

〉. In general, an order of

2^{n}

measurements is required to accurately reconstruct the probability distribution. In order to reduce the effects of quantum statistical noise, it is recommended to gather as many circuit samples (shots) [13] as possible.

P (| ψ 〉) = [\begin{matrix} p_{0} \\ p_{1} \\ ⋮ \\ p_{j} \\ ⋮ \\ p_{N - 2} \\ p_{N - 1} \end{matrix}] = [\begin{matrix} | c_{0} |^{2} \\ | c_{1} |^{2} \\ ⋮ \\ | c_{j} |^{2} \\ ⋮ \\ | c_{N - 2} |^{2} \\ | c_{N - 1} |^{2} \end{matrix}], where p_{j} = {| c_{j} |}^{2}, and 0 \leq j < N

(1)

ψ_{decoded - data}^{classical} = \sqrt{P (| ψ 〉)} = [\begin{matrix} | c_{0} | \\ | c_{1} | \\ ⋮ \\ | c_{j} | \\ ⋮ \\ | c_{N - 2} | \\ | c_{N - 1} | \end{matrix}]

(2)

The reset operation sets the state of qubits to |0〉. This operation consists of a midcircuit measurement gate followed by a conditional X gate [14,15] such that the bit-flip operation is applied when the measured qubit is in state |1〉. The reset gate and its equivalent circuit are both shown in Figure 2 [15].

2.2. Classical-to-Quantum (C2Q)

There are a number of quantum data encoding techniques [12,16], each of which uses different methods to initialize quantum circuits from the ground state. Among the many methods, this work leverages the classical-to-quantum (C2Q) arbitrary state synthesis [12,16] operation to perform amplitude encoding and initialize an n-qubit state

| ψ_{0} 〉

, see Figure 3. The C2Q operation employs a pyramidal structure of multiplexed

R_{y}

and

R_{z}

gates. It should be noted that the

R_{z}

gates are only required for positive real data. Thus, for positive real data, the circuit depth is

2 \cdot 2^{n} - n - 2

, while for complex data, the circuit depth is

3 \cdot 2^{n} - n - 4

[12].

2.3. Convolutional Neural Networks (CNNs)

CNNs are one of the most widely used types of deep neural networks for image classification [17]. It consists of convolutional, pooling, and fully connected layers. The convolutional layer applies multiple filters to the input to create feature maps. The pooling layer reduces the dimensionality of each feature map but retains the most important information. Some of the most important pooling techniques include max-pooling, average pooling, and sum pooling. Fully connected layers are in the last layers of the CNN network, and their inputs correspond to the flattened one-dimensional matrix generated by the last pooling layer. Activation functions, which are essential for handling complex patterns, are used throughout the network. Finally, a softmax prediction layer is used to generate probability values for each of the possible output labels. The label with the highest probability is selected as the final prediction.

Figure 3. The quantum circuit for classical-to-quantum (C2Q) arbitrary state synthesis [16].

2.4. Quantum Machine Learning with Variational Algorithms

Variational quantum algorithms are a type of quantum–classical technique that facilitates the implementations of machine learning on noisy intermediate-scale quantum (NISQ) machines [5,7]. Current quantum devices are not able to maintain coherent states for sufficient periods, preventing current algorithms from performing meaningful optimization on the machine learning model. Thus, VQAs combine classical optimization algorithms with parameterized quantum circuits, or ansatz. Here, the ansatz takes on the role of the model [5]. One specific type of VQA is the variational quantum classifier (VQC), which is used for classification problems. Existing VQCs [6,8,9] have been shown to be effective for classifying datasets with high accuracy and few training parameters in both simulation and current quantum processors.

3. Related Work

In this section, we discuss the existing related works with an emphasis on quantum machine learning. Our discussion focuses on commonly used data encoding techniques, existing implementations of the convolution and quantum convolution algorithms, and related quantum machine learning (QML) algorithms. Moreover, we also discuss existing quantum convolutional classification algorithms that leverage data locality.

3.1. Data Encoding

For encoding classical image data into the quantum domain, the commonly used methods are Flexible Representation of Quantum Images (FRQI) [18] and Novel Enhanced Quantum Representation (NEQR) [19]. In FRQI, positional and color information is encoded as amplitude encoding and angle encoding, respectively. In NEQR, positions of the pixels are encoded using amplitude encoding but color information is encoded using basis encoding, where q represents the number of bits/qubits allocated for color data. For

N = 2^{n}

data points, in terms of circuit width and depth, FRQI incurs

n + 1

and

O (4^{n})

, respectively, while NEQR incurs

n + q

and

O (q n 2^{n})

, respectively [20]. Although these techniques are employed in the existing quantum convolution techniques, their disadvantages are discussed below.

3.2. Convolution

We now discuss existing implementations of convolution and discuss their associated time complexity. These implementations consist of various classical and quantum techniques. In addition, we consider the shortcomings of existing quantum convolution methods.

3.2.1. Classical Convolution

Classical implementations of convolution are usually implemented directly, through general matrix multiplication (GEMM) or through the Fast Fourier transform (FFT). A data size N, running the direct implementation on CPUs, has complexity

O (N^{2})

[21], while the complexity of an FFT-based implementation is

O (N log N)

[21]. On GPUs, FFT-based convolution incurs a similar

O (N log N)

complexity [22], while the direct approach requires

O (N_{K} N)

FLOPS [23,24], where

N_{K}

is the filter size.

3.2.2. Quantum Convolution

The existing quantum convolution techniques [25,26,27,28,29] rely on fixed filter sizes and support only specific filters at a time, e.g., edge detection. They do not contain methods for implementing a general filter. Additionally, these techniques have a quadratic circuit depth, i.e.,

O (n^{2})

, where

n = ⌈ {log}_{2} N ⌉

is the number of qubits and N is the size of the input data. While these methods appear to show quantum advantage, these results do not include overhead incurred from data encoding. The related methods employ the FRQI and NEQR data encoding methods, leading to inferior performance compared with classical methods once the additional overhead is factored in. The authors in [30] propose an edge-detection technique based on quantum wavelet transform QWT and amplitude encoding, named quantum Hadamard edge detection (QHED), which is not generalizable for multiple convolution kernels or multidimensional data. Thus, their algorithm loses parallelism, increases circuit depth, and is difficult to generalize beyond capturing 1-D features. In [20], the authors have developed a quantum convolution algorithm that supports single feature/kernel and multidimensional data. In this work, we leverage the convolution method from [20] and generalize it to support multiple features/kernels in our proposed MQCC framework.

3.3. Quantum Machine Learning

There exist two primary techniques for quantum convolutional classification that may leverage data locality through convolution: quantum convolutional neural network (QCNN)s [31] and quanvolutional neural networks [32]. QCNNs are inspired by classical convolutional neural networks, employing quantum circuits to perform convolutions, and quanvolutional neural networks replace classical convolutional layers with quantum convolutional (or quanvolutional) layers.

3.3.1. Quantum Convolutional Neural Networks

The QCNN [31] and our proposed multidimensional quantum convolutional classifier (MQCC) are both VQAs models with structures inspired by CNNs. However, QCNNs borrow the superficial structure of CNNs without considering the underlying purpose. Specifically, the QCNN’s quantum ansatz is designed so that its “convolution” and “pooling” operations exploit the locality of qubits in the circuit (rather than the locality of data). However, unlike data locality, qubit locality does not have a practical purpose for machine learning in terms of isolating relevant input features. Moreover, by considering input data as 1-D, QCNNs do not leverage the dimensionality of datasets, which constitute a primary advantage of CNNs. MQCC, on the other hand, faithfully implements CNN operations in quantum circuits, offering performance improvements in circuit execution time (based on circuit depth) classification accuracy over contemporary implementations of QCNNs on classical computers.

3.3.2. Quanvolutional Neural Networks

Quanvolutional neural networks [32] are a hybrid quantum–classical algorithm named eponymously after the quanvolutional layer added to a conventional CNN. These quanvolutional layers serve to decimate a 2D image, which is then sequentially fed into a quantum device. In this manner, the quanvolutional layer effectively exploits data locality. Yet, the model’s dependency on classical operations, specifically the decimation of input data and the repeated serial data I/O transfer, vastly increases compute time. In contrast, the required convolution operation is incorporated into our proposed MQCC, reducing classical–quantum data transfer. Moreover, MQCC takes advantage of parallelism inherent to quantum computers, while quanvolutional neural networks do not. Together, this allows the MQCC to apply convolutional filters to window data in parallel.

4. Materials and Methods

In this section, we describe the materials and methods associated with the proposed multidimensional quantum convolutional classifier (MQCC). The proposed method mainly uses generalized quantum convolution, quantum pooling based on the quantum Haar transform (QHT) and partial measurement [11], and a quantum fully connected layer that is illustrated in this section. To the best of our knowledge, this work is the first to carry out the following:

Develop a generalizable quantum convolution algorithm for a quantum-convolution-based classifier that supports multiple features/kernels.
Design a scalable MQCC that uses multidimensional quantum convolution and pooling based on the QHT. This technique reduces training parameters and time complexity compared with other classical and quantum implementations.
Evaluate the MQCC model in a state-of-the-art QML simulator from Xanadu using a variety of datasets.

4.1. Quantum Fully Connected Layer

A fully connected classical neural network constitutes a collection of layers that each perform a linear transformation on

N_{in}

input features

x \in R^{N_{in}}

to generate

N_{out}

-feature output

y \in R^{N_{out}}

[2,3]. Each layer can be represented in terms of a multiply-and-accumulate (MAC) operation and an addition operation, as shown in (3), where

W \in R^{N_{out} \times N_{in}}

and

b \in R^{N_{out}}

represent the trainable weight and bias parameters, respectively. Here, we use bold symbols to represent classical quantities, e.g., vectors, while the Dirac notation to represent their quantum counterparts.

y = W^{T} x + b

(3)

The particular weights and biases that generate the

j^{th}

feature of the output,

y_{j}

, can be isolated by taking the

j^{th}

column of

W

,

w_{j}

, and the

j^{th}

term of

b

,

b_{j}

, as shown in (4), which can be directly implemented using quantum circuits. Section 4.1.1 discusses the quantum circuits for a single-feature output, and Section 4.1.2 generalizes the proposed technique for an arbitrary amount of features in the output.

\begin{matrix} y_{j} = (w_{j} \cdot x) + b_{j}, where 0 \leq j < N_{out} \end{matrix}

(4)

4.1.1. Single-Feature Output

For a single-feature output neural network, the weight parameters can be represented as a vector

w \in R^{N_{in}}

. Here,

w

can be expressed as a quantum state |w〉, as shown in (5), similar to the process of C2Q data encoding; see Section 2.2.

\begin{matrix} | w 〉 & = \frac{1}{| | w | |} [\begin{matrix} w \\ 0 \\ ⋮ \\ 0 \end{matrix}] \begin{matrix} ↕ N_{in} \end{matrix}↕ 2^{n_{in}}, where n_{in} = ⌈ {log}_{2} N_{in} ⌉ \end{matrix}

(5)

Similarly, for a single-feature output, Dirac notation of the MAC operation follows from (4), as shown in (6), where |

ψ

〉 corresponds to the input data.

y - b = 〈 w | ψ 〉

(6)

However, it is necessary to obtain a quantum operator to perform a parameterized unitary linear transformation from the weights vector |w〉 on the input data |

ψ

〉 using an inverse C2Q operation as shown in Figure 4 and described by (7).

\begin{matrix} U_{MAC} (| w 〉) & = [\begin{matrix} 〈 w | \\ 〈 \times | \\ ⋮ \\ 〈 \times | \end{matrix}]↕ 2^{n_{in}} \\ = U_{C 2 Q}^{†} (| w 〉) \end{matrix}

(7)

4.1.2. Multifeature Output

A multifeature output can be implemented in a naive approach using Single-Feature Output (7) for an

N_{out}

-feature output, where

N_{out} \leq N_{in}

, which can be obtained by encoding each weight vector

w_{j} : 0 \leq j < N_{out}

as a normalized row in

U_{MAC}

. However, the result might yield a nonunitary operator as the weight vectors can be arbitrary.

U_{MAC}

is unitary when each row is orthogonal to all other rows in the matrix, mathematically,

〈 W_{i} | W_{j} 〉 = δ_{i j} : \forall i, j \in [0, N_{out})

. As described in Appendix A.4, independently defined weights can be supported for each feature of the output by multiplexing

U_{MAC}

. Now, the generic fully connected operation,

U_{FC}

, can be generated as shown in (8), where

n_{out} = ⌈ {log}_{2} N_{out} ⌉

.

U_{FC} = [\begin{matrix} U_{MAC} (| w_{0} 〉) \\ ⋱ \\ U_{MAC} (| w_{2^{n_{out}} - 1} 〉) \end{matrix}]

(8)

By generating

N_{out}

replicas of the initial state,

| ψ_{0} 〉

, the operation can be parallelized, i.e.,

U_{MAC} (| w_{0} 〉) \dots U_{MAC} (| w_{2^{n_{out}} - 1} 〉)

transformations from (8).

Replication:

To replicate the initial state,

| ψ_{0} 〉

n_{out}

qubits, which extends the state vector to a total size of

2^{n_{in} + n_{out}}

, see (9) and Figure 5.

| ψ_{1} {〉 = | 0 〉}^{\otimes n_{out}} \otimes | ψ_{0} 〉 = [\begin{matrix} | ψ_{0} 〉 \\ 0 \\ ⋮ \\ 0 \end{matrix}] \begin{matrix} ↕ 2^{n_{in}} \end{matrix}↕ 2^{n_{in} + n_{out}}

(9)

By applying an

n_{out}

-qubit Hadamard (see Appendix A.3) operation to the relevant qubits the replicas can be obtained through superposition (see (10) and Figure 5), which generates the desired replicas scaled by a factor of

\frac{1}{\sqrt{2^{n_{out}}}}

to maintain normalization

〈 ψ_{2} | ψ_{2} 〉 = 1

.

\begin{matrix} | ψ_{2} 〉 = (H^{\otimes n_{out}} \otimes I^{\otimes n_{in}}) | ψ_{1} 〉 = \frac{1}{\sqrt{2^{n_{out}}}} [\begin{matrix} | ψ_{0} 〉 \\ ⋮ \\ | ψ_{0} 〉 \end{matrix}] \begin{matrix} ↕ 2^{n_{in}} \\ ⋮ \\ ↕ 2^{n_{in}} \end{matrix}↕ 2^{n_{in} + n_{out}} \end{matrix}

(10)

Applying the $U_{FC}$ Filter:

U_{FC}

can perform the MAC operation for the entire

N_{out}

-feature output in parallel with the set of replicas of

| ψ_{0} 〉

; see (11) and Figure 5.

\begin{matrix} | ψ_{3} 〉 = U_{FC} | ψ_{2} 〉 & = [\begin{matrix} U_{MAC} (| w_{0} 〉) | ψ_{0} 〉 \\ ⋮ \\ U_{MAC} (| w_{j} 〉) | ψ_{0} 〉 \\ ⋮ \\ U_{MAC} (| w_{2^{n_{out}} - 1} 〉) | ψ_{0} 〉 \end{matrix}] \begin{matrix} ↕ 2^{n_{in}} \\ ⋮ \\ ↕ 2^{n_{in}} \\ ⋮ \\ ↕ 2^{n_{in}} \end{matrix}↕ 2^{n_{in} + n_{out}} \\ = [\begin{matrix} 〈 w_{0} | ψ_{0} 〉 \\ \times \\ ⋮ \\ \times \\ ⋮ \\ 〈 w_{j} | ψ_{0} 〉 \\ \times \\ ⋮ \\ \times \\ ⋮ \\ 〈 w_{2^{n_{out}} - 1} | ψ_{0} 〉 \\ \times \\ ⋮ \\ \times \end{matrix}] \begin{matrix} ↕ 2^{n_{in}} \\ ⋮ \\ ↕ 2^{n_{in}} \\ ⋮ \\ ↕ 2^{n_{in}} \end{matrix}↕ 2^{n_{in} + n_{out}} \end{matrix}

(11)

Data Rearrangement:

The data rearrangement operation can be performed by applying perfect-shuffle gates; see Appendix A.6. It simplifies the output-feature extraction by gathering them into

N_{out}

data points at the top of the state vector; see (12) and Figure 5.

\begin{matrix} | ψ_{4} 〉 = \prod_{i = 0}^{n_{out} - 1} (RoL (n_{in} + n_{out} - i) \otimes I^{\otimes i}) | ψ_{3} 〉 & = [\begin{matrix} 〈 w_{0} | ψ_{0} 〉 \\ ⋮ \\ 〈 w_{j} | ψ_{0} 〉 \\ ⋮ \\ 〈 w_{2^{n_{out}} - 1} | ψ_{0} 〉 \\ \times \\ ⋮ \\ \times \end{matrix}] \begin{matrix} ↕ 2^{n_{out}} \end{matrix}↕ 2^{n_{in} + n_{out}} \end{matrix}

(12)

It is worth mentioning that instead of applying the auxiliary qubits at the most significant position, as shown in the decomposed and simplified fully connected circuit in Figure 6, auxiliary qubits can be applied at the least significant position to avoid perfect-shuffle permutations.

4.1.3. Circuit Depth of the Quantum Fully Connected Layer

As discussed in Section 2.2,

U_{MAC}

operation is implemented by applying the C2Q/arbitrary state synthesis operation with a depth of

(3 \cdot 2^{n_{in}} - n_{in} - 4)

fundamental single-qubit and CNOT gates. The depth is expected to increase by a factor of

2^{n_{out}}

when multiplexing

U_{MAC}

to implement an

N_{out}

-feature output [33], see (13).

\begin{matrix} Δ_{U_{FC}} (n_{in}, n_{out}) & = 2^{n_{out}} \cdot Δ_{U_{MAC} (n_{in})} \\ = 2^{n_{out}} (3 \cdot 2^{n_{in}} - n_{in} - 4) \\ = O (2^{n_{in} + n_{out}}) \end{matrix}

(13)

Figure 6. Decomposed and simplified quantum fully connected layer.

4.2. Generalized Quantum Convolution

The most significant part of the MQCC framework is the generalized quantum convolution operation with support for arbitrary, parameterized filters. Compared with the classical convolution operation, the convolution operation in the quantum domain achieves exponential improvement in time complexity due to its innate parallelism. The convolution operation consists of stride, data rearrangement, and multiply-and-accumulate (MAC) operations.

Stride:

The first step of quantum convolution is generating shifted replicas of the input data. Quantum decrementers controlled by additional qubits, called “filter” qubits are used for this purpose. The

U_{shift}^{- 1}

operator shown in Figure A8b shifts the replica by a single stride.

The stride operation,

U_{stride}

, is composed of controlled quantum decrementers, where each

U_{shift}^{- 1}

operation has a quadratic depth complexity; see (A15). Thus, the depth of the controlled quantum decrementer can be derived according to (14), where n corresponds to the number of qubits the decrementer is applied to and c reflects the number of control qubits.

\begin{matrix} Δ_{{cU}_{shift}^{\pm 1}} (n, c) & = \sum_{i = c}^{n + c - 1} Δ_{MCX} (i) \\ = \sum_{i = c}^{n + c - 1} (48 i - 196) \\ = \sum_{i = 0}^{n - 1} 48 i + 48 c - 196 n \\ = 24 n^{2} + 48 c - 220 n \\ = O (n^{2} c) for large n \end{matrix}

(14)

Multiply-and-Accumulate (MAC):

Kernels are applied to the strided replicas of the input data in parallel using the MAC operation; see Figure 4. In the MAC operations, kernels are applied to the contiguous set of data with the help of the inverse arbitrary state synthesis operation. One benefit achieved by using this MAC operation is the superposition of multiple kernels. The superposition of the kernel can be helpful for the classification of multiple features.

Data Rearrangement:

Data rearrangement is required to coalesce the output pieces of the MAC steps and create one contiguous piece of output. This step is performed using perfect shuffle permutation (PSP) operations described in Appendix A.6.

4.2.1. One-Dimensional Multifeature Quantum Convolution

The one-dimensional quantum convolution operation, with a kernel of size

N_{K}

terms, requires generating

N_{K}

replicas of the input data in a range of possible strides between

0 \leq k < N_{K}

. Therefore, a total of

N_{K} N

terms need to be encoded into a quantum circuit, including the

n_{k} = ⌈ {log}_{2} N_{K} ⌉

additional auxiliary qubits, denoted as “kernel” qubits, which are allocated the most significant qubits to maintain data contiguity.

Necessary

N_{K}

replicas of the input vector are created by using Hadamard gates; see Figure 7. Convolution kernels can be implemented using multiply-and-accumulate (MAC) operations; as such, it is possible to leverage

U_{MAC}

, as defined in Section 4.1, for implementing quantum convolution kernels. Given a kernel

K \in R^{N_{K}}

, the corresponding kernel operation

U_{K}

can be constructed from the normalized kernel

| K 〉

, as shown in (15).

U_{K} = U_{MAC} (| K 〉), where | K 〉 = \hat{K} = \frac{K}{| | K | |}

(15)

When applied to the

n_{k}

lower qubits of the state vector,

U_{K}

applies the kernel K to all data windows in parallel. However, in CNNs, convolution layers typically must support multiple convolution kernels/features. Fortunately, one major advantage of the proposed quantum convolution technique is that multiple features can be supported by multiplexing only the MAC operations—the stride and data rearrangement operations do not need to be multiplexed; see Figure 7. Accordingly, for

N_{F}

features,

n_{f} = ⌈ {log}_{2} N_{F} ⌉

must be added to the circuit and placed in superposition using Hadamard gates, similar to the process in (9). Thus, the depth complexity of

U_{stride}

can be expressed in terms of

Δ_{{cU}_{shift}^{- 1}} (n - j, c)

, as described by (16), where

c = 1

for all

0 \leq j < n_{k}

; see Figure 7. Similarly, the depth complexity of

U_{K}

can be expressed by (17). Finally, The depth of the proposed multifeature 1D quantum convolution can be obtained as (18).

\begin{matrix} Δ_{U_{stride}} (n, n_{k}) & = \sum_{j = 0}^{n_{k} - 1} Δ_{U_{shift}^{- 1}} (n - j, 1) = \sum_{j = 0}^{n_{k} - 1} [24 {(n - j)}^{2} + 48 - 220 (n - j)] \\ \leq 24 n_{k} n^{2} - 24 n_{k}^{2} n - 196 n_{k} n + 8 n_{k}^{3} - 98 n_{k}^{2} - 58 n_{k} \\ = O (n_{k} n^{2} - n_{k}^{2} n + n_{k}^{3}) for large n ≫ n_{k} \end{matrix}

(16)

\begin{matrix} Δ_{U_{K}} (n_{k}, n_{f}) & = 2^{n_{f}} \cdot Δ_{U_{mac} (n_{k})} \\ \leq 2^{n_{f}} (3 \cdot 2^{n_{k}} - n_{k} - 4) \\ = O (2^{n_{f} + n_{k}}) for large n_{f}, n_{k} \end{matrix}

(17)

\begin{matrix} Δ_{1 D conv} (n, n_{k}, n_{f}) & = Δ_{H} + Δ_{U_{stride}} (n, n_{k}) + Δ_{U_{K}} (n_{k}, n_{f}) + Δ_{SWAP} \\ = 1 + 24 n_{k} n^{2} - 24 n_{k}^{2} n - 196 n_{k} n + 8 n_{k}^{3} - 98 n_{k}^{2} - 58 n_{k} \\ + 2^{n_{f}} (3 \cdot 2^{n_{k}} - n_{k} - 4) + 3 \\ = O (n_{k} n^{2} - n_{k}^{2} n + 2^{n_{f} + n_{k}}), where n ≫ n_{f}, n_{k} \end{matrix}

(18)

Figure 7. One-dimensional quantum convolution circuit.

4.2.2. Multidimensional Multifeature Quantum Convolution

Multidimensional quantum convolution can be implemented by stacking multiple one-dimensional quantum circuits, as shown in Figure 8. A d-dimensional quantum convolution circuit can be constructed with a stacked kernel of 1-dimensional convolution circuits only when the multidimensional kernels are outer products of d instances of 1-dimensional kernels. The depth of d-D quantum convolution can be obtained as (19).

\begin{matrix} Δ_{d - D conv} (n, n_{k}, n_{f}) & = O (n_{k_{\max}} n_{\max}^{2} - n_{k_{\max}}^{2} n_{\max} + 2^{n_{k} + n_{f}}), where \\ n_{\max} & = {max}_{i = 0}^{d - 1} (n_{i}), n_{k_{\max}} = {max}_{i = 0}^{d - 1} (n_{k_{i}}), and n_{\max} ≫ n_{f}, n_{k_{\max}} \end{matrix}

(19)

4.3. Quantum Pooling

A critical part of CNNs is the pooling operation or downsampling of the feature maps. One widely used method is average pooling, where only the average of the adjacent pixels in the feature map is preserved, creating a smoothing effect [34].

4.3.1. Quantum Average Pooling using Quantum Haar Transform

The average pooling operation can be implemented using the quantum wavelet transform (QWT) [11], which has the advantage of preserving data locality using wavelet transform decomposition. It is a commonly used technique for dimension reduction in image processing [11]. In this work, we utilize the simplest and first wavelet transform, quantum Haar transform (QHT) [11] to implement quantum pooling operation. This operation is executed in two steps: the Haar wavelet operation and data rearrangement.

Haar Wavelet Operation:

For separating the high- and low-frequency components from the input data, H gates are applied in parallel. The number of H gates applied in QHT is equal to the levels of decomposition.

Data Rearrangement:

After separating the high- and low-frequency components, quantum rotate-right (RoR) operations are applied to group them accordingly. As mentioned before, the proposed framework is highly parallelizable regardless of the dimensions of the data, as the QHT operation can be applied to multiple dimensions of data in parallel.

As shown in Figure 9a, for a single-level of decomposition, H gates are applied on one qubit (the least significant qubit) per dimension, and for ℓ-level decomposition, shown in Figure 9b, l number of H gates are applied per dimension. Each level of decomposition reduces the size of the corresponding dimension by a factor of 2.

4.3.2. Quantum Euclidean Pooling using Partial Measurement

In machine learning applications, the average and maximum (max) pooling [34] are the most commonly used pooling schemes for dimension reduction. The two schemes differ in the sharpness of data features. On one hand, max pooling yields a sharper definition of input features, which makes it preferable for edge detection and certain classification applications [34]. On the other hand, average pooling offers a smoother dimension reduction that may be preferred in other workloads [34]. Thus, to accompany our implementation of quantum averaging pooling using QHT (see Section 4.3.1), it would be beneficial to have an implementation of quantum max pooling. However, such an operation would be nonunitary, creating difficulty for the implementation of quantum max pooling [35]. Therefore, instead of max pooling, we utilize an alternative pooling technique we denote as quantum Euclidean pooling.

Mathematically, average and Euclidean pooling are special cases of the p-norm [36], where for a vector of size N elements, the p-norm or

ℓ^{p}

norm of a vector

x \in C^{N}

is given by (20) for

p \in Z

[36]. The average pooling occurs for the 1-norm (

p = 1

) and Euclidean pooling occurs for the 2-norm (

p = 2

). A notable benefit of the Euclidean pooling technique is its zero-depth circuit implementation by leveraging partial measurement [35].

{| | x | |}_{p} = {(\sum_{i = 0}^{N} x_{i}^{p})}^{\frac{1}{p}}

(20)

This work leverages the multilevel, d-dimensional quantum Euclidean pooling circuit presented in [35]; see Figure 10. Here, for each dimension i,

ℓ_{i}

is the number of decomposition levels for dimension where

0 \leq i < d

[35].

4.4. Multidimensional Quantum Convolutional Classifier

The proposed multidimensional quantum convolution classifier framework, see Figure 11, resembles a CNN [2] structures. After a sequence of convolution pooling (CP) pairs, the model is finally connected to a fully connected layer, see Figure 6 and Figure 11. The total number of layers in the proposed model can be expressed in terms of CP pairs as

2 λ + 1

, where

λ

is the number of CP pairs.

Figure 10. Multilevel, d-D Euclidean pooling circuit.

Figure 11. High-level overview of the MQCC architecture.

It is worth mentioning that there is no advantage in changing the number of features among convolution/pooling layers in the MQCC because of the implementation constraints. Therefore, the total number of kernel features can be estimated globally instead of layer-by-layer.

The circuit width of the MQCC (21) can be derived from the number of convolution layers, pooling layers, and the fully connected layer. Input data are encoded using n qubits, and each convolution operation adds

n_{f} = ⌈ {log}_{2} N_{F} ⌉

qubits for

N_{F}

features and

n_{k} = ⌈ {log}_{2} N_{K} ⌉

qubits per layer for kernels. In addition, the fully connected operation contributes

n_{c} = ⌈ {log}_{2} N_{C} ⌉

qubits to encode

N_{C}

output features/classes. On the other hand, each Euclidean pooling layer frees ℓ qubits, which can then be reused by other layers.

\begin{matrix} n_{MQCC}^{Avg} & = n + n_{f} + λ n_{k} + n_{c} \\ n_{MQCC}^{Euclidean} & = n + n_{f} + n_{k} + max (0, (λ - 1) n_{k} - λ ℓ + n_{c}) \end{matrix}

(21)

The MQCC can be further parallelized in terms of circuit depth between the interlayer convolution/fully connected layers. This parallelism can be achieved by performing (multiplexed) MAC operations from the quantum convolution and fully connected layers in parallel with the stride operation from the previous layer(s) of quantum convolution. The circuit depth of MQCC can be derived as shown in (22).

\begin{matrix} Δ_{MQCC} & = {max}_{i = 0}^{d - 1} Δ U_{stride} (n_{i}, n_{k_{i}}) + λ (Δ_{H} + Δ_{SWAP}) \\ + \sum_{j = 0}^{λ - 2} max ({max}_{i = 0}^{d - 1} Δ_{U_{stride}} (n_{i} - j ℓ_{i}, n_{k_{i}}), Δ_{U_{K}} (n_{k}, n_{f})) \\ + max (Δ_{U_{FC}} (n - (λ - 1) ℓ, n_{c}), Δ_{U_{K}} (n_{k}, n_{f})) \end{matrix}

(22)

4.5. Optimized MQCC

Figure 12 presents a width-optimized implementation of MQCC, which we refer to as Quantum-Optimized Multidimensional Quantum Convolutional Classifier (MQCC Optimized). To reduce the required number of qubits, the convolution and pooling operations are swapped, which allows kernel qubits to be trimmed for each convolution layer, see Section 4.2. To achieve higher processing efficiency, trimmed qubits are reassigned to later layers of dimension reduction and run in parallel. Furthermore, only Euclidean pooling with partial measurements is used because of the inherent circuit depth efficiency. The circuit width of MQCC Optimized is shown in (23), where n is the number of qubits corresponding to the data size,

n_{f}

is the number of qubits corresponding to the features, and

n_{c}

is the number of qubits corresponding to the classes. If necessary, additional pooling operations can be applied to keep the circuit width at or below the absolute minimum number of qubits n by excluding qubits dedicated to features and classes. It should be noted that reordering convolution and pooling operations reduces the maximum number of convolution operations by 1.

n_{MQCC Optimized} = n + n_{f} + n_{c}

(23)

Accordingly, the depth of MQCC Optimized can be expressed as shown in (24).

\begin{matrix} Δ_{MQCC}^{opt} & = {max}_{i = 0}^{d - 1} Δ U_{stride} (n_{i} - ℓ_{i}, n_{k_{i}}) + λ (Δ_{H} + Δ_{SWAP}) \\ + \sum_{j = 1}^{λ - 1} max ({max}_{i = 0}^{d - 1} Δ_{U_{stride}} (n_{i} - j ℓ_{i}, n_{k_{i}}), Δ_{U_{K}} (n_{k}, n_{f})) \\ + max (Δ_{U_{FC}} (n - λ ℓ, n_{c}), Δ_{U_{K}} (n_{k}, n_{f})) \end{matrix}

(24)

To further reduce the depth of MQCC Optimized, we investigated replacing inverse-C2Q operations for MAC operations with different parameterized ansatz. More specifically, a common ansatz in QML, namely NLocal operation in Qiskit [37] or BasicEntanglerLayer in Pennylane [38], was utilized; see Figure 13. The depth of this ansatz is linear with respect to the data qubits (see (25)), which is a significant improvement over arbitrary state synthesis, which has a circuit depth of

O (2^{n})

for an n-qubit state. Although the ansatz could potentially reduce circuit depth, its structure lacks theoretical motivation or guarantees for high fidelity when modeling convolution kernels.

Δ_{α} (n, ℓ) = ℓ \cdot n + 1

(25)

Figure 12. Quantum-Optimized Multidimensional Quantum Convolutional Classifier (MQCC Optimized).

Figure 13. Alternate ansatz

(U_{ansatz})

option, distinct from

U_{MAC}

.

Figure 13. Alternate ansatz

(U_{ansatz})

option, distinct from

U_{MAC}

.

5. Experimental Work

In this section, we first detail our experimental setup, followed by the results for the proposed MQCC technique. Experiments were conducted using real-world, multidimensional image data to test both the individual and composite components of our techniques.

5.1. Experimental Setup

The MQCC methodology was validated by first evaluating its most important component, namely convolution using the metric of fidelity and then evaluating the complete technique, i.e., MQCC and MQCC optimized by conducting classifications experiments using 1D, 2D, and 3D datasets.

For the convolution experiments on 1D data, we used audio material published by the European Broadcasting Union for sound quality assessment [39]. Using a preprocessing step, the data were converted into a single channel, with the data size varying from

2^{8}

data points to

2^{20}

data points, sampled at a rate of

44.1

kHz.

For conducting 2D convolution experiments, we used 2D images that are either black and white or color Jayhawks [40], as shown in Figure 14. These images range in size from

(8 \times 8)

pixels to

(512 \times 512 \times 3)

pixels. For the 3D image experiments, we used hyperspectral images from the Kennedy Space Center (KSC) dataset [41]. The images were preprocessed and resized, with the sizes ranging from

(8 \times 8 \times 8)

pixels to

(128 \times 128 \times 128)

pixels. Simulations of quantum convolution operation were run using Qiskit SDK (v0.45.0) from IBM Quantum [13] over the given data. To demonstrate the effect of statistical noise on the fidelity (26), both noise-free and noisy (with 1,000,000 circuit samples/shots) simulation environments were evaluated.

Fidelity (X, Y) = \frac{〈 X, Y 〉}{{| | X | |}_{F} {| | Y | |}_{F}}

(26)

To evaluate the performance of the complete MQCC and the MQCC-optimized technique, they were tested against CNNs, QCNNs, and quanvolutional neural networks by their capabilities of binary classification on real-world datasets, such as MNIST [42], FashionMNIST [43], and CIFAR10 [44]. The classical components in these trials were run using PyTorch (v2.1.0) [45], while the quantum circuits used Pennylane (v0.32.0), a Xanadu QML-focused framework [46].

The experiments were performed on a cluster node at the University of Kansas [47]. The node consisted of a 48-Core Intel Xeon Gold 6342 CPU, three NVIDIA A100 80 GB GPUs (CUDA version 11.7) with PCIe 4.0 connectivity and 256 GB of 3200 MHz DDR4 RAM. To account for initial parameter variance in ML or noise in noisy simulations, experiments were repeated for 10 trials, with the median being displayed in graphs.

5.2. Configuration of ML Models

The different techniques fundamentally being ML models meant that they could share some parameters and metrics during their testing. For example, the log loss and the Adam optimizer [48] were shared by all the techniques, and the “feature-count” metric was shared between the CNN and MQCC, which have 4 features per convolution layer. The parameters that were unique to each model are discussed next.

Convolutional Neural Networks: In Figure 15, we show the classification accuracy of the CNN model on (

16 \times 16

) and (

28 \times 28

) FashionMNIST datasets, using average pooling, max pooling, and Euclidean pooling. The plots show the obtained accuracy with and without ReLU [49], which is an optional layer that can be appended to each pooling layer in a CNN. Based on the results, which show Max Pooling without ReLU to be the configuration with the best accuracy, we chose it to be the baseline configuration for CNN in our tests.

Quanvolutional Neural Networks: While quanvolutional neural networks were initially introduced without a trainable random quantum circuit in the quanvolutional layer, later work has suggested implementing parameterized and trainable quanvolutional layers. We, therefore, test both the trainable and nontrainable quanvolutional techniques, and Figure 16 demonstrates that the trainable variant outperforms the other method in the (

16 \times 16

) FashionMNIST dataset, although the differences are negligible on the (

28 \times 28

) dataset. This is used as evidence behind our decision to use the trainable variant of the quanvolutional neural network as the baseline for comparison with the other models.

Quantum Convolutional Neural Networks: We based our implementation of the QCNN on [50]; however, some modifications were made to the technique to work around limitations present in the data encoding method. When encoding data that are not of size

2^{n}

in each dimension, the original method flattens (vectorizes) the input before padding with zeros, as opposed to padding each dimension. However, this sacrifices the advantage of multidimensional encoding, where each dimension is mapped to a region of qubits. To ensure a level field between QCNN and MQCC, the (

16 \times 16

) and (

28 \times 28

) FashionMNIST datasets were tested both for the original (1D) and a corrected (2D) data encoding configuration of the QCNN, the results of which are shown in Figure 17. As expected, we see a clear improvement on the (

28 \times 28

) dataset, and based on this, we chose the corrected (2D) data encoding method as our baseline QCNN for comparison against other ML models.

5.3. Results and Analysis

We first present the results of the quantum convolution operations on data with varying dimensionalities. Then, we compare the fidelity results of the quantum convolution under a noisy simulation environment with reference to classical convolution implementation. Finally, we present the results for MQCC.

Quantum Convolution Results

The fidelity of the quantum convolution technique was tested in both a noise-free and noisy environment against a classical implementation using common

(3 \times 3)

and

(5 \times 5)

filter kernels. These kernels, as described in (27)–(33), include the Averaging

F_{avg}

, Gaussian blur

F_{blur}

, Sobel edge-detection

F_{Sx}

/

F_{Sy}

, and Laplacian of Gaussian blur (Laplacian)

F_{L}

filters. To enable a quantum implementation of these kernels, a classical preprocessing step zero-padded each kernel until the dimensions were an integer power of two. As negative values may occur in classical convolution, the magnitudes of the output values were cast into a single-byte range

[0, 255]

in a classical postprocessing step.

F_{avg}^{3 \times 3} = \frac{1}{9} [\begin{matrix} 1 & 1 & 1 \\ 1 & 1 & 1 \\ 1 & 1 & 1 \end{matrix}],

(27)

F_{avg}^{5 \times 5} = \frac{1}{25} [\begin{matrix} 1 & 1 & 1 & 1 & 1 \\ 1 & 1 & 1 & 1 & 1 \\ 1 & 1 & 1 & 1 & 1 \\ 1 & 1 & 1 & 1 & 1 \\ 1 & 1 & 1 & 1 & 1 \end{matrix}]

(28)

F_{blur}^{3 \times 3} = \frac{1}{16} [\begin{matrix} 1 & 2 & 1 \\ 2 & 4 & 2 \\ 1 & 2 & 1 \end{matrix}],

(29)

F_{blur}^{5 \times 5} = \frac{1}{273} [\begin{matrix} 1 & 4 & 7 & 4 & 1 \\ 4 & 16 & 26 & 16 & 4 \\ 7 & 26 & 41 & 26 & 7 \\ 4 & 16 & 26 & 16 & 4 \\ 1 & 4 & 7 & 4 & 1 \end{matrix}]

(30)

F_{Sx} = \frac{1}{4} [\begin{matrix} 1 & 0 & - 1 \\ 2 & 0 & - 2 \\ 1 & 0 & - 1 \end{matrix}],

(31)

F_{Sy} = \frac{1}{4} [\begin{matrix} 1 & 2 & 1 \\ 0 & 0 & 0 \\ - 1 & - 2 & - 1 \end{matrix}]

(32)

F_{L}^{3 \times 3} = \frac{1}{6} [\begin{matrix} 1 & 1 & 1 \\ 1 & - 8 & 1 \\ 1 & 1 & 1 \end{matrix}],

(33)

F_{L}^{5 \times 5} = \frac{1}{20} [\begin{matrix} 1 & 1 & 1 & 1 & 1 \\ 1 & 1 & 1 & 1 & 1 \\ 1 & 1 & - 24 & 1 & 1 \\ 1 & 1 & 1 & 1 & 1 \\ 1 & 1 & 1 & 1 & 1 \end{matrix}]

(34)

A 1D averaging kernel of sizes (

1 \times 3

) and (

1 \times 5

) was applied to audio files after preprocessing, described in Section 5.1, with data size ranging from

2^{8}

to

2^{20}

data points. Table 1 presents the reconstructed output data of this operation, with Figure 18 displaying the associated calculated fidelity.

The 2D averaging, Gaussian blur, Sobel edge-detection, and Laplacian kernels were applied to 2D black-and-white (B/W) and 3D RGB Jayhawk images, see Figure 14, ranging from (

8 \times 8

) to (

512 \times 512

) pixels and (

8 \times 8 \times 3

) pixels to (

512 \times 512 \times 3

) pixels, respectively.

The reconstruction from convolution operations in classical, noise-free, and noisy environments on (

128 \times 128

)- and (

128 \times 128 \times 3

)-pixel input images can be seen in Table 2 and Table 3, respectively.

Finally, a 3D averaging kernel of sizes (

3 \times 3 \times 3

) and (

5 \times 5 \times 5

) was applied to hyperspectral images from the KSC dataset [41]. The images were preprocessed and resized to a power of two, ranging from (

8 \times 8 \times 8

) pixels to (

128 \times 128 \times 128

) pixels in size. Table 4 shows the reconstructed output images from convolution operations in classical, noise-free, and noisy environments.

Compared with the expected, classically generated results, the noise-free quantum results tested at 100% fidelity across all trials. Therefore, in a noise-free environment, given the same inputs, the proposed convolution techniques have no degradation compared with classical convolution. The fidelity of noisy simulations using 1D audio, 2D B/W, 2D RGB, and 3D hyperspectral data are presented in Figure 18, Figure 19, Figure 20, and Figure 21, respectively. The fidelity degradation in these figures is due to the statistical noise where the constant shot count (number of circuit samples) becomes less and less sufficient to describe the increasing data size. We could improve this reduction from noise by increasing the number of shots, but our experiments were limited to 1,000,000 shots in simulation, which is the maximum number of shots allowed by the simulator.

6. Discussion

In this section, we discuss the results of our experiments with MQCC and compare them against the other models in terms of the number of required training parameters, the accuracy of the model, and the circuit depth of the implemented model. The number of qubits required by MQCC can be easily calculated using (23). The number of qubits required for data encoding and filter implementation can be obtained from the dimensions of the data and filter respectively, i.e.,

n = ⌈ {log}_{2} 128 ⌉ + ⌈ {log}_{2} 128 ⌉ + ⌈ {log}_{2} 3 ⌉ = 16

for

(128 \times 128 \times 3)

data,

n_{f} = ⌈ {log}_{2} 4 ⌉ = 2

qubits for four features. Similarly, the number of qubits required for feature classes can be calculated as, for example,

n_{c} = ⌈ {log}_{2} 2 ⌉ = 1

for 2 classes. All together, for input data encoded into n qubits, the optimized MQCC requires

n + n_{f} + n_{c} = 19

qubits.

6.1. Number of Parameters

Among the classical ML models evaluated, MQCC had the fewest trainable parameters; see Figure 22. This implies potential advantages such as reduced memory usage and faster performance when using classical gradient descent. Although the reduction in parameter decreases from (MLP to CNN) and then further from (CNN to MQCC), parameter reduction diminishes from (MLP to CNN) and further from (CNN to MQCC), and there is still a significant

83.62 %

decrease in parameter count.

6.2. Loss History and Accuracy

ML-based classifiers aim to maximize the accuracy of their classifications, measured by a loss function during training to estimate the accuracy that may be exhibited when deployed. Hence, Figure 23 and Figure 24 depict the performance of the ML models across the experimental datasets in their plotting of log-loss history and classification accuracy. The MNIST [42] dataset is not complex enough to effectively distinguish models; however, differences begin to emerge in the FashionMNIST [43] and CIFAR10 [44] datasets. MLP consistently achieves the highest accuracy across trials due to its larger parameter count, allowing for greater flexibility in adapting to nuances in input. The CNN showcases its ability to select relevant input features using convolution and data locality, demonstrating the second-highest accuracy. Among the tested models, QCNN generally performs the poorest, displaying its inability to properly leverage data locality. However, comparing the accuracy of MQCC and quanvolutional neural networks is inconclusive. Quanvolutional neural networks performed better on FashionMNIST, whereas MQCC performed better on CIFAR10.

6.3. Gate Count and Circuit Depth

Although comparing MQCC, MQCC Optimized, and QCNN with quantum metrics like gate count and circuit depth is viable, see Figure 25, it is challenging to include quanvolutional neural networks in the comparison due to their significant differences from the other models. These differences are due to the quantum component within quanvolutional neural networks constituting a small fraction of the entire algorithm, bringing it closer to a classical algorithm than a quantum algorithm. Meanwhile, comparing the techniques of MQCC and QCNN in Figure 25 highlights the rationale behind developing MQCC Optimized. Initially, MQCC performed worse than QCNN in gate count and circuit depth. However, after optimizations, MQCC matched the performance of QCNN and even outperformed it in the best-case scenarios. While the QCNN architecture appears more suitable for shallower quantum circuits than MQCC, it is because the high parallelization of each QCNN layer halves the active qubits.

Despite QCNN using half the active qubits per layer than MQCC, MQCC utilizes the extra qubits for weights and features, with each pooling layer reducing the qubit count by a constant amount,

n_{k}

. However, as QCNN structures are motivated by the classical convolution operation, they usually need more complex and deeper “convolution” and “pooling” ansatz to attain a higher accuracy.

Figure 25. Gate count and circuit depth of MQCC vs. QCNN: (a) gate count; (b) circuit depth.

6.4. Complexity Comparison with Classical Models

The proposed method can also be compared with the classical models in terms of temporal complexity. The temporal/depth complexity of MQCC Optimized can be easily derived using Figure 11 and Figure 12, as well as (24), as shown in (35), where

O_{s t r i d e}

and

O_{U_{k}}

are combined to obtain

O_{c o n v}

.

\begin{matrix} O_{M Q C C}^{o p t} (n, n_{k}, n_{f}, n_{c}) & = O_{s t r i d e} (n - ℓ, n_{k}) + O_{U_{k}} (n_{k}, n_{f}) + O_{F C} (n - λ ℓ, n_{c}) \\ = O_{c o n v} (n, n_{k}, n_{f}) + O_{F C} (n - λ ℓ, n_{c}) \end{matrix}

(35)

For a MAC-based fully connected layer, we can consider

U_{FC} = U_{MAC}

to obtain the complexity using (13) and (35), as shown in (36). Similarly, for an ansatz-based fully connected layer, we can consider

U_{FC} = U_{ansatz}

to obtain the complexity using (25) and (35), as shown in (37).

\begin{matrix} O_{M Q C C_{M A C}}^{o p t} & = O (n_{k} n^{2} - n_{k}^{2} n + 2^{n_{k} + n_{f}}) + O (2^{n + n_{c} - λ ℓ}) \\ = O (n^{2}) + O (2^{n}) = O (2^{n}) = O (N) \approx O_{F C} \end{matrix}

(36)

\begin{matrix} O_{M Q C C_{a n s a t z}}^{o p t} & = O (n_{k} n^{2} - n_{k}^{2} n + 2^{n_{k} + n_{f}}) + O (n) \\ = O (n^{2}) + O (n) = O ({({log}_{2} N)}^{2}) \approx O_{c o n v} \end{matrix}

(37)

The general expression for calculating the temporal complexity of classical CNNs is shown in (38). It can be broadly divided into three parts: the complexity of the convolutional layers, the complexity of the pooling layers, and the complexity of the fully connected layers. The complexity of the combined layers of convolution and pooling is primarily determined by the number of filters, the size of the filters, and the dimensions of the input feature maps, while the complexity of fully connected layers depends on the number of neurons in the layers. It is worth mentioning that in classical CNNs, convolutional layers dominate both the pooling and fully connected layers in the overall execution time, as expressed by (38).

\begin{matrix} O_{C N N}^{C l a s s i c a l} & = O_{c o n v} + O_{p o o l i n g} + O_{f u l l y_c o n n e c t e d_l a y e r} \approx O_{c o n v} \end{matrix}

(38)

A comparison of the depth/temporal complexity of MQCC Optimized against the classical method is shown in Table 5. It details the depth complexities of C2Q data encoding (I/O overhead), two variants of MQCC Optimized, i.e., MAC-based fully connected layer and ansatz-based fully connected layer, and three variants of classical CNN implementation. Among the classical CNN variants, i.e., Direct (CPU), FFT (CPU/GPU), and GEMM (GPU), we consider the Direct (CPU) case as the worst case and GEMM (GPU) as the best case. The MQCC-optimized algorithm with a MAC-based fully connected layer matches the best case. Considering the depth of the I/O circuit, which, in this case, is the depth of the C2Q method, the overall complexity of MQCC is still better than the worst case in classical methods. The MQCC-optimized algorithm, with an ansatz-based fully connected layer, has the least complexity among all the compared models. Although the I/O overhead represents the worst-case scenario for our proposed technique (see Table 5), the complexity of our proposed MQCC technique, including the I/O overhead, matches the best case in classical methods. It is worth emphasizing that the I/O overhead is not intrinsic to our proposed technique; rather, it is a general consideration for any data-intensive quantum application like quantum machine learning (QML) classification. Moreover, our proposed MQCC method provides two additional advantages over classical CNNs, being highly parallelizable and requiring fewer training parameters, see Figure 22, which ultimately leads to fewer resource requirements than classical CNNs.

7. Conclusions

In this paper, we presented a multidimensional quantum convolutional classifier (MQCC) that consists of quantum convolution, quantum pooling, and a quantum fully connected layer. We leveraged existing convolution techniques to support multiple features/kernels and utilized them in our proposed method. Furthermore, we proposed a novel width-optimized quantum circuit that reuses freed-up qubits from the pooling layer in the subsequent convolutional layer. The proposed MQCC additionally preserves data locality in the input data which has shown to improve data classification in convolutional classifiers. The MQCC methodology is generalizable for any arbitrary multidimensional filter operation and pertinent for multifeature extraction. The proposed method can also support data of arbitrary dimensionality since the underlying quantum pooling and convolution operations are generalizable across data dimensions. We experimentally evaluated the proposed MQCC on various real-world multidimensional images, utilizing several filters through simulations on state-of-the-art quantum simulators from IBM Quantum and Xanadu. In our experiments, MQCC achieved higher classification accuracy over contemporary QML methods while having a reduced circuit depth and gate count. In our future work, we are planning on expanding MQCC with additional convolution capabilities, such as arbitrary striding and dilation, and further optimizing it for deployment on real-world quantum processors. In addition, we will investigate using our proposed quantum techniques for real-life applications such as medical imaging and classification.

Author Contributions

Conceptualization: E.E.-A., M.J., D.K., and V.J.; methodology: E.E.-A., M.J., D.K., and V.J.; software: E.E.-A., M.J., D.K., and D.L.; validation: E.E.-A., M.J., V.J., A.N., D.K., and D.L.; formal analysis: E.E.-A., M.J., D.L., D.K., and V.J.; investigation: E.E.-A., M.J., V.J., A.N., D.K., D.L., I.I., M.C., A.F., E.V., M.S., A.A., and E.B.; resources: E.E.-A., M.J., A.N., D.L., V.J., M.C., D.K., A.F., I.I., E.B., M.S., A.A., and E.V.; data curation: E.E.-A., M.J., V.J., A.N., D.K., D.L., I.I., M.C., A.F., E.V., M.S., E.B., and A.A.; writing—original draft preparation: E.E.-A., A.N., M.J., V.J., D.K., D.L., I.I., and M.C.; writing—review and editing: E.E.-A., M.J., V.J., A.N., D.K., D.L., I.I., M.C., A.F., E.V., M.S., E.B., and A.A.; visualization: E.E.-A., M.J., V.J., A.N., D.K., D.L., I.I., M.C., A.F., E.V., M.S., E.B., and A.A.; supervision: E.E.-A.; project administration: E.E.-A.; funding acquisition: E.E.-A. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The audio samples used in this work are publicly available from the European Broadcasting Union at https://tech.ebu.ch/publications/sqamcd (accessed on 23 February 2024) as file 64.flac [39]. The hyperspectral data used in this work are publicly available from the Grupo de Inteligencia Computacional (GIC) at https://www.ehu.eus/ccwintco/index.php/Hyperspectral_Remote_Sensing_Scenes#Kennedy_Space_Center_(KSC) (accessed on 23 February 2024) under the heading Kennedy Space Center (KSC) [41].

Acknowledgments

This research used resources of the Oak Ridge Leadership Computing Facility, which is a DOE Office of Science User Facility supported under Contract DE-AC05-00OR22725.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A

In this section, we briefly discuss fundamental quantum topics including various quantum operations that are relevant to the proposed MQCC. These operations are generally applicable to quantum computing and quantum machine learning.

Appendix A.1. Quantum Bits and States

A single quantum bit, or qubit, can be represented by a normalized vector |

ψ

〉 with

N = 2^{1}

elements, called a state vector; see (A1). The state of a qubit is represented as a point on the surface of the Bloch sphere [51], as shown in Figure A1. To preserve the normalization property of a state vector, a qubit’s state can only be expressed as a point on the sphere’s surface [10].

| ψ 〉 = c_{0} | 0 〉 + c_{1} | 1 〉 = [\begin{matrix} c_{0} \\ c_{1} \end{matrix}], where c_{0}^{2} + c_{1}^{2} = 1

(A1)

Figure A1. The Bloch sphere.

For n qubits, the state vector |

ψ

〉 has

N = 2^{n}

elements. As shown in (A2), each element

c_{j} \in C

of |

ψ

〉 represents the amplitude/coefficient of the

j^{t h}

entry in |

ψ

〉, or the basis state |j〉 [10].

| ψ 〉 = \sum_{j = 0}^{N - 1} c_{j} \cdot | j 〉 = [\begin{matrix} c_{0} \\ c_{1} \\ ⋮ \\ c_{j} \\ ⋮ \\ c_{N - 2} \\ c_{N - 1} \end{matrix}], where \sum_{j = 0}^{N - 1} | c_{j} |^{2} = 1, and 0 \leq j < (N = 2^{n})

(A2)

Appendix A.2. Rotation Gates

The rotation gates,

R_{x} (θ)

,

R_{y} (θ)

, and

R_{z} (ϕ)

, are parameterized versions of the Pauli single-qubit gates, shown in (A3), (A4) and (A5), respectively [33]. In QML, rotation gate parameters are trained to produce better models [5,7].

R_{x} (θ) = [\begin{matrix} cos (\frac{θ}{2}) & - i sin (\frac{θ}{2}) \\ - i sin (\frac{θ}{2}) & cos (\frac{θ}{2}) \end{matrix}]

(A3)

R_{y} (θ) = [\begin{matrix} cos (\frac{θ}{2}) & - sin (\frac{θ}{2}) \\ sin (\frac{θ}{2}) & cos (\frac{θ}{2}) \end{matrix}]

(A4)

R_{z} (ϕ) = [\begin{matrix} e^{- \frac{i ϕ}{2}} & 0 \\ 0 & e^{\frac{i ϕ}{2}} \end{matrix}]

(A5)

Appendix A.3. Hadamard Gate

The Hadamard gate is a fundamental quantum gate that puts the qubit into a superposition of two basis states, the |0〉 and |1〉 states; see (A6) and Figure A2 [10]. Similarly, by applying Hadamard gates on n qubits independently, an n-qubit superposition can be created that comprises of

2^{n}

basis states, as shown in (A7).

H = \frac{1}{\sqrt{2}} [\begin{matrix} 1 & 1 \\ 1 & - 1 \end{matrix}]

(A6)

Figure A2. Hadamard gate diagram.

H^{\otimes n} \cdot {| 0 〉}^{\otimes n} = \frac{1}{\sqrt{2^{n}}} \sum_{j = 0}^{2^{n} - 1} | j 〉

(A7)

Appendix A.4. Controlled-NOT (CNOT) Gate

CNOT gates can form a fundamental set of basis gates in conjunction with single-qubit rotation gates [33]; see (A8) and Figure A3. CNOT gates can be used to entangle two qubits, where a measurement of one qubit can provide information about the second qubit [33].

CNOT = cX = (| 0 〉 〈 0 | \otimes I) + (| 1 〉 〈 1 | \otimes X) = [\begin{matrix} 1 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 \\ 0 & 0 & 0 & 1 \\ 0 & 0 & 1 & 0 \end{matrix}]

(A8)

Figure A3. Controlled-NOT gate diagram.

All gates, single- or multiqubit, can be extended to have control qubit(s), as shown for a general U in (A9) and Figure A4 [33].

cU = (| 0 〉 〈 0 | \otimes I) + (| 1 〉 〈 1 | \otimes U) = [\begin{matrix} I & 0 \\ 0 & U \end{matrix}]

(A9)

Figure A4. Controlled-U gate diagram.

When a general multiqubit gate is extended to have more control qubits, it becomes an n-qubit multiple-controlled U (MCU) gate. In the context of CNOT gates, the operation becomes a multiple-controlled-not (MCX) gate [52]. For an n-qubit MCX gate with

n - 1

control qubits and 1 target qubit (see (A10)), the depth of an MCX gate with the addition of a single extra qubit can be found in (A11) [52].

MCX (n) = \sum_{i = 0}^{2^{n - 1} - 2} (| i 〉 〈 i | \otimes I) + (| 2^{n - 1} - 1 〉 〈 2^{n - 1} - 1 | \otimes X)

(A10)

Δ_{MCX} (n) \leq 48 n - 196 = O (n) for large n

(A11)

The most general controlled gate is the multiplexer, see (A12), which defines a quantum operation to be applied on

n_{t a r g e t}

qubits for each state permutation of some control qubit(s),

n_{c o n t r o l}

[33]. In quantum circuits, the square box notation is used to denote a multiplexer operation, as shown in Figure A5 [33].

U_{m u x} = \sum_{i = 0}^{2^{n_{c}} - 1} | i 〉 〈 i | \otimes U_{i} = [\begin{matrix} U_{0} \\ ⋱ \\ U_{2^{n_{c o n t r o l} - 1}} \end{matrix}] \begin{matrix} ↕ 2^{n_{t a r g e t}} \\ ⋮ \\ ↕ 2^{n_{t a r g e t}} \end{matrix} ↕ 2^{n_{t a r g e t} + n_{c o n t r o l}}

(A12)

Figure A5. The square box notation for a quantum multiplexer [33].

Here,

U_{i}

is a matrix of size

2^{n_{t a r g e t}} \times 2^{n_{t a r g e t}}

defining the unitary operations/gates applied on each data qubit for the corresponding values i, where,

0 < i < 2^{n_{c o n t r o l} - 1}

.

Appendix A.5. SWAP Gate

In a quantum circuit, SWAP gates are applied to interchange the position of two qubits, as shown in (A13) and Figure A6. The SWAP gate can be decomposed into three CNOT gates [10]; see Figure A6.

SWAP = [\begin{matrix} 1 & 0 & 0 & 0 \\ 0 & 0 & 1 & 0 \\ 0 & 1 & 0 & 0 \\ 0 & 0 & 0 & 1 \end{matrix}]

(A13)

Figure A6. Swap gate diagram and decomposition.

Appendix A.6. Quantum Perfect-Shuffle Permutation (PSP)

The quantum perfect-shuffle permutations (PSPs) are operations that perform a cyclical rotation of given input qubits and can be implemented using SWAP gates. Figure A7 shows the rotate-left (RoL) and rotate-right (RoR) PSP operations [11] with

(n - 1)

SWAP operations, or

3 (n - 1)

equivalent CNOT operations. More information about CNOT and SWAP gates can be found in Appendix A.4 and Appendix A.5, respectively.

Figure A7. Quantum Perfect-Shuffle Permutations [11]. (a) RoL:

| q_{n - 1} q_{n - 2} \dots q_{0} 〉 \to | q_{n - 2} \dots q_{0} q_{n - 1} 〉

; (b) RoR:

| q_{n - 1} \dots q_{1} q_{0} 〉 \to | q_{0} q_{n - 1} \dots q_{1} 〉

.

Figure A7. Quantum Perfect-Shuffle Permutations [11]. (a) RoL:

| q_{n - 1} q_{n - 2} \dots q_{0} 〉 \to | q_{n - 2} \dots q_{0} q_{n - 1} 〉

; (b) RoR:

| q_{n - 1} \dots q_{1} q_{0} 〉 \to | q_{0} q_{n - 1} \dots q_{1} 〉

.

Appendix A.7. Quantum Shift Operation

The quantum shift operation also performs a cyclic rotation of the basis states [20] similar to the quantum perfect-shuffle operations. Here, the basis states of the n-qubit state |

ψ

〉 are shifted by

+ k

or

- k

positions by applying a shift operation

U_{shift}^{\pm k}

; see (A14). Here, the value of k determines the type of operation such that when

k > 0

, and the corresponding shift operation is denoted as a quantum incrementer, as shown in Figure A8a [30,53], and when

k < 0

, the corresponding shift operation is denoted as a quantum decrementer [30,53], as shown in Figure A8b. The quantum shift operation plays a vital role in convolution operation by striding the filter operation over data windows [25].

\begin{matrix} U_{shift}^{\pm k} | ψ 〉 & = \sum_{i = 0}^{2^{n} - 1} c_{i} | j 〉, where j = (i \pm k) mod 2^{n} \\ = (\prod_{i = 0}^{k - 1} U_{shift}^{\pm 1}) | ψ 〉 = {(U_{shift}^{\pm 1})}^{k} | ψ 〉 \end{matrix}

(A14)

Figure A8. Quantum incrementer, decrementer, and construction of quantum shift operation: (a) quantum incrementer by 1; (b) quantum decrementer by 1; (c) quantum shift operation by

\pm k

, where

k = \sum_{0}^{j = n - 1} k_{j} 2^{j}

.

Figure A8. Quantum incrementer, decrementer, and construction of quantum shift operation: (a) quantum incrementer by 1; (b) quantum decrementer by 1; (c) quantum shift operation by

\pm k

, where

k = \sum_{0}^{j = n - 1} k_{j} 2^{j}

.

Each quantum shift operation

U_{shift}^{\pm 1}

acting on an n-qubit state can be decomposed into a pyramidal structure of n× MCX gates in a series pattern [20], as shown in Figure A8a,b. In terms of fundamental single-qubit and CNOT gates, the depth of a quantum shift operation with

\pm 1

can be demonstrated as quadratic [52]; see (A15). Generalized quantum shift operations

U_{shift}^{\pm k}

can be derived using

k \times U_{shift}^{\pm 1}

operations, from the expression in (A14), and the corresponding circuit depth can be derived as shown in (A16).

\begin{matrix} \begin{matrix} Δ_{U_{shift}^{- 1}} (n) & = \sum_{i = 0}^{n - 1} Δ_{MCX} (i) \\ = \sum_{i = 0}^{n - 1} 48 i - 196 \\ \leq 24 n^{2} - 220 n = O (n^{2}) for large n \end{matrix} \end{matrix}

(A15)

\begin{matrix} \begin{matrix} Δ_{U_{shift}^{\pm k}} (n, k) & = k \cdot Δ_{U_{shift}^{\pm 1}} (n) = O (k n^{2}) \end{matrix} \end{matrix}

(A16)

Now,

U_{shift}^{\pm k}

can be expressed as a sequence of controlled shift operations, as shown in (A17). Such operations can be denoted as

U_{shift}^{\pm k_{j} 2^{j}} (n)

, where

0 \leq j < n

and

(n)

indicates that the shift operation is applied to an n-qubit state, as shown in (A18). Instead of applying sequential

U_{shift}^{\pm 1} (n)

operations, each

U_{shift}^{\pm 2^{j}} (n)

operation can be performed using a single

U_{shift}^{\pm 1} (n - j)

operation; see (A18). Therefore, a more depth-efficient decomposition is shown in Figure A8, and the corresponding depth is provided in (A19).

k = {(k_{n - 1} k_{n - 2} \dots k_{j} \dots k_{1} k_{0})}_{2} = \sum_{j = 0}^{n - 1} k_{j} 2^{j} : k_{j} \in {0, 1}

(A17)

\begin{matrix} U_{shift}^{\pm k} (n) & = U_{shift}^{\pm \sum_{j = n - 1}^{0} k_{j} 2^{j}} (n) = \prod_{j = n - 1}^{0} U_{shift}^{\pm k_{j} 2^{j}} (n) \\ = \prod_{j = n - 1}^{0} (U_{shift}^{\pm k_{j}} (n - j) \otimes I^{\otimes j}) \end{matrix}

(A18)

\begin{matrix} Δ_{U_{shift}^{\pm k}}^{opt} (n) & \leq \sum_{j = 0}^{n - 1} Δ_{U_{shift}^{- 1}} (n - j) = 24 \sum_{j = 0}^{n - 1} j^{2} - 220 \sum_{j = 0}^{n - 1} j \\ \leq 8 n^{3} - 122 n^{2} + 114 n = O (n^{3}) for large n \end{matrix}

(A19)

References

Villalobos, P.; Sevilla, J.; Besiroglu, T.; Heim, L.; Ho, A.; Hobbhahn, M. Machine learning model sizes and the parameter gap. arXiv 2022, arXiv:2207.02852. [Google Scholar]
LeCun, Y.; Kavukcuoglu, K.; Farabet, C. Convolutional networks and applications in vision. In Proceedings of the 2010 IEEE International Symposium on Circuits and Systems, Paris, France, 30 May–2 June 2010; IEEE: New York, NY, USA, 2010; pp. 253–256. [Google Scholar]
Lecun, Y.; Bengio, Y.; Hinton, G. Deep learning. Nature 2015, 521, 436–444. [Google Scholar] [CrossRef] [PubMed]
Chakraborty, S.; Dey, L. Computing for Data Analysis: Theory and Practices; Springer: Singapore, 2023; pp. 161–178. [Google Scholar]
Peruzzo, A.; McClean, J.; Shadbolt, P.; Yung, M.H.; Zhou, X.Q.; Love, P.J.; Aspuru-Guzik, A.; O’Brien, J.L. A variational eigenvalue solver on a photonic quantum processor. Nat. Commun. 2014, 5, 4213. [Google Scholar] [CrossRef]
Havlíček, V.; Córcoles, D.; Temme, K.; Harrow, W.; Kandala, A.; Chow, J.; Gambetta, M. Supervised learning with quantum-enhanced feature spaces. Nature 2019, 567, 209–212. [Google Scholar] [CrossRef]
Farhi, E.; Goldstone, J.; Gutmann, S. A quantum approximate optimization algorithm. arXiv 2014, arXiv:1411.4028. [Google Scholar]
Farhi, E.; Neven, H. Classification with quantum neural networks on near term processors. arXiv 2018, arXiv:1802.06002. [Google Scholar]
Schuld, M.; Bocharov, A.; Svore, M.; Wiebe, N. Circuit-centric quantum classifiers. Phys. Rev. A 2020, 101, 032308. [Google Scholar] [CrossRef]
Williams, C.P. Explorations in Quantum Computing, 2nd ed.; Springer: London, UK, 2011. [Google Scholar]
Jeng, M.; Islam, S.I.U.; Levy, D.; Riachi, A.; Chaudhary, M.; Nobel, M.A.I.; Kneidel, D.; Jha, V.; Bauer, J.; Maurya, A.; et al. Improving quantum-to-classical data decoding using optimized quantum wavelet transform. J. Supercomput. 2023, 79, 20532–20561. [Google Scholar] [CrossRef]
El-Araby, E.; Mahmud, N.; Jeng, M.J.; MacGillivray, A.; Chaudhary, M.; Nobel, M.A.I.; Islam, S.I.U.; Levy, D.; Kneidel, D.; Watson, M.R.; et al. Towards Complete and Scalable Emulation of Quantum Algorithms on High-Performance Reconfigurable Computers. IEEE Trans. Comput. 2023, 72, 2350–2364. [Google Scholar] [CrossRef]
IBM Quantum. Qiskit: An Open-Source Framework for Quantum Computing. Zenodo. Available online: https://doi.org/10.5281/zenodo.2573505 (accessed on 28 February 2024).
Nation, P. How to Measure and Reset a Qubit in the Middle of a Circuit Execution. 2021. Available online: https://www.ibm.com/quantum/blog/quantum-mid-circuit-measurement (accessed on 26 February 2024).
Mi, A.; Deng, S.; Szefer, J. Securing Reset Operations in NISQ Quantum Computers. In Proceedings of the 2022 ACM SIGSAC Conference on Computer and Communications Security, Los Angeles, CA, USA, 7–11 November 2022; pp. 2279–2293. [Google Scholar]
Mahmud, N.; Jeng, M.; Nobel, M.A.I.; Chaudhary, M.; Ul Islam, S.M.I.; Levy, D.; El-Araby, E. Efficient Data Encoding and Decoding for Quantum Computing. In Proceedings of the 2022 IEEE International Conference on Quantum Computing and Engineering (QCE), Broomfield, CO, USA, 18–23 September 2022; pp. 765–768. [Google Scholar] [CrossRef]
Albawi, S.; Mohammed, T.A.; Al-Zawi, S. Understanding of a convolutional neural network. In Proceedings of the 2017 International Conference on Engineering and Technology (ICET), Antalya, Turkey, 21–23 August 2017; IEEE: Piscataway, NJ, USA, 2017; pp. 1–6. [Google Scholar]
Le, Q.; Dong, F.; Hirota, K. A flexible representation of quantum images for polynomial preparation, image compression, and processing operations. Quantum Inf. Process. 2011, 10, 63–84. [Google Scholar] [CrossRef]
Zhang, Y.; Lu, K.; Gao, Y.; Wang, M. NEQR: A novel enhanced quantum representation of digital images. Quantum Inf. Process. 2013, 12, 2833–2860. [Google Scholar] [CrossRef]
Jeng, M.; Nobel, A.; Jha, V.; Levy, D.; Kneidel, D.; Chaudhary, M.; Islam, I.; Rahman, M.; El-Araby, E. Generalized Quantum Convolution for Multidimensional Data. Entropy 2023, 25, 11. [Google Scholar] [CrossRef]
Burrus, C.S.S.; Parks, T.W. DFT/FFT and Convolution Algorithms: Theory and Implementation; John Wiley & Sons, Inc.: Hoboken, NJ, USA, 1991. [Google Scholar]
Podlozhnyuk, V. FFT-Based 2D Convolution. NVIDIA. 2007. Available online: https://developer.download.nvidia.com/compute/cuda/1.1-Beta/x86_64_website/projects/convolutionFFT2D/doc/convolutionFFT2D.pdf (accessed on 18 February 2024).
NVIDIA. CUTLASS Convolution. Github. 2023. Available online: https://github.com/NVIDIA/cutlass/blob/main/media/docs/implicit_gemm_convolution.md (accessed on 18 February 2024).
NVIDIA. Convolution Algorithms. Github. 2023. Available online: https://docs.nvidia.com/deeplearning/performance/dl-performance-convolutional/index.html#conv-algo (accessed on 18 February 2024).
Fan, P.; Zhou, R.; Hu, W.; Jing, N. Quantum image edge extraction based on classical Sobel operator for NEQR. Quantum Inf. Process. 2018, 18, 24. [Google Scholar] [CrossRef]
Ma, Y.; Ma, H.; Chu, P. Demonstration of Quantum Image Edge Extration Enhancement through Improved Sobel Operator. IEEE Access 2020, 8, 210277–210285. [Google Scholar] [CrossRef]
Zhang, Y.; Lu, K.; Gao, Y. QSobel: A novel quantum image edge extraction algorithm. Sci. China Inf. Sci. 2015, 58, 1–13. [Google Scholar] [CrossRef]
Zhou, R.; Yu, H.; Cheng, Y.; Li, F. Quantum image edge extraction based on improved Prewitt operator. Quantum Inf. Process. 2019, 18, 261. [Google Scholar] [CrossRef]
Li, P. Quantum implementation of the classical Canny edge detector. Multimed. Tools Appl. 2022, 81, 11665–116941. [Google Scholar] [CrossRef]
Yao, X.; Wang, H.; Liao, Z.; Chen, M.; Pan, J.; Li, J.; Zhang, K.; Lin, X.; Wang, Z.; Luo, Z.; et al. Quantum Image Processing and Its Application to Edge Detection: Theory and Experiment. Phys. Rev. X 2017, 7, 031041. [Google Scholar] [CrossRef]
Cong, I.; Choi, S.; Lukin, M.D. Quantum convolutional neural networks. Nat. Phys. 2019, 15, 1273–1278. [Google Scholar] [CrossRef]
Henderson, M.; Shakya, S.; Pradhan, S.; Cook, T. Quanvolutional neural networks: Powering image recognition with quantum circuits. Quantum Mach. Intell. 2020, 2, 2. [Google Scholar] [CrossRef]
Shende, V.; Bullock, S.; Markov, I. Synthesis of quantum-logic circuits. IEEE Trans. Comput. Aided Des. Integr. Circuits Syst. 2006, 25, 1000–1010. [Google Scholar] [CrossRef]
Bieder, F.; Sandkühler, R.; Cattin, P. Comparison of methods generalizing max-and average-pooling. arXiv 2021, arXiv:2103.01746. [Google Scholar] [CrossRef]
Jeng, M.; Nobel, A.; Jha, V.; Levy, D.; Kneidel, D.; Chaudhary, M.; Islam, I.; Baumgartner, E.; Vanderhoof, E.; Facer, A.; et al. Optimizing Multidimensional Pooling for Variational Quantum Algorithms. Algorithms 2024, 17, 82. [Google Scholar] [CrossRef]
PyTorch. torch.nn.LPPool1d [Computer Software]. 2023. Available online: https://pytorch.org/docs/stable/generated/torch.nn.LPPool1d.html (accessed on 18 February 2024).
IBM Quantum. qiskit.circuit.library.NLocal [Computer Software]. 2023. Available online: https://qiskit.org/documentation/stubs/qiskit.circuit.library.NLocal.html (accessed on 18 February 2024).
Xanadu. qml.BasicEntanglerLayers [Computer Software]. 2023. Available online: https://docs.pennylane.ai/en/stable/code/api/pennylane.BasicEntanglerLayers.html (accessed on 18 February 2024).
Geneva, Switzerland. Sound Quality Assessment Material: Recordings for Subjective Tests. 1988. Available online: https://tech.ebu.ch/publications/sqamcd (accessed on 18 February 2024).
Brand Center, University of Kansas. Jayhawk Images. Available online: https://brand.ku.edu/ (accessed on 18 February 2024).
Graña, M.; Veganzons, M.A.; Ayerdi, B. Hyperspectral Remote Sensing Scenes. Available online: https://www.ehu.eus/115ccwintco/index.php/Hyperspectral_Remote_Sensing_Scenes#Kennedy_Space_Center_(KSC) (accessed on 18 February 2024).
Deng, L. The MNIST database of handwritten digit images for machine learning research. IEEE Signal Process. Mag. 2012, 29, 141–142. [Google Scholar] [CrossRef]
Xiao, H.; Rasul, K.; Vollgraf, R. Fashion-MNIST: A Novel Image Dataset for Benchmarking Machine Learning Algorithms. arXiv 2017, arXiv:1708.07747. [Google Scholar]
Krizhevsky, A. Learning Multiple Layers of Features from Tiny Images. 2009. Available online: https://www.cs.toronto.edu/~kriz/learning-features-2009-TR.pdf(accessed on 18 February 2024).
Paszke, A.; Gross, S.; Massa, F.; Lerer, A.; Bradbury, J.; Chanan, G.; Killeen, T.; Lin, Z.; Gimelshein, N.; Antiga, L.; et al. Pytorch: An imperative style, high-performance deep learning library. Adv. Neural Inf. Process. Syst. 2019, 32, 8026–8037. [Google Scholar]
Bergholm, V.; Izaac, J.; Schuld, M.; Gogolin, C.; Ahmed, S.; Ajith, V.; Alam, M.S.; Alonso-Linaje, G.; AkashNarayanan, B.; Asadi, A.; et al. PennyLane: Automatic differentiation of hybrid quantum-classical computations. arXiv 2022, arXiv:1811.04968. Available online: https://pennylane.ai/ (accessed on 18 February 2024).
KU Community Cluster, Center for Research Computing, University of Kansas. Available online: https://crc.ku.edu/systems-services/ku-community-cluster (accessed on 18 February 2024).
Kingma, D.P.; Ba, J. Adam: A method for stochastic optimization. arxiv 2014, arXiv:1412.6980. [Google Scholar]
Arora, R.; Basu, A.; Mianjy, P.; Mukherjee, A. Understanding deep neural networks with rectified linear units. arxiv 2014, arXiv:1611.01491. [Google Scholar]
Hur, T.; Kim, L.; Park, D. Quantum convolutional neural network for classical data classification. Quantum Mach. Intell. 2022, 4, 3. [Google Scholar] [CrossRef]
Anton, F.K.; Franco, N. Quantum Bits with Josephson Junctions; Springer: Cham, Switzerland, 2019. [Google Scholar]
Balauca, S.; Arusoaie, A. Efficient Constructions for Simulating Multi Controlled Quantum Gates. In Proceedings of the Computational Science—ICCS, 22nd International Conference, London, UK, 21–23 June 2022; Springer International Publishing: Cham, Switzerland, 2022; Volume 25, pp. 179–194. [Google Scholar] [CrossRef]
Li, X.; Yang, G.; Torres, M.; Zheng, D.; Wang, L. A Class of Efficient Quantum Incrementer Gates for Quantum Circuit Synthesis. Int. J. Mod. Phys. B 2014, 28, 1350191, (Preprint). [Google Scholar] [CrossRef]

Figure 1. Full quantum state measurement diagram.

Figure 2. Reset gate and equivalent circuit.

Figure 4. Quantum multiply-and-accumulate (MAC) operation using inverse arbitrary state synthesis.

Figure 5. Quantum fully connected layer with an

N_{out}

-feature output.

Figure 5. Quantum fully connected layer with an

N_{out}

-feature output.

Figure 8. Multidimensional quantum convolution circuit.

Figure 9. Quantum Haar transform circuits: (a) single-level One-dimensional (1-D) QHT circuit; (b) multilevel One-dimensional (1-D) QHT circuit.

Figure 14. High-resolution, multidimensional, real-world input data used in experimental trials: (a) 2D B/W image [40]; (b) 3D RGB image [40]; (c) 3D hyperspectral image [41].

Figure 15. Classification accuracy of convolutional neural network (CNN) configurations on the FashionMNIST dataset: (a) (

16 \times 16

) dataset; (b) (

28 \times 28

) dataset.

Figure 15. Classification accuracy of convolutional neural network (CNN) configurations on the FashionMNIST dataset: (a) (

16 \times 16

) dataset; (b) (

28 \times 28

) dataset.

Figure 16. Classification accuracy of quanvolution neural network configurations on the FashionMNIST dataset: (a) (

16 \times 16

) dataset; (b) (

28 \times 28

) dataset.

Figure 16. Classification accuracy of quanvolution neural network configurations on the FashionMNIST dataset: (a) (

16 \times 16

) dataset; (b) (

28 \times 28

) dataset.

Figure 17. Classification accuracy of quantum convolutional neural network (QCNN) configurations on the FashionMNIST dataset: (a) (

16 \times 16

) dataset; (b) (

28 \times 28

) dataset.

Figure 17. Classification accuracy of quantum convolutional neural network (QCNN) configurations on the FashionMNIST dataset: (a) (

16 \times 16

) dataset; (b) (

28 \times 28

) dataset.

Figure 18. Fidelity of 1D convolution (averaging) filters with unity stride on 1D audio data (sampled with 1,000,000 shots).

Figure 19. Fidelity of 2D convolution filters with unity stride on 2D B/W data (sampled with 1,000,000 shots).

Figure 20. Fidelity of 2D convolution filters with unity stride on 3D RGB data (sampled with 1,000,000 shots).

Figure 21. Fidelity of 3D convolution (averaging) filters with unity stride on 3D hyperspectral data (sampled with 1,000,000 shots).

Figure 22. Number of training parameters for ML models for tested data sizes.

Figure 23. Loss history of ML models on various datasets: (a) MNIST (

16 \times 16

); (b) MNIST (

28 \times 28

); (c) FashionMNIST (

16 \times 16

); (d) FashionMNIST (

28 \times 28

); (e) CIFAR10 (

16 \times 16 \times 3

); (f) CIFAR10 (

32 \times 32 \times 3

).

Figure 23. Loss history of ML models on various datasets: (a) MNIST (

16 \times 16

); (b) MNIST (

28 \times 28

); (c) FashionMNIST (

16 \times 16

); (d) FashionMNIST (

28 \times 28

); (e) CIFAR10 (

16 \times 16 \times 3

); (f) CIFAR10 (

32 \times 32 \times 3

).

Figure 24. Classification accuracy of ML models on various datasets: (a) MNIST (

16 \times 16

); (b) MNIST (

28 \times 28

); (c) FashionMNIST (

16 \times 16

); (d) FashionMNIST (

28 \times 28

); (e) CIFAR10 (

16 \times 16 \times 3

); (f) CIFAR10 (

32 \times 32 \times 3

).

Figure 24. Classification accuracy of ML models on various datasets: (a) MNIST (

16 \times 16

); (b) MNIST (

28 \times 28

); (c) FashionMNIST (

16 \times 16

); (d) FashionMNIST (

28 \times 28

); (e) CIFAR10 (

16 \times 16 \times 3

); (f) CIFAR10 (

32 \times 32 \times 3

).

Table 1. The 1D convolution (averaging) filters applied to 1D audio samples [39].

Data Size (No. of Sample Points)	(3) Averaging Classical/ Noise-Free	(3) Averaging Noisy ( $10^{6}$ Shots)	(5) Averaging Classical/ Noise-Free	(5) Averaging Noisy ( $10^{6}$ Shots)
256
4096
65,536
1,048,576

Table 2. Two-dimensional convolution kernels applied to a (

128 \times 128

) bw image [40].

Table 2. Two-dimensional convolution kernels applied to a (

128 \times 128

) bw image [40].

Kernel	( $3 \times 3$ ) Kernel Classical/Noise-Free	( $3 \times 3$ ) Kernel Noisy ( $10^{6}$ Shots)	( $5 \times 5$ ) Kernel Classical/Noise-Free	( $5 \times 5$ ) Kernel Noisy ( $10^{6}$ Shots)
Average
Gaussian
Sobel-X
Sobel-Y
Laplacian

Table 3. Two-dimensional convolution kernels applied to a (

128 \times 128 \times 3

) rgb image [40].

Table 3. Two-dimensional convolution kernels applied to a (

128 \times 128 \times 3

) rgb image [40].

Kernel	( $3 \times 3$ ) Kernel Classical/Noise-Free	( $3 \times 3$ ) Kernel Noisy ( $10^{6}$ Shots)	( $5 \times 5$ ) Kernel Classical/Noise-Free	( $5 \times 5$ ) Kernel Noisy ( $10^{6}$ Shots)
Average
Gaussian
Sobel-X
Sobel-Y
Laplacian

Table 4. The 3D convolution (averaging) filters applied to 3D hyperspectral images (bands 0, 1, and 2) [41].

Data Size ( $N_{rows} \times N_{cols} \times N_{bands}$ )	( $3 \times 3 \times 3$ ) Averaging Classical/ Noise-Free	( $3 \times 3 \times 3$ ) Averaging Noisy ( $10^{6}$ Shots)	( $5 \times 5 \times 5$ ) Averaging Classical/ Noise-Free	( $5 \times 5 \times 5$ ) Averaging Noisy ( $10^{6}$ Shots)
( $8 \times 8 \times 8$ )
( $16 \times 16 \times 16$ )
( $32 \times 32 \times 32$ )
( $64 \times 64 \times 64$ )
( $128 \times 128 \times 128$ )

Table 5. Comparison of depth/time complexity of proposed MQCC-optimized against classical CNN.

a Depth complexity of C2Q [12] data encoding (I/O) technique
	$O_{I / O} (2^{n}) = O_{I / O} (N)$
b Complexity of proposed technique compared with classical convolutional neural network
MQCC Optimized with MAC-based fully connected layer	MQCC Optimized with ansatz-based fully connected layer	Direct (CPU) [21]	FFT (CPU/GPU) [21,22]	GEMM (GPU) [23,24]
$\begin{matrix} O_{alg} (2^{n}) \equiv O_{alg} (N) \\ O_{alg} + I / O (2^{n} + 2^{n}) \equiv \\ O_{alg} + I / O (N) \end{matrix}$	$\begin{matrix} O_{alg} (n^{2}) \equiv O_{alg} ({(log N)}^{2}) \\ O_{alg} + I / O (n^{2} + 2^{n}) \equiv \\ O_{alg} + I / O (N) \end{matrix}$	$\begin{matrix} O_{alg} (4^{n}) \equiv \\ O_{alg} (N^{2}) \end{matrix}$	$\begin{matrix} O_{alg} (n 2^{n}) \equiv \\ O_{alg} (N log N) \end{matrix}$	$\begin{matrix} O_{alg} (2^{(n + n_{k})}) \equiv \\ O_{alg} (N_{K} N) \end{matrix}$

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Jeng, M.; Nobel, A.; Jha, V.; Levy, D.; Kneidel, D.; Chaudhary, M.; Islam, I.; Facer, A.; Singh, M.; Baumgartner, E.; et al. Leveraging Data Locality in Quantum Convolutional Classifiers. Entropy 2024, 26, 461. https://doi.org/10.3390/e26060461

AMA Style

Jeng M, Nobel A, Jha V, Levy D, Kneidel D, Chaudhary M, Islam I, Facer A, Singh M, Baumgartner E, et al. Leveraging Data Locality in Quantum Convolutional Classifiers. Entropy. 2024; 26(6):461. https://doi.org/10.3390/e26060461

Chicago/Turabian Style

Jeng, Mingyoung, Alvir Nobel, Vinayak Jha, David Levy, Dylan Kneidel, Manu Chaudhary, Ishraq Islam, Audrey Facer, Manish Singh, Evan Baumgartner, and et al. 2024. "Leveraging Data Locality in Quantum Convolutional Classifiers" Entropy 26, no. 6: 461. https://doi.org/10.3390/e26060461

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Leveraging Data Locality in Quantum Convolutional Classifiers

Abstract

1. Introduction

2. Background

2.1. Quantum Measurement and Reset

2.2. Classical-to-Quantum (C2Q)

2.3. Convolutional Neural Networks (CNNs)

2.4. Quantum Machine Learning with Variational Algorithms

3. Related Work

3.1. Data Encoding

3.2. Convolution

3.2.1. Classical Convolution

3.2.2. Quantum Convolution

3.3. Quantum Machine Learning

3.3.1. Quantum Convolutional Neural Networks

3.3.2. Quanvolutional Neural Networks

4. Materials and Methods

4.1. Quantum Fully Connected Layer

4.1.1. Single-Feature Output

4.1.2. Multifeature Output

Replication:

Applying the U FC Filter:

Data Rearrangement:

4.1.3. Circuit Depth of the Quantum Fully Connected Layer

4.2. Generalized Quantum Convolution

Stride:

Multiply-and-Accumulate (MAC):

Data Rearrangement:

4.2.1. One-Dimensional Multifeature Quantum Convolution

4.2.2. Multidimensional Multifeature Quantum Convolution

4.3. Quantum Pooling

4.3.1. Quantum Average Pooling using Quantum Haar Transform

Haar Wavelet Operation:

Data Rearrangement:

4.3.2. Quantum Euclidean Pooling using Partial Measurement

4.4. Multidimensional Quantum Convolutional Classifier

4.5. Optimized MQCC

5. Experimental Work

5.1. Experimental Setup

5.2. Configuration of ML Models

5.3. Results and Analysis

Quantum Convolution Results

6. Discussion

6.1. Number of Parameters

6.2. Loss History and Accuracy

6.3. Gate Count and Circuit Depth

6.4. Complexity Comparison with Classical Models

7. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Appendix A

Appendix A.1. Quantum Bits and States

Appendix A.2. Rotation Gates

Appendix A.3. Hadamard Gate

Appendix A.4. Controlled-NOT (CNOT) Gate

Appendix A.5. SWAP Gate

Appendix A.6. Quantum Perfect-Shuffle Permutation (PSP)

Appendix A.7. Quantum Shift Operation

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Applying the $U_{FC}$ Filter: