Estimation of Convex Polytopes for Automatic Discovery of Charge State Transitions in Quantum Dot Arrays

Krause, Oswin; Brovang, Bertram; Rasmussen, Torbjørn; Chatterjee, Anasua; Kuemmeth, Ferdinand

doi:10.3390/electronics11152327

Open AccessArticle

Estimation of Convex Polytopes for Automatic Discovery of Charge State Transitions in Quantum Dot Arrays

by

Oswin Krause

^1,*

,

Bertram Brovang

²

,

Torbjørn Rasmussen

²

,

Anasua Chatterjee

²

and

Ferdinand Kuemmeth

^2,*

¹

Department of Computer Science, University of Copenhagen, 2100 Copenhagen, Denmark

²

Center for Quantum Devices, Niels Bohr Institute, University of Copenhagen, Universitetsparken 5, 2100 Copenhagen, Denmark

^*

Authors to whom correspondence should be addressed.

Electronics 2022, 11(15), 2327; https://doi.org/10.3390/electronics11152327

Submission received: 24 May 2022 / Revised: 8 July 2022 / Accepted: 15 July 2022 / Published: 27 July 2022

(This article belongs to the Special Issue Machine Learning and Artificial Intelligence in Quantum Computing Platforms)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

In spin based quantum dot arrays, material or fabrication imprecisions affect the behaviour of the device, which must be taken into account when controlling it. This requires measuring the shape of specific convex polytopes. We present an algorithm that automatically discovers count, shape and size of the facets of a convex polytope from measurements by alternating a phase of model-fitting with a phase of querying new measurements, based on the fitted model. We evaluate the algorithm on simulated polytopes and devices, as well as a real 2 × 2 spin qubit array. Results show that we can reliably find the facets of the convex polytopes, including small facets with sizes on the order of the measurement precision.

Keywords:

quantum dot arrays; large margin; convex polytopes; polytope estimation; active learning

1. Introduction

Quantum devices controlled by gate voltages have wide-ranging applications, spanning from quantum computation, spintronics as well as the exploration of fundamental physics [1]. An important class are spin based quantum dot arrays, which are candidates for universal quantum computers [2]. In these devices, electrostatic forces are used to trap single electrons at discrete locations, so called quantum dots. Finding the correct control parameters to confine the desired amounts of electrons (e.g., one electron on each dot) or to move single electrons between different quantum dot locations is a key challenge and primary bottleneck in developing these devices.

In this work, we tackle the problem of controlling the electron transitions between quantum dot locations. We present an algorithm that is capable of discovering the set of possible electron transitions as well as their correct control parameters and demonstrate its performance on a real device. Our algorithm is based on a connection to computational geometry and phrases the optimization problem as estimating the facets of a convex polytope from measurements. While this problem is NP-hard, an approximated solution performs well and thus our algorithm represents the first practical automatic tuning algorithm which has the prospects to accurately learn the set of state-transitions in more than two dimensions and which can be scaled to more than 2 or 3 quantum dots on real devices. We demonstrate its practicality on a simulated device with four quantum dots as well as a real device with four quantum dots (three qubit dots plus one sensor dot), for which a full discovery of all transitions is already outside the scope of human tuning capabilities. Our contributions are the following:

We develop an algorithm that aims to find a sparse approximate solution of a convex polytope from measurements with as few facets as possible. For this we extend the non-convex large margin approach introduced by [3].
We proof a lemma that can be used to test the correctness of a learned polytope from measurements. Our result leads to an active learning scheme that iteratively improves the polytope estimate by adding informative samples that systematically disproves the previous solution.
We show applicability of our algorithm on a real quantum dot array, specifically a foundry-fabricated silicon device that is currently being developed for spin qubit applications.

In this work, we target capacitively coupled quantum dot arrays such as the one depicted in Figure 1a (see also [4]). In this device, each gate-electrode is able to create a quantum dot below it, by choosing appropriate gate-voltages. We will refer to the vector of electron occupations on each dot as the state of the device. Due to the weak tunnel coupling of dots, the ground state of the device is essentially determined by classical physics and can be described well via the constant interaction model [5]. In this model, the set of control parameters associated with a specific ground state forms a convex polytope. Its boundary is formed by the intersection of linear boundaries, each representing a state transition to another state [1]. With this polytope, the control task can be solved directly by selecting a control parameter path that leaves the current state through the desired state transition into the neighbouring target state. This way it is possible to program core operations in the array. For example, to entangle the spin of two electrons on neighbouring dots via Heisenberg exchange coupling [6], an experimenter must identify the transition that moves both electrons on the same dot and then pick a trajectory through this transition and back to the original state. For example, in a linear device with three quantum dots with one electron on each dot (i.e., a device in state [1, 1, 1]), entangling the middle electron with its neighbours requires finding the transitions to states [0, 2, 1] and [1, 2, 0], or alternatively [2, 0, 1] and [1, 0, 2].

Unfortunately, due to manufacturing imprecisions, the parameters of the constant interaction model are unknown and nonlinear effects lead to small deviations of the model, even though ground-states of weakly coupled quantum-dot arrays typically resemble convex polytopes. Thus, each manufactured device must be tuned individually in order to find the correct control parameters.

The problem of finding the control parameters becomes daunting in larger devices, as each added dot requires at least one additional gate electrode with its own control parameter. Qubit-qubit connectivities required for quantum simulations and quantum computing also constraints the geometrical layout of the quantum dots and their gate electrodes on a chip: Instead of individual isolated qubit dots with dedicated sensor dots, the current trend is to fabricate dense arrays of coupled qubit dots that are monitored by as few proximal sensor dots as possible. This complicates the tune-up in several ways. First, individual gate voltages no longer act locally, meaning that changing the gate voltage of one qubit also affects the potential of other, nearby qubits. Second, the signal of each sensor dot responds to charge changes of multiple dots, making it more difficult to determine which of the dots is making a transition. For example, in our device, a single charge sensor is responsible for monitoring all quantum dots, which means that its signal must be interpreted and does not directly relate to individual electron transitions [7].

Currently, experts find the correct control parameters manually using ad-hoc tuning protocols [8,9], which are tedious and time consuming. This limits devices to require tuning of at most three control parameters at once. The field has made first steps into automating parts of this process. For the problem of automating state identification, i.e., estimating the electron count on each quantum dot given a set of control parameters, estimation algorithms can be grouped in two main directions. The first direction uses CNN-based approaches [10,11], which requires dense rastering of image planes within the parameter space. An alternative approach explores the use of line searches to obtain a dataset of state transitions which are modeled using deep neural networks [12,13,14], which allows human experts to label the discovered regions. Both approaches have only been applied to devices with at most two quantum dots and a maximum of three tunable parameters.

While promising, these approaches are not easily transferred to the task of finding state transitions. While a perfect model for state-identification can also be used to solve the task of state transitions, it is difficult to reach the required accuracy around the boundaries. As the important transitions are routinely very small, high resolution measurements are required to find them. However, obtaining a dataset with the required resolution in higher dimensions is very expensive. For the line-search based approaches mentioned above, it has been shown [15] that for a naive measurement design that does not have any knowledge of the position of transitions, the number of line-searches required to correctly identify all transitions of a state increases exponentially with the dimensionality. The key difficulty lies in obtaining a grid of measurements fine enough to obtain enough points on the smallest facets of the polytope to estimate the slopes. Since the hyperareas of facets can vary significantly (see Figure 1c), the grid must be very fine to discover all facets of interest reliably.

Even if enough data is available, evaluating the learned model for the task of finding state transitions is difficult. Typical machine-learning methodology focuses on measuring accuracy, e.g., the fraction of correctly classified points in the dataset. However, the effect of a missing facet on the accuracy can be very small. Indeed, in our approach, we obtain measures of correctly classified volume (as measured by intersection over union, IoU) above 99.9%, while the learned polytopes can still contain significant errors in the boundaries.

The use of deep-neural networks adds another challenge, as these models are difficult to interpret. Even if the model learns to correctly classify the boundaries between states, it does not directly answer the question whether a transition between two states exists or what the position and size of the transition is. Thus, the experimenter would have to search for the desired transition in the learned model.

In this work, we approach the difficult tuning step of finding the transitions after the initial tuning of the barrier gates and tuning of the sensor dots. We assume, that the experimenter already found a point inside a state of interest and needs to find the exact location of state-transitions to neighbouring states. We do not assume that the experimenter knows which transitions exists, but we assume that the experimenter can correctly identify and label any detected transition. Further, we require that polytopes have finite volume in gate-voltage space. This means, that either all dots are occupied by at least one electron, or the voltage space is bounded from below. This does not add a limitation in practice, as lower bounds are typically imposed by the device software to protect the device from damage.

While our model depends on weakly tunnel coupled quantum dots to ensure that the ground states form convex polytopes, we do not require that it abides to the constant interaction model exactly. For example, in the constant interaction model, the polytopes for states [1, 1, 1] and [2, 3, 1] are just translated versions of each other. This is not even true in well-behaved real devices, and our model does not assume this.

Inspired by the aforementioned line search based approaches, we use line searches to obtain an initial dataset, which is iteratively improved upon.

We combine this with an interpretable model of the polytope which learns individual state transitions. In line with the device limitations and measurement challenges outlined above, we will only assume that the sensor reliably detects that a state transition occurred, but not which state the device transitioned to. Using such a sensor, it is possible to locate a state transition in control parameter space using a line search procedure starting from a point within the convex polytope of a state. The resulting line search brackets the position of a state transition by pairs of control parameters

(x^{+}, x^{-})

, each defining one point inside and outside of the (unknown) polytope. This allows for high precision measurements and locates the boundary within a margin of

δ = ∥ x^{+} - x^{-} ∥

, which is a tuneable parameter chosen based on the trade-off between line search precision and short measurement times.

Under the conditions outlined above, finding the true polytope from measurements is a hard machine learning task. Even if all parameters of the constant interaction model are known, computing the boundary of the polytope requires solving the subspace intersection problem [16], with the number of intersections exponential in the number of quantum dots. Indeed, already finding the device state given a set of known parameters of the constant interaction model requires solving an integer quadratic program, which is NP-hard in the number of quantum dots [17]. Thus, it is unlikely that learning the polytope from measurements is substantially easier. In fact, if we are given a training set of points which are labeled according to whether they are in- or outside the polytope, then finding the optimal large margin polytope is NP-hard. (See [3] for a review on recent results). When relaxing the task to allowing a polytope with more facets than the optimal polytope, it has been shown that there exists a polynomial time algorithm which fits ℓ points to a polytope that has at most a factor of

O (log ℓ)

more facets than the optimal polytope and has strong approximation guarantees [3]. However, the algorithmic complexity in our setting is

ℓ^{O (1 / δ^{2})}

and thus too large for practical application. It is clear that allowing even more facets will make the task substantially easier, e.g., by adding one facet for each point outside the polytope, the task can be solved using

O (ℓ)

classification tasks, each separating one point outside the polytope from all points inside it. Practical algorithms therefore forego finding the optimal number of facets and use an approximate polynomial time solver [18,19]. Howevever, there is a lack of numerically stable general purpose optimization algorithms that aim for a minimal amount of facets and which go beyond convergence in a volume metric (for example Hausdorff in [19]). This is especially problematic in our task, as each facet of the unknown polytope constitutes an operational resource (relocation of individual electrons in the array). Producing an estimate with too many (unphysical) facets might make it impractical to filter out the relevant facets. Our approach is similar to [3] in that we repeatedly solve convex relaxations of a large margin classification problem. In contrast to the related work, we use a numerically stable second order solver instead of stochastic gradient descent and design the problem such that it produces solutions with as few facets as possible. Random perturbations to the solutions found allow the discovery of different local optima from which we choose the best.

Our machine learning task extends over pure polytope estimation from measurements as our goal is not only to find the optimal polytope given a fixed set of points, but a polytope that generalizes well and accurately reflects the true polytope with all its facets. This is difficult to achieve using an independent and identically distributed dataset as facets with small surface area are unlikely to be found using naive random sampling. Instead, a scheme is required that actively improves our dataset and ensures that we find all relevant facets that are detectable by our hardware. We base our scheme on a theoretical result which can be used to assess whether two polytopes are the same. Intuitively, we must perform validation measurements on each facet to show that it is a facet of the true polytope, but we must also search for small (still undetected) facets that might be hiding near vertices of the current polytope. The resulting algorithm performs a line search in the directions of the corners and through each facet of our current best estimate of the polytope. If our polytope is correct, all state transitions of our model will be contained between the pairs

(x^{+}, x^{-})

of the performed line searches. Otherwise, we obtain examples of new points which we can add to our dataset and fit a new convex polytope. This process is repeated until we either run out of computation time or all performed line searches are correct. The details of our algorithm are given in the method section.

To investigate the performance and quality of our algorithm, we test it on three different setups: Two simulated problems in 3 or 4 dimensions based on either a device simulation or using constructed polytopes from Voronoi regions, and as third application the real device shown in Figure 1a, where we activate two qubit dots (and the sensor dot) and let human experts verify the result. We further apply the algorithm to the same device with all three qubit dots activated. However, in this case no ground truth is available for comparison. In all simulated experiments, we compare our algorithm to an idealized baseline algorithm that additionally has access to the exact state information while estimating the polytope from measurements and otherwise uses the same active learning protocol. This algorithm assumes a more powerful device with a sensor signal that provides exact state information. (In principle, such a device can be realized by careful design of the charge sensor.) This idealized algorithm is useful to differentiate between the impact of our active learning scheme and the impact of approximately solving the NP-hard estimation problem.

The structure of the paper is as follows: We begin by defining the notation of our paper. In Section 2.2, we introduce the constant interaction model which links between polytope estimation and charge transitions in quantum dot devices. In Section 2.3, we describe the main algorithm by first introducing a meta algorithm for active learning followed by a description of our algorithm for fitting a polytope to measurements. We describe our experiments in Section 3, present our results in Section 4 and end with a discussion and conclusion of the paper in Section 5.

2. Materials and Methods

We will first briefly describe our mathematical notation, before we introduce the constant interaction model as additional background.

We then describe our main contribution, an algorithm that can be used to find the facets of a convex polytope from measurements. First, we introduce a meta algorithm that describes our active-learning procedure. It relies on an algorithm to estimate a convex polytope from measurements, which we describe afterwards.

2.1. Notation

A convex polytope

P \subset R^{d}

is defined as

P = {x \in R^{d} ∣ A_{k}^{T} x + b_{k} \leq 0, k = 1, \dots, N}

, where

A \in R^{N \times d}

,

b \in R^{N}

, N is the number of inequality constraints,

A_{k}

is a column vector representing the k-th row of A and

A_{k}^{T}

denotes transposition of the vector

A_{k}

. In this work, we treat all vectors as column vectors, except when marked by a transposition. We will denote the boundary of P as

\partial P = {x \in P ∣ \exists k : A_{k}^{T} x + b_{k} = 0}

. A facet of P is a set of points

f_{l} = {x \in P ∣ A_{l}^{T} x + b_{l} = 0} \subset \partial P

. We will also refer to

A_{l}

as the normal of

f_{l}

. We define as the inside of

f_{l}

the open set of points

{x \in f_{l} ∣ x \notin f_{k}, k \neq l}

. We will call the extreme points of P its vertices. We assume P to be finite and non-degenerate and thus there exist exactly N unique facets.

2.2. The Constant Interaction Model

In the following, we give a short introduction to the constant interaction model of quantum dot devices [5]. A quantum dot is modeled as a capacitor which can be charged with single electrons. The state of the device

s \in N^{n_{D}}

is the vector of electron occupancies, where,

s_{i}

is the number of electrons on the ith quantum dot. When applying a vector of gate voltages

V_{g} \in R^{n_{g}}

, the system will assume the state that minimizes the free energy:

F (s, V_{g}) = \frac{1}{2} {(| e | s + C_{D g} V_{g})}^{T} C_{D D}^{- 1} (| e | s + C_{D g} V_{g})

(1)

Here,

| e |

is the charge of a single electron and the matrices

C_{D D} \in R^{n_{D} \times n_{D}}

(symmetric positive definite) and

C_{D g} \in R^{n_{D} \times n_{g}}

are capacitance matrices which model the interaction strength between dots and between dots and gates, respectively. The state that achieves the minimum free energy given a fixed

V_{g}

can be computed by iterating over all possible states:

s^{*} (V_{g}) = arg min_{s \in N^{d}} F (s, V_{g}) = arg min_{s \in N^{d}} \frac{{| e |}^{2}}{2} s^{T} C_{D D}^{- 1} s + | e | s^{T} C_{D D}^{- 1} C_{D g} V_{g}

Since (1) consists of terms which are at most linear in

V_{g}

, the boundary between neighbouring states s and r in state parameter space lies on a plane where the free energy of both states is equal and the plane fulfills

F (s, V_{g}) - F (r, V_{g}) = n_{s r}^{T} V_{g} + b_{s r} = 0 .

Here,

n_{s r} = C_{D g}^{T} C_{D D}^{- 1} (s - r)

is the normal of the plane and

b_{s r} = | e | / 2 (s^{T} C_{D D}^{- 1} s - r^{T} C_{D D}^{- 1} r)

is its offset. Since all boundaries are linear, the set of gate voltages

V_{g}

that lead to a given state s form a convex polytope created by the intersection of the planes:

\begin{matrix} P (s) & = \{V_{g} \in R^{n_{g}} ∣ s^{*} (V_{g}) = s\} \\ = \{V_{g} \in R^{n_{g}} ∣ \forall r \in N^{n_{D}} : F (s, V_{g}) - F (r, V_{g}) \leq 0\} \\ = \{V_{g} \in R^{n_{g}} ∣ \forall r \in N^{n_{D}} : n_{s r}^{T} V_{g} + b_{s r} \leq 0\} \end{matrix}

(2)

While the definition of (2) runs over an infinite number of states, the number of facets which compose the boundary are finite. We will refer to states r which share a facet with s in

P (s)

as neighbouring states. Thus, the polytope carries information about which state transitions to neighbouring states exist and can be used to control the device.

2.3. A Meta Algorithm for Finding Polytopes

We employ a meta algorithm to iteratively improve an estimate

\hat{P}

of a convex polytope P based on line searches. For this, we use an algorithm

A

which computes

\hat{P} = A (X)

, where X is a training dataset of previously collected measurements. Our goal is to use active learning to propose new line search directions in order to add new points to X. The meta algorithm is based on the following lemma:

Lemma 1.

Let P and

\hat{P}

be two convex polytopes in

R^{d}

. Let

c_{k}, k = 1, \dots, {\hat{N}}_{c}

be the vertices of

\hat{P}

and

f_{l}

,

l = 1, \dots, \hat{N}

its facets with normals

{\hat{A}}_{l}

. If

1.: $c_{k} \in \partial P, \forall k = 1, \dots, {\hat{N}}_{c}$
2.: for each facet $f_{l}$ , there exists a point $x_{l}$ inside of $f_{l}$ such that $x_{l} \in \partial P$ ,

then

P = \hat{P}

.

Proof.

From the first point, we directly obtain

\hat{P} \subseteq P

due to the definition of convexity. To show

P \subseteq \hat{P}

, pick any

l \in {1, \dots, \hat{N}}

. Since

x_{l} \in \partial P

, there must exist a direction

p_{l} \in R^{d}

such that all

x \in P

fulfill the constraint

p_{l}^{T} x - p_{l}^{T} x_{l} \leq 0

. Since

x_{l}

lies inside a facet of

\hat{P}

, we also have

{\hat{A}}_{l}^{T} x - {\hat{A}}_{l}^{T} x_{l} \leq 0

for all

x \in \hat{P}

. Finally, as

\hat{P} \subseteq P

and

x_{l}

is inside of

f_{l}

, we have

p = {\hat{A}}_{l}

. As this holds for all facets of

\hat{P}

, we obtain

P \subseteq {x \in R^{d} ∣ {\hat{A}}_{l}^{T} x - {\hat{A}}_{l}^{T} x_{l} \leq 0, \forall l = 1, \dots, \hat{N}} = \hat{P}

. □

This suggests a simple algorithm for iteratively improving an estimate of a convex polytope

\hat{P}

, by generating a set of query points R on the boundary of

\hat{P}

which includes the vertices of

\hat{P}

as well as a point inside each facet. To obtain points inside the facets, we take all vertices which are on a given facet and compute their mean. Then, for each point

R_{i}

, we perform a line search from a point

o \in P \cap \hat{P}

(for example the mean of points in the training set) in direction of

R_{i} - o

and search for the intersection with P. All intersections (or a bracket containing the likely intersection interval) are added to X. The algorithm terminates if all new points in X lie sufficiently close to the boundary of

\hat{P}

, i.e.,

dist (\partial \hat{P}, x) < ϵ_{end}, \forall x \in X

. Otherwise, we repeat using the updated training set.

2.4. Large Margin Polytopes

The algorithm introduced above works well given an exact line search. In this case, we could choose

A

as a convex hull algorithm. However, this has two disadvantages:

In the presence of noise in the line search, points within a facet will never lie exactly on a hyperplane.
Even with exact line searches, the number of facets found by the algorithm tends to increase over time, far exceeding the true number of facets. This is because line searches will almost surely not hit a true vertex of P, resulting in additional facets.

Both problems can be understood by seeing the convex hull as a non-parametric model for estimating a convex polytope. Each sampled point increases the complexity of the model, which makes it prone to overfitting, and thus requires regularization. Instead of using a convex hull, we model this estimation problem via regularized large margin classification [20]. To this end, we require that the line search does not return an estimate of the point

x \in \partial P

, but instead a confidence interval

(x^{-}, x^{+})

, where

x^{-} \in P

and

x^{+} \notin P

. For this, an existing line search can be adapted, for example by adding confidence bounds to its estimate. As a decision function to optimize, we use

f (x) = max_{k \in {1, \dots, \hat{N}}} {\hat{A}}_{k}^{T} x + {\hat{b}}_{k},

(3)

as with this choice it holds

\hat{P} = {x \in R^{d} ∣ f (x) \leq 0} = {x \in R^{d} ∣ {\hat{A}}_{k}^{T} x + {\hat{b}}_{k} \leq 0, k = 1, \dots, \hat{N}}

.

\hat{A}

and

\hat{b}

are the optimizable parameters and

\hat{N}

must be chosen larger than the number of facets in P. With this model, the task is to find a function f which separates the training set of points

x_{j}^{-}, j = 1, \dots, ℓ

from

x_{j}^{+}, j = 1, \dots, ℓ

. Our approach is similar to the large margin approach in [3]. The soft-constrained large margin primal optimization problem can be stated as

\begin{matrix} min_{\hat{A}, \hat{b}} & Ω (\hat{A}) + \frac{C}{ℓ} \sum_{i = 1}^{ℓ} {(ξ_{i}^{+})}^{2} + \frac{C}{ℓ} \sum_{i = 1}^{ℓ} {(ξ_{i}^{-})}^{2} \end{matrix}

(4)

\begin{matrix} s . t . & max_{j} {\hat{A}}_{j}^{T} x_{i}^{-} + {\hat{b}}_{j} \leq - 1 + ξ_{i}^{-}, i = 1, \dots, ℓ \end{matrix}

(5)

\begin{matrix} \land & max_{j} {\hat{A}}_{j}^{T} x_{i}^{+} + {\hat{b}}_{j} \geq 1 - ξ_{i}^{+}, i = 1, \dots, ℓ \\ \land & ξ_{i}^{-} \geq 0 \land ξ_{i}^{+} \geq 0, i = 1, \dots, ℓ, \end{matrix}

(6)

where

Ω : R^{\hat{N} \times d} \to R

is a regularizer with complexity parameter

ξ_{i}^{+}

and

ξ_{i}^{-}

are slack variables. The complexity parameter C balances the penalisation of the model error (as measured by the slack variables) and penalisation of model complexity (as measured by

Ω

). Thus, a larger C leads to more complex models that fit the data better. The constraint (6) turns the problem into a non-convex optimization problem, and to find the global optimum, we would need to infer correctly for each

{\hat{A}}_{j}

those pairs of points that are separated by it (or, in the context of the application, the state of the device associated with

x_{i}^{+}

).

The problem can be relaxed and solved iteratively using a sequence of convex problems with solutions

({\hat{A}}^{(t)}, {\hat{b}}^{(t)})

,

t = 1, 2, \dots

This can be achieved via a convex relaxation of (6) based on the fact that if we find any hyperplane

{\hat{A}}_{s_{i}}, {\hat{b}}_{s_{i}}

for which

{\hat{A}}_{s_{i}}^{T} x_{i}^{+} + {\hat{b}}_{s_{i}} > 1 - ξ_{i}^{+}

, we automatically fulfill constraint (6). Thus, in iteration t of the algorithm, we can take the previous estimate

{\hat{A}}^{(t - 1)}, {\hat{b}}^{(t - 1)}

to find

s_{i}

and solve the subproblem

\begin{matrix} min_{\hat{A}, \hat{b}} & Ω (\hat{A}) + \frac{C}{ℓ} \sum_{i = 1}^{ℓ} {(ξ_{i}^{+})}^{2} + \frac{C}{ℓ} \sum_{i = 1}^{ℓ} {(ξ_{i}^{-})}^{2} \\ s . t . & max_{j} {\hat{A}}_{j}^{T} x_{i}^{-} + {\hat{b}}_{j} \leq - 1 + ξ_{i}^{-} \end{matrix}

(7)

\begin{matrix} \land & {\hat{A}}_{s_{i}}^{T} x_{i}^{+} + {\hat{b}}_{s_{i}} \geq 1 - ξ_{i}^{+}, s_{i} = arg max_{s} \{{({\hat{A}}_{s}^{(t - 1)})}^{T} x_{i}^{+} + {\hat{b}}_{s}^{(t - 1)}\} \\ \land & ξ_{i}^{-} \geq 0 \land ξ_{i}^{+} \geq 0, i = 1, \dots, ℓ . \end{matrix}

(8)

In the algorithm by [3],

Ω

was chosen as the squared

L_{2}

-norm of the rows of

\hat{A}

,

Ω (\hat{A}) = \sum_{i = 1}^{\hat{N}} {∥ {\hat{A}}_{i} ∥}^{2}

and the problem was solved via stochastic gradient descent using a variant of the Pegasos [21] solver, that re-estimated

s_{i}

in (8) during each stochastic gradient step. Additionally, the solver in [3] would assign points to boundaries that are not assigned enough points to prevent a sparse solution. In initial experiments, this solver did not lead to usable or even plausible results in our setting. This is likely due to the ill-conditioning introduced by creating pairs of samples which are very close to the boundary. The resulting ill-conditioning of up to

10^{8}

necessitates second order optimization algorithms.

We therefore included the following changes: To minimize the number of facets of the polytope, we use a regularizer, which enforces setting whole rows

{\hat{A}}_{i}

to zero. To this end, we propose to use the

L_{2, 1}

regularizer,

L_{2, 1} (\hat{A}) = \sum_{i = 1}^{\hat{N}} {∥ {\hat{A}}_{i} ∥}_{2}

. This regularizer can be seen as a generalization of 1-norm regularization to impose sparsity on row vectors and has among other applications been used successfully in the context of joint feature selection for multi task learning [22]. With this regulariser, the problem (7) becomes a second order cone program (SOCP) which we minimize in each iteration using a second order solver before re-computing

s_{i}

. We stop the optimization when the

s_{i}

in two iterations are the same. To save computation time, we prune the solution of each subproblem by removing all hyperplanes with

{\hat{A}}_{i} = 0

as they won’t get assigned new points and thus they would remain zero due to the regularization term. Finally, we use a restarting strategy to find a good local optimum as described in Algorithm 1. We first create an initial value

{\hat{A}}^{init}, {\hat{b}}^{init}

by computing the convex hull of the points

x_{j}^{-}

. We subtract 1 from

{\hat{b}}^{init}

so that the initial solution fulfills (5) with

ξ^{-} = 0

. Afterwards, we repeatedly solve problem (7) to obtain a first local optimum

{\hat{A}}_{sol}^{(0)}, {\hat{b}}_{sol}^{(0)}

. In our experience, this solution still has too many facets. Thus, we repeatedly add noise to this solution, solve the problem and return the best solution found. We denote the number of repeats of finding the solution as

n_{repeat}

and noise is added to the solution by adding normal distributed noise with mean 0 and variance

σ^{2}

to each entry of

{\hat{A}}_{sol}^{(0)}

and

{\hat{b}}_{sol}^{(0)}

. As an ablation study, we applied the same strategy to the original optimization problem of [3], but numerical difficulties made it impossible to solve the problem reliably.

Algorithm 1 Algorithm for approximate solution of the polytope estimation problem

1:: input $X^{+}, X^{-}$ , $n_{repeat} \in N^{+}$ , $C > 0$ , $σ > 0$
2:: output Estimated polytope parameters ${\hat{A}}^{best}, {\hat{b}}^{best}$
3:: ${\hat{A}}^{init}, {\hat{b}}^{init} \leftarrow$ convex_hull_polytope $(X)$
4:: ${\hat{b}}^{init} \leftarrow {\hat{b}}^{init} - 1$
5:: Solve (7) starting from ${\hat{A}}^{init}, {\hat{b}}^{init}$ and prune to obtain ${\hat{A}}_{sol}^{(0)}, {\hat{b}}_{sol}^{(0)}$ with value $L^{(0)}$
6:: for $i = 1, \dots, n_{repeat}$ do
7:: ${\hat{A}}^{noise} \leftarrow {\hat{A}}_{sol}^{(0)} + N (0, σ^{2})$
8:: ${\hat{b}}^{noise} \leftarrow {\hat{b}}_{sol}^{(0)} + N (0, σ^{2})$
9:: Solve (7) starting from ${\hat{A}}^{noise}, {\hat{b}}^{noise}$ and prune to obtain ${\hat{A}}_{sol}^{(i)}, {\hat{b}}_{sol}^{(i)}$ with value $L^{(i)}$
10:: $i^{*} \leftarrow arg {min}_{i} L^{(i)}$
11:: ${\hat{A}}^{best}, {\hat{b}}^{best} \leftarrow {\hat{A}}_{sol}^{(i^{*})}, {\hat{b}}_{sol}^{(i^{*})}$

3. Experiments

We implemented our algorithm by solving problem (7) with cvxpy [23,24] using the ECOS solver for SOCP problems [25]. We compute halfspace intersections, convex hulls and their volumes using the Qhull library [16]. To save computation time, we include new point pairs

(x^{-}, x^{+})

into the training dataset only if it does not already contain any pair

(x_{i}^{-}, x_{i}^{+})

with

∥ x_{i}^{-} - x^{-} ∥ < ϵ_{close}

. The algorithm terminates when the distance between the sampled points

(x_{i}^{-}, x_{i}^{+})

to the boundary of the polytope is smaller than

ϵ_{end}

. To assess the performance of our algorithms, we compared our approach, where possible, to an algorithm that has perfect information regarding which point pair is cut by which hyperplane. For this, we additionally compute the state

s_{i}

of

x_{i}^{+}

, pick

\hat{N}

as the number of different observed states and solve (7) using the obtained

s_{i}

. This way, the problem becomes convex and can be solved efficiently. We refer to this algorithm as our baseline.

To quantify the quality of the obtained convex polytopes, we define a matching error between the set of facets in the ground truth

f_{l} = {x \in P ∣ A_{l}^{T} x + b_{l} = 0} \in F

and the facets in the estimated convex polytope

{\hat{f}}_{k} = {x \in \hat{P} ∣ {\hat{A}}_{k}^{T} x + {\hat{b}}_{k} = 0} \in \hat{F}

. For this, we compute the minimum angle between the normal of the facet

f_{l}

and the normals of all

{\hat{f}}_{k}

: We report an error if there is no

{\hat{f}}_{k}

with angle smaller than 10 degrees:

E = \frac{1}{| F |} \sum_{l = 1}^{| F |} I \{min_{k} arccos (\frac{A_{l}^{T} {\hat{A}}_{k}}{∥ A_{l} ∥ ∥ {\hat{A}}_{k} ∥}) \cdot \frac{180}{π} > 10\}

This is a sufficient metric for estimating closeness of the facets: as the stopping condition of the algorithm already ensures that the estimated polytope must be very similar to the true polytope in terms of intersection over union, it is only the directions of the facet hyperplanes and their number that needs to be assessed here. To verify this claim, we also compute the intersection over union between the estimated and true ground truth polytopes:

IoU (P, \hat{P}) = \frac{Vol (P \cap \hat{P})}{Vol (\hat{P}) + Vol (\hat{P}) - Vol (P \cap \hat{P})}

To test our algorithms, we devise three experiments: Two simulated problems as well as one application on a real device. As application, we consider the device shown in Figure 1a, consisting of a narrow silicon nanowire through which electrons can flow from source (S) to drain (D). Four gate electrodes (G

_{1, \dots, 4}

) are capacitively coupled to the nanowire, and under correct tuning of their voltages (

V_{1, \dots, 4}

), quantum dots are formed that confine single electrons in small regions (tens of nanometers) of the nanowire. One of the gate electrodes (G

_{4}

) is connected to a high frequency circuit and produces a sensor signal that responds very quickly to any rearrangements of electrons inside the nanowire [4]. This sensor signal remains constant under voltage changes (line searches) of gate electrodes G

_{1}

, G

_{2}

, and G

_{3}

, as long as the state does not change. Experimentally, this is accomplished by compensating

V_{4}

for any voltage changes applied to G

_{1, 2, 3}

, such that the potential of the sensor dot is independent of the three control voltages

V_{1, 2, 3}

. This requires estimating the coefficients of a linear compensation function, which was done once before the algorithm is run. As soon as the charge state changes, i.e., a state boundary is encountered, the sensor signal drops and generates one

(x_{i}^{-}, x_{i}^{+})

pair. The device in Figure 1a can create up to four quantum dots. We applied our algorithm to a state with dot 1 and dot 2 each occupied by one electron, and use the two gate voltage parameters

V_{1}

and

V_{2}

to control this

[1, 1]

configuration. (Dot 3 is kept empty by keeping

V_{3}

fixed at a sufficiently negative voltage). We repeat this experiment tuned such that an electron occupies the dot under G

_{3}

as well and estimate the polytope using all three gate voltage parameters in the

[1, 1, 1]

configuration.

For the simulated experiments, we implement a line search which returns point pairs such that

∥ x_{i}^{+} - x_{i}^{-} ∥ < δ

, where we vary

δ

across the experiments to measure the impact of the quality of the line search. As our first simulated problem, we generate a set of polytopes via Voronoi regions. For this, we sample 30 points from a d-dimensional normal distribution in

R^{d}

with zero mean and diagonal covariance matrix

Σ

with entries

Σ_{i i} = 2 \cdot 10^{\frac{i}{d}}

. For each set, we compute a Voronoi triangulation. We discard the sample if the origin is not inside a closed Voronoi region or this region extends outside the set

{[- 10, 10]}^{d}

. On this problem, we can use the Voronoi region around the origin as ground truth. This way, we obtain 100 convex polytopes with between 6 and 12 facets for

d = 3

and

d = 4

. For the baseline algorithm, we use the index of the closest point as state of the Voronoi region.

As our second problem, we simulate a quantum dot array with 3 or 4 qubit dots using the constant interaction model. For simplicity, we omit an additional sensor dot that would normally sense the charge transitions within these triple- and quadruple-dot devices. Specifically, we generate 100 different device simulations by choosing one set of realistic device capacitances and adding noise to simulate variations in device manufacturing. To be able to use a similar tuning as in the Voronoi experiment, we re-scale the parameter space by a factor of 100 so that the polytopes cover roughly the

{[- 10, 10]}^{d}

volume. As the ground truth polytope for each simulated device we compute that region in control-voltage space that has a ground state with a [1, 1, 1] or [1, 1, 1, 1] charge configuration, i.e., exactly one electron per dot. These calculations yield 14 facets for all simulated triple-dot devices, and 30 facets for all simulated quadruple-dot devices. Errors made by the algorithm in finding the correct facets are discussed below.

As parameters, we chose in all experiments

ϵ_{end} = 1.5 δ

,

ϵ_{close} = δ

and

n_{repeat} = 10

. In the simulated experiments, we vary

δ \in {0.01, 0.05, 0.1, 0.2}

, choose

σ = 0.001 / δ

and

C = 75 / δ

. For the baseline algorithm, we use the same parameters except

C = 750 / δ

. We obtain the initial dataset X by performing 100 line searches in random directions starting from a known point inside P (e.g., the origin for the Voronoi region based dataset). For the real device, we implemented a line search in hardware using a DAC voltage generator. Instead of using the single point

x_{i}

returned by the hardware, we add a confidence interval

x_{i}^{\pm} = x_{i} \pm δ / 2

with

δ = 0.001

, which takes into account the experimental measurement uncertainty. This choice of

δ

is equivalent to the

δ = 0.1

setting in the simulated device on the re-scaled parameters. Further, we pick

σ = 0.05 / δ

and

C = 10 / δ

. The initial dataset was obtained via line searches using 10 random directions.

4. Results

In our experiments, we first investigated how close the learned polytope is to the true polytope on the simulated devices, depending on the line search accuracy

δ = ∥ x^{+} - x^{-} ∥

. For this, we computed the intersection over union of true and estimated polytopes and found an expected value between 0.9996 (

δ = 0.01

) and 0.995 (

δ = 0.02

), which are both very close to the optimum of 1. We did neither observe any relevant differences to the baseline algorithm, nor large differences between 3 and 4 dimensions. We then measured how many of the linear decision functions in the ground truth polytope were not approximated well in the estimated polytope. The recorded number of matching errors on the two simulated datasets as a function of the precision of the line search,

δ

, can be seen in Figure 2. For both datasets, the error increased with

δ

, where for the simulated devices, we saw a far steeper increase than for the Voronoi regions. For small values of

δ

, we observed less than 0.1 matching errors, which increased to 4.5 at

δ = 0.2

in 4D. Here, the baseline algorithm performed better.

To investigate the nature of the errors, we computed the hyper volume of the state transitions as a measure of size of the facet in the polytope. The histograms of volumes as well as relative distributions of errors are shown in Figure 3. As can be seen from the figure, most errors appeared on facets with small volume. Further, we observed that facets with small volumes were estimated more reliably as the line search accuracy improved. To give a more qualitative impression of the errors observed, we visualized two examples of an error by creating random cuts through the 4D parameter space of the simulated device in such a way that an unmatched facet in the ground truth is visible within the cut. See Figure 4 for common examples, extracted from a problem instance with 4 errors and obtained with

δ = 0.2

.

For the real device, the obtained model of our algorithm is shown in Figure 1 in red color. In 2D, human experts verified the correctness of the state transitions and annotated the resulting states, based on a dense 2D raster scan shown in the background of Figure 1b [4]. We can see that the algorithm found two very small facets which are invisible on the raster scan. In 3D no ground truth was available, but qualitatively the 10 facets found by the algorithm (six large ones and two pairs of thin slabs in Figure 1c) are in line with how the experts understand this device: The 6 large transitions correspond to adding or removing one electron from each qubit dot, whereas the two pairs of smaller facets correspond to moving an electron between dot 1 and dot 2, or between dot 2 and dot 3. We further verified in simulations of the constant interaction model, including a simulation of the sensor dot, that our polytope is consistent with specific choices of the device parameters.

Finally, we investigated the run time and convergence of the algorithm. We measured the convergence speed of the algorithm in terms of the imprecision of the fit measured by our stopping criterion. For this, we measure the maximum distance a point in the training set

X_{i + 1}

to the boundary of the polytope

{\hat{P}}_{i}

estimated using the dataset

X_{i}

. To be exact, we compute

{max}_{x \in X_{i + 1}} dist (\partial {\hat{P}}_{i}, x)

. If this distance is smaller than

ϵ_{end}

, the algorithm terminates. To get a better understanding of the expected behaviour, we define a number of precision targets

t_{j}

,

j = 1, \dots, 20

geometrically spaced between 10 and

ϵ_{end}

. We then measure after each iteration how many precision targets have been reached and store the size of the dataset

| X_{i} |

. In Figure 5 we report as an average over all trials the fraction of targets reached after a certain number of line-searches as well as the number of line-searches needed to ensure that 75% of runs terminate (i.e., reach all targets). We see that in all cases, 75% of runs terminate within 300 (3D) and 1300 (4D) line-searches with approximately 95% of total targets reached. However, our results also show that the remaining 25% of runs can take a long time to reach the final targets. This behaviour seems to not be influenced by the choice of

δ

.

The run time of the algorithm for the real device (including line searches) was less than a minute for two dots and 30 min for three dots (15 min measurement time for 180 line searches). Computing an instance of the 4D problem on the simulated device takes approximately one hour on a single core CPU.

5. Discussion

Our results show that it is possible to reliably estimate the polytope associated with state transitions out of a specific charge state in a quantum dot array. Overall, it appears that the algorithm finds a good estimate of the true polytope volumes and facet shapes within the control parameter space formed by the array’s gate voltages. While the underlying optimization problem is non-convex, we did not find signs of bad local optima. This is likely because we verify the estimated polytope using new measurements which are designed to find potential modeling errors. Thus, a local optimum is likely caught and results in an new training iteration. This hypothesis is strengthened by the fact that we observe that with higher precision of the line search the distribution of errors is shifted towards smaller facets, showing that we are better at disproving local optima.

An example of the superiority of our approach to rastering can be seen in Figure 1b. Here, we can see that the algorithm managed to find two very small facets (labeled

[0, 2]

and

[2, 0]

) which are invisible on the raster scan. Operationally, these are two very important facets in the device as they amount to transitions of electrons between quantum dots, while the other transitions are electrons entering or leaving the array. For the purposes of quantum computation, such inter-dot tunneling processes effectively turn on wave-function overlap between two electrons, which is a key resource to manipulate their spin states via Heisenberg exchange coupling [6]. The same holds for the 3D polytope estimated in Figure 1c, where small facets corresponding to inter-dot transitions have been found.

The results in Figure 1c also highlight another challenge of using existing 2D raster scan approaches for devices with higher dimensions. While 2D raster-scan based approaches can be used to correctly identify the position or existence of some transition on the scan, the correct interpretation of the transition can become difficult. For example, when creating a 2D scan from Figure 1c by fixing the

V_{2}

component to a constant value, the resulting shape of the 2D slice of the polytope changes significantly depending on the chosen value of

V_{2}

, as different transitions become visible on the slice. Thus, from a single slice alone it becomes difficult to accurately assess the actual state transition. However, using the full 3D figure, it becomes relatively easy for an expert to label most of the transitions.

Compared to random sampling, our sampling approach using active learning seems to be superior. For random sampling, the number of samples requires depends largely on the relative size of a facet compared to the overall surface area [15]. Thus, for the small size of facets on our polytope, an average of 1500 samples is significantly less than expected via random sampling.

Still, our algorithm is not perfect. Comparing the performance of our algorithm to the idealized baseline algorithm indicates that the majority of errors are introduced by the difficulties of solving the NP-hard polytope estimation problem, while our active learning scheme seems to works very well. Thus, our work will likely profit from continued development of improved and faster estimation algorithms.

We have successfully applied our algorithm to a real 2 × 2 quantum dot array, constituting the first automatic discovery of the full set of state transitions in the literature for a device with more than 2 dots.

As the underlying estimation problems are NP-hard and we observed a steep increase of run-time already going from 3D to 4D, we foresee practical limits of our algorithm for controlling large quantum dot arrays. However, in practice, one may not need to find all facets in order to implement desired qubit functionalities. For example, universal quantum computers can be constructed entirely from single- and two-qubit operations, meaning that many facets associated with a multi-dot state may be operationally irrelevant (such as facets corresponding to multi-electron co-tunneling transitions, which are likely very small facets that are hard to find). Alternatively, it might be possible to partition the device in blocks of smaller arrays, for which polytopes can be estimated independently. For example, for a linear array of many quantum dots, it might be possible to estimate all relevant local transitions of a quantum dot using its two-neighbourhood in the dot connection graph, a 5-dimensional subspace, which we have shown is already within our estimation capabilities. Another possible solution is to use more and better sensor signals to identify the state transitions.

A potential shortcoming is our assumption of linear state transitions, which form a convex polytope. This assumption is not true for devices that exhibit device instabilities or strongly tunnel coupled quantum dots, for which the approximation of the ground states via the constant interaction model is invalid and the ground states can not be well approximated by convex polytopes anymore. However, stable, non-hysteretic materials are essential prerequisites for quantum processors, and recent material improvements already led to very stable materials. Thus, we expect that future weakly coupled devices will fulfill this assumption to a much higher degree than devices that are available today.

6. Conclusions

We have introduced an algorithm for automatic learning of the Coulomb diamonds of weakly-coupled quantum dot arrays, in which state transitions have approximately linear boundaries in gate-voltage space. Based on measurement data obtained via linear raster scans, our algorithm learns the shape and number of state transitions from a given state, as long as the transitions are sufficiently large compared to the measurement precision. We showed the effectiveness of our algorithm on simulated 2 × 2 devices and a real triple-dot device capactively coupled to an additional sensor dot.

Our algorithm can be applied to partially automate the difficult tuning step of discovering and operating inter-dot transitions for a device that already has sufficiently well tuned barrier gates and sensor compensation and for which the important states of interest have been identified. The human task simplifies to labeling the transitions, while discovering their existence and locations is done by the algorithm. We foresee that this will greatly simplify the tuning of intermediate size arrays with 4–6 dots, where human tuning capabilities are limited.

Future work must strive to improve the computation time required for finding the solution, as the current algorithm already requires several hours of computation time, which is the limiting factor in applying it to larger devices. Moreover, our algorithm is envisioned to be embedded in a suite made up of similar and sequentially carried out algorithms; for example, future work should strive to automate labeling of transitions and automating the initial tuning steps. Finally, the assumption of approximately linear state transitions is not fulfilled in devices with strong inter-dot tunnel coupling and future work should strive to enable the application to curvilinear transitions.

Author Contributions

Contributions: O.K. developed the algorithm and designed, built and executed the experiments on the simulated device. A.C., T.R. and B.B. developed the simulator and conducted the experiments on the real device. F.K. led the project. All authors contributed to writing the article. All authors have read and agreed to the published version of the manuscript.

Funding

F.K. and A.C. acknowledge support from the Independent Research Fund Denmark. This work received funding from the European Union under grant agreement No. 688539 and 951852.

Institutional Review Board Statement

Not Applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Code and results are available at https://erda.ku.dk/archives/07289ebce9f8f4347e00dbd79bbaf75d/published-archive.html (accessed on 20 May 2022).

Conflicts of Interest

The authors declare no conflict of interest. The funding sponsors had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, and in the decision to publish the result.

References

Nazarov, Y.V.; Nazarov, Y.; Blanter, Y.M. Quantum Transport: Introduction to Nanoscience; Cambridge University Press: Cambridge, UK, 2009. [Google Scholar]
Loss, D.; DiVincenzo, D.P. Quantum computation with quantum dots. Phys. Rev. A 1998, 57, 120. [Google Scholar] [CrossRef] [Green Version]
Kantchelian, A.; Tschantz, M.C.; Huang, L.; Bartlett, P.L.; Joseph, A.D.; Tygar, J.D. Large-margin convex polytope machine. Adv. Neural Inf. Process. Syst. 2014, 27, 3248–3256. [Google Scholar]
Ansaloni, F.; Chatterjee, A.; Bohuslavskyi, H.; Bertrand, B.; Hutin, L.; Vinet, M.; Kuemmeth, F. Single-electron control in a foundry-fabricated two-dimensional qubit array. arXiv 2020, arXiv:2004.00894. [Google Scholar]
Schröer, D.; Greentree, A.; Gaudreau, L.; Eberl, K.; Hollenberg, L.; Kotthaus, J.; Ludwig, S. Electrostatically defined serial triple quantum dot charged with few electrons. Phys. Rev. B 2007, 76, 075306. [Google Scholar] [CrossRef] [Green Version]
Chatterjee, A.; Stevenson, P.; De Franceschi, S.; Morello, A.; de Leon, N.; Kuemmeth, F. Semiconductor Qubits In Practice. Nat. Rev. Phys. 2021, 3, 157–177. [Google Scholar] [CrossRef]
Chanrion, E.; Niegemann, D.J.; Bertrand, B.; Spence, C.; Jadot, B.; Li, J.; Mortemousque, P.A.; Hutin, L.; Maurand, R.; Jehl, X.; et al. Charge detection in an array of CMOS quantum dots. Phys. Rev. Appl. 2020, 14, 024066. [Google Scholar] [CrossRef]
Botzem, T.; Shulman, M.D.; Foletti, S.; Harvey, S.P.; Dial, O.E.; Bethke, P.; Cerfontaine, P.; McNeil, R.P.; Mahalu, D.; Umansky, V.; et al. Tuning methods for semiconductor spin qubits. Phys. Rev. Appl. 2018, 10, 054026. [Google Scholar] [CrossRef] [Green Version]
Van Diepen, C.; Eendebak, P.T.; Buijtendorp, B.T.; Mukhopadhyay, U.; Fujita, T.; Reichl, C.; Wegscheider, W.; Vandersypen, L.M. Automated tuning of inter-dot tunnel coupling in double quantum dots. Appl. Phys. Lett. 2018, 113, 033101. [Google Scholar] [CrossRef]
Kalantre, S.S.; Zwolak, J.P.; Ragole, S.; Wu, X.; Zimmerman, N.M.; Stewart, M.D.; Taylor, J.M. Machine learning techniques for state recognition and auto-tuning in quantum dots. NPJ Quantum Inf. 2019, 5, 1–10. [Google Scholar] [CrossRef] [Green Version]
Teske, J.D.; Humpohl, S.S.; Otten, R.; Bethke, P.; Cerfontaine, P.; Dedden, J.; Ludwig, A.; Wieck, A.D.; Bluhm, H. A machine learning approach for automated fine-tuning of semiconductor spin qubits. Appl. Phys. Lett. 2019, 114, 133102. [Google Scholar] [CrossRef]
Zwolak, J.P.; Kalantre, S.S.; McJunkin, T.; Weber, B.J.; Taylor, J.M. Ray-based classification framework for high-dimensional data. arXiv 2020, arXiv:2010.00500. [Google Scholar]
Zwolak, J.P.; McJunkin, T.; Kalantre, S.S.; Neyens, S.F.; MacQuarrie, E.; Eriksson, M.A.; Taylor, J.M. Ray-Based Framework for State Identification in Quantum Dot Devices. PRX Quantum 2021, 2, 020335. [Google Scholar] [CrossRef]
Zwolak, J.P.; McJunkin, T.; Kalantre, S.S.; Dodson, J.; MacQuarrie, E.; Savage, D.; Lagally, M.; Coppersmith, S.; Eriksson, M.A.; Taylor, J.M. Autotuning of Double-Dot Devices In Situ with Machine Learning. Phys. Rev. Appl. 2020, 13, 034075. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Weber, B.J.; Kalantre, S.S.; McJunkin, T.; Taylor, J.M.; Zwolak, J.P. Theoretical bounds on data requirements for the ray-based classification. SN Comput. Sci. 2022, 3, 1–10. [Google Scholar] [CrossRef]
Barber, C.B.; Dobkin, D.P.; Huhdanpaa, H. The quickhull algorithm for convex hulls. ACM Trans. Math. Softw. 1996, 22, 469–483. [Google Scholar] [CrossRef] [Green Version]
Chaovalitwongse, W.A.; Androulakis, I.P.; Pardalos, P.M. Quadratic Integer Programming: Complexity and Equivalent Forms. In Encyclopedia of Optimization; Springer: Boston, MA, USA, 2009; pp. 3153–3159. [Google Scholar]
Gottlieb, L.A.; Kaufman, E.; Kontorovich, A.; Nivasch, G. Learning convex polytopes with margin. In Proceedings of the Advances in Neural Information Processing Systems, Montreal, QC, Canada, 4–6 December 2018; pp. 5706–5716. [Google Scholar]
Gardner, R.J.; Kiderlen, M.; Milanfar, P. Convergence of algorithms for reconstructing convex bodies and directional measures. Ann. Stat. 2006, 34, 1331–1374. [Google Scholar] [CrossRef] [Green Version]
Boser, B.E.; Guyon, I.M.; Vapnik, V.N. A training algorithm for optimal margin classifiers. In Proceedings of the Fifth Annual Workshop on Computational Learning Theory, Pittsburgh, PA, USA, 27–29 July 1992; pp. 144–152. [Google Scholar]
Shalev-Shwartz, S.; Singer, Y.; Srebro, N.; Cotter, A. Pegasos: Primal estimated sub-gradient solver for svm. Math. Program. 2011, 127, 3–30. [Google Scholar] [CrossRef] [Green Version]
Liu, J.; Ji, S.; Ye, J. Multi-task feature learning via efficient l2, 1-norm minimization. arXiv 2012, arXiv:1205.2631. [Google Scholar]
Diamond, S.; Boyd, S. CVXPY: A Python-embedded modeling language for convex optimization. J. Mach. Learn. Res. 2016, 17, 1–5. [Google Scholar]
Agrawal, A.; Verschueren, R.; Diamond, S.; Boyd, S. A rewriting system for convex optimization problems. J. Control Decis. 2018, 5, 42–60. [Google Scholar] [CrossRef]
Domahidi, A.; Chu, E.; Boyd, S. ECOS: An SOCP solver for embedded systems. In Proceedings of the European Control Conference (ECC), Zurich, Switzerland, 17–19 July 2013; pp. 3071–3076. [Google Scholar]

Figure 1. Automatic discovery algorithm implemented on a real quantum dot array with a sensor dot coupled to either a double dot or a triple dot. (a) Micrograph of the device. Individual electrons originating from the source (S) or drain (D) reservoirs can be trapped by gate-induced quantum dots (circles), located below electrostatic gate electrodes G

_{1, \dots, 4}

and controlled by voltages

V_{1, \dots, 4}

. For the targeted

[1, 1]

state, each qubit dot (red) contains one electron, whereas the sensor dot (black) is used to generate a high bandwidth sensor signal. (b) 2D map of the sensor signal as a function of the control voltages

V_{1}

and

V_{2}

, acquired in high resolution to illustrate the convex polytope of this device configuration. For all yellow pixels, the qubit array is in the

[1, 1]

state. (Whenever

V_{1, \dots, 3}

is changed,

V_{4}

is not held constant but compensates the sensor dot potential for capacitive cross talk from G

_{1}

, G

_{2}

and G

_{3}

, see experiments.) Our algorithm estimates state transitions to other states (red lines), based on a small number of point pairs

(x^{+}, x^{-})

(blue and orange dots) obtained via line searches. Target states of transitions are annotated by

[x, y]

. (c) Estimated state boundaries of the 3D polytope associated with single-electron occupation of three quantum dots (under G

_{1}

, G

_{2}

and G

_{3}

). The ground truth of this triple-dot system is already tedious to measure and unknown for this device tuning.

Figure 1. Automatic discovery algorithm implemented on a real quantum dot array with a sensor dot coupled to either a double dot or a triple dot. (a) Micrograph of the device. Individual electrons originating from the source (S) or drain (D) reservoirs can be trapped by gate-induced quantum dots (circles), located below electrostatic gate electrodes G

_{1, \dots, 4}

and controlled by voltages

V_{1, \dots, 4}

. For the targeted

[1, 1]

state, each qubit dot (red) contains one electron, whereas the sensor dot (black) is used to generate a high bandwidth sensor signal. (b) 2D map of the sensor signal as a function of the control voltages

V_{1}

and

V_{2}

, acquired in high resolution to illustrate the convex polytope of this device configuration. For all yellow pixels, the qubit array is in the

[1, 1]

state. (Whenever

V_{1, \dots, 3}

is changed,

V_{4}

is not held constant but compensates the sensor dot potential for capacitive cross talk from G

_{1}

, G

_{2}

and G

_{3}

, see experiments.) Our algorithm estimates state transitions to other states (red lines), based on a small number of point pairs

(x^{+}, x^{-})

(blue and orange dots) obtained via line searches. Target states of transitions are annotated by

[x, y]

. (c) Estimated state boundaries of the 3D polytope associated with single-electron occupation of three quantum dots (under G

_{1}

, G

_{2}

and G

_{3}

). The ground truth of this triple-dot system is already tedious to measure and unknown for this device tuning.

Figure 2. Average number of matching errors made by the algorithm as function of line search precision (see text). We show results on the 3D and 4D problems for our algorithm and the idealized baseline. (a): Voronoi region problem. (b): Problems drawn from the quantum device simulator.

Figure 3. Histograms of the volumes of the facets of the convex polytopes in 4D, the relative distribution of matching errors and the error rate at a given volume and for different

δ

. (Top): Voronoi problem. (Bottom): Simulated quantum devices.

Figure 3. Histograms of the volumes of the facets of the convex polytopes in 4D, the relative distribution of matching errors and the error rate at a given volume and for different

δ

. (Top): Voronoi problem. (Bottom): Simulated quantum devices.

Figure 4. Examples of typical errors of the algorithm with

δ = 0.2

on the simulated device. Shown are two different 2D cuts through the same 4D volume. Cuts are chosen such that a wrongly estimated facet appears as a vertical line through the origin (blue dashed line). Red lines represent the constraints of estimated facets that intersect with the image plane. For better visibility of errors, areas are colored based on a partitioning of the plane by the ground truth constraints. Each area depicts a region where a certain set of ground truth constraint agree on the point belonging to the convex polytope. The true area of the polytope where all constraints agree is depicted by dark grey. An estimated constraint is correct if it correctly separates the areas. Units are displayed relative to the chosen origin. (a): Error in estimating the correct angle of a small facet. The facet is found but significantly tilted to the right in comparison to the dotted line, which can be seen as it passes through the light purple area on top. (b): Missing edge due to small facet size.

Figure 4. Examples of typical errors of the algorithm with

δ = 0.2

on the simulated device. Shown are two different 2D cuts through the same 4D volume. Cuts are chosen such that a wrongly estimated facet appears as a vertical line through the origin (blue dashed line). Red lines represent the constraints of estimated facets that intersect with the image plane. For better visibility of errors, areas are colored based on a partitioning of the plane by the ground truth constraints. Each area depicts a region where a certain set of ground truth constraint agree on the point belonging to the convex polytope. The true area of the polytope where all constraints agree is depicted by dark grey. An estimated constraint is correct if it correctly separates the areas. Units are displayed relative to the chosen origin. (a): Error in estimating the correct angle of a small facet. The facet is found but significantly tilted to the right in comparison to the dotted line, which can be seen as it passes through the light purple area on top. (b): Missing edge due to small facet size.

Figure 5. Estimated cumulative distribution function (ECDF) depicting the fraction of reached precision targets over number of line searches for the simulated device. For each run there are 20 reachable targets, geometrically spaced between

ϵ_{end} = 1.5 \cdot δ

and 10. Let

X_{i}

be the training set used to train model

{\hat{P}}_{i}

in the i-th iteration of Algorithm 1. The jth target

t_{j}

is considered reached if we find an i such, that

{max}_{x \in X_{i + 1}} dist (\partial {\hat{P}}_{i}, x) < t_{j}

. The

| X_{i} |

for the smallest i fulfilling this is used as the number of line searches required. Vertical dashed lines represent the point where 75% of runs are terminated.

Figure 5. Estimated cumulative distribution function (ECDF) depicting the fraction of reached precision targets over number of line searches for the simulated device. For each run there are 20 reachable targets, geometrically spaced between

ϵ_{end} = 1.5 \cdot δ

and 10. Let

X_{i}

be the training set used to train model

{\hat{P}}_{i}

in the i-th iteration of Algorithm 1. The jth target

t_{j}

is considered reached if we find an i such, that

{max}_{x \in X_{i + 1}} dist (\partial {\hat{P}}_{i}, x) < t_{j}

. The

| X_{i} |

for the smallest i fulfilling this is used as the number of line searches required. Vertical dashed lines represent the point where 75% of runs are terminated.

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Krause, O.; Brovang, B.; Rasmussen, T.; Chatterjee, A.; Kuemmeth, F. Estimation of Convex Polytopes for Automatic Discovery of Charge State Transitions in Quantum Dot Arrays. Electronics 2022, 11, 2327. https://doi.org/10.3390/electronics11152327

AMA Style

Krause O, Brovang B, Rasmussen T, Chatterjee A, Kuemmeth F. Estimation of Convex Polytopes for Automatic Discovery of Charge State Transitions in Quantum Dot Arrays. Electronics. 2022; 11(15):2327. https://doi.org/10.3390/electronics11152327

Chicago/Turabian Style

Krause, Oswin, Bertram Brovang, Torbjørn Rasmussen, Anasua Chatterjee, and Ferdinand Kuemmeth. 2022. "Estimation of Convex Polytopes for Automatic Discovery of Charge State Transitions in Quantum Dot Arrays" Electronics 11, no. 15: 2327. https://doi.org/10.3390/electronics11152327

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Estimation of Convex Polytopes for Automatic Discovery of Charge State Transitions in Quantum Dot Arrays

Abstract

1. Introduction

2. Materials and Methods

2.1. Notation

2.2. The Constant Interaction Model

2.3. A Meta Algorithm for Finding Polytopes

2.4. Large Margin Polytopes

3. Experiments

4. Results

5. Discussion

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI