The Lattice Geometry of Walsh-Function-Based Adaptive Optics

Hu, Qi; Xiao, Yuyao; Cui, Jiahe; Turcotte, Raphaël; Booth, Martin J.

doi:10.3390/photonics9080547

Open AccessArticle

The Lattice Geometry of Walsh-Function-Based Adaptive Optics

by

Qi Hu

,

Yuyao Xiao

,

Jiahe Cui

,

Raphaël Turcotte

and

Martin J. Booth

^*

Department of Engineering Science, University of Oxford, Oxford OX1 3PJ, UK

^*

Author to whom correspondence should be addressed.

Photonics 2022, 9(8), 547; https://doi.org/10.3390/photonics9080547

Submission received: 10 June 2022 / Revised: 16 July 2022 / Accepted: 30 July 2022 / Published: 4 August 2022

(This article belongs to the Special Issue Various Applications of Methods and Elements of Adaptive Optics)

Download

Browse Figures

Versions Notes

Abstract

:

We show that there is an intrinsic link between the use of Walsh aberration modes in adaptive optics (AO) and the mathematics of lattices. The discrete and binary nature of these modes means that there are infinite combinations of Walsh mode coefficients that can optimally correct the same aberration. Finding such a correction is hence a poorly conditioned optimisation problem that can be difficult to solve. This can be mitigated by confining the AO correction space defined in Walsh mode coefficients to the fundamental Voronoi cell of a lattice. By restricting the correction space in this way, one can ensure there is only one set of Walsh coefficients that corresponds to the optimum correction aberration. This property is used to enable the design of efficient estimation algorithms to solve the inverse problem of finding correction aberrations from a sequence of measurements in a wavefront sensorless AO system. The benefit of this approach is illustrated using a neural-network-based estimator.

Keywords:

lattice geometry; Walsh functions; adaptive optics

1. Introduction

Many adaptive optics (AO) methods have been developed to compensate phase aberrations in a range of applications including astronomy, ophthalmology and microscopy [1,2,3]. All AO systems are limited, in some way, by the capabilities of the adaptive element, typically a deformable mirror (DM) or a spatial light modulator (SLM), that corrects the aberrations. One such limitation is in the range of phase functions that the element can correct. The correction space of an AO element is defined by the range of phase functions that can be imparted by the device. For a pixelated AO device, such as a SLM or segmented DM, the correction space is defined by the set of accessible pixel states, which could be represented by the set of phase values for each pixel.

In many AO systems, it is preferable to design the system around a set of orthogonal modes for representation and control of the wavefront, rather than localized wavefront modulations. For example, wavefont-sensorless AO systems usually use a modal basis [4,5]. This method involves the sequential application of predetermined bias aberrations, the acquisition of a set of measurements of an appropriate quality metric, and then estimation of the required correction aberration. The conventional approach to sensorless AO is to use knowledge of the forward problem—that is how the quality metric is affected by input aberrations—to inform the design of an efficient estimation scheme that, in effect, solves the inverse problem of finding the optimal correction aberrations from the set of metric measurements. Such estimation can be performed using optimisation algorithms or neural networks (NN) to solve the inverse problem [6,7].

It is known that control using modes defined across the whole pupil provides stronger modulation of the optimisation metric than individual pixels or subregions of the pupil [5]. Such whole-pupil modulation hence provides more robust operation, particularly in low-light level imaging scenarios when the signal-to-noise ratio (SNR) is low. For such pixel-based sensorless AO systems, Walsh modes are an appropriate choice. Walsh modes are a set of orthogonal functions that represent phase patterns across a pixelated pupil, where the number of pixels is equal to a power of 2 [8,9]. Each Walsh mode consists of an equal number of pixels taking each of the binary values

+ 1

or

- 1

. For the different modes, a different combination of pixels takes the positive and negative values.

The binary nature of Walsh modes means that the range of each mode that must be searched to find the optimum correction is finite. This contrasts with other modal bases built upon continuous functions, such as Zernike polynomials, which would have unbounded range (albeit limited by the stroke of the adaptive correction element).

However, care needs to be taken when considering combinations of Walsh modes, as multiple combinations of modes can have the same effect on the system. This means that there are multiple potential solutions to the inverse problem of finding the optimal set of Walsh mode coefficients that optimise aberration correction. These multiple solutions can cause complications in defining an estimator to solve the inverse problem. Solving the inverse problem would be considerably simplified if we could ensure that there was only one optimal solution in the search space.

We show that there are properties of the Walsh modes that link the operation of these sensorless AO systems to the mathematics of lattices [10]. We discuss how these mathematical properties can aid the design of aberration estimation algorithms by constraining the search space. Specifically, we show heuristically that through understanding of the lattice geometry, we can define a unique finite search space, in terms of combinations of Walsh mode coefficients, that contains a single optimum correction. This permits the implementation of an efficient NN-based optimisation scheme that can measure and correct any combination of N Walsh modes of any coefficient value using only

2 N + 1

metric measurements. We show that a simple NN can be trained to solve the inverse problem if the search space is constrained using the lattice model, whereas the correct combination of Walsh mode coefficients cannot reliably be found for a nonconstrained search space.

2. Optical System Model

For the purposes of modelling, we considered the simple sensorless adaptive optics system shown in Figure 1. Such a model has been extensively used for analysis of such sensorless systems [11,12], as the principle of operation is readily extendable to similar optical systems, including applications in laser material processing, free-space communications, and laser scanning microscopy.

The input beam to the system is collimated and has uniform amplitude. The input phase aberration is

Φ (r)

and the phase

Ψ (r)

is added by the adaptive element (AE), which could be a pixelated SLM or a segmented DM. These are both considered to be added at the pupil P, which is taken to have unit radius;

r

is the normalised coordinate vector in the pupil. The lens performs a Fourier transform of the pupil field to provide a focal field. A vanishingly small pinhole detector is placed on axis at the centre of the focus and detects a signal I that corresponds to the on-axis intensity at the focus. This is equivalent to the power of the zero-frequency component of the Fourier transform, which is equivalent to the squared modulus of the mean value of the pupil field. Mathematically, this can be expressed as

I = {|\frac{1}{π} \underset{P}{\int \int} exp i [Φ (r) + Ψ (r)] d^{2} r|}^{2}

(1)

where the maximum, aberration-free signal is normalised to 1.

3. Representation of Aberrations as Walsh Modes

For simplicity, let us assume

Φ (r) = 0

so that all aberrations can be represented within

Ψ (r)

. Let us also assume that the adaptive element is a pixelated device, where each of the N pixels can introduce a piston phase. The phase introduced by the adaptive element could be expressed as

Ψ (r) = \sum_{l = 1}^{N} α_{l} η_{l} (r)

(2)

where

η_{l} (r)

are the phase influence functions of each pixel, which have value 1 within the pixel area and 0 elsewhere;

α_{l}

are the coefficients of these influence functions, which correspond to the pixel phase value. Alternatively, we could represent the AE phase as

Ψ (r) = \sum_{k = 0}^{N - 1} β_{k} ω_{k} (r)

(3)

where

ω_{k} (r)

are functions that take binary values of

- 1

or

+ 1

in each pixel region, such that the lth pixel takes on the lth value of the kth Walsh function of length N,

W_{k}^{N} [l]

[8].

β_{k}

are the coefficients of these functions

ω_{k} (r)

. For a given sequence length

N = 2^{γ}

, where

γ

is an integer, there are N orthogonal Walsh functions, each of which consists of

N / 2

elements of value

- 1

or

+ 1

, except for the first function that consists entirely of 1 s (see examples in Figure 2). We follow the convention that the Walsh function index starts at

k = 0

. From the above relationships, it is clear that each pixel phase can be represented as

α_{l} = \sum_{k = 0}^{N - 1} β_{k} W_{k}^{N} [l]

(4)

or alternatively in matrix–vector format as

a = W^{T} b

(5)

where

a

is a vector of length N that contains the phase value of each pixel,

b

is a vector of length N that contains the coefficient of each Walsh function and

W

is an

N \times N

Walsh–Hadamard matrix consisting of values

\pm 1

[13,14]. The rows of this matrix correspond to each of the Walsh functions. The matrix

W

provides the mapping between the Walsh coefficients and the pixel values. For Hadamard matrices,

W W^{T} = N I_{N}

, where

I_{N}

is the identity matrix of size N [13,14]. Hence, we can invert Equation (5) as

b = \frac{1}{N} Wa

(6)

Note that for a set of Walsh functions to be defined, we require

N = 2^{γ}

, where

γ

is an integer. We assume throughout this paper therefore that

N = 2^{γ}

. However, Hadamard matrices also exist for

N = 4 γ

, where

γ

is an integer [13,14]. For simplicity, these other matrices will not be considered in this current analysis.

4. Lattice Symmetry of Aberration States

We can define the aberration state fully as the vector of pixel values

a

. Thus, we can consider that a point at position

a

in the N-dimensional space of pixel values is equivalent to an aberration state where any coefficient

α_{l}

is replaced by

α_{l} + 2 π q

, where q is an arbitrary integer. This reveals a repetitive structure in each coordinate of the N-dimensional space that results in a lattice type symmetry. Hence, in this space, there is an infinite number of points that represent a given aberration state and these points are arranged in a transformed integer lattice

Z_{N}

that is scaled by a factor of

2 π

and offset by the pixel value

α_{l}

along each dimension. Furthermore, the lattice structure is based around a fundamental unit that is an N-dimensional cube of side length

2 π

; this fundamental unit is known as a Voronoi cell [10].

The matrix–vector operation of Equation (6) can now be interpreted as a rotation (as

W

is an orthogonal matrix) and scaling by

1 / N

of the vector

a

to give the Walsh coefficient vector

b

. The lattice symmetry is hence maintained in a rotated and scaled form when the state is described by

b

. When represented by the vector

a

, any Walsh function consists of equal magnitude amounts (

+ 1

or

- 1

) of each pixel value, so the vector must be directed along certain body diagonals of the cubic Voronoi cell. After transformation, these body diagonals lie along the axes of the vector space containing

b

. This lattice symmetry will be used for derivations later in this article.

5. Effects of Pixels and Modes on Signal Modulation

Let us assume that each pixel of the AE has equal area (the pixels should have equal area if the amplitude profile at the pupil is uniform. For nonuniform illumination, the pixel area should be varied to provide the same total power in each pixel (e.g., the pixels could be large near the edge of the pupil for a Gaussian illumination profile.) No constraint is placed here on the position or shape of the pixels.), so that the integration used in Equation (1) can be replaced by a summation, assuming here that

Φ (r) = 0

:

I = {|\frac{1}{N} \sum_{l = 1}^{N} exp (i α_{l})|}^{2} = {|\frac{1}{N} \sum_{l = 1}^{N} exp (i \sum_{k = 0}^{N - 1} β_{k} W_{k}^{N} [l])|}^{2}

(7)

If the arbitrarily chosen lth pixel is varied and all other pixel values have the same value (here arbitrarily set to zero), then

I (α_{l}) = (1 - \frac{D}{2}) + \frac{D}{2} cos α_{l}

(8)

where the modulation depth

D = 4 (N - 1) / N^{2}

. For all other Walsh functions other than

W_{0}^{N} [l]

, the signal is also cyclic as a function of

β_{k}

:

I (β_{k}) = {cos}^{2} β_{k}

(9)

where the modulation depth has the maximum possible value of 1 and the period in terms of

β_{k}

is

π

. The effects of single pixel and modal variations are shown in Figure 3.

If a combination of Walsh modes is present with small coefficients, we can use a Maclaurin expansion of the exponential in Equation (7) to give

I \approx 1 - \sum_{j} \sum_{k} β_{j} β_{k} (\frac{1}{N} \sum_{l} W_{j}^{N} [l] W_{k}^{N} [l]) + {[\sum_{k} β_{k} (\frac{1}{N} \sum_{l} W_{k}^{N} [l])]}^{2}

(10)

The term in the final bracket

\frac{1}{N} \sum_{l} W_{k}^{N} [l]

is equal to zero except for when

k = 0

, in which case it has value 1. The orthogonality property of the Walsh functions means that

\frac{1}{N} \sum_{l} W_{j}^{N} [l] W_{k}^{N} [l]

in the second term is equivalent to the Kronecker delta function

δ_{j k}

. Hence, the signal is approximately

I \approx 1 - \sum_{k = 1}^{N - 1} {β_{k}}^{2}

(11)

This is equivalent to the well-known approximation of the Strehl ratio as

1 - ϕ_{r m s}^{2}

, where

ϕ_{r m s}

is the root mean square value of the aberration, which in this case is equal to

\sqrt{\sum_{k} {β_{k}}^{2}}

.

6. Defining a Well-Corrected System

If we define our system to be “well-corrected” when the root-mean-square (rms) phase error is below a chosen value, such that

ϕ_{r m s} \leq ϵ

, then the system will be well-corrected when

I \geq 1 - ϵ^{2}

. We can also express the second term in Equation (11) as a length

N - 1

vector

b^{'}

, which is equivalent to

b

with the piston coefficient removed, as

I \approx 1 - {|b^{'}|}^{2}

(12)

Our condition for being well-corrected is hence equivalent to requiring that

|b^{'}| \leq ϵ

. Interpreted geometrically, this means that any point within an

(N - 1)

-dimensional spherical volume of radius

ϵ

centred on the point where

I = 1

will be considered well-corrected.

In practice, the total aberration in a system will be the sum of the input aberration and that introduced by the AE, that is

Φ (r) + Ψ (r)

. The values of

b^{'}

discussed here consolidate these two sources of aberrations to represent the residual aberrations, such that we seek a perfect correction for which

b^{'} = 0

. We will also assume for this analysis that the input aberration consists entirely of modes that can be corrected by the AE.

7. Vector Space Representation of Well-Corrected States

The signal variation as a function of pixel values was given in Equation (7) and can be expressed alternatively as

I = {|\frac{1}{N} \sum_{l = 1}^{N} exp (i α_{l})|}^{2} = \frac{1}{N^{2}} {|1 + \sum_{l = 2}^{N} exp (i α_{l}^{'})|}^{2}

(13)

where

α_{l}^{'} = α_{l} - α_{1}

. From this, we find the value

I = 1

can only be obtained if

α_{l}^{'} mod 2 π = 0

for all l. We derive this result by considering the phasor sum of each of the terms in the final modulus expression: the maximum signal is only obtained when all of the exponential terms in the summation are real.

In a more general case where all pixels are offset by a mean pixel phase value c rather than the first pixel phase value, we could state that

I = 1

only if each element of the vector

a

has a value

α_{l} = c + 2 n π

where n is an arbitrary integer. We can also express the signal explicitly as

I (a)

; this is a function of the vector

a

, which describes a point in an N-dimensional space. In this way, we can see that

I (a)

has maximum value 1 at the origin of this space when c is zero. Furthermore, we see that there is an infinite number of points in this space at which

I (a) = 1

. For example, on each of the axes, there are points where

I (a) = 1

that are equally spaced at steps of

2 π

. Varying the value of c is equivalent to adding a constant phase to every pixel (or equivalent to adding the piston mode to the whole pupil) and thus has no effect on the signal. We deduce therefore that there are infinite lines of

I (a) = 1

parallel to the vector

{(1, 1, \dots, 1)}^{T}

. As the value of c has no effect on the signal, we can set this arbitrarily to zero without affecting further analysis. This is equivalent to removing the piston mode. It is also equivalent to taking the

(N - 1)

dimensional subspace including the origin in an orientation orthogonal to the direction

{(1, 1, \dots, 1)}^{T}

. The position of the maxima in this slice would be equivalent to the positions of a scaled version of the integer lattice

Z_{N}

, as explained in Section 3, projected along the direction

{(1, 1, \dots, 1)}^{T}

. An illustration is provided in Figure 4.

8. Lattice Representation of States after Removal of Piston

The removal of the piston mode is equivalent to the removal of the first row of the matrix

W

to create the reduced matrix

W^{'}

and removing the corresponding element of

b

to obtain a reduced vector

b^{'}

so that

a = {W^{'}}^{T} b^{'}

(14)

As row–column products represent dot products between Walsh vectors, the following relationship is valid:

W^{'} {W^{'}}^{T} = N I_{N - 1}

, so Equation (14) can be inverted as

b^{'} = \frac{1}{N} W^{'} a

(15)

We can interpret Equation (15) as a transformation from an N-dimensional vector of pixel values

a

to an

(N - 1)

-dimensional vector of Walsh modal coefficients

b^{'}

.

The matrix

W^{'}

is, however, a redundant representation, as the column space has dimension greater than its rank. This is rectified by removal of any one of the columns to create the matrix

W^{''}

; we choose arbitrarily to remove the first column. In order to maintain the form of Equation (15), we remove the first element of the vector

a

to produce a new vector

a^{'}

. From a practical perspective, this means that the pixel value

a_{1}

is a dependent parameter determined by the other pixel values because of removal of the piston mode.

b^{'} = \frac{1}{N} W^{''} a^{'}

(16)

The operation performed by matrix

W^{''}

is to map the vector

a^{'}

to the corresponding vector

b^{'}

. Similarly, the operation of

W^{''}

would be to transform (project) the positions of the maxima of

I (a) = 1

, which were located on lines passing through a scaled integer lattice

Z_{N}

(as illustrated in Figure 4), to another lattice in the

(N - 1)

-dimensional space spanned by

b^{'}

.

We can determine the properties of this new lattice by considering its Gram matrix, which is the matrix of the inner products between its lattice vectors [10]. The Gram matrix is hence given by

\begin{matrix} G = \frac{1}{N} {W^{''}}^{T} W^{''} = \frac{1}{N} (\begin{matrix} N - 1 & - 1 & \dots \\ - 1 & N - 1 & \dots \\ ⋮ & ⋮ & ⋱ \end{matrix}) \end{matrix}

(17)

where the factor of

1 / N

has been chosen so that the basis vectors are equivalent to Walsh functions with normalised vector magnitudes.

G

is equivalent to the Gram matrix of the so-called

A_{N - 1}^{*}

lattice [10], which is an

(N - 1)

-dimensional analogue of the body centred cubic (BCC) lattice in three dimensions. It follows that the maxima in the

(N - 1)

-dimensional space spanned by

b^{'}

must be located at the lattice points of a scaled

A_{N - 1}^{*}

lattice.

Understanding the symmetries of this lattice thus provides an understanding of the symmetries of the function

I (b^{'})

. For example, the response of the signal around each lattice point should be identical. In other words,

I (b^{'} - d_{m}) = I (b^{'})

for all m, where

d_{m}

represents an arbitrary lattice point. As there is an infinite number of lattice points, there is an infinite combination of the

(N - 1)

Walsh coefficients that can provide the optimal correction. Furthermore, correction to a precision of

ϕ_{r m s} \leq ϵ

can be achieved by finding a setting for the adaptive correction device that places

b^{'}

within a sphere of radius

ϵ

centred upon any of the lattice points.

9. Fundamental Correction Space

The lattice model allows us to define a fundamental correction space—that is, the range of

b^{'}

we must search to find an optimal correction. This fundamental correction space is smaller than the correction space covered by all pixel values in the range 0 to

2 π

radians. The lattice symmetry of the function

I (b^{'})

indicates that we need only search the Voronoi cell of the lattice in order to cover all possible states. Therefore, the search space is the Voronoi cell of the scaled

A_{N - 1}^{*}

lattice, whose properties are known [10]. The position of the cell’s vertices can be readily calculated. For example, the Voronoi cell of the

A_{3}^{*}

(or BCC) lattice is a truncated octahedron; this would be the Voronoi cell for a

N = 4

pixel system and is illustrated in Figure 5.

Using the symmetries of the Voronoi cell, further general properties of this fundamental correction space can be derived. Moving along any of the axes from a lattice point at which

I (b^{'}) = 1

, we encounter another lattice point at a distance

b = π

(noting that this corresponds to the variation of a single Walsh function, the pixel values of which will be

\pm π

for this value of b; see Equation (9)). Therefore, the halfway point between lattice points along the axis is at a distance

b = π / 2

. Hence, the distance between two faces of the Voronoi cell along such an axis is

π

. By looking solely along the axes, one might assume that the search space is an

(N - 1)

-dimensional cube of side length

π

, which would have a volume of

π^{N - 1}

. However, the Voronoi cell’s volume is given by

π^{N - 1} \sqrt{det G} = \frac{π^{N - 1}}{\sqrt{N}}

(18)

which is a factor

1 / \sqrt{N}

smaller than the encompassing hypercube [10]. Hence, for large numbers of pixels, the search space is considerably smaller than might be assumed if considering the pixel phases directly.

10. Implications of the Lattice Structure for Sensorless AO

We have shown that searching the Voronoi cell of the

A_{N - 1}^{*}

lattice is sufficient to find the optimal correction in the whole Walsh coefficient space of the adaptive element. The lattice structure also means that this same cell repeats over the whole space. Consequently, if the aberration in the system can be accurately represented by a finite number of Walsh modes, then the necessary search space is finite. This contrasts with an aberration represented by a finite number of continuous modes, such as Zernike polynomials, where the search space would have to be infinite in extent to cover all possible coefficient values.

In modal sensorless AO correction schemes, a sequence of predetermined bias aberrations for fixed set of correction modes is applied to the adaptive element and the corresponding signal values are recorded. From this set of measurements, the correction aberration is estimated using an appropriately chosen optimisation algorithm. When using continuous modes, such estimation can provide accurate correction for aberrations over a limited magnitude range but usually provides poor estimation outside this range. Using Walsh modes, however, the finite search space within one Voronoi cell of the lattice structure for a fixed set of modes means that it is possible to design a correction scheme that is accurate across all possible aberrations within the Walsh mode set. For the continuous modes, the bias aberrations span a finite range, such as in the typical configuration for sensorless AO of having equal magnitude positive and negative biases for each mode. However, the same configuration for Walsh modes in effect spans an infinite range, as the bias positions are also repeated in the lattice structure across the whole coefficient space.

If a sequence of intensity measurements is taken with bias aberrations defined as each Walsh mode with an amplitude of

π / 2

, the set of measurements is related to the Walsh transform of the pixel values (see Appendix B). The sum of these measurements is equal to the intensity after aberration correction. This has important implications for the normalisation of measurements for use in aberration estimation algorithms.

11. Neural Network for Solution of the Inverse Problem

We used the knowledge of the fundamental correction space to design estimators for a sensorless AO scheme. Specifically, the estimation process should solve the inverse problem to obtain aberration coefficients from the set of biased intensity measurements. We choose to demonstrate this with a NN estimator, whose design incorporates physical knowledge of the system. This method was chosen as it is more readily extendable to more advanced AO systems than conventional optimisation algorithms.

In this demonstration, we compare two similar NN-based methods for which the search space is defined differently. In the first case, it was assumed that each of the Walsh mode coefficients

a_{k}

can take any value

- π / 2 < a_{k} \leq π / 2

. This was equivalent to taking any point in a

(N - 1)

-dimensional cube in coefficient space (we refer to this as the “hypercube cell”). In the second case, the coefficients were chosen so that they lie only within the Voronoi cell centred at the origin (we refer to this as the “primary Voronoi cell”). This primary Voronoi cell would be a sub-region of the hypercube used in the first case. Based upon the previous analysis, it was known that there would be a single point corresponding to optimum correction in the primary Voronoi cell but multiple such points in the hypercube cell.

Having multiple global optima in the search space can be detrimental when using neural networks to perform such an optimisation. This is because such ill-conditioned problems have no unique answer and thus prevent convergence of the network training. We employed a bespoke NN architecture that was developed to take advantage of the particular physical process used in the sensorless AO scheme. The overall process and the NN architecture are outlined here. More details about the NN can be found in Appendix D. In order to adequately sample the space, a biasing scheme was chosen that used

2 N - 1

measurements. This corresponded to the application of positive and negative biases of magnitude

π / 3

for each of the

(N - 1)

Walsh modes, excluding piston; an additional nonbiased measurement was also included. For the kth mode, we denote the negatively bias measurement as

I_{k}^{- 1}

, the positively biased measurement as

I_{k}^{+ 1}

and the unbiased measurement as

I_{0}

.

The NN process was constructed as shown in Figure 6; a more detailed description of the architecture is given in Appendix D. The NN takes two separate sets of inputs, both of which rely on biased intensities (generated using Equation (1)): the first (Input1) directly uses these normalised intensity values, while the second (Input2) analytically processes the intensities based upon sinusoidal estimation to acquire a set of preliminary aberration coefficient estimates. The first input is passed into a convolutional neural network (CNN) followed by fully connected layers (FCL). It is then concatenated with the second input (Input2) and passed into fully connected layers to generate the outputs that correspond to the estimated Walsh coefficients. The rationale behind this dual input approach was that the learning task would be easier if based, in effect, on the differences between rough estimates and actual measurements rather than on the raw measurements themselves.

Input1 was structured in the instance of

N = 8

, as a matrix in the following form

\begin{matrix} I n p u t 1 = (\begin{matrix} I_{0} & I_{0} & I_{0} \\ I_{0} & I_{1}^{+ 1} & I_{1}^{- 1} \\ I_{0} & I_{2}^{+ 1} & I_{2}^{- 1} \\ I_{0} & I_{3}^{+ 1} & I_{3}^{- 1} \\ I_{0} & I_{4}^{+ 1} & I_{4}^{- 1} \\ I_{0} & I_{5}^{+ 1} & I_{5}^{- 1} \\ I_{0} & I_{6}^{+ 1} & I_{6}^{- 1} \\ I_{0} & I_{7}^{+ 1} & I_{7}^{- 1} \end{matrix}) \end{matrix}

(19)

This structure was chosen so that the CNN block could interpret known correlations from biased measurement values. Indeed, the first CNN layer was structured so that it operated first on the 3-tuples of intensity values contained in each row of Input1, which were each expected to depend primarily on the corresponding coefficient

a_{k}

. Further CNN layers sought to operate on correlations between the different coefficients.

The rough estimation used for Input2 was based upon the knowledge that variations in a single Walsh mode coefficient led to a sinusoidal relationship with detected signal (see Appendix C). The value of each coefficient in Input2 was set to the value that provided the maximum value of this sinusoidal variation. This estimation provides only a rough value for correction, as it treats each coefficient separately and hence does not deal with the coupling effect between combinations of modes.

Separate instances of this NN architecture were trained for the two scenarios. In the first case of the hypercube cell, the training and validation data were obtained by assigning a random value to each input coefficient in the range

- π / 2 < a_{k} \leq π / 2

. For the second case, the same combinations of coefficients were “wrapped” so that they lay only within the primary Voronoi cell (see Appendix D). In each case, the same calculated intensity measurements were used at the input. The difference between the two training processes lies in the coefficient labels that were used to calculate the loss function during network training—in the first case, the coefficient labels were defined throughout the hypercube cell; in the second case, the corresponding labels were wrapped into the primary Voronoi cell. Full details of the training procedure are provided in Appendix D.

Results are shown in Figure 7 for the case of

N = 8

, which corresponds to the correction of 7 Walsh modes. The loss function curves showed that only the scenario of confining the coefficient to within the primary Voronoi cell permitted the NN to converge. The mean squared phase error was reduced to 0.063 radians after correction, which corresponded to a Strehl ratio of approximately 0.6. In comparison, the loss for the hypercube case was significantly higher and the validation loss did not reduce while deviating from the training loss. This demonstrated the effectiveness of using prior knowledge about the lattice symmetries. It is expected that similar trends would be seen in scenarios with large pixel numbers, alongside more complicated NN architectures.

These results show clearly that the prior knowledge of the lattice geometry, and hence the need to confine the estimation to the primary Voronoi cell, is essential to effective training of the NN. When training using coefficients selected from the hypercube cell, the loss functions (which are related to wavefront estimation errors) do not converge sufficiently to provide good aberration correction. This is attributed to the existence of multiple optimal solutions within the hypercube cell that complicate the training process. However, this problem is avoided when using a training set where the labels were “wrapped” and the resulting labels were within the primary Voronoi cell, and thus only one optimal combination of Walsh mode coefficients is present.

12. Conclusions

The insight provided by the mathematical link between Walsh mode AO and lattices informs the design of efficient sensorless AO schemes. This is relevant to the solution of the inverse problem of how to estimate the aberration coefficients from metric measurements with different applied bias aberrations. The new insights are particularly important when using NNs to solve this inverse problem, as otherwise we suffer from the challenges caused by having multiple solutions for a given set of metric measurements, which requires more complicated networks.

Although the Walsh modes are pixelated, when sufficient pixels are used, they can provide a suitably accurate representation of low order aberrations. While continuous modes, such as the Zernike polynomials, are commonly used, they are not guaranteed to provide a good fit to high-order aberrations in the system, nor to the correction device, which may well be pixelated in nature. The results presented here have relevance not just for correction of low-order aberrations but also for high-order scattering compensation, where previously schemes have been based around control of Walsh modes (or similar) [15,16,17].

The analysis presented in this paper was based around a simple AO focussing system. However, as the overall lattice geometry arises from the nature of the aberration representation, and not the AO system or the optimisation metric, a similar repetitive lattice structure and primary Voronoi cell will hold for any other AO system using Walsh modes as the basis. These results should therefore have relevance to any application of AO using pixelated correction devices.

Author Contributions

Conceptualization: M.J.B.; methodology: M.J.B. and Q.H.; software: M.J.B., Q.H. and Y.X.; validation: Q.H., J.C., R.T. and Y.X.; investigation: M.J.B., Q.H. and Y.X.; writing—original draft preparation: M.J.B. and Q.H.; writing—review and editing: R.T., J.C. and Y.X.; visualization: M.J.B. and Q.H.; supervision: M.J.B.; funding acquisition: M.J.B. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the European Research Council, grant number 695140 (AdOMiS).

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:

AO	Adaptive optics
DM	Deformable mirror
SLM	Spatial light modulator
NN	Neural networks
SNR	Signal to noise ratio
AE	Adaptive element
CNN	Convolutional neural network
FCL	Fully connected layers
WT	Walsh transform

Appendix A. Observations Based upon the Lattice Geometry

Various connections can be made between the operation of the Walsh mode-based AO system and the lattice representation outlined above. The following points address certain relationships between variations of individual pixels and the Walsh functions.

A single pixel variation is put into effect by a combination of all N Walsh modes, which can be obtained using Equation (5). As is clear from this equation, the coefficients in the vector $b$ are themselves given by the elements of a Walsh mode.
As we have removed the first (piston) mode, we can see that each reduced Walsh mode (i.e., without its first element) is a vector of length $N - 1$ that points in the direction corresponding to the variation of a single pixel. As these vectors each include all the $N - 1$ basis vectors in equal magnitude ( $\pm 1$ ), then these directions must correspond to some of the body diagonals between opposite vertices through the centre of the fundamental cubic cell that encloses the Voronoi cell.
For N pixels, there are $2 N$ body diagonals that correspond to single pixel variations (counting positive and negative directions separately). For $N > 4$ , this is less than the total number of body diagonals, which is $2^{N}$ . Hence, for $N > 4$ , there are diagonals that do not correspond to single pixel variations, but would involve multiple changing pixels.
The “kissing number” $τ$ is a characteristic of a lattice that indicates how many nearest neighbours there are to any lattice point. The kissing number for the $A_{n}^{*}$ lattice is $τ = 2 n + 2$ (for $n \geq 2$ ), so for $A_{N - 1}^{*}$ we expect $τ = 2 N$ (p155 of [10]). This is equal to the number of body diagonals noted above as corresponding to single pixel variations. These single pixel variations correspond to the kissing directions, which are the closest spacing between lattice points.
Equation (8) shows how I varies with single pixel modulation. However, removal of the piston mode leads to some differences. Ensuring zero piston means that all pixels are modulated, but $N - 1$ pixels are shifted by the same value $- ξ$ , whereas the single desired pixel will be shifted by the value $(N - 1) ξ$ . The peak-to-peak amplitude would be $ψ = N ξ$ .
The mean square amplitude of such a mode must be given by

$\frac{[(N - 1) ξ^{2} + {(N - 1)}^{2} ξ^{2}]}{N} = {(N - 1) ξ}^{2} = \frac{N - 1}{N^{2}} ψ^{2}$

(A1)

Hence, the rms phase related to the peak-to-peak phase by $ϕ_{r m s} = \frac{\sqrt{N - 1}}{N} ψ$ .
The distance between these closest lattice points is equivalent to the rms phase required to shift one pixel so that it is $2 π$ radians different to the others. Setting $ψ = 2 π$ gives $ϕ_{r m s} = \frac{2 π \sqrt{N - 1}}{N}$ . This varies inversely with $\sqrt{N}$ , such that as N increases the spacing between closest lattice points reduces. This is to be expected, as increasing N means a smaller pixel size and hence a smaller rms phase for a given phase variation of a single pixel.
Along these kissing directions, the minimum signal is obtained when the single pixel is $π$ out of phase with the other $N - 1$ pixels. This leads to a signal minimum of

$I = \frac{{(N - 2)}^{2}}{N^{2}}$

(A2)

which tends to 1 as N increases and, correspondingly, as the size of a single pixel decreases.

Hence, we can also use this lattice description to show that the signal varies only weakly in these kissing directions (corresponding to single pixels), whereas adjustment of Walsh modes provides more robust measurement. This confirms the results presented in Figure 3 using single pixel and modal variations. These properties are closely related to the extent of the Voronoi cell in these kissing directions. The Voronoi cell is in effect narrower in the kissing directions than in others. In general, larger signal modulations are obtained if one samples the function in the directions where the Voronoi cell has a larger extent.

Appendix B. Sensorless AO and the Walsh Transform

Suppose we take a sequence of biased measurements, where we apply biases of each Walsh mode with a single positive amplitude of

π / 2

radians. This is equivalent to shifting the measurement point from the body centre of the (

N - 1

)-dimensional Voronoi cell to the centre of each (

N - 2

)-dimensional facet centred along the positive axis of the corresponding Walsh mode. Due to the lattice structure, the facet in the positive direction is homologous to the facet in the negative direction. Therefore, by choosing this bias amplitude, we are simultaneously sampling both the positive and negative bias positions. When biasing the kth mode, we are increasing the (unknown) coefficient

β_{k}

by

π / 2

. The complex pixel value of that mode is therefore

\begin{matrix} exp (i \{β_{k} + \frac{π}{2}\} W_{k}^{N} [l]) & = exp (i \frac{π}{2} W_{k}^{N} [l]) exp (i β_{k} W_{k}^{N} [l]) \\ = i W_{k}^{N} [l] exp (i β_{k} W_{k}^{N} [l]) \end{matrix}

(A3)

where we have exploited the fact that the Walsh function has values only of

\pm 1

. We can now see that the kth biased signal measurement is given from Equation (7) by

I_{k} = {|\frac{i}{N} \sum_{l = 1}^{N} \{exp (i \sum_{k = 0}^{N - 1} β_{k} W_{k}^{N} [l]) W_{k}^{N} [l]\}|}^{2} = {|\frac{i}{N} \sum_{l = 1}^{N} x_{l} W_{k}^{N} [l]|}^{2}

(A4)

where we have defined

x_{l}

to represent the complex pixel values so that:

x_{l} = exp (i \sum_{k = 0}^{N - 1} β_{k} W_{k}^{N} [l])

(A5)

The Walsh transform (WT)

X_{k}

of a sequence

x_{l}

of length N is defined conventionally as [8]:

X_{k} = \frac{1}{N} \sum_{l = 0}^{N - 1} x_{l} W_{k}^{N} [l]

(A6)

We can therefore derive

I_{k} = {|X_{k}|}^{2}

(A7)

In other words, we see that the set of biased intensity measurements is the modulus squared of the kth component of the WT of the complex pixel values

x_{l}

. As the first Walsh function is piston (hence all pixels have value 1), it is related to the first element of the WT

X_{k}

. As biasing with piston has no effect on the measurement, this is the same as the unbiased measurement in our system. Hence, the unbiased measurement (corresponding to the centre of the Voronoi cell) along with the

N - 1

biased measurements correspond to a full set of intensity measurements. This set of intensity measurements is known as the spectral density of the WT of the set of complex pixel values. In practice, we cannot measure the complex values of

X_{k}

directly, but we are able to quantify the Walsh spectral density through the intensity measurements with the applied bias modes.

We now derive a relationship using the fundamental properties of the WT. In [8], we find an equivalent of Parseval’s theorem for WTs:

\sum_{k = 0}^{N - 1} {|X_{k}|}^{2} = \frac{1}{N} \sum_{l = 1}^{N} {|x_{l}|}^{2}

(A8)

From the definition of

x_{l}

, it is clear that its modulus is equal to one. Hence, we find that

\sum_{k = 0}^{N - 1} {|X_{k}|}^{2} = 1

(A9)

Equivalently, we find that the sum of the

N - 1

biased measurements and the one unbiased measurement must add to one.

The importance of this result is that in a real experiment the measured signal is not actually normalised to one, but there is an unknown “brightness” that is a function of numerous experimental variables. This would be a multiplying factor in the expressions for I, which we have chosen to omit from the analysis for simplicity. We can use this result to obtain the “brightness” in a way that is independent of the input aberration, as it is simply the sum of the

N - 1

biased measurements and the unbiased measurement.

Appendix C. Estimation of Coefficients Using Simple Sinusoidal Model

In this appendix, we present a method for rough initial estimation of individual Walsh mode coefficients. Let us redefine Equation (7) by including a factor A that is equivalent to an unknown maximum intensity

I = A {|\frac{1}{N} \sum_{l = 1}^{N} exp (i \sum_{k = 0}^{N - 1} β_{k} W_{k}^{N} [l])|}^{2}

(A10)

The signal varies with a single modal coefficient

β_{j}

as

\begin{matrix} I (β_{j}) & = A {|\frac{1}{N} \sum_{l = 1}^{N} exp (i \sum_{k = 0; k \neq j}^{N - 1} β_{k} W_{k}^{N} [l]) exp (i β_{j} W_{j}^{N} [l])|}^{2} \\ = A {|\frac{1}{N} \sum_{l = 1}^{N} C [l] exp (i β_{j} W_{j}^{N} [l])|}^{2} \end{matrix}

(A11)

where the effects of the modes contained within the first exponential term have been subsumed into the complex coefficients

C [l]

.

We introduce a useful relationship, which takes advantage of the binary (

\pm 1

) values of the Walsh functions [9]:

exp (i β_{j} W_{j}^{N} [l]) = cos (β_{j} W_{j}^{N} [l]) + i sin (β_{j} W_{j}^{N} [l]) = cos β_{j} + i W_{j}^{N} [l] sin β_{j}

(A12)

This leads to

I (β_{j}) = A {|\frac{1}{N} \sum_{l = 1}^{N} C [l] \{cos β_{j} + i W_{j}^{N} [l] sin β_{j}\}|}^{2}

(A13)

The exact form of this solution will depend upon

C [l]

and hence on the other Walsh coefficients. However, the only terms in

β_{j}

that can arise from the modulus square term are of the form of

{cos}^{2} β_{j}

,

cos β_{j} sin β_{j}

, or

{sin}^{2} β_{j}

, which can all be expressed as sinusoidal terms of argument

2 β_{j}

. Hence, we deduce that

I (β_{j})

must be given by the form

I (β_{j}) = A [p + q cos (2 β_{j} + ζ)]

(A14)

where p, q and

ζ

depend on all values

β_{k \neq j}

. Let us simplify Equation (A14) to the form

I (θ_{j}) = U + V cos (2 θ_{j})

, where we have defined

2 θ_{j} = {2 β}_{j} + ζ

.

Consider applying a bias

b W_{j}^{N}

so that the biased measurement

I_{+} = U + V cos (2 θ_{j} + 2 b)

. Then, we apply the negative bias

- b W_{j}^{N}

so that the biased measurement

I_{-} = U + V cos (2 θ_{j} - 2 b)

. We also use the unbiased measurement

I_{0} = U + V cos (2 θ_{j})

. If we use a bias of

b = \frac{π}{3}

, then we can obtain through elementary operations

θ_{j} = \frac{1}{2} {tan}^{- 1} [\sqrt{3} \frac{I_{+} - I_{-}}{I_{+} + I_{-} - 2 I_{0}}] = β_{j} + \frac{ζ}{2}

(A15)

The value of

θ_{j}

obtained through Equation (A15) was used as the initial rough estimate provided to the NN as Input2. The error between this estimate and the actual coefficient is given by

β_{j} - θ_{j} = - ζ / 2

.

Appendix D. Description of the Neural Network Training and Network Architecture

The training data consisted of

2^{20}

(about one million) simulated samples, and

2^{7}

(128) samples were used for validation to avoid overfitting. Both sets of data were generated for the case of

N = 8

, corresponding to the correction of seven Walsh modes (excluding piston). The total number of training datasets was chosen to provide a sufficient representation of the seven-dimensional search space.

For each training dataset, seven values were randomly generated corresponding to the coefficients of seven polar Walsh modes (

β_{k}

). The coefficients followed a uniform distribution over the range of

- π / 2

to

π / 2

. These coefficients were used as labels during network training in the case of the hypercube cell.

The sum of the product between each coefficient with its corresponding unit Walsh mode formed the phase aberration

Ψ (r)

according to Equation (3). The phasor of the complex field was calculated by averaging the complex field over the pupil. By subtracting the angle of the mean phasor of

Ψ (r)

across the pupil (Equation (A16) ) and wrapping the phase back within the range of

\pm π

, the aberration

Ψ^{'} (r)

was calculated to be equivalent to

Ψ (r)

and within the primary Voronoi cell.

Ψ^{'} (r) = arg (exp \{i [Ψ (r) - arg (\frac{1}{π} \int exp [i Ψ (r)] d r)]\})

(A16)

Using the orthogonality of Walsh modes, the new coefficients

{β_{k}}^{'}

of the equivalent aberration

Ψ^{'} (r)

(Equation (A17)) could be calculated.

{β_{k}}^{'}

were used as the labels when training the network with a confined searching space.

{β_{k}}^{'} = \frac{1}{π} \int Ψ^{'} (r) ω_{k} (r) d r

(A17)

For each set of data, two phase biases per Walsh mode were introduced. The bias amplitudes were chosen to be

- π / 3

and

π / 3

. For each introduced bias phase, the intensity signal (

I_{-}

and

I_{+}

) was computed using the Equation (1). The same equation was used to calculate the signal when no bias phase was introduced (

I_{0}

). From previous discussions, the phase modulation of mode 1 (piston) would have no effect on the signal and thus excluded from the collection of signal readings. The total of 15 signal readings (14 biased and 1 unbiased readings) were used as the input (Input1) of the network.

In addition, from our previous discussion, the signal varied with mode coefficients in a period of

π

. A good approximation to the coefficients of each mode (

{β_{k}}^{''}

) was obtained using Equation (A17). These approximations were also used as the separate input (Input2) to the two networks we trained. Figure A1 shows a few sets of

β_{k}

,

{β_{k}}^{'}

and

{β_{k}}^{''}

, which derived from the same initial aberration.

Figure A1. (a–p) 16 sets of randomly selected

β

,

β^{'}

and

β^{''}

derived from the same aberration, shown in blue, red and yellow plots, respectively. In some cases,

β^{''}

(estimation using sinusoidal model) closely resembled

β^{'}

(such as in case (f)) while in some other cases,

β^{''}

could be different to

β^{'}

(such as in (e) and (n)). There were also a small proportion of cases where

β

was identical to

β^{'}

(such as (n)), which corresponded to the initial aberration being within the primary Voronoi cell.

Figure A1. (a–p) 16 sets of randomly selected

β

,

β^{'}

and

β^{''}

derived from the same aberration, shown in blue, red and yellow plots, respectively. In some cases,

β^{''}

(estimation using sinusoidal model) closely resembled

β^{'}

(such as in case (f)) while in some other cases,

β^{''}

could be different to

β^{'}

(such as in (e) and (n)). There were also a small proportion of cases where

β

was identical to

β^{'}

(such as (n)), which corresponded to the initial aberration being within the primary Voronoi cell.

The full diagram of the NN architecture is shown in Figure A2. The network was built using TensorFlow Keras. For the first CNN layer, the padding was chosen to be “valid” and strides equalled to (1,1). For each of the second to fourth CNN layers, the padding was chosen to be “same” and strides equalled to (1,1). Following each of the second to the fourth convolutional layers was a maxpooling layer with a pool size of

2 \times 1

. For all the layers (except the output layer), the nonlinear activation was chosen to be “tanh”. The activation of the last output layer was linear. The initializer of all the kernels was glorot uniform. The loss function was mean squared error (MSE). The optimizer was Adam.

Figure A2. Full neural network architecture. The blue boxes were the two inputs to the network. The yellow boxes were the trainable kernels of the CNN with the corresponding dimensions as shown. The orange boxes were the internal layers of the CNN and dense layers in the later stages. The green box represents the output of the CNN.

References

Booth, M.J. Adaptive optical microscopy: The ongoing quest for a perfect image. Light Sci. Appl. 2014, 3, e165. [Google Scholar] [CrossRef]
Ji, N. Adaptive optical fluorescence microscopy. Nat. Methods 2017, 14, 374–380. [Google Scholar] [CrossRef] [PubMed]
Hampson, K.M.; Turcotte, R.; Miller, D.T.; Kurokawa, K.; Males, J.R.; Ji, N.; Booth, M.J. Adaptive optics for high-resolution imaging. Nat. Rev. Methods Prim. 2021, 1, 68. [Google Scholar] [CrossRef] [PubMed]
Booth, M.J.; Neil, M.A.A.; Juškaitis, R.; Wilson, T. Adaptive aberration correction in a confocal microscope. Proc. Natl. Acad. Sci. USA 2002, 99, 5788–5792. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Hu, Q.; Wang, J.; Antonello, J.; Hailstone, M.; Wincott, M.; Turcotte, R.; Gala, D.; Booth, M.J. A universal framework for microscope sensorless adaptive optics: Generalized aberration representations. APL Photonics 2020, 5, 100801. [Google Scholar] [CrossRef]
Facomprez, A.; Beaurepaire, E.; Débarre, D. Accuracy of correction in modal sensorless adaptive optics. Opt. Express 2012, 20, 2598–2612. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Saha, D.; Schmidt, U.; Zhang, Q.; Barbotin, A.; Hu, Q.; Ji, N.; Booth, M.J.; Weigert, M.; Myers, E.W. Practical sensorless aberration estimation for 3D microscopy with deep learning. Opt. Express 2020, 28, 29044–29053. [Google Scholar] [CrossRef] [PubMed]
Beauchamp, K. Walsh Functions and Their Applications; Nutrition, Basic and Applied Science; Academic Press: Cambridge, MA, USA, 1975. [Google Scholar]
Wang, F. Wavefront sensing through measurements of binary aberration modes. Appl. Opt. 2009, 48, 2865–2870. [Google Scholar] [CrossRef] [PubMed]
Conway, J.H.; Sloane, N.J.A. Sphere Packings, Lattices and Groups, 3rd ed.; Grundlehren der mathematischen Wissenschaften, A Series of Comprehensive Studies in Mathematics; Springer: New York, NY, USA, 1999; Volume 290. [Google Scholar]
Booth, M.J. Wave front sensor-less adaptive optics: A model-based approach using sphere packings. Opt. Express 2006, 14, 1339–1352. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Antonello, J.; Verhaegen, M.; Fraanje, R.; van Werkhoven, T.; Gerritsen, H.C.; Keller, C.U. Semidefinite programming for model-based sensorless adaptive optics. J. Opt. Soc. Am. A 2012, 29, 2428–2438. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Weisstein, E.W. Hadamard Matrix—From Wolfram MathWorld. Available online: https://mathworld.wolfram.com/HadamardMatrix.html (accessed on 18 July 2022).
Sloane, N.J.A. Hadamard Matrices. Available online: http://neilsloane.com/hadamard/ (accessed on 18 July 2022).
Tang, J.; Germain, R.N.; Cui, M. Superpenetration optical microscopy by iterative multiphoton adaptive compensation technique. Proc. Natl. Acad. Sci. USA 2012, 109, 8434–8439. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Park, J.H.; Sun, W.; Cui, M. High-resolution in vivo imaging of mouse brain through the intact skull. Proc. Natl. Acad. Sci. USA 2015, 112, 9236–9241. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Kong, L.; Cui, M. In vivo neuroimaging through the highly scattering tissue via iterative multi-photon adaptive compensation technique. Opt. Express 2015, 23, 6145–6150. [Google Scholar] [CrossRef] [PubMed]

Figure 1. Optical system used for modelling. The input wavefront contains a phase aberration

Φ

, which passes through a correction device imparting an additional phase

Ψ

. The beam is focussed onto a vanishingly small pinhole detector on the optical axis that admits the intensity I.

Figure 1. Optical system used for modelling. The input wavefront contains a phase aberration

Φ

, which passes through a correction device imparting an additional phase

Ψ

. The beam is focussed onto a vanishingly small pinhole detector on the optical axis that admits the intensity I.

Figure 2. Illustrations of Walsh modes. (a) The Walsh functions

W_{k}^{8} [l]

shown in numerical form. (b) The Walsh modes

W_{k}^{16} [l]

shown as aberration basis modes over a square aperture. (c) The polar Walsh modes, equivalent to

W_{k}^{16} [l]

shown as aberration basis modes over a circular aperture. In both (b) and (c), index

k = 0

for the first mode is in the top left, and k increments in row-major order to

k = 15

in the bottom right.

Figure 2. Illustrations of Walsh modes. (a) The Walsh functions

W_{k}^{8} [l]

shown in numerical form. (b) The Walsh modes

W_{k}^{16} [l]

shown as aberration basis modes over a square aperture. (c) The polar Walsh modes, equivalent to

W_{k}^{16} [l]

shown as aberration basis modes over a circular aperture. In both (b) and (c), index

k = 0

for the first mode is in the top left, and k increments in row-major order to

k = 15

in the bottom right.

Figure 3. (a) Variation of signal for a 16-pixel system with a single pixel modulation showing low modulation depth and period

2 π

. (b) variation of a Walsh mode in the same system showing full modulation and period

π

.

Figure 3. (a) Variation of signal for a 16-pixel system with a single pixel modulation showing low modulation depth and period

2 π

. (b) variation of a Walsh mode in the same system showing full modulation and period

π

.

Figure 4. Illustration of the lattice geometry for intensity variation with pixel phase value. As it is not possible to represent higher order systems in a three-dimensional rendering, the example shown is for a three-pixel system. While this system does not use Walsh modes, it shows the same phenomena of piston invariance and lattice-like behaviour. The axes represent each of the pixel phase values in radians. The same volume rendering is shown from two different angles. The visible contours are set at

I = 0.01

(blue) and

I = 0.8

(orange) to show the positions of the zeros and the maxima, respectively. The function I is invariant with the piston mode, hence the elongation of the contours along the direction

(1, 1, 1)

. The lattice like structure of the function is apparent, in this case in the form of the hexagonal lattice. This shows that there are many different combinations of aberration mode coefficients that provide a similar well-corrected state. Analogous behaviour is found in higher dimensions for the Walsh-mode-based systems.

Figure 4. Illustration of the lattice geometry for intensity variation with pixel phase value. As it is not possible to represent higher order systems in a three-dimensional rendering, the example shown is for a three-pixel system. While this system does not use Walsh modes, it shows the same phenomena of piston invariance and lattice-like behaviour. The axes represent each of the pixel phase values in radians. The same volume rendering is shown from two different angles. The visible contours are set at

I = 0.01

(blue) and

I = 0.8

(orange) to show the positions of the zeros and the maxima, respectively. The function I is invariant with the piston mode, hence the elongation of the contours along the direction

(1, 1, 1)

. The lattice like structure of the function is apparent, in this case in the form of the hexagonal lattice. This shows that there are many different combinations of aberration mode coefficients that provide a similar well-corrected state. Analogous behaviour is found in higher dimensions for the Walsh-mode-based systems.

Figure 5. (a) Illustration of the lattice geometry for the fundamental correction space of a four-pixel system, which corresponds to three Walsh modes after neglecting the piston mode. The axes represent each of the Walsh coefficient values in radians. The same volume rendering is shown from two different angles. The visible contours are set at

I = 0.01

(blue) and

I = 0.8

(orange) to show the positions of the zeros and the maxima, respectively. The BCC lattice geometry is apparent. (b) The Voronoi cell for the

A_{3}^{*}

(or BCC) lattice, a truncated octahedron, is shown within an encompassing cube of side length

π

radians.

Figure 5. (a) Illustration of the lattice geometry for the fundamental correction space of a four-pixel system, which corresponds to three Walsh modes after neglecting the piston mode. The axes represent each of the Walsh coefficient values in radians. The same volume rendering is shown from two different angles. The visible contours are set at

I = 0.01

(blue) and

I = 0.8

(orange) to show the positions of the zeros and the maxima, respectively. The BCC lattice geometry is apparent. (b) The Voronoi cell for the

A_{3}^{*}

(or BCC) lattice, a truncated octahedron, is shown within an encompassing cube of side length

π

radians.

Figure 6. Outline of the NN architecture and preprocessing of data. CNN: convolutional neural network; FCL: fully connected layer; OL: output layer.

Figure 7. (a) NN training and validation loss functions as mean square error (MSE) for the scenarios where the coefficient labels were defined throughout the hypercube cell (HC) or the primary Voronoi cell (VC) for

N = 8

. The insets on the right show a schematic representation in three dimensions the difference between the two types of cells. (b) An illustrative example of correction of an initial aberration consisting of 8 pixels, equivalently 7 polar Walsh modes. The residual error of the VC-based correction was far lower than that of the HC based correction. (c) Statistical summary of correction results from the NN validation set: Initial—distribution of initial input aberrations; Sinus—after estimation using sinusoidal model; HC—after correction using the hypercube cell method; VC—after correction with the Voronoi cell method. Error bars show the standard deviation of the distribution.

Figure 7. (a) NN training and validation loss functions as mean square error (MSE) for the scenarios where the coefficient labels were defined throughout the hypercube cell (HC) or the primary Voronoi cell (VC) for

N = 8

. The insets on the right show a schematic representation in three dimensions the difference between the two types of cells. (b) An illustrative example of correction of an initial aberration consisting of 8 pixels, equivalently 7 polar Walsh modes. The residual error of the VC-based correction was far lower than that of the HC based correction. (c) Statistical summary of correction results from the NN validation set: Initial—distribution of initial input aberrations; Sinus—after estimation using sinusoidal model; HC—after correction using the hypercube cell method; VC—after correction with the Voronoi cell method. Error bars show the standard deviation of the distribution.

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Hu, Q.; Xiao, Y.; Cui, J.; Turcotte, R.; Booth, M.J. The Lattice Geometry of Walsh-Function-Based Adaptive Optics. Photonics 2022, 9, 547. https://doi.org/10.3390/photonics9080547

AMA Style

Hu Q, Xiao Y, Cui J, Turcotte R, Booth MJ. The Lattice Geometry of Walsh-Function-Based Adaptive Optics. Photonics. 2022; 9(8):547. https://doi.org/10.3390/photonics9080547

Chicago/Turabian Style

Hu, Qi, Yuyao Xiao, Jiahe Cui, Raphaël Turcotte, and Martin J. Booth. 2022. "The Lattice Geometry of Walsh-Function-Based Adaptive Optics" Photonics 9, no. 8: 547. https://doi.org/10.3390/photonics9080547

APA Style

Hu, Q., Xiao, Y., Cui, J., Turcotte, R., & Booth, M. J. (2022). The Lattice Geometry of Walsh-Function-Based Adaptive Optics. Photonics, 9(8), 547. https://doi.org/10.3390/photonics9080547

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

The Lattice Geometry of Walsh-Function-Based Adaptive Optics

Abstract

1. Introduction

2. Optical System Model

3. Representation of Aberrations as Walsh Modes

4. Lattice Symmetry of Aberration States

5. Effects of Pixels and Modes on Signal Modulation

6. Defining a Well-Corrected System

7. Vector Space Representation of Well-Corrected States

8. Lattice Representation of States after Removal of Piston

9. Fundamental Correction Space

10. Implications of the Lattice Structure for Sensorless AO

11. Neural Network for Solution of the Inverse Problem

12. Conclusions

Author Contributions

Funding

Conflicts of Interest

Abbreviations

Appendix A. Observations Based upon the Lattice Geometry

Appendix B. Sensorless AO and the Walsh Transform

Appendix C. Estimation of Coefficients Using Simple Sinusoidal Model

Appendix D. Description of the Neural Network Training and Network Architecture

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI