Next Article in Journal
A Novel Piezo Inertia Actuator Utilizing the Transverse Motion of Two Parallel Leaf-Springs
Next Article in Special Issue
Dynamic Flow Control over Optical Properties of Liquid Crystal–Quantum Dot Hybrids in Microfluidic Devices
Previous Article in Journal
Triple-Band Surface Plasmon Resonance Metamaterial Absorber Based on Open-Ended Prohibited Sign Type Monolayer Graphene
Previous Article in Special Issue
Miniature Fourier Transform Spectrometer Based on Thin-Film Lithium Niobate
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Review

Hardware Error Correction for MZI-Based Matrix Computation

1
Key Laboratory of Optoelectronic Materials Chemistry and Physics, Fujian Institute of Research on the Structure of Matter, Chinese Academy of Sciences, Fuzhou 350002, China
2
University of Chinese Academy of Sciences, Beijing 100049, China
3
State Key Laboratory of Advanced Optical Communication Systems and Networks, School of Electronics, Peking University, Beijing 100871, China
4
Beijing Aijie Optoelectronic Technology Co., Ltd., Beijing 100190, China
5
Fujian Science & Technology Innovation Laboratory for Optoelectronic Information of China, Fuzhou 350108, China
*
Authors to whom correspondence should be addressed.
Micromachines 2023, 14(5), 955; https://doi.org/10.3390/mi14050955
Submission received: 20 March 2023 / Revised: 23 April 2023 / Accepted: 24 April 2023 / Published: 27 April 2023
(This article belongs to the Special Issue Integrated Photonics and Optoelectronics)

Abstract

:
With the rapid development of artificial intelligence, the electronic system has fallen short of providing the needed computation speed. It is believed that silicon-based optoelectronic computation may be a solution, where Mach–Zehnder interferometer (MZI)-based matrix computation is the key due to its advantages of simple implementation and easy integration on a silicon wafer, but one of the concerns is the precision of the MZI method in the actual computation. This paper will identify the main hardware error sources of MZI-based matrix computation, summarize the available hardware error correction methods from the perspective of the entire MZI meshes and a single MZI device, and propose a new architecture that will largely improve the precision of MZI-based matrix computation without increasing the size of the MZI’s mesh, which may lead to a fast and accurate optoelectronic computing system.

1. Introduction

Benefiting from the era of big data, the rapid growth of the Internet provides sufficient training datasets for functioning artificial intelligence (AI), which has also become a hot spot in the current technological revolution. However, artificial intelligence has conventionally relied on electronic processors, with its computing power being greatly determined by transistor numbers and capabilities; managing the massive amount of data necessitates additional computing resources. In the post-Moore era, the increasing transistor numbers can no longer keep up with the computing power demand of artificial intelligence, and electronic computation is therefore stuck in a bottleneck [1,2,3]. On the other hand, photons are bosons, which have the advantages of a higher transmission speed and can work well with electrons through the Einstein coefficients, which is conducive to the realization of ultra-high-speed optoelectronic computing. Consequently, more and more researchers are beginning to focus on optoelectronic computing [4,5,6,7,8,9].
The fabrication techniques for silicon photonics are advancing rapidly, enabling the creation of large-scale and complicated circuits and paving the way for low-loss and low-cost optoelectronic devices [10]. These devices can be produced using complementary metal–oxide–semiconductor (CMOS) fabs [11]. As a result, silicon-based optoelectronic computation has emerged as a prominent area of research [3]. At present, optoelectronic matrix computation mainly includes three implementation methods [12]: multi-plane light conversion (MPLC), the Mach–Zehnder interferometer method (MZI), and wavelength division multiplexing (WDM). The Mach–Zehnder interferometer method is simple and easy to integrate. It is one of the best choices to implement optoelectronic matrix computation [13].
The optoelectronic matrix computation implemented by the Mach–Zehnder interferometer is a kind of analog computation, in which computational precision is the most important and should be treated with special attention. However, the performance of silicon photonic devices is sensitive to the influence of environmental disturbances [14,15,16,17], fabrication [15,18], and device aging [15,19], including MZI devices. Particularly, the hardware errors generated by fabrication are the focus of research [20,21], which result in deviations between the matrix implemented by MZI and the actual needed matrix. The hardware errors involve beam splitter errors [20], phase errors [22], and errors caused by optical loss differences [23]. The hardware errors will accumulate with the increase of the scale of the MZI meshes, which limits the implementation of large-scale matrix computation based on MZI meshes. In order to overcome this limitation of hardware error, especially beam splitter error, researchers have proposed different MZI meshes or different MZI devices with higher precision.
In this paper, the Mach–Zehnder interferometer method of achieving optoelectronic matrix computation is introduced in Section 2. The origins of the hardware error of the Mach–Zehnder interferometer are analyzed in Section 3, followed by a synthesis of the studies of MZI hardware error correction in Section 4. Finally, a feasible approach to mitigate the MZI hardware error is proposed in Section 5.

2. Methods

In 1994, Reck et al. [24] proposed one type of triangular mesh based on the Mach–Zehnder interferometer and demonstrated that the unitary matrix transformation of any finite dimension could be implemented by MZI devices. They decomposed the N-dimensional unitary matrix into a series of two-dimensional unitary matrices, which were presented as a triangular mesh formed by the arrangement of MZIs, as shown in Figure 1.
A common Mach–Zehnder interferometer consists of an external phase shifter (φ), an internal phase shifter (θ), and two 50:50 beam splitters (directional couplers (DC)) or multimode interference couplers (MMI)), as shown in Figure 2.
For an MZI device, transmission matrix S is
S = C 4 P 3 C 2 P 1
where P1 and P3 are the transmission matrices of the external phase shifter and the internal phase shifter, respectively; C2 and C4 are the transmission matrices of the beam splitter on the left side of the MZI and the beam splitter on the right side of the MZI, respectively. Among them, the left and right beam splitters are considered as ideal 50:50 beam splitters. Hence, MZI transmission matrix S can be written as [25]
S = 1 / 2 i / 2 i / 2 1 / 2 e i θ 0 0 1 1 / 2 i / 2 i / 2 1 / 2 e i φ 0 0 1
S = i e i θ / 2 e i φ s i n θ / 2 c o s θ / 2 e i φ c o s θ / 2 s i n θ / 2
where φ and θ are phases produced by the external phase shifter and the internal phase shifter, respectively. Obviously, we can produce different phases through these two phase shifters to implement two-dimensional unitary matrices with different elements. For an N-dimensional unitary matrix, the MZI’s transmission matrix can be generalized as Tn,m (θ, φ):
T n , m θ , φ = 1 0 0 0 0 1 0 0 e i φ s i n θ / 2 c o s θ / 2 e i φ c o s θ / 2 s i n θ / 2 0 0 1 0 0 0 0 1
where n and m represent the transmission matrix of the MZI between the nth and mth input ports of the signal entering the mesh. The dimension of an N-dimensional unitary matrix, U(N), can be reduced by right multiplying Tn,m (θ, φ), namely,
U N T N , N 1 T N , N 2 T N , 1 = U N 1 0 0 1
The N-dimensional unitary matrix’s dimension can be continued to be reduced according to the above dimensionality reduction method, and finally, a diagonal matrix, D, with elements of modulo 1 is obtained.
Let
R N = T N , N 1 T N , N 2 T N , 1
U N R N R N 1 R 2 = 1 0 1 0 1 = D
It can be seen from the previous theoretical derivation that the final implementation of the N-dimensional unitary matrix will be a triangular mesh with N-1 MZIs in the N row and N-2 MZIs in the N-1 row. Based on the above theory, in 2017, Shen et al. [26] experimentally demonstrated a cascaded mesh of 56 programmable MZIs with this triangular mesh, which improved the computational speed and power efficiency compared with that of traditional electronic processors.

3. Hardware Error

The N-dimensional unitary matrix implemented by MZI meshes has some deviations from the needed matrix in actual computation. On the one hand, the triangular mesh of the Reck design leads to an inconsistent optical loss of each output, and the larger mesh is, the greater the optical loss difference of each output, and the higher the total losses. On the other hand, more importantly, the beam splitter in MZI cannot implement the ideal 50:50 beam splitter due to the hardware error generated by the fabrication and the non-uniformity of the material, which leads to the deviation of the transmission matrix and affects the final computation.
The non-ideal 50:50 beam splitter transmission matrix can be expressed as follows:
c o s φ i s i n φ i s i n φ c o s φ
where, when φ = π / 4 , it is the ideal 50:50 beam splitter; φ will generally deviate from π/4 for the non-ideal 50:50 beam splitter, which leads to the deviation of the MZI transmission matrix. For the transmission matrix of the beam splitter on the left side of the MZI and the beam splitter on the right side of the MZI, φ can be expressed as φ 1 and φ 2 , respectively, and then MZI transmission matrix S′ can be expressed as
S = c o s φ 2 i s i n φ 2 i s i n φ 2 c o s φ 2 e i θ 0 0 1 c o s φ 1 i s i n φ 1 i s i n φ 1 c o s φ 1 e i φ 0 0 1
S = i e i θ / 2 e i φ c o s φ 1 φ 2 s i n θ / 2 i c o s φ 1 + φ 2 c o s θ / 2 s i n φ 1 + φ 2 c o s θ / 2 + i s i n φ 1 φ 2 s i n θ / 2 e i φ s i n φ 1 + φ 2 c o s θ / 2 i s i n φ 1 φ 2 s i n θ / 2 c o s φ 1 φ 2 s i n θ / 2 + i c o s φ 1 + φ 2 c o s θ / 2
Obviously, the MZI’s transmission matrix changes because of two non-ideal beam splitters, which is inconsistent with the transmission matrix in the ideal case, which leads to the deviation between the N-dimensional unitary matrix implemented by the MZI meshes and the theoretical one.

4. Error Correction

In terms of the hardware error in the N-dimensional unitary matrix transformation implemented by the MZI meshes introduced in Section 3, researchers have made a lot of effort. This section will introduce the main hardware error correction methods for (1) the entire MZI meshes and (2) a single MZI device.

4.1. The Entire MZI Meshes

4.1.1. Rectangular Mesh

In 2016, Clements et al. [23] proposed a rectangular mesh, which was an improvement of the triangular mesh of Reck, as shown in Figure 3. Compared to the triangular mesh of Reck, the rectangular mesh has higher symmetry and a lower optical loss difference of each output. Additionally, the longest optical depth of the rectangular mesh is about half that of the triangular mesh, and the total optical loss is only half that of the triangular mesh.
The improvement of the rectangular mesh is specifically manifested in the process of decomposition of the N-dimensional unitary matrix into a series of two-dimensional unitary matrices. The decomposition process does not reduce the dimensions by right multiplying Tn,m(θ, φ), but by both right multiplying Tn,m(θ, φ) and left multiplying T−1n,m(θ, φ), as shown in Figure 4 and seen in the following:
T 4 , 5 T 3 , 4 T 2 , 3 T 1 , 2 T 4 , 5 T 3 , 4 U T 1 , 2 1 T 3 , 4 1 T 2 , 3 1 T 1 , 2 1 = D
that is,
U = T 3 , 4 1 T 4 , 5 1 T 1 , 2 1 T 2 , 3 1 T 3 , 4 1 T 4 , 5 1 D T 1 , 2 T 2 , 3 T 3 , 4 T 1 , 2
For a matrix Tn,m(θ, φ) and a diagonal matrix D, there are matrix T−1n,m(θ, φ) and another diagonal matrix D′, with T−1n,m(θ, φ) D′ = D Tn,m(θ, φ), and we obtain a right-multiplication dimensionality reduction operation similar to that in Equation (7).
U = D T 3 , 4 T 4 , 5 T 1 , 2 T 2 , 3 T 3 , 4 T 4 , 5 T 1 , 2 T 2 , 3 T 3 , 4 T 1 , 2
Shokraneh et al. [27] experimentally proved that due to the asymmetric distribution of MZIs in the triangular mesh, optical loss is greater on the triangular mesh than on the rectangular mesh in the computation process. They used a dataset that was perfectly classifiable to assess the classification performance of the two meshes in optical neural networks (ONN). Compared to the triangular mesh, the rectangular mesh is more phase-error-tolerant and loss-tolerant. Thus, the rectangular mesh is commonly adopted as the fundamental unit for constructing ONN in various research studies [28,29,30]. However, the beam splitter errors will still impact the computed result of the rectangular mesh.

4.1.2. Fourier Structure

In order to solve hardware errors introduced by the non-ideal 50:50 beam splitter, Lopez-Pastor et al. [31] presented one kind of mesh composed of Fourier transforms and phase masks, which can implement the unitary matrix transformation of any finite dimension.
For an N-dimensional unitary matrix, the above Equation (13) can be written as
U = D i = 1 N / 2 k = 1 N / 2 1 T 2 k χ k i , η k i j = 1 N / 2 T 2 j 1 θ j i , φ j i ,
k = 1 N / 2 1 T 2 k χ k i , η k i and j = 1 N / 2 T 2 j 1 θ j i , φ j i can be decomposed by the diagonal matrix, permutation matrix and circulant matrix, and the decomposition of the unitary matrix can be obtained as follows:
U = D G i = 1 N / 2 B i A i G ,
where A(i) and B(i) are
A i = E , G , H , p Γ θ i , E , G Γ φ i F ,
and,
B i = E , p G , H , Γ χ i , E , p G Γ η i F ,
Finally, The N-dimensional unitary matrix, U(N), can be decomposed into a product of phase masks and Fourier transforms. As shown in Figure 5, the gray rounded rectangles represent the Fourier transforms, and the colored rectangles represent the phase-mask diagonal matrices. Only two diagonal matrices per layer (denoted by red and yellow rectangles) depend on the unitary matrix being implemented, while the rest (denoted by blue rectangles) are fixed. In this structure, the MZI of each layer in the mesh is decomposed into the form of Fourier transforms and phase masks, and the Fourier transforms can be implemented by MMI. This mesh avoids the use of a large number of beam splitters, reduces the source of error to a certain extent, and is more conducive to the realization of a large-scale mesh for optoelectronic computation. However, this decomposition method is not as simple and easy to implement as the triangular mesh and rectangular mesh methods are. Additionally, an even-dimension unitary matrix requires 6N DFTs (the discrete Fourier transform) and 6N + 1 controllable phase masks, and thus the size of the mesh will also significantly increase.

4.1.3. Redundant Rectangular Mesh

Pai et al. [32] proposed two mesh improvements. The first is adding redundant tunable layers in the rectangular Mesh, called the redundant rectangular mesh (RRM), which can accelerate the optimization process of the mesh to implement the unitary matrix. The second is adding a redundant mesh composed of low-loss waveguide crossings or MZIs with fixed cross-state phase shifts, called the permuting rectangular mesh (PRM). As shown in Figure 6, Figure 6a is a schematic diagram of the RRM, where the green part represents the redundant tunable layers, and Figure 6b shows the PRM, where the gray part represents the additional low-loss waveguide crossings or MZIs with fixed cross-state phase shifts. This method of adding additional MZIs in the mesh increases the tunable degrees of freedom in the MZI’s mesh. For a given unitary matrix, there is a supersaturated implementation schemes. Some unitary matrices that cannot be realized due to MZI imperfections can be realized by new equivalent schemes brought by the extra degrees of freedom in the redundant mesh [33].
However, the redundant mesh increases the size of the MZI meshes, which inevitably leads to an increase in the optical loss of the entire mesh. In a small mesh, the increased optical loss is within an acceptable range, and the computational precision of the MZI meshes is better improved.

4.2. A Single MZI Device

In addition to the above improvements in the MZI’s mesh, more researchers focus on the improvement of a single MZI device. The improvement of a single MZI device is mainly carried out by adding redundant phase shifters and beam splitters to improve the tunable degrees of freedom of the MZI, so as to correct its hardware errors.
For a MZI with non-ideal beam splitters, where neither of the two beam splitters has an ideal 50:50 beam splitter ratio, the transmission matrix is given by Equation (10). When we input optical signal E1 from only one port of the MZI, the output optical powers Pout_1 and Pout_2 of the two output ports are
P o u t _ 1 = E 1 2 c o s 2 φ 1 + φ 2 c o s 2 θ / 2 + c o s 2 φ 1 φ 2 s i n 2 θ / 2 ,
P o u t _ 2 = E 1 2 s i n 2 φ 1 + φ 2 c o s 2 θ / 2 + s i n 2 φ 1 φ 2 s i n 2 θ / 2 ,
When φ 1 + φ 2 = π / 2 , P o u t _ 1 = E 1 2 c o s 2 φ 1 + φ 2 c o s 2 θ / 2 , complete extinction can be achieved. Suzuki et al. [34] proposed a MZI architecture, replacing a beam splitter in the common MZI with a MZI, as shown in Figure 7. They adjusted this MZI as a 50:50 beam splitter so that its split ratio was complementary to that of another beam splitter to meet the above complete extinction conditions. Moreover, they demonstrated this 2 × 2 MZI with the highest extinction ratio (50.4 dB).
Although this architecture improves the extinction ratio of the MZI and corrects its transmission matrix to a certain extent, the correction of the transmission matrix is not complete. When complete extinction is achieved, MZI transmission matrix S″ can be expressed as
S = i e i θ / 2 2 e i φ cos φ 2 s i n φ 2 s i n θ / 2 c o s θ / 2 + i cos 2 φ 2 s i n θ / 2 e i φ c o s θ / 2 i c o s 2 φ 2 s i n θ / 2 2 cos φ 2 s i n φ 2 s i n θ / 2 ,
It is evident that the transmission matrix of MZI undergoes a slight correction, yet it still deviates from the ideal MZI transmission matrix’s S (Equation (3)). Consequently, errors may still arise during matrix computations, especially when dealing with large-scale MZI meshes.
Miller et al. [35] presented a double Mach–Zehnder interferometer (DMZI), as shown in Figure 8. Two MZIs were used as tunable beam splitters to replace the two splitters on the left and right of the common MZI, forming a new MZI architecture.
Now, we will illustrate the error correction process of DMZI. For instance, considering the MZI on the left, we should adjust its beam splitters to a 50:50 split. The transmission matrix of two beam splitters in this MZI can be expressed as Pa and Pb:
P a = 1 2 R a i 1 2 + R a i 1 2 + R a 1 2 R a ,
P b = 1 2 R b i 1 2 + R b i 1 2 + R b 1 2 R b ,
where, Ra and Rb represent errors of the left beam splitter and the right beam splitter, respectively. The transmission matrix, PL, of the MZI as a beam splitter can be expressed as
P L = 1 2 R b i 1 2 + R b i 1 2 + R b 1 2 R b e i θ 0 0 1 1 2 R a i 1 2 + R a i 1 2 + R a 1 2 R a e i φ 0 0 1 ,
This can be rewritten as
P L = e i θ α b α a β b β a i e i θ α b β a + i β b α a i e i θ β b α a + i α b β a α b α a e i θ β b β ,
where, α b = 1 2 R b ; β b = 1 2 + R b ; α a = 1 2 R a ; β a = 1 2 + R a . When we input optical signals, E1, from only one port of MZI, the output optical power, Pout, of one of the two output ports is
P o u t = E 1 2 1 2 + 2 R a R b 1 4 R a 2 1 4 R b 2 × c o s θ ,
Thus, if this MZI is used as a 50:50 beam splitter, P 3 = 1 2 E 1 2 ; hence, from Equation (25)
R a R b = 1 4 R a 2 1 4 R b 2 × c o s θ ,
considering c o s 2 θ 1 ; hence, from Equation (26)
R a 2 R b 2 1 4 R a 2 1 4 R b 2 ,
which can be derived to give
R a 1 8 0.35 & R b 1 8 0.35 ,
Therefore, the split ratios of the fabricated power from 85:15 to 15:85 can be compensated by adjusting the split ratio of the two tunable beam splitters back to 50:50. Two redundant beam splitters and two redundant phase shifters can completely correct the transmission matrix of MZI. Wilkes et al. [36] proposed a configuration algorithm for this DMZI, eventually achieving a 60 dB extinction ratio.
However, the size of the MZI also increases significantly. The increase in optical loss brought about by redundant beam splitters and phase shifters is also a problem to be considered in large-scale MZI meshes. Moreover, the transmission matrix of a real MZI as a beam splitter is different from that of the ideal one; when the ideal MZI is used as a 50:50 beam splitter, its transmission matrix R, from Equation (3), is:
R = 1 1 1 1 ,
where,   x = c o s φ 1 φ 2 ; y = c o s φ 1 + φ 2 ; s = s i n θ / 2 ; c = c o s θ / 2 . The corresponding output optical phase of the four elements in the matrix R can be represented on a complex plane, as shown in Figure 9.
For a real MZI, when it is used as a 50:50 beam splitter, its transmission matrix R′, from Equation (10), is
R = s x i c y c 1 y 2 + i s 1 x 2 c 1 y 2 i s 1 x 2 s x i c y ,
where,   x = c o s φ 1 φ 2 ; y = c o s φ 1 + φ 2 ; s = s i n θ / 2 ; c = c o s θ / 2 . The corresponding output optical phase of the four elements in the matrix R′ can be represented on a complex plane, as shown in Figure 10.
By comparison with Figure 11 and Figure 12, it is evident that for a MZI with a beam splitter that is not an ideal 50:50 beam splitter, when it is used as a 50:50 beam splitter, the phase of the output optical signal will be altered, thus ultimately impacting the interference result in the subsequent optical path of the mesh and thus the final computed result.
Based on the above works, Hamerly et al. [20] proposed two MZI architectures (Figure 11 and Figure 12). One is of a three-splitter MZI, which can correct generic errors and achieve a full range of split ratios. To realize the full range of the split ratio, it changes the position of “forbidden regions” caused by the error of the beam splitter, which are some unitary matrices that cannot be implemented because of the beam splitter error. The “forbidden regions” are displaced away from the cross state, rather than being eliminated completely. However, similarly to the architecture proposed by Suzuki, it does not completely correct the transmission matrix of the MZI. Furthermore, this MZI architecture does not incorporate an external phase shifter. Therefore, when it is formed into a mesh, it cannot correct the phase error from the previous layeR′s MZIs.
The other one is MZI + crossing, which can only correct correlated device errors. Because the errors of the right and left beam splitters are consistent, the added cross waveguide rotates the “forbidden regions” by 180° to achieve complete extinction. Thus, this architecture has bandwidth tolerance. However, due to the errors of the right and left beam splitters being usually inconsistent, this architecture only exhibits good bandwidth tolerance but cannot correct the beam splitter error and eliminate limitations to matrix computation. Compared to the Suzuki design and Miller design, these two MZI architectures are smaller in size and do not add redundant phase shifters, but the hardware error correction is not good enough.
Bandyopadhyay et al. [21] used redundant phase shifters to correct hardware errors and proposed a method of adding phase shifters to two ports of the output end of an MZI to locally correct the hardware error within an individual MZI. In this method, no additional beam splitters are added, and the increase in the size of a single device is small, as shown in Figure 13.
For the MZI with non-ideal 50:50 beam splitters, the transmission matrix S′, from Equation (10), can be rewritten as:
S θ , φ = i e i θ / 2 e i φ c o s φ 1 φ 2 s i n θ / 2 i c o s φ 1 + φ 2 c o s θ / 2 s i n φ 1 + φ 2 c o s θ / 2 + i s i n φ 1 φ 2 s i n θ / 2 e i φ s i n φ 1 + φ 2 c o s θ / 2 i s i n φ 1 φ 2 s i n θ / 2 c o s φ 1 φ 2 s i n θ / 2 + i c o s φ 1 + φ 2 c o s θ / 2 ,
To implement a desired unitary S (Equation (3)), we should find the θ , φ of each S θ , φ = S. The condition to each S θ , φ = S produces the following expression for θ :
θ = 2 arcsin sin 2 θ / 2 cos 2 ( φ 1 + φ 2 ) cos 2 ( φ 1 φ 2 ) cos 2 ( φ 1 + φ 2 ) ,
Therefore, the beam splitter errors restrict θ to the range
2 φ 1 + φ 2 π 2 < θ < π 2 φ 1 φ 2 ,
Assuming that θ is in this range, the transmission matrix’s S θ , φ , from Equation (31), can be rewritten as
S = i e i θ / 2 e i φ e i δ a s i n θ / 2 e i δ b c o s θ / 2 e i φ e i δ c c o s θ / 2 e i δ d s i n θ / 2 ,
= i e i θ / 2 e i δ b 0 0 e i δ d e i φ e i δ a δ b s i n θ / 2 c o s θ / 2 e i φ e i δ c δ d c o s θ / 2 s i n θ / 2 ,
= i e i θ / 2 e i δ b 0 0 e i δ d e i φ + δ a δ b s i n θ / 2 c o s θ / 2 e i φ + δ a δ b c o s θ / 2 s i n θ / 2 ,
where δ a , δ b , δ c and δ d are phase errors the elements of S θ , φ , and for the unitary matrix requiring that δ a + δ d = δ b + δ c , we can correct those phase errors from Equation (36) to set S θ , φ as equal to S. In accordance with Equation (36), the architecture shown in Figure 13 can be obtained. For different unitary matrices, corresponding error correction procedures must be implemented, and the correction method is complicated. It is important to note that this architecture can only correct the phase error. Although the transmission matrix is corrected, θ is restricted, meaning that some matrices cannot be implemented accurately. When θ is not in this range, there is a deviation between the desired matrix and the actual matrix.

5. Discussion

To overcome hardware errors in MZI meshes during the implementation of matrix computation, the primary strategies include increasing the tunable degrees of freedom of MZI meshes. This involves adding redundant beam splitters and phase shifters. However, incorporating a redundant mesh and improving a single MZI device may result in an increase in the size of the MZI mesh, as shown in Table 1. Therefore, it is crucial to develop a MZI mesh that can perform high-precision computation without increasing the device’s size to achieve large-scale matrix computation.
In 2001, a study report [37] was conducted on tunable multimode interference couplers (MMI). By changing the position of the four-fold images of MMI out of the phase shift of the four fields, the split ratio of the two-fold images was controlled, which was then realized in the InP material. In 2008, May-Arrioja et al. [38] used local electrical modulation to control the split ratio, and in addition, changed the phase of the two-fold images [39,40] to realize the arbitrary split ratio of the self-image. A thermally modulated MMI using polymer materials was also introduced [41]. In 2019, Perez et al. [42] reported a thermally modulated dual-drive directional coupler (DD-DC) and experimentally proved that it can realize an arbitrary split ratio. The size of the tunable beam splitters proposed in these studies is relatively large, and some beam splitters are not integrated on a silicon platform. However, these studies provide us with a novel perspective; by applying thermal or electrical modulation interference to the local optical field of the static beam splitter, we can also create a tunable beam splitter on a silicon platform without needing to increase its size.
Here, we propose a new MZI architecture, where we replace the 50:50 beam splitter in the common MZI with a tunable DC/MMI that takes into account hardware error correction and mesh size, as shown in Figure 14. This improves the computational precision of the MZI’s mesh without increasing its size. By adjusting the split ratios of the tunable DC/MMI, we can eliminate its split ratio deviations from 50:50. For instance, for a tunable MMI, when the phase of optical fields at the position of the four-fold images in the multimode interferometer is altered, it affects the interference results of the optical fields, culminating in changes in the intensity of the two images at the two-fold image position. This enables the adjustment of the beam split ratios. The tunable MMI has the theoretical capability to adjust split ratios from 100:0 to 0:100, enabling the adjustment of any fabricated power split ratios in the physical 2 × 2 MMI to the ideal 50:50 split. This correction completely eliminates beam splitter errors and enables high-precision MZI-based matrix computation.

6. Conclusions

To summarize, this paper introduces the method of using a MZI’s mesh to implement any finite dimensional unitary matrix transformation, along with an analysis of the effects of hardware errors during matrix computation based on the MZI. Addressing MZIs’ hardware errors is crucial to achieving large-scale matrix computation. To eliminate the hardware error of an MZI, various improvement works have been carried out on the entire MZI mesh and a single MZI device. It is important to note that correcting these errors can lead to an increase in an MZI’s mesh size, which is also a significant concern. The trade-off between hardware error correction and mesh size requires more innovative works in artificial intelligence, materials, optics, device manufacturing and other related fields.
In this paper, a new MZI architecture is proposed which replaces the 50:50 beam splitter in the common MZI with the tunable DC/MMI. This MZI is designed to take into account both hardware error correction and mesh size concerns. The tunable beam splitter-based MZI provides a new approach for more accurate large-scale matrix computation based on the MZI’s mesh.

Author Contributions

Conceptualization, H.H. and Z.Z.; writing-original draft preparation, H.H.; writing-review and editing, H.H., Z.Z., P.X. and H.S. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Key Program of the National Natural Science Foundation of China (62035001).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

No new data were created or analyzed in this study. Data sharing is not applicable to this article.

Acknowledgments

The authors would like to thank the Beijing Aijie Optoelectronic Technology Co., Ltd. for the technical and partial finacial support.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Zhou, Z. Silicon Based Optoelectronics, 2nd ed.; Science Press: Beijing, China, 2021; pp. 430–447. [Google Scholar]
  2. Zhou, Z.; Xu, P.; Dong, X. Computing on Silicon Photonic Platform. Chin. J. Lasers 2020, 47, 0600001. [Google Scholar] [CrossRef]
  3. Xu, P.; Zhou, Z. Silicon-based optoelectronics for general-purpose matrix computation: A review. Adv. Photonics 2022, 4, 044001. [Google Scholar] [CrossRef]
  4. Carolan, J.; Harrold, C.; Sparrow, C.; Martín-López, E.; Russell, N.J.; Silverstone, J.W.; Shadbolt, P.J.; Matsuda, N.; Oguma, M.; Itoh, M.; et al. Universal linear optics. Science 2015, 349, 711–716. [Google Scholar] [CrossRef] [PubMed]
  5. Bogaerts, W.; Pérez, D.; Capmany, J.; Miller, D.A.B.; Poon, J.; Englund, D.; Morichetti, F.; Melloni, A. Programmable photonic circuits. Nature 2020, 586, 207–216. [Google Scholar] [CrossRef]
  6. Pérez-López, D.; López, A.; DasMahapatra, P.; Capmany, J. Multipurpose self-configuration of programmable photonic circuits. Nat. Commun. 2020, 11, 6359. [Google Scholar] [CrossRef] [PubMed]
  7. Dong, M.; Clark, G.; Leenheer, A.J.; Zimmermann, M.; Dominguez, D.; Menssen, A.J.; Heim, D.; Gilbert, G.; Englund, D.; Eichenfield, M. High-speed programmable photonic circuits in a cryogenically compatible, visible–near-infrared 200 mm CMOS architecture. Nat. Photonics 2022, 16, 59–65. [Google Scholar] [CrossRef]
  8. Enomoto, Y.; Yonezu, K.; Mitsuhashi, Y.; Takase, K.; Takeda, S. Programmable and sequential Gaussian gates in a loop-based single-mode photonic quantum processor. Sci. Adv. 2021, 7, j6624. [Google Scholar] [CrossRef]
  9. Fu, T.; Zang, Y.; Huang, Y.; Du, Z.; Huang, H.; Hu, C.; Chen, M.; Yang, S.; Chen, H. Photonic machine learning with on-chip diffractive optics. Nat. Commun. 2023, 14, 70. [Google Scholar] [CrossRef]
  10. Bogaerts, W.; Chrostowski, L. Silicon Photonics Circuit Design: Methods, Tools and Challenges. Laser Photonics Rev. 2018, 12, 1700237. [Google Scholar] [CrossRef]
  11. Boeuf, F.; Cremer, S.; Temporiti, E.; Fere, M.; Shaw, M.; Vulliet, N.; Orlando, B.; Ristoiu, D.; Farcy, A.; Pinguet, T.; et al. Recent progress in silicon photonics R and D and manufacturing on 300mm wafer platform. In Proceedings of the 2015 Optical Fiber Communications Conference and Exhibition (OFC), Los Angeles, CA, USA, 22–26 March 2015. [Google Scholar]
  12. Zhou, H.; Dong, J.; Cheng, J.; Dong, W.; Huang, C.; Shen, Y.; Zhang, Q.; Gu, M.; Qian, C.; Chen, H.; et al. Photonic matrix multiplication lights up photonic accelerator and beyond. Light. Sci. Appl. 2022, 11, 30. [Google Scholar] [CrossRef]
  13. Du, Y.; Su, K.; Yuan, X.; Li, T.; Liu, K.; Man, H.; Zou, X. Implementation of optical neural network based on Mach–Zehnder interferometer array. Iet Optoelectron. 2023, 17, 1–11. [Google Scholar] [CrossRef]
  14. Jacques, M.; Samani, A.; El-Fiky, E.; Patel, D.; Xing, Z.; Plant, D.V. Optimization of thermo-optic phase-shifter design and mitigation of thermal crosstalk on the soi platform. Opt. Express 2019, 27, 10456. [Google Scholar] [CrossRef] [PubMed]
  15. AlTaha, M.W. Automatic Tuning Circuits for Mach-Zehnder Interferometer Optical Switches; University of British Columbia: Vancouver, BC, Canada, 2017. [Google Scholar] [CrossRef]
  16. Zhang, Y.; Ashe, J.J. Designing a High-Performance Tec Controller; SPIE: Washington, DC, USA, 2002; Volume 4913, pp. 177–183. [Google Scholar] [CrossRef]
  17. Shastri, B.J.; Tait, A.N.; Ferreira De Lima, T.; Pernice, W.H.P.; Bhaskaran, H.; Wright, C.D.; Prucnal, P.R. Photonics for artificial intelligence and neuromorphic computing. Nat. Photonics 2021, 15, 102–114. [Google Scholar] [CrossRef]
  18. Xie, Y.; Shi, Y.; Liu, L.; Wang, J.; Priti, R.; Zhang, G.; Liboiron-Ladouceur, O.; Dai, D. Thermally-reconfigurable silicon photonic devices and circuits. IEEE J. Sel. Top. Quantum Electron. 2020, 26, 1–21. [Google Scholar] [CrossRef]
  19. Chung, H.S.; Chang, S.H.; Lee, J.C.; Lee, J.H.; Kim, K. Field experiment of 112 gb/s dual-carrier dqpsk signal transmission with automatic bias control of optical iq modulator. In Proceedings of the 2013 Optical Fiber Communication Conference and Exposition and the National Fiber Optic Engineers Conference (OFC/NFOEC), Anaheim, CA, USA, 17–21 March 2013. [Google Scholar]
  20. Hamerly, R.; Bandyopadhyay, S.; Englund, D. Asymptotically fault-tolerant programmable photonics. Nat. Commun. 2022, 13, 6831. [Google Scholar] [CrossRef] [PubMed]
  21. Bandyopadhyay, S.; Hamerly, R.; Englund, D. Hardware error correction for programmable photonics. Optica 2021, 8, 1247. [Google Scholar] [CrossRef]
  22. Yang, Y.; Ma, Y.; Guan, H.; Liu, Y.; Danziger, S.; Ocheltree, S.; Bergman, K.; Baehr-Jones, T.; Hochberg, M. Phase coherence length in silicon photonic platform. Opt. Express 2015, 23, 16890. [Google Scholar] [CrossRef]
  23. Clements, W.R.; Humphreys, P.C.; Metcalf, B.J.; Kolthammer, W.S.; Walsmley, I.A. Optimal design for universal multiport interferometers. Optica 2016, 3, 1460. [Google Scholar] [CrossRef]
  24. Reck, M.; Zeilinger, A.; Bernstein, H.J.; Bertani, P. Experimental realization of any discrete unitary operator. Phys. Rev. Lett. 1994, 73, 58–61. [Google Scholar] [CrossRef]
  25. Macho Ortiz, A.; Pérez López, D.; Capmany, J. Optical Implementation of 2 × 2 Universal Unitary Matrix Transformations. Laser Photonics Rev. 2021, 15, 2000473. [Google Scholar] [CrossRef]
  26. Shen, Y.; Harris, N.C.; Skirlo, S.; Prabhu, M.; Baehr-Jones, T.; Hochberg, M.; Sun, X.; Zhao, S.; Larochelle, H.; Englund, D.; et al. Deep learning with coherent nanophotonic circuits. Nat. Photonics 2017, 11, 441–446. [Google Scholar] [CrossRef]
  27. Shokraneh, F.; Geoffroy-Gagnon, S.; Liboiron-Ladouceur, O. Towards Phase-Error-and Loss-Tolerant Programmable MZI-Based Optical Processors for Optical Neural Networks. In Proceedings of the 2020 IEEE Photonics Conference (IPC), Vancouver, BC, Canada, 28 September–1 October 2020. [Google Scholar]
  28. Zhu, Y.; Zhang, G.L.; Li, B.; Yin, X.; Zhuo, C.; Gu, H.; Ho, T.; Schlichtmann, U. Countering variations and thermal effects for accurate optical neural networks. In Proceedings of the 2020 IEEE/ACM International Conference on Computer Aided Design (ICCAD), San Diego, CA, USA, 2–5 November 2020. [Google Scholar]
  29. Aoyama, K.; Sawada, H. Acceleration method for learning fine-layered optical neural networks. In Proceedings of the 2021 IEEE/ACM International Conference on Computer Aided Design (ICCAD), Munich, Germany, 1–4 November 2021. [Google Scholar]
  30. Sarantoglou, G.; Bogris, A.; Mesaritakis, C.; Theodoridis, S. Bayesian photonic accelerators for energy efficient and noise robust neural processing. IEEE J. Sel. Top. Quantum Electron. 2022, 28, 1–10. [Google Scholar] [CrossRef]
  31. López Pastor, V.; Lundeen, J.; Marquardt, F. Arbitrary optical wave evolution with Fourier transforms and phase masks. Opt. Express 2021, 29, 38441. [Google Scholar] [CrossRef]
  32. Pai, S.; Bartlett, B.; Solgaard, O.; Miller, D.A.B. Matrix Optimization on Universal Unitary Photonic Devices. Phys. Rev. Appl. 2019, 11, 064044. [Google Scholar] [CrossRef]
  33. Burgwal, R.; Clements, W.R.; Smith, D.H.; Gates, J.C.; Kolthammer, W.S.; Renema, J.J.; Walmsley, I.A. Using an imperfect photonic network to implement random unitaries. Opt. Express 2017, 25, 28236. [Google Scholar] [CrossRef]
  34. Suzuki, K.; Cong, G.; Tanizawa, K.; Kim, S.; Ikeda, K.; Namiki, S.; Kawashima, H. Ultra-high-extinction-ratio 2 × 2 silicon optical switch with variable splitter. Opt. Express 2015, 23, 9086. [Google Scholar] [CrossRef] [PubMed]
  35. Miller, D.A.B. Perfect optics with imperfect components. Optica 2015, 2, 747–750. [Google Scholar] [CrossRef]
  36. Wilkes, C.M.; Qiang, X.; Wang, J.; Santagati, R.; Paesani, S.; Zhou, X.; Miller, D.A.; Marshall, G.D.; Thompson, M.G.; O’Brien, J.L. 60 dB high-extinction auto-configured Mach-Zehnder interferometer. Opt. Lett. 2016, 41, 5318–5321. [Google Scholar] [CrossRef]
  37. Leuthold, J.; Joyner, C.W. Multimode interference couplers with tunable power splitting ratios. J. Light. Technol. 2001, 19, 700–707. [Google Scholar] [CrossRef]
  38. May-Arrioja, D.A.; LiKamWa, P. Reconfigurable 3-dB MMI splitter. In Proceedings of the 2008 Digest of the IEEE/LEOS Summer Topical Meetings, Acapulco, Mexico, 21–23 July 2008; pp. 39–40. [Google Scholar]
  39. Rosa, Á.; Gutiérrez, A.; Brimont, A.; Griol, A.; Sanchis, P. High performace silicon 2 × 2 optical switch based on a thermo-optically tunable multimode interference coupler and efficient electrodes. Opt. Express 2016, 24, 191. [Google Scholar] [CrossRef]
  40. Hassan, S.; Chack, D.; Pavesi, L. High extinction ratio thermo-optic based reconfigurable optical logic gates for programmable PICs. Aip Adv. 2022, 12, 055304. [Google Scholar] [CrossRef]
  41. Fan Wang, J.Y. Optical Switch Based on Multimode Interference Coupler. Ieee Photonic Tech. L 2006, 18, 421–423. [Google Scholar] [CrossRef]
  42. Perez, D.; Gutierrez, A.M.; Sanchez, E.; Dasmahapatra, P.; Capmany, J. Dual-Drive Directional Couplers for Programmable Integrated Photonics. In Proceedings of the 2019 IEEE Photonics Society Summer Topical Meeting Series (SUM), Ft. Lauderdale, FL, USA, 8–10 July 2019. [Google Scholar]
Figure 1. Triangular mesh.
Figure 1. Triangular mesh.
Micromachines 14 00955 g001
Figure 2. The common MZI.
Figure 2. The common MZI.
Micromachines 14 00955 g002
Figure 3. Rectangular mesh.
Figure 3. Rectangular mesh.
Micromachines 14 00955 g003
Figure 4. The decomposition method of the rectangular mesh. Reprinted under the terms of the CC-BY license [23]. Copyright 2016, Clements et al., published by Optica.
Figure 4. The decomposition method of the rectangular mesh. Reprinted under the terms of the CC-BY license [23]. Copyright 2016, Clements et al., published by Optica.
Micromachines 14 00955 g004
Figure 5. Mesh composed of Fourier transforms and phase masks. Reprinted under the terms of the OSA Open Access Publishing Agreement [31]. Copyright 2021, Lopez-Pastor et al., published by Opt. Express.
Figure 5. Mesh composed of Fourier transforms and phase masks. Reprinted under the terms of the OSA Open Access Publishing Agreement [31]. Copyright 2021, Lopez-Pastor et al., published by Opt. Express.
Micromachines 14 00955 g005
Figure 6. (a) Redundant rectangular mesh; (b) permuting rectangular mesh. Reprinted with permission from ref. [32]. Copyright 2019, APS.
Figure 6. (a) Redundant rectangular mesh; (b) permuting rectangular mesh. Reprinted with permission from ref. [32]. Copyright 2019, APS.
Micromachines 14 00955 g006
Figure 7. MZI architecture with a variable splitter as the front 3-dB splitter (MZI as a beam splitter in green frame).
Figure 7. MZI architecture with a variable splitter as the front 3-dB splitter (MZI as a beam splitter in green frame).
Micromachines 14 00955 g007
Figure 8. A double MZI architecture (MZI as a beam splitter in green frame).
Figure 8. A double MZI architecture (MZI as a beam splitter in green frame).
Micromachines 14 00955 g008
Figure 9. The corresponding output optical phase of the four elements in the matrix R.
Figure 9. The corresponding output optical phase of the four elements in the matrix R.
Micromachines 14 00955 g009
Figure 10. The corresponding output optical phase of the four elements in the matrix R′.
Figure 10. The corresponding output optical phase of the four elements in the matrix R′.
Micromachines 14 00955 g010
Figure 11. Three-splitter MZI.
Figure 11. Three-splitter MZI.
Micromachines 14 00955 g011
Figure 12. MZI + crossing.
Figure 12. MZI + crossing.
Micromachines 14 00955 g012
Figure 13. Bandyopadhyay et al.’s MZI architecture with a local error correction design.
Figure 13. Bandyopadhyay et al.’s MZI architecture with a local error correction design.
Micromachines 14 00955 g013
Figure 14. A new MZI architecture.
Figure 14. A new MZI architecture.
Micromachines 14 00955 g014
Table 1. Characteristics of major MZI schemes.
Table 1. Characteristics of major MZI schemes.
ArchitectureNumber of Beam SplittersNumber of Phase ShiftsSizeHardware Error Correction
The common MZI221none
Suzuki331.5+++
Miller442++++
3-splitter MZI321.2++
MZI + crossing22 + 1 crossing1.2+
Bandyopadhyay241.2++
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Hou, H.; Xu, P.; Zhou, Z.; Su, H. Hardware Error Correction for MZI-Based Matrix Computation. Micromachines 2023, 14, 955. https://doi.org/10.3390/mi14050955

AMA Style

Hou H, Xu P, Zhou Z, Su H. Hardware Error Correction for MZI-Based Matrix Computation. Micromachines. 2023; 14(5):955. https://doi.org/10.3390/mi14050955

Chicago/Turabian Style

Hou, Huihuang, Pengfei Xu, Zhiping Zhou, and Hui Su. 2023. "Hardware Error Correction for MZI-Based Matrix Computation" Micromachines 14, no. 5: 955. https://doi.org/10.3390/mi14050955

APA Style

Hou, H., Xu, P., Zhou, Z., & Su, H. (2023). Hardware Error Correction for MZI-Based Matrix Computation. Micromachines, 14(5), 955. https://doi.org/10.3390/mi14050955

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop