We simulate the optical system using the spectrum of plane-waves method [
27] for propagation between SPP and MC, while using ray tracing for propagation between the front face of MC and rear face of the lens, for evaluation of wavefront aberrations, and for optimization of aspherical shapes of MC interfaces. The Debye–Wolf diffraction integral [
28] was used to compute the focal spot. Two different aspherical lenses with NA 0.21 (low NA asphere) and 0.47 (high NA asphere) were used, to differentiate between effects caused by strong focusing and effects which are not caused by strong focusing. A simple Wolfram Mathematica code was written to perform the simulations, which is placed as a
supplementary material. We briefly describe the methods used for simulations in the first subsection, while we give the expression for all aspherical surfaces, including both lenses and MC for both variations, in the second subsection, and we present and discuss the results regarding the focal spot in the third subsection.
3.1. Methods Used for Numerical Simulations
The spectrum of plane-waves method is well known, easy to implement, and efficient for beams with nearly plane wavefront in the near-field. It involves Fourier transforming the input signal and adding a phase, in accordance with the dispersion relation of the waves and distance between input and output planes, to determine the output signal. Mathematically, it is described in Equation (8).
where
k0 =
ω/c and
k is the transversal wave-vector of a plane-wave. We use this formula in the case of variation 1 to simulate the propagation of the beam over the distance
z = 1 m, between SPP and MC, starting with a Gaussian beam with diameter 16 mm, after we multiplied the field with the complex transmission function of SPP (topological charge 𝓁 = −1).
Ray tracing is used everywhere in optics. Therefore, here, we discuss only the way in which we use it to compute the wavefront aberration produced by the combination MC + lens and how we reconstruct the field from ray information traced through an optical component. Firstly, let us define the ray vector (
t,
tz) which points in the direction of the ray and has the property that
k0 t =
k. Let us imagine a divergent bundle of rays that originates from a single point. One ray from the bundle intersects a plane surface placed at distance
z from the starting point, at coordinate
rs(
t) =
z t/tz, while the optical path length along the ray is
s(
t) =
z/tz. We can observe now that the complex amplitude produced by the plane-wave indexed by
k in the plane
z, which usually can be written as exp[i
kz (
k)
z + i
k·
r], where
kz (
k) = (
k20 −
k2)
1/2, could also be written as exp[i
k0 s(
k) + i
k·
r − i
k·
rs(
k)]. Therefore, we can conclude that information about the ray intersection with the surface and optical path length along the ray can be used to model the propagation of a plane-wave. Now, we have to determine the additional phase that each plane-wave acquires in the focal plan due to the aberrations of the system.
Equation (9) evaluates the aberrations in the focal plane, usually evaluated in the plane tangent to the rear face of the last component or on the surface of a sphere centered in the focal point. s1(k) is the optical path length along the ray, between the input plane and the plane tangent to the last surface, s2(k) is the optical path length along the ray indexed by k between this plane and the focal plane, and rs is the point of intersection between the ray and the focal plane. The approximate value from Equation (9) is equivalent to the wavefront aberration as is usually employed for the Debye–Wolf integral since it is given up to a constant by the optical path length between the input plane and the sphere of radius f centered in the focal point. We used the correct value from Equation (9), which also takes into consideration that the plane-waves are not directed exactly to the focal point.
To determine the complex amplitude of the beam in a certain plane after the focusing element, in the point indicated by vector
R, we simply take the complex amplitude in the input plane, in the point
r(
R) that gives the intersection between the input plane with the ray which passes through
R, and we multiply this value with a factor which accounts for the fact that the density of rays changes along the ray indexed by
R. Ray density is computed using a Voronoi mesh [
29] as the inverse of the area of a Voronoi cell; Voronoi mesh is a built-in function of Wolfram Mathematica. The complex amplitude of the beam in the output plane (after the lens) is given in Equation (10), where the phase factor exp[i
k0 s1(
R)] was dropped but is later included when the Debye–Wolf integral is performed, in Φ(
t).
For implementation of the Debye–Wolf integral, we used Equations (11)–(14) from Reference [
30], with wavefront aberration determined according to Equation (9) and the complex amplitude of the beam, denoted by
Vlens, determined using Equations (8) and (10). The complex amplitude of the beam in the output plane, in the point
R, is proportional to the complex amplitude of the plane-wave with direction given by the ray vector of the ray that passes through
R. Since there is a one-to-one correspondence between the position in the transversal plane, after the focusing element, and the transversal projection of ray vector
t, we can index the complex amplitude depending on
t as shown in Equation (11).
Matrix , whose expression is in Equation (12), gives the contribution of each plane-wave to the electric field components Ex, Ey, and Ez depending on its polarization, while θ(t) gives the orientation of the ray/plane-wave indexed by t, around the optical axis.
3.2. Expression of the Surface of MC
Surfaces of optical component MC should be conical shaped. Although the tip of the rear (convex) face can be produced in practice with enough sharpness, the front (concave face) cannot be made in a conical shape without a non-transmissive region that could be even larger than 2 mm in radius. It seems that a conical (aspherical) shape with a rounded tip and curvature radius larger than 2 mm on the front face will also highly distort the wavefront. The solution we found was to choose a conical (aspherical) shape with a rounded tip with curvature radius 5 mm (for variation 1) and to correct the shape of the rear face using higher aspherical coefficients. We would like to mention that spherical concave lenses with curvature radius less than 5 mm are commercially available. Gonzalez-Acuña and Guitiérrez-Vega [
31] determined the shape of the rear surface of a component, being given the equation for the front face, such that the component behaves as an axicon that tilts the rays with angle
β. We use these equations by setting
β = 0 to determine the shape of the rear surface and then we fit the data using the aspherical lens (Equation (13)).
The variation from the target value of mode conversion efficiency of MC for the waves which travel near the optical axis is not essential, in the case of variation 1 all modes are helical; therefore, this will not produce intensity on the optical axis in the focal plane. On the other hand, in the case of variation 2, the mode without topological charge is necessary to form the top-hat shape. Equations for shapes of front/rear interfaces of MC for variation 1 and 2 are presented in
Table 1; for completeness, we give also the expression of the front face of the two aspherical lenses used for simulation.
The back surface of MC is designed for an aperture diameter of 26 mm, while the clean aperture is considered 23 mm in diameter. It is very likely that better designs are possible for both versions of MC, while what we present is more a proof of concept. Aspherical coefficients α
2n have alternating signs and very high values, which means that they compensate for each other to produce a correction that is not easily described by a polynomial function. As a result, the designed surface presents small variations from the ideal surface, given by equations from Reference [
31], which scatter the rays. The geometrical spot for both cases is still sub-wavelength. For version 1 of MC, we could not properly fit the ideal surface near the tip; therefore, the waves that pass through the central region are not focused on the optical axis. MC for variation 2 looks more like a meniscus lens, having a larger curvature radius; therefore, this problem was not present.
3.3. Results
In
Figure 3a, we show the intensity profile of the beam right before (red) and after (blue) MC for variation 1 of the system, with green representing the conversion efficiency depending on the input coordinate. The beam is helical due to the presence of SPP, placed 1 m before MC. We can observe that the conversion efficiency is nearly 50%, except in the central region, where it is nearly zero. In
Figure 3b, we have a ray-fan diagram (red) which represents the position of the ray in the focal plane depending on the position before the MC. We can observe small oscillations near zero that are caused by the inability of aspherical lens formula to perfectly represent the ideal corrected rear surface of MC. In
Figure 3c (inset), we represent MC for variation 1 with the high-NA asphere in the longitudinal section, where the distance between components is shrunk to fit in the image. In
Figure 3d (inset) is the ray spot diagram of the system, where we can observe that the geometrical spot is smaller than the wavelength and, therefore, the system is nearly diffraction-limited.
The focal spot for variation 1A, which is polarized azimuthally, is presented in
Figure 4b, while a focal spot with radial polarization is presented in
Figure 4a for comparison. The radially polarized focal spot can be generated using a polarization rotator that rotates the polarization anti-clockwise instead of clockwise. The high-NA aspherical lens (0.47 NA) was used for focusing, in both cases. For both cases, the polarization state is the same after PR and in the focal plane because the beam is formed by a superposition between two helical beams with topological charge +1 and −1 with orthogonal circular polarization states. For the azimuthally polarized beam (variation 1A), the intensity profile in the focal plane is typical for a beam used for STED microscopy, having zero intensity on the optical axis. The radially polarized focal spot has non-zero intensity on the optical axis, and this behavior is a marker for high NA focusing, as pointed out, for example, in Reference [
22]. The presence of longitudinal electric field on the optical axis for the radially polarized helical beams was also experimentally demonstrated in Reference [
26]. This is proof that, by using a lens with 0.47 NA, we are in the strong focusing regime. We should observe a non-zero intensity on the optical axis for variation 1B and 1C if this effect is present, and we should be able to determine if the focal spot generated by variation 1B or by variation 1C is proper for STED microscopy.
Focal spots produced by variation 1B and variation 1C using the same focusing element are shown in
Figure 5a,b respectively, where the polarization state is also shown. We can observe in
Figure 5 that, around 2 μm from the optical axis, polarization becomes azimuthal. This may seem unexpected since the polarization for variation 1C is radial after the polarization rotator (Equations (6) and (7)). This change in polarization state suggests that the phase difference between the two circularly polarized modes that are superimposed increases with π. This phenomenon is attributed to the Gouy phase shift [
24], which is different for the mode with topological charge −3. For both cases, the intensity is zero on the optical axis, and, for both cases, the polarization is partially azimuthal. The main difference between the two configurations is that variation 1B produces a focal spot with axial symmetry, while, in the case of variation 1C, the focal spot is circularly symmetric. The focal spot generated by variation 1A is also circularly symmetric but less extended than in the case of variation 1C.
The expression of the complex electric field generated by a paraxial Gauss–Laguerre mode is given in Equation (14), where the only phase term which depends on the
z-coordinate and is not common for modes with different indexes is the Gouy phase term
ψp𝓁(
z) = (2
p + |𝓁| + 1) arctan(
z λ/(π w20)). Considering the focal plane as the origin, we can evaluate the Gouy phase in the plane of the lens, with the expression given in Equation (15). Other notations are as follows:
w(
z) is the
z-dependent half-diameter of the beam,
q(
z) is the complex beam parameter, and
Lp|𝓁| is the generalized Laguerre polynomial.
The mode with helical index −3 acquires a phase π with respect to modes with helical index −1, which justifies a change of sign in the Jones vector, as done in Equations (18) and (19).
Equations (16)–(19) describes the beam generated by variation 1B, represented in
Figure 5a using Jones vectors. For the Jones vector description, we consider an ideal MC; note that input polarization (before QWP) is (1, 1). Equation (16) represents the polarization state after the HWP (before MC), considering HWP oriented with the fast axis along
x and QWP oriented at 45°, as a superposition of right/left circularly polarized modes. The Jones vector for the beam in the plane of the lens is given in Equation (17). Equations (18) and (19) give the polarization state in the focal plane. We observe that the cos
θ dependence of the complex field amplitude in the focal plane, as well as the nearly azimuthal polarization state, can be explained by a change of sign in the Jones vector for the mode with topological charge −3. Polarization is not perfectly linear with azimuthal orientation because the modes do not have the same amplitude profile. Polarization is linear only in the region where the amplitude of the mode with topological charge −3 is equal to the amplitude of the mode with topological charge −1. This affirmation is correct also for the spot produced by variation 1C, represented in
Figure 5b.
We summarize the polarization states of the beam after each optical component and for each variation of the system 1A, 1B, 1C, and 2 in
Figure 6. Variation 2 is the configuration of the system for the generation of flat-top beams. Note that the first polarization state represented is the polarization state of the beam after QWP; note that MC for variation 2 is different from MC for variation 1A, 1B, and 1C, and also note that HWP produces a relevant effect only in the case of variation 1B for which it can be used to rotate the diffraction pattern. For variation 2, the polarization state along the optical axis is maintained through the system.
We also consider the case with QWP and HWP rotated simultaneously in the same direction, with the rotation angle of QWP twice the rotation angle of HWP (
φ = 2
γ). The Jones vector for the beam after the HWP is given in Equation (20) as a superposition between right and left circularly polarized modes, and we can see that the weight of each mode is controlled by the rotation angle
φ of the QWP. As a consequence, the beam in the plane of the lens (expression given in Equation (21)) is described as the summation of one azimuthally polarized beam with zero angular momentum (expression given in Equation (5)) and a radially polarized beam with non-zero angular momentum (with expression given Equation (7)). As we did before, we add a minus sign to the component with topological charge −3 in order to take into account the change in polarization due to propagation to the focal plane; the polarization state in the focal plane is described by Equations (22) and (23).
For the simulations presented in
Figure 7, we consider the low-NA aspherical lens, with back focal length 46 mm and clear aperture 23 mm in diameter, as the focusing element.
Figure 7 also indicates that the phenomenon that leads to the generation of the circularly non-symmetrical focal spot is not due to the strong focusing (presence of
Ez component), since the results are in perfect accord with simulations using a focusing element with higher NA, shown in
Figure 4 and
Figure 5.
In
Figure 7a–d,f–h,j–l, we represent the transition between variation 1A and 1C of the system resulting from the clockwise rotation of QWP by 10° and the clockwise rotation of HWP by 5°, which means
φ = 2
γ; cases
φ = 70°,
γ = 45° and
φ = 60°,
γ = 45° are represented in
Figure 7e,i, respectively. Rotation of HWP is necessary in order to preserve the orientation of the focal spot and can be used to further control the orientation, which can be observed from the fact that
Figure 7e does not have the same orientation as
Figure 7f, while
Figure 7i does not have the same orientation as
Figure 7j. We can see how the polarization state of the beam before MC (represented by black ellipses) affects the focal spot. Results from
Figure 7 are well explained by Equation (23).
Hao and Ledger [
26] used a similar optical system, using a radial analyzer instead of MC and a polarization rotator that rotated the polarization by 90°. They produced two different types of radially polarized vector beams depending on the input polarization on the radial analyzer. One of the vector beams produced a focal spot similar to that presented in
Figure 4a, and the other vector beam produced a focal spot similar to configuration 1C (
Figure 5b). They experimentally proved that the radially polarized focal spot possesses non-zero intensity along the optical axis. We wanted to point out that the focal spot which is similar to the focal spot represented in
Figure 5b is also radially polarized before the focusing element, and it is likely that the underlying phenomenon that generates this certain focal shape is the same. The configurations that can be produced using elliptical/linear input polarization are not evidenced in the paper, but they are likely to be different from our case since the linearly polarized beam cannot become circularly symmetric after the radial analyzer. Sánchez-López et al. [
32] also showed that the Gouy phase plays an important role in the change of the intensity profile of a vector beam near the focal point.
In
Figure 8, we show the focal spot produced by variation 2 of the optical system proposed herein. The optical layout is similar to the layout for variation 1A, except that MC has slightly different characteristics and SPP is not present. Unlike variation 1A, the beam is composed of a helical mode −2 with left circular polarization and a non-helical mode with right circular polarization, while the mode conversion efficiency is set around 80%. The profile flatness is good and is maintained for 5–10 µm. The edges are not sharp, but this is not a design problem; it is caused by the fact that the spot is almost diffraction-limited and the length in the transversal plane on which the intensity drops from the maximum value to zero is approximatively given by the radius of the diffraction-limited spot. The polarization state is circular near the optical axis. One of the major advantages is the fact that the spot is radially polarized near the edges of the flat profile; radial polarization, as pointed in Reference [
7], is ideal for Si/SiO
2 processing, especially for drilling boreholes.