1. Introduction
Accurate distance measurement is paramount in industrial settings, serving diverse applications such as manufacturing, optical metrology, robotics, and quality control. Various methods are available to measure distance and displacement, surface profiles, and vibrations [
1]. The widely used methods for distance measurement include intensity-based sensors [
2], triangulation sensors [
3], time-of-flight sensors [
4], confocal sensors [
5,
6,
7,
8,
9,
10,
11], and interferometric sensors [
12,
13,
14]. These techniques offer high precision, rapid response times, and non-contact measurement, enabling efficient and precise operations without direct interaction with the target.
Among the methods mentioned above, only interferometry and chromatic confocal sensors can achieve a precision finer than one micrometer, providing extremely high resolution measurements [
1,
6,
12]. The main disadvantage of interferometric sensors is that they are susceptible to slight motion and vibration [
15].
Chromatic confocal sensors can be divided into two types: monochromatic confocal sensors [
16,
17] and polychromatic confocal sensors [
5,
6,
7,
8,
9,
11]. Their functionality is based on the well-known confocal microscopy, in which the narrow light from an aperture is focused on the sample and the scattered or reflected light after passing a pinhole is detected by a sensor. By applying an aperture, the out-of-focus light is rejected and the maximum intensity is obtained from the image plane [
18].
Monochromatic sensors use a single wavelength, primarily a laser, illuminating the target with a small spot, and the reflected beam is gathered through a pinhole. The object’s surface profile is constructed by scanning the entire sample surface and obtaining the intensity of the spots. In contrast, the polychromatic sensor uses a broadband light source. The light is focused on the sample using a chromatic lens. The sensor collects the spectrum of the light reflected by the sample; provided that the spectrum profile changes accordingly to the distance to the sample, the change in distance from the sensor to the sample can be related to changes in the spectrum’s peak. Polychromatic sensors are devices for the accurate measurement of the depth and topography of an object, even when the object contains various materials and colors. Hu et al. [
19] have developed a fast chromatic confocal sensor that is capable of working on different reflective surfaces. A similar work was done by Jeong et al. [
20], in which they managed to measure the surface of non-specular reflective surfaces with an axial resolution of about 130 µm.
A key comparison in high-precision optical measurement lies between laser interferometry and confocal sensing, as both methods offer excellent resolution and working distance ratios, typically in the micrometer range [
1]. Although laser interferometry can achieve a sub-nanometric resolution and accommodate larger working distances, confocal sensors present several advantages. Confocal sensing is more robust against vibrations, requires simpler calibration, and involves a less complex setup. Moreover, while this work primarily focuses on relative measurements, confocal sensors have the potential to provide absolute distance measurements by linking focus distances for different wavelengths with a calibration curve. In contrast, laser interferometry is generally more suited for displacement monitoring, rather than absolute distance measurement, making confocal sensing a versatile alternative in applications requiring precise distance tracking, such as industrial manufacturing [
21].
Most works on confocal sensors in the recent literature have focused on working distances below 50 mm [
22,
23]. In this study, we present the development of a polychromatic confocal sensor that employs a robust algorithm to measure the relative displacement of the sample. The key advantage of this sensor, compared to conventional chromatic confocal sensors, is its ability to operate at longer working distances without altering the optical configuration of an existing industrial machine. This makes the sensor particularly suitable for applications involving the measurement of thickness and depth in extended samples through axial motion. Additionally, a Python-based simulator has been created to assist in the design process and predict the sensor’s performance, with the source code available for public use.
  2. Experimental Setup
The chromatic confocal sensor was purposefully designed to integrate into an existing optical setup for the wave front phase measurement of 300 mm diameter optics (
Figure 1), developed in [
24].
The experimental setup is shown in 
Figure 2. The sensor can be split into four main components: illuminator, collector, signal conditioner, and spectrometer. The illuminator path consists of a Tungsten-Halogen Light Source (SLS201L, Thorlabs, Newton, NJ, USA), which delivers a consistent intensity spectrum of 10 mW black-body-like radiation, spanning from 360 to 2600 nm, coupled to a multimode 50 µm core optical fiber patch cable. The other end of the optical fiber is at the focal plane of a positive achromatic doublet lens of 100 mm focal length. The broadband collimated beam emerging from the collimator passes through a 50/50 non-polarizing beam splitter and finally is focused by a large singlet aspheric lens made of BK7 glass; the most relevant optical parameters are shown in 
Table 1. The singlet design of this lens causes a large focal shift between the extremes of the spectrum. The sample is typically placed at the midpoint of this range.
The collector part starts when the light reflected by the sample is partially collimated by the aspheric singlet lens. Again, the singlet design of the lens produces a difference in the collimation of different wavelengths, i.e., the wavelength of the best collimation changes if the sample is moved closer or farther from the aspheric lens. The beam splitter directs the reflected light to an achromatic doublet of 30 mm focal length.
The signal conditioner is composed of three achromatic doublets, a spatial filter, and a pinhole. The first two achromatic doublets work as a Keplerian beam expander, expanding the beam to a diameter of 25 mm in order to enable the use of a spatial filter, blocking an inner disk of 20 mm in diameter. The last achromatic doublet lens focuses the filtered light into a 200 µm pinhole.
The spectrometer (CCS175, Newton, NJ, Thorlabs, USA) is coupled via a bundle fiber to the back of the pinhole. The spectral range is from 500 to 1000 nm with a resolution of 0.16 nm. The full chromatic confocal sensor assembly is shown in 
Figure 3.
Finally, the displacement of the sample towards the main lens results in a shift of the spectrum’s maxima towards shorter wavelengths; a converse shift occurs when the sample is displaced further from the main lens.
  3. Simulator Software
To facilitate the design of the chromatic confocal sensor, a custom Python module was developed specifically for this purpose. Simulations were conducted considering the actual surfaces of the experimental setup. This module offers a straightforward approach to optical design for the chromatic confocal sensor; it would also help other authors to develop their own sensors, provided the open-source nature of the software. 
Figure 6 illustrates the optical design along with ray tracing for the minimum and maximum wavelengths utilized in the experiment. Due to the design specifications of the optical setup, including the beam splitter and achromatic lenses, the simulations were limited to a spectrum of up to 800 nm, as the system was optimized for wavelengths ranging from 300 to 800 nm.
To simulate the propagation of light rays through an aspheric optical system, we developed a ray-tracing module using Python 3.12. This module leverages numerical optimization techniques and performance enhancements to accurately model ray interactions with complex lens surfaces. The core of the simulation is based on solving for the intersection points between light rays and the surfaces of aspheric lenses, as well as computing the subsequent refraction according to Snell’s law. All of the lenses are defined as aspheric surfaces. Those that are not aspheric will have the conic constant set to zero. The simulation workflow is detailed as follows:
  3.1. Ray–Surface Intersection
The surfaces of the aspheric lenses are defined by the following conic equation:
        where 
R is the radius of curvature, 
k is the conic constant, and 
z0 is the offset of the surface. To find the intersection of a ray with this surface, a merit function was constructed to describe the geometry of the ray relative to the surface. The system of equations is solved numerically using the 
fsolve from a Python scientific package called scipy [
25], which is based on the Powell’s minimization routine [
26]. The solver computes the ray’s coordinates (
y, z) at the point of intersection with the lens, ensuring the geometric consistency between the ray’s path and the surface.
  3.2. Surface Slope and Refraction
Once the ray crosses the lens surface, the surface slope at the intersection point is computed using the derivative of the surface equation. This slope is necessary to determine the tangent to the surface, which, in turn, is used to compute the angle of incidence. The refraction on the lens surface is modeled using Snell’s law:
  3.3. Simulation Workflow
The overall simulation workflow involves:
- Generating an array of rays, either randomly or uniformly distributed across the light source radius. 
- The iterative calculation of the intersection points of each ray with each optical surface. 
- Updating the ray’s trajectory, based on the calculated intersection and refraction. 
- The storing of the traced rays for further analysis. 
Using numerical optimization and efficient computational techniques, this module enables the simulation of complex optical systems, making it suitable for applications in chromatic confocal systems and other optical designs.
        
        where 
n1 and 
n2 are the refractive indices of the incident and refractive media, respectively, and 
θ1 and 
θ2 are the angles of incidence and refraction. The refracted angle is then used to update the trajectory of the ray as it propagates through the lens.
  3.4. Ray Tracing Through Optical Elements
The simulation handles multiple optical elements by iterating through all surfaces in the system. Rays are initialized at the light source, and their propagation is traced through successive elements. The refractive index of each element is obtained from a pre-defined dataset in a JSON-like file, and the ray’s intersection and refraction are recalculated at each surface.
The software can be used freely as a Python module and can be found at [
27]. All simulations were carried out using version 0.1.34.
  4. Methods
The methodology entails leveraging the chromatic aberration exhibited by a lens to establish a working range, given by the focal lengths, corresponding to different wavelengths. The illumination of the primary lens with a collimated white-light beam generates this working range.
Figure 8 shows two spectra that were captured at two separate positions of the sample, separated by 800 µm. Even though the spectrum has changed, the maximum value lies in the same wavelength, thus demonstrating that the detection of the peak wavelength in a raw spectrum signal is not enough even to resolve a large distance. To mitigate this issue, two distinct methods were employed: low-pass filtering and Gaussian fitting.
   4.1. Low-Pass Filtering Method
The low-pass filtering method involves isolating and retaining only the lowest frequency components of the spectrum. This process is achieved by performing a Discrete Fourier Transform (DFT) on the detected spectral signal, as outlined in Equation (7). The DFT converts the spectral data from the time (or wavelength) domain into the frequency domain, enabling the separation of different frequency components. Mathematically, the DFT of a discrete signal 
S(
n) is given by
        
        where 
S(
n) represents the sampled spectrum, 
N is the number of data points in the spectrum, and 
X(
k) are the frequency-domain coefficients, each corresponding to a specific frequency component of the signal. These coefficients indicate the contribution of various frequency components to the overall spectral data.
After applying the DFT, a rectangular low-pass filter is employed in the frequency domain to remove higher-frequency components and retain only the low-frequency parts of the signal. This filter essentially “cuts off” frequencies above a certain threshold, passing only those below it. Mathematically, the rectangular filter 
H(
k) can be defined as
        
        where 
kcutoff is the cutoff frequency defining the boundary between the low and high frequencies. The filtered signal 
Xfiltered(
k) is then obtained by multiplying the DFT coefficients 
X(
k) by the filter 
H(
k):
By performing the inverse DFT (IDFT) on the filtered frequency components, the signal is transformed back into wavelength domain. The resulting signal now contains only the smooth, slowly varying features, which correspond to the low-frequency components of the original spectrum.
In 
Figure 9, the impact of applying low-pass filtering on the spectrum is illustrated. Notably, the maximum intensity value, which originally corresponded to a distinct peak in the unfiltered spectrum, no longer aligns precisely after filtering. The resulting spectrum is smoother, making it easier to track gradual variations in the signal, such as those caused by sample displacement.
  4.2. Gaussian Fitting Method
A similar approach was employed to fit a Gaussian distribution to the averaged spectral data. This fitting process was executed using a nonlinear, least-squares optimization method, as outlined in [
28]. Specifically, the nonlinear regression technique was utilized to estimate the parameters 
θ = {
µ, 
σ, 
A}, where one symbol is the mean (or peak location), another symbol is the standard deviation (which defines the width of the Gaussian), and 
A is the amplitude of the distribution. The objective is to minimize the residual sum of squares between the observed spectral data points, 
yi, and the predicted values of the Gaussian model 
f (
xi, 
θ), formalized in the cost function
        
        where 
f (
xi, 
θ) represents the Gaussian function, typically given by
        
The nonlinear least-squares fitting process aims to iteratively adjust the parameter set, θ, to minimize the difference between the model and the measured data points. This is typically achieved using optimization algorithms, such as method- or gradient-based techniques, which are well suited for problems involving nonlinearities in the parameter space. The optimal parameter values correspond to the best-fit Gaussian distribution that characterizes the averaged spectra. This allows for the critical properties of the spectra, such as the peak location (i.e., the maximum of the distribution given by µ), to be precisely tracked. Monitoring these properties is essential for understanding spectral shifts.
  4.3. Spectrum Oversampling
In 
Figure 10, the impact of a displacement of 800 µm on the spectrum is demonstrated, where the peak of the distribution shifts by 0.498 nm. Given the resolution of the spectrometer, which is 0.16 nm, this spectral shift results in a calculated displacement sensitivity of approximately 250 µm. However, this sensitivity is constrained by the intrinsic resolution limit of the spectrometer, defined by the smallest wavelength interval it can reliably detect. To enhance the sensitivity and improve the precision of the displacement detection, the detected spectrum can be oversampled. This technique effectively increases the apparent resolution of the spectrometer, allowing for finer distinctions between spectral peaks. The oversampling process is achieved by linear interpolation between the actual data points obtained from the spectrometer. Linear interpolation works by estimating the values between the discrete data points that are directly measured by the spectrometer. If the spectrometer measures at wavelengths 
λi and 
λi+1, linear interpolation constructs intermediate values, based on the assumption that the data between these points vary linearly. Mathematically, for a given point, 
λk, between 
λi and 
λi+1, the interpolated value S(
λk) of the spectrum can be estimated as
        
        where 
S(
λi) and S(
λi+1) represent the spectrum intensities at the measured wavelengths 
λi and 
λi+1, respectively, and 
λk is the new interpolated wavelength. By performing this interpolation across the entire spectral range, additional virtual data points are created, effectively refining the wavelength resolution beyond the hardware limitations of the spectrometer.
The result of oversampling is a denser spectrum with more closely spaced data points, allowing for a more precise determination of the spectral peak position and improved sensitivity in detecting small displacements. This is particularly beneficial in applications where sub-nanometer shifts in the spectrum need to be detected but the physical resolution of the spectrometer is insufficient.
  5. Calibration
To enhance the quality of the measurements, the chromatic confocal sensor must be calibrated by establishing a calibration curve. This calibration curve serves as a reference standard, enabling the straightforward tracking of sample displacement by correlating spectral changes with known displacements.
The calibration procedure consisted of mounting a 50 mm diameter flat mirror on a linear translation stage. The sample was positioned at 620 mm from the aspheric lens, and a total distance of 800 µm was scanned with a step size of 20 µm.
The experiment was carried out in a dark room environment, and the exposure time was set to 200 ms to ensure the optimal conditions for measurement accuracy. To increase the repeatability of the measurement, 3000 spectra were acquired for each sample position. These displacements were carried out in the direction opposite to the lens, resulting in the anticipated movement of tracked peaks towards longer wavelengths.
This calibration step also works as a validation of the provided simulator software; for this, we simulated the same setup used for calibration. The comparison of calibration curves, both experimental and simulated, reveals a similarity, as depicted in 
Figure 11, and 
Table 2 provides a comparison of the slopes of both curves, indicating an estimated difference of 5%. It is important to note that the simulated spectra exhibit an infinite signal-to-noise ratio (SNR), and the extent of the light source is negligible; the spectral resolution was down-sampled by a factor of five, and the number of rays used was 220,000, suggesting potential areas for simulation enhancement. However, these findings suggest that the module serves as a reliable initial step in the design of a chromatic confocal sensor.
The discrepancy between the real and simulated data, apart from those mentioned above, is mainly due to the position of the sample inside the working range. As shown in 
Figure 5, we can clearly see that the spacing between the foci of shorter wavelengths is bigger than the spacing between the ones of higher wavelengths. Furthermore, it is worth noting that if we are in the range of higher wavelengths and if we go through a sufficiently small range, we can approximate the calibration curve to a straight line.
Moreover, the wavelengths are normalized because the detected peak wavelength does not match the one in the real data, which is due to the coatings of the lenses, which will strongly affect the transmitted intensity of each wavelength and, therefore, the detected wavelength peak, and it is not considered in the simulation.
Lastly, we only aimed at the region where the biggest sensitivity had been achieved experimentally. Furthermore, the motive of this work was to measure the thickness of a silicon wafer from a long distance. The thickness of a single wafer is around 775 µm, which is the reason for the small range of 800 µm being used inside the working range.
  6. Results
In this section, we will show the effect of applying the different proposed methods to increase the sensitivity of the sensor.
Each displacement was executed in increments of 20 µm. At each position, three thousand spectra were captured, and the averages of each one hundred spectra were computed to enhance the signal-to-noise ratio (SNR), resulting in thirty spectra per displacement. In 
Figure 12, the standard deviation (STD) for each sample position, based on 30 captured spectra, is compared. The upper section of the figure illustrates that the Gaussian fit generally exhibits a higher STD value across most of the sample positions, with the exception of two. The lower section displays the result of subtracting the frequency filter curve from the Gaussian fit curve. This difference shows that, apart from two sample positions, the values are predominantly positive, indicating that the uncertainty associated with the Gaussian fit is, on average, greater than that of the frequency filter within the examined range. The “Gaussian fit with 10 interpolated points” are scattered points, since some standard deviation values had a value of zero as no peak movement was detected, which is the same effect that is illustrated in 
Figure 8. The mean standard deviation with the largest oversampling outperforms the one with fewer interpolated points by a factor of 3.8.
To justify the number of interpolated points, in 
Section 4.3, the actual resolution of 250 µm/nm was obtained experimentally, supposing a linear shift between the detected peaks. Thus, in order to be able to detect movements of 20 µm, at least 10 more values need to be interpolated to allow the peak detection for such sample displacements. The higher value of 1000 interpolated points was motivated by the highest spectrometer’s resolution that can be found commercially, which is in the order of picometers.
Additionally, an interpolation between real spectrum values was performed; for 
Figure 13, the resolution was enhanced by a factor of ten, while for 
Figure 14, the resolution was increased by a factor of 1000, yielding a significantly higher resolution than the original results. Therefore, as previously stated, this interpolation step is crucial when the spectrometer’s resolution is insufficient to achieve the desired axial resolution.
  7. Conclusions
In conclusion, our study demonstrates the practicality and effectiveness of integrating a chromatic confocal sensor into a fixed optical setup, emphasizing the optimization of both the illumination and the sensing components. By carefully selecting and calibrating the chromatic lenses, we achieved a 20-micrometer resolution over a significant distance of 620 mm, highlighting the system’s adaptability for various applications. The development of a custom Python module further streamlined the sensor design process, eliminating the need for expensive commercial software. However, our experiments revealed an error margin of approximately 5% compared to the real data, underlining the importance of validating simulation results with empirical testing.
Our findings also show that oversampling effectively enhances the axial resolution without the need for high-resolution spectrometers, and low-pass filtering outperforms Gaus sian fitting in peak detection. Looking ahead, we aim to optimize the extension of the light source fiber optic output to minimize the spot size and further improve the axial resolution. Additionally, we will explore the use of positive and negative lenses to address the challenges associated with the wide measurement range of the primary chromatic lens. Furthermore, in long-distance measurements, environmental factors are crucial for obtaining reliable and accurate data. In the present work, these factors were not studied since the experiment was carried out in a temperature-controlled environment (less than 0.5 °C temperature variation).
The current device has demonstrated its capability to measure the thickness of reflective samples, such as silicon wafers, and to track relative displacements. With proper calibration and the use of calibration curves, absolute distance measurements can also be achieved. By extending the working distance of chromatic confocal sensors from the typical range of hundreds of micrometers to tens of millimeters, as described by Berkovic et al. [
1], to around 600 mm, our method broadens the scope of applications for these sensors.
Typically, confocal sensing offers the advantage of a wavelength versus distance response that is independent of the object’s properties, whether reflective, diffuse, light, or dark. However, in our current work, the long working distance introduces challenges, such as a reduced sensitivity to low-reflectivity or diffuse surfaces. Although low reflectivity can be compensated by increasing exposure times, diffuse surfaces remain difficult to measure and would be an interesting subject for future characterization.
The resolution is primarily influenced by three factors: noise in the light spectrum, meaning that a flat spectrum would ideally be preferable; spectrometer readout noise, which can be minimized by increasing the exposure times, an approach we attempted; and, most critically, the dispersion range of the main lens. These factors collectively impact the precision of the measurements. As discussed earlier, increasing the number of interpolation points between the real data reduces the positional uncertainty, as shown by the error bars in 
Figure 13 and 
Figure 14, representing the standard deviation at each sample position. Thus, increasing the interpolation points infinitely will not increase the sensor’s resolution further. These are physical limitations.
Another limitation is the system’s small tip/tilt tolerance of 0.38°, which could easily be addressed by incorporating a tip/tilt stage to correct the sample orientation. This was not a primary focus of our study, as we concentrated on highly reflective and flat surfaces, such as silicon wafers and mirrors. However, future experiments could explore the sensor’s performance with varying sample orientations and more complex surface types.
These advances offer significant potential for enhancing chromatic confocal sensor capabilities in various fields, where long-distance measurements with high accuracy are crucial, ranging from manufacturing to metrology. This paves the way for greater precision and reliability in measurement and imaging tasks.