1. Introduction
Generally, synchronous data transmission systems send only the data signal to reduce the bandwidth and power consumption. The receiver has to implement a CDR circuit to perform both the extraction of the transmitted data sequence from the noisy received signal and the recovery of the associated clock timing information [
1]. The data must be retimed to remove the jitter accumulated during transmission. The generated clock must have a frequency equal to the data rate and a low jitter because it is the main contributor to the retimed data jitter. To have an optimum sampling of the bits, the rising edges of the restored clock must coincide with the midpoint of each bit. In this case, the sampling occurs farthest from the previous and following data transitions, providing a maximum margin for jitter [
2].
In the field of CDR architectures, closed-loop systems, such as Phase Locked Loop (PLL) and Delay Locked Loop (DLL), excel in providing low-jitter performance, making them the preferred choice in many applications [
1,
2,
3,
4,
5,
6]. However, open-loop CDR structures offer distinct advantages, particularly in terms of low power consumption, fast locking time, and simplicity of design, often resulting in cost-effective solutions [
7]. The choice between closed-loop and open-loop CDRs ultimately depends on the specific requirements of the application. Closed-loop systems offer precision and stability but with higher complexity and power consumption. Conversely, open-loop CDRs provide an alternative, particularly suitable for burst-mode communication where speed and efficiency are required.
A 28 nm FDSOI technology allows us to create a complementary inverter by using the back-gate of the transistor to symmetrize its outputs [
3]. Complementary inverters allow us to implement back-gate auto-biasing feedback and to realize a QRO with an even number of inverters. By injecting a signal into this circuit, we create an ILO with low jitter [
4]. Thanks to the good performance of this oscillator, we can propose a low-cost and low-power ILCDR with an improved jitter and a fast-locking time for burst-mode applications.
This paper is organized as follows:
Section 2 introduces and compares the various types of open-loop CDR.
Section 3 presents the circuit implementation of the injection complementary QRO based on the UTBB-FDSOI technology. The proposed ILCDR architecture and the simulation results are described in
Section 4, followed by a conclusion of this work in
Section 5.
2. CDR Architectures
CDR can be classified into two major categories. The first structures are using feedback phase tracking like Phase Locked Loop (PLL) or Delay Locked Loop (DLL). These closed-loop architectures are the most used essentially for their low jitter [
1,
2,
5,
6]. The second ones are open loop structures with low power, fast locking time, and simple and low-cost design [
7].
Open-loop CDR structures aim at local area networks, passive optical networks, and Serializer/Deserializer applications in which jitter accumulation is not a major problem but need a burst-mode operation to extract a synchronous clock and recover the received data quickly for each asynchronous packet [
1,
7,
8,
9]. An open-loop CDR is usually based on a gated oscillator, a high-Q bandpass filter, or an ILO circuit [
7] as shown in
Figure 1.
Driven by the recovered clock, the D Flip-Flop (DFF) retimes the incoming data. The variable delay buffer provides an adjustable delay of the input data to align it with the clock edge (the best time is the midpoint of each bit). It can be controlled externally to correct the Process, Voltage, and Temperature (PVT) variations. The edge detector is generally an XOR gate that will generate pulses at the input data bit frequency that commands the oscillator.
The high-Q filter-based CDR, such as an LC filter, directly samples the clock but its integration on a chip is difficult and there is no input jitter rejection [
1,
7]. The gated oscillator circuit aligns this phase with every input data transition, so it also has no input jitter rejection and a strong sensitivity to PVT variations [
1].
We propose the implementation of an ILCDR to achieve a rapid acquisition time and robust jitter tolerance. Notably, this type of CDR exhibits an input-jitter-filtering nature, distinguishing it from CDRs relying on the high-Q filter or gated oscillator [
7,
10]. Additionally, injecting the output of the edge detector into the ILO forces it to lock onto this frequency. This allows for some process variation in the oscillator design, thereby enhancing PVT variation tolerance. Certainly, injecting a periodic signal into an oscillator leads to pulling or locking phenomena [
11]. Injection locking occurs when an oscillation source is shifted by an interference signal with a frequency very close to its own. In this case, the output frequency will be locked to the frequency of the interference signal instead of its own free-running frequency [
4]. The principle of a simple injection structure is shown in
Figure 2 [
4].
A three-stage ring oscillator oscillates at the frequency f
0. The introduction of a phase shift ϕ
0 in this loop modifies the oscillation frequency because the system phase deviates by a value equal to ϕ
0.
Figure 2b shows the change in the oscillation frequency from f
0 to f1 induced by the effect of ϕ
0. Assuming that ϕ
0 is generated by an injection voltage V
inj, the system then oscillates at the injection frequency f
inj [
4].
The ILCDR structure may include a closed-loop, typically designed with a PLL, to conduct the input data frequency tracking and adjust the free-running frequency of the ILO, as illustrated in
Figure 3.
In case of a long sequence of 0 or 1, the input frequency tracking system improves the BER of the ILCDR by reducing the difference between the free-running frequency and the input data frequency of the ILO [
7]. Adding a PLL increases the cost, complexity, power consumption, and die area of the CDR. In this study, our proposition aims to enhance the jitter of the standard architecture of the open-loop ILCDR without utilizing a PLL, thus maintaining a low-cost and low-power design.
4. Proposed ILCDR
4.1. Timing Analysis of the Proposed ILCDR
Figure 12 depicts a simplified block diagram of the injection CDR. The input signal is a PRBS7 at 868 Mbps [
4]. After the pulse generator, a pseudo-random pulse signal is obtained at the rate multiplied by 2, which is 1.736 Gbps. After injecting the pulse signal into the complementary RO, it outputs a periodic signal with a frequency of 868 MHz, which is the recovered clock signal. The recovered data signal is the output signal with a rate of 868 Mbps after sampling.
Equation (1) indicates the behavior of the injection current pulse:
where γ denotes the pulse width and t denotes the period of the injected pulse [
20]. Experimental results on the 8 quadrature-phase output signals of the complementary RO show that the maximum value of γ needs to be less than 0.5 T to achieve an acceptable jitter. Indeed, injection intensity is related to the duty cycle of the injection signal, which affects the injection performance. The best pulse signal width obtained to minimize the jitter of the ILCDR system is equal to a quarter-bit period (T
b/4) as shown in
Figure 13 [
4].
4.2. Architecture
Figure 14 illustrates the architecture of the proposed full-rate ILCDR, which includes a pulse signal generation block consisting of a DFF and an XNOR gate, an injection block, and a complementary RO block [
4]. The incoming data, NRZ code, whose power spectral density is zero at the frequency component of the clock or its multiples, the clock signal cannot be extracted. By passing the NRZ data through a DFF and an exclusive-OR gate, RZ data can be obtained from which the clock frequency components may be extracted [
21]. Injecting the pulse signal M
inj into a complementary ring oscillator outputs eight periodic signals with the same rate and different phases. One of these 8-phase signals is selected as the clock signal which contains a variable phase, so a variable delay consisting of several inverters is no longer required, which could simplify the circuit structure and thus reduce the power consumption and the chip size of the ILCDR. One of the 8 phases is manually selected in the simulation stage and is sent to the DFF. A binary selector and an 8-to-1 MUX will be used to select the phase for the manufacturing stage. The principle of manual selection is to obtain smaller jitter. Indeed, the output signal makes the injection pulse width match T
b/4 minimizing the jitter of the CDR circuit. The input data is XNORed with the output signal of the DFF to obtain a pulse signal with 2 times the bit rate. The recovery data D
out is obtained after the decision circuit DFF is clocked by the recovered clock CK
OUT.
In general, the QRO takes advantage of the back-gate control structure that reduces the error in each transition. Random jitter (thermal noise, flicker noise, etc.) can be corrected periodically using injection techniques. Furthermore, a DFF was used to implement the functions of pulse signal generation and recovery data sampling simultaneously.
4.3. Schematic Simulation Results
We utilized the 28 nm UTBB-FDSOI transistor model of STMicroelectronics. It is important to note that the V
DD in this environment is set to 1 V. We modulated V
DD to control the oscillation frequency, as depicted in
Figure 10. With a supply voltage of 0.5 V, the proposed ILCDR achieves a wide operation range of 868 Mbps. The dimensions of the RO transistors are 4.1 µm/30 nm for the PMOS and 2 µm/60 nm for the NMOS.
Figure 15 illustrates the results of transient simulations performed on the complementary ILCDR, where D
in is the input data, RO
out is the output of the injected complementary ring oscillator, CK is the recovered clock signal and D
out is the recovered data [
4]. Simultaneously,
Figure 16 shows the variation curve of frequency during the locking process. The figure illustrates that when the input data is PRBS7 random data with an 868 Mbps data rate, the locking time of the ILCDR is about 5 ns, and the clock frequency stabilizes around 868 MHz after locking [
4].
Based on over 10,000 cycles of transient simulations,
Figure 17 shows the jitter of the recovered clock and recovered data for the PRBS7 input data [
4]. The jitter is 7.4 ps (0.6%UI) of the clock and 2.3 ps (0.2%UI) of the PRBS7 data. The locking range is 860–1100 Mbps without errors of the input PRBS7, which is 28% of 868 Mbps. After the level shifter, the output signal reaches 1 V.
Figure 18 presents the phase noise when a periodic injection signal is introduced. The phase noise for the injection circuit is −122 dBc/Hz@1 MHz [
4].
It is obtained from transient simulation that when the input data is a random code of PRBS7 at 868 Mbps, the power consumption is 235 μW when the CDR circuit is locked. The percentage of power consumption of each part of the circuit is shown in
Figure 19, where the RO consumes 49 μW and the other modules consume 186 μW. The power efficiency of CDR is:
In our CDR circuit, the frequency of the RO is directly controlled by V
DD. Therefore, our focus will be on process and temperature variations.
Table 1 shows the output frequency of the CDR for the 868 Mbps PRBS7 input data in different cases, including the fastest case (FF corner with 125 °C) and the slowest case (SS with corner −40 °C) [
22].
Based on the results in
Table 1, it can be observed that temperature has a more significant impact on the structure for an input rate of 868 Mbps, leading to harmonic injection in both the fastest and slowest cases. In the fastest case, the output frequency is close to 2 × 868 MHz, resulting in 1.736 GHz. Conversely, in the slowest case, the output frequency is close to 868 MHz/2, resulting in 434 MHz.
A Monte Carlo, incorporating random statistical variation, was conducted on the ILCDR to analyze the resulting clock frequency, involving 500 data points. The histogram plot in
Figure 20 illustrates the approximate probability distribution [
4].
Despite the mean of the output clock frequency is 865 MHz (i.e., 864.8 MHz), a significant concentration of data points is observed near 868 MHz. The standard deviation is calculated to be 9.6 MHz. The MC simulation reveals that the variability attributed to the manufacturing process on the oscillation frequency remains low, approximately 3σ/Mean = 27/868 = 3.1% maximum relative deviation.
4.4. Layout Design and Post-Layout Simulation
Figure 21 illustrates the structural arrangement of the complementary inverter layout topology [
4]. In this configuration, a symmetrical layout of four transistors and their corresponding ports is displayed, taking advantage of the balanced arrangement between the various circuit elements. This symmetry serves a dual purpose: firstly, it helps to simplify the complex circuit layout and secondly, it has a key role in enhancing the overall stability of the system.
Figure 22 illustrates the configuration of the complementary inverter, detailing the precise placement and interconnections of the transistors and ports, with a layout area of approximately 8 × 8 μm
2 [
4].
Figure 23 graphically illustrates the layout configuration of a complementary RO [
4]. In this case, the fundamental objective is to systematically organize the four complementary inverters into a symmetrical layout. The symmetrical arrangement is designed to minimize the variations in parasitic effects between each complementary inverter, ensuring uniform rise and fall times. This enhances both the signal integrity and overall performance stability of the complementary ring oscillator.
Figure 24 provides a comprehensive view of the layout design of the complementary RO, including the start-up circuit [
4].
First, the schematic and layout of the complementary inverter were simulated for comparative analysis. The transient simulation results are shown in
Figure 25 [
4]. The results of the schematic simulation and the post-layout simulation are consistent to a large extent, with the main difference being that the rise time and the fall time of the post-layout simulation are both slightly extended.
Based on this framework, we performed a similar comparative analysis for the simulation of a complementary RO. In the schematic simulation, the oscillator operates at a frequency of 868 MHz with a supply voltage of 0.5 V. However, in post-layout simulations, the oscillation frequency decreases to 285 MHz with a supply voltage of 0.5 V. This reduction can be attributed to resistance effects and parasitic capacitance.
Figure 26 depicts the transient simulation results.
To compensate for resistance effects and parasitic capacitance in the layout, we have adjusted the VDD from 0.5 V to a higher level of 0.63 V based on the simulation data results. The aim was to achieve 868 MHz ILCDR operational functionality to compare with the schematic simulation result. This has been shown to help reduce the effects of parasitic components, thus contributing to the expected performance of the ILCDR. The adjusted ILCDR has a locking time of approximately 5 ns and the clock frequency is stabilized around the 868 MHz target frequency.
After confirming the proper functionality of the CDR circuit, we turned our focus to assessing the jitter performance of both the clock and the output data using eye diagram analysis.
Figure 27 presents these findings, indicating a clock jitter of around 26.7 ps (2.3% UI) and a recovered data jitter of approximately 11.9 ps (1% UI) [
4]. Moreover, when exposed to a periodic input signal, this CDR demonstrates a phase noise of −118 dBc/Hz@1 MHz. The ILCDR consumes 318 μW, and the power efficiency is 0.318/0.868 = 0.37 pJ/bit.
Table 2 offers a performance comparison between the schematic and post-layout simulations. The post-layout results reveal an increase in phase noise and peak-to-peak jitter compared to the schematic outcomes. This difference is attributed to the impact of parasitic resistance and parasitic capacitance associated with the metal interconnects on the RO’s performance. Furthermore, the elevation of V
DD to achieve the target frequency of 868 MHz leads to higher power consumption in post-layout simulation.
Table 3 enumerates the schematic and post-layout simulation outcomes for the complementary ILCDR under various processes and temperatures. Notably, the post-layout simulation results exhibit relatively greater stability during the process and temperature variations compared to the schematic because the V
DD is increased to 0.63 V in the post-layout (0.5 V in the schematic). Indeed, in our CDR circuit, the frequency of RO is directly controlled by V
DD. To improve frequency stability with respect to temperature, we are considering implementing the solution proposed by [
23], which is simple and low power.
5. Conclusions and State-of-the-Art Comparisons
A performance comparison between some recently published CDRs and our work is presented in
Table 4.
Thanks to FDSOI technology, we have proposed to implement a novel cross-coupled back-gate technique to improve analog and mixed-signal cells and to decrease the surface of the integrated circuit. Thanks to this technique, we implemented a QRO based on complementary logic and inverters. This type of QRO is very efficient for low-power and low-frequency applications. Based on the back-gate structure, the proposed ILCDR significantly simplifies the design and, therefore, greatly reduces power consumption and surface. The ILCDR extracts an 868 MHz clock signal for the same bit rate as random input data. Featuring back-gate and injection technology, it exhibits a desirable jitter performance.
Subsequently, we conduct a post-simulation analysis of the ILCDR. Despite the oscillation frequency being reduced to one-third of the schematic level due to resistance effects and parasitic capacitance in the post-simulation, we successfully achieved the expected frequency of the CDR by increasing the V
DD. In the post-layout simulations at 868 Mhz, the peak-to-peak jitter is 26.7 ps (2.3% UIp-p) and 11.9 ps (1% UIp-p) for the recovered clock and data signal, respectively. The power consumption for the ILCDR is 318μW and the power efficiency of 0.37 pJ/bit is relatively good. The power consumption rises by 35%, and the phase noise increases by 4 dB compared to the schematic simulation results. To provide a fairer comparison in
Table 4, we also conducted measurements in the post-layout simulation at 2.4 Ghz for wireless communications using IoT protocols by increasing V
DD to 0.7 V. In this case, the peak-to-peak jitter is 10.9 ps (2.6% UIp-p) for the recovered clock signal and 5.8 ps (1.4% UIp-p) for the data signal. The power consumption for the ILCDR increases to 897 μW, while its power efficiency remains at 0.37 pJ/bit. The ILCDR maintains its attributes of low jitter, low power, small surface area, and substantial energy efficiency. Our next step is to add a circuit [
23] to improve the stability of the frequency as a function of temperature.