Optimized Design of Direct Digital Frequency Synthesizer Based on Hermite Interpolation

Zhou, Kunpeng; Xu, Qiaoyu; Zhang, Tianle

doi:10.3390/s24196285

Open AccessArticle

Optimized Design of Direct Digital Frequency Synthesizer Based on Hermite Interpolation

by

Kunpeng Zhou

,

Qiaoyu Xu

^* and

Tianle Zhang

College of Mechanical and Electrical Engineering, Henan University of Science and Technology, Luoyang 471000, China

^*

Author to whom correspondence should be addressed.

Sensors 2024, 24(19), 6285; https://doi.org/10.3390/s24196285 (registering DOI)

Submission received: 6 August 2024 / Revised: 19 September 2024 / Accepted: 26 September 2024 / Published: 28 September 2024

(This article belongs to the Section Electronic Sensors)

Download

Browse Figures

Versions Notes

Abstract

:

To address the issue of suboptimal spectral purity in Direct Digital Frequency Synthesis (DDFS) within resource-constrained environments, this paper proposes an optimized DDFS technique based on cubic Hermite interpolation. Initially, a DDFS hardware architecture is implemented on a Field-Programmable Gate Array (FPGA); subsequently, essential interpolation parameters are extracted by combining the derivative relations of sine and cosine functions with a dual-port Read-Only Memory (ROM) structure using the cubic Hermite interpolation method to reconstruct high-fidelity target waveforms. This approach effectively mitigates spurious issues caused by amplitude quantization during the DDFS digitalization process while reducing data node storage units. Moreover, this paper introduces single-quadrant ROM compression technology to further diminish the required storage space. Experimental results indicate that, compared to traditional DDFS methods, the optimization scheme proposed in this work achieves a ROM resource compression ratio of 1792:1 and a 14-bit output Spurious-Free Dynamic Range (SFDR) of −88.134 dBc, effectively enhancing amplitude quantization precision and significantly lowering spurious levels. This significantly improves amplitude quantization precision and reduces spurious levels. The proposed scheme demonstrates notable advantages in both spectral performance and resource utilization efficiency, making it highly suitable for resource-constrained embedded systems and high-performance applications such as radar and communication systems.

Keywords:

direct digital frequency synthesis; hermite interpolation; FPGA; amplitude quantization

1. Introduction

Amid the rapid advancement of electronic device technology, the demand for high-frequency and high-precision signals continues to grow [1]. Frequency synthesis technology is widely used to generate frequency sources in electronic devices, with four main types currently in use: Analog Direct Frequency Synthesis (ADS), Phase-Locked Loop (PLL), Pulse Output DDS (PDDS), and Direct Digital Frequency Synthesis (DDFS). ADS and PLL technologies, however, rely on analog implementation, which requires numerous analog components, leading to complex circuitry, bulky size, and excessive spurious signals. While PDDS is implemented using more compact standard cells, its 1-bit output structure limits its ability to produce high frequencies and achieve a high Spurious-Free Dynamic Range (SFDR). DDFS has become a prominent research topic due to its superior frequency resolution, fast frequency switching capabilities, and wide output range. Nonetheless, the performance of DDFS, particularly its SFDR, is significantly constrained by errors caused by phase truncation due to the limited ROM size [2,3,4].

To address the spurious issues in DDFS, current research commonly employs interpolation or dithering schemes. Tang et al. [2] proposed a piecewise linear approximation method that reduces spurious signals caused by quantization errors. While this approach increases the amplitude quantization level to minimize quantization errors, it also requires more complex parameter calculations, resulting in higher ROM resource consumption. Choi et al. [3] designed a novel low-power DDFS topology that utilizes dithering techniques to improve SFDR, enhancing signal quality while reducing system power consumption. However, this method introduces additional background noise, which may impact the overall signal-to-noise ratio (SNR) performance. Several studies have also explored ROM-less DDFS designs, using various algorithms instead of ROM tables to implement the phase-to-sine mapping function. Although these approaches eliminate ROM storage and reduce chip area, they introduce high computational complexity, leading to increased power consumption and resource usage [4,5,6]. Furthermore, implementing polynomials of orders higher than three has been found impractical in terms of hardware feasibility [5]. Currently, no method can achieve high SFDR without increasing ROM resource consumption, a critical factor in DDFS design and implementation.

To address these challenges, this paper proposes a DDFS design scheme based on the cubic Hermite interpolation algorithm. The key innovations of this design include:

Utilizing the smooth characteristics of cubic Hermite interpolation to effectively address waveform spurs caused by accumulator truncation. A dual-port ROM structure is employed to obtain the parameters required for interpolation calculations.
By leveraging the derivative relationship between sine and cosine functions in the ROM table design, this approach avoids excessive ROM resource consumption. Additionally, a single-quadrant ROM compression method is introduced to further reduce ROM size.

Although this design requires significant FPGA logic resources and multipliers, their usage does not increase exponentially with bit width, unlike ROM. This scheme meets higher spectral purity requirements while reducing ROM resources, offering a promising new approach for the design of efficient DDFS systems.

2. DDFS Architectural Design

2.1. Traditional DDFS Architecture

In the traditional DDFS system architecture, which includes a phase accumulator, waveform lookup table, digital-to-analog converter (DAC), and low-pass filter (LPF) [7,8,9], as illustrated in Figure 1, the input data consists of a phase control word (

P w o r d

) and a frequency control word (

F w o r d

). With each system clock cycle,

f_{c l k}

, the phase register performs an accumulation calculation, using

F w o r d

as the increment and combining it with

P w o r d

to determine the precise phase information of the waveform at that moment [10]. In this process, the phase data must be truncated and used as the index address for the ROM lookup table. Each address in the lookup table corresponds to an approximate phase point within the 0 to

2 π

range. The lookup table maps this address information to sine amplitude values, which are then output through the DAC and LPF to generate a smooth waveform signal [11].

When the frequency register in a traditional DDFS is set to N bits and incremented by steps of

F w o r d

, the frequency register resets after every

2^{N} / F w o r d

system clock cycle. Simultaneously, the sine lookup table address also returns to its initial value after completing a cycle, allowing the entire DDFS system to output a complete sine wave [3,12,13,14,15], as shown in Figure 2. The output frequency

f_{o u t}

can be calculated as follows:

f o u t = \frac{F w o r d}{2^{N}} f c l k

(1)

In a traditional DDFS, the output signal

f o u t

is sampled at discrete intervals. The frequency spectrum

A (ω)

of this output signal

f o u t

can be expressed as follows [16]:

A (ω) = e^{- j ω T / 2} \cdot s i n c (\frac{ω T}{2}) \sum_{n = - \infty}^{+ \infty} δ [ω - (\frac{2 π n}{T} \pm ω_{o u t})]

(2)

where T is the sampling period;

s i n c (\frac{ω T}{2})

is the envelope of the spectrum;

e^{- j ω T / 2}

represents the phase shift of the output signal; and

\sum_{n = - \infty}^{+ \infty} δ (\cdot)

represents the summation of Dirac delta functions for each harmonic component. Within this spectrum, the fundamental frequency component we desire is as follows:

a_{1} (t) = s i n c (\frac{ω_{o u t} T}{2}) sin (ω_{o u t} t - \frac{ω_{o u t} T}{2}) = a_{1} sin (ω_{o u t} t - ϕ_{1})

(3)

In the DDFS system, let the phase quantization step size of the waveform lookup table be

Δ

. The quantization error due to this step size can be modeled as being uniformly distributed within the interval

[- \frac{Δ}{2}, \frac{Δ}{2}]

. The mean square error for such a uniform distribution is given by the following:

P_{quant} = \frac{Δ^{2}}{12}

(4)

Since the DDFS system samples the output signal at the clock frequency

f_{clk}

, the quantization noise power accumulates across the entire sampling bandwidth. Given that noise is generated at each sampling instance, the total quantization noise power is proportional to the clock frequency. Hence, the total quantization noise power is as follows:

P_{q} = \frac{Δ^{2}}{12} \cdot f_{clk}

(5)

Quantization noise is not uniformly distributed across the frequency spectrum; instead, it follows a specific pattern. For the sinusoidal output of the DDFS, the quantization noise spectrum, after filtering, primarily concentrates around the fundamental frequency of the signal. This distribution can be described using the sinc function. For the DDFS output signal, the quantization noise in the frequency spectrum is suppressed near the fundamental frequency by the low-pass filter, resulting in the error power

P_{q}

for traditional DDFS methods:

P_{q} = \frac{Δ^{2}}{12} \cdot f_{c l k} \cdot s i n c^{2} (\frac{ω_{o u t} T}{2})

(6)

2.2. FPGA-Based DDFS Architectural Design

To achieve high precision, stability, and rapid response, FPGAs are selected for DDFS implementation due to their parallel processing capabilities and configurable resources [17,18,19,20,21]. These features enable flexible adjustments in design parameters, such as the bit widths of the lookup table and phase accumulator. The architecture is implemented using Verilog in the Quartus tool [22], with the Register Transfer Level (RTL) as presented in Figure 3.

In traditional DDFS architectures, the lookup table ROM maps discrete phase values to D-bit wide sine wave amplitudes. Although an infinitely large D value would theoretically ensure error-free phase-to-amplitude conversion, practical FPGA implementations are constrained by ROM resources and DAC performance [4]. Consequently, the D bit width is limited, leading to amplitude errors in the lookup table and spurious components in the DDFS output spectrum [2,23].

3. Optimized Design of DDFS Architecture

To reduce spurious signals caused by quantization and to enhance the SFDR of the output signal, this study employs a cubic Hermite interpolation algorithm and a single-quadrant storage method to optimize the traditional DDFS architecture. The optimized architecture is shown in Figure 4.

3.1. Optimization of Interpolation Algorithm

In the field of signal processing, piecewise linear interpolation algorithms are commonly used due to their simplicity and ease of implementation. However, linear interpolation typically approximates the relationship between adjacent data points by merely connecting them, neglecting the overall continuity of the data distribution. This results in interpolation curves with discontinuous corners and irregular sawtooth patterns, as the model relies solely on local data for predictions, limiting its accuracy based on data density and distribution [2,24,25]. In contrast, cubic Hermite interpolation provides a smoother and more accurate approximation. It not only considers the values at the data points but also includes the first-order derivatives at these points, offering a more coherent and precise depiction of the overall trend. This method is especially effective in processing signals with high-frequency components [17,18,26]. As demonstrated in Figure 5, a comparison of the effects of piecewise linear interpolation and cubic Hermite interpolation on a sine wave signal clearly shows the superior interpolation quality of the cubic Hermite method.

To address amplitude quantization errors in DDFS systems, this paper optimizes the process using the cubic Hermite interpolation algorithm, following these main research steps:

Approximate the curve between two sampling points as a cubic polynomial;
Specify both function values and first-order derivatives at the two sampling points;
Solve for the coefficients of the polynomial;
Use the polynomial for interpolation between the two sampling points.

The cubic Hermite interpolation algorithm is detailed by Gibin et al. [18]. The expression for the cubic Hermite interpolation function

I_{h} (x)

over the interval [

x_{k}

,

x_{k + 1}

] is provided by Equation (7):

\begin{matrix} I_{h} (x) = {(\frac{x - x_{k + 1}}{x_{k} - x_{k + 1}})}^{2} (1 + 2 \frac{x - x_{k}}{x_{k + 1} - x_{k}}) f_{k} + {(\frac{x - x_{k}}{x_{k + 1} - x_{k}})}^{2} (1 + 2 \frac{x - x_{k + 1}}{x_{k} - x_{k + 1}}) f_{k + 1} \\ + {(\frac{x - x_{k + 1}}{x_{k} - x_{k + 1}})}^{2} (x - x_{k}) f_{k}^{'} + {(\frac{x - x_{k}}{x_{k + 1} - x_{k}})}^{2} (x - x_{k + 1}) f_{k + 1}^{'} \end{matrix}

(7)

where

f_{k}

and

f_{k + 1}

are nodal function values, and

f_{k}^{'}

and

f_{k + 1}^{'}

are nodal derivative values.

Direct FPGA implementation of Equation (7) requires a large number of multipliers and dividers, leading to the consumption of substantial FPGA logic units. Therefore, the equation is transformed into a form using basis functions [18], as shown in Equation (8):

H (x) = h_{0} y_{a} + h_{1} y_{b} + h_{2} m_{a} + h_{3} m_{b}

(8)

where a, b are the endpoints of the interpolation interval;

y_{a}

is the signal value at node 1,

m_{a}

is the signal derivative value at node 1; and

h_{0}

,

h_{1}

,

h_{2}

, and

h_{3}

are Hermitian basis functions.

h_{0} = 2 t^{3} - 3 t^{2} + 1

(9)

h_{1} = - 2 t^{3} + 3 t^{2} = 1 - h_{0}

(10)

h_{2} = t^{3} - 2 t^{2} + t

(11)

h_{3} = t^{3} - t^{2}

(12)

The relative position t of the interpolated point x between the two nodes is

t = \frac{x - x_{a}}{x_{b} - x_{a}}

(13)

The processed cubic Hermite interpolation algorithm allows for a reduction in the utilization of FPGA logic units, as presented in Table 1.

The error of the cubic Hermite interpolation can be expressed as follows:

e_{H} (x) = f (x) - f_{H} (x)

(14)

Using Taylor expansion,

f (x)

can be approximated as follows:

f (x) \approx f (x_{a}) + f^{'} (x_{a}) (x - x_{a}) + \frac{1}{2} f^{″} (x_{a}) {(x - x_{a})}^{2} + \dots

(15)

The error after cubic Hermite interpolation is dominated by the fourth-order derivative term, which can be further expressed as follows:

e_{H} (x) \approx \frac{f^{(4)} (x)}{4!} \cdot {(x - x_{a})}^{2} {(x - x_{b})}^{2}

(16)

The error power is the integral mean of the error squared over the entire interval:

P_{H} = \frac{1}{T} \int_{x_{a}}^{x_{b}} {| e_{H} (x) |}^{2} d x

(17)

Substituting the formula into Equation (17),

P_{H} = \frac{1}{T} \int_{x_{a}}^{x_{b}} {(\frac{f^{(4)} (x)}{4!} \cdot {(x - x_{a})}^{2} {(x - x_{b})}^{2})}^{2} d x

(18)

For a sine wave

f (x) = s i n (ω x)

, the fourth-order derivative is as follows:

f^{(4)} (x) = ω^{4} sin (ω x)

(19)

Therefore, the error power can be expressed as follows:

P_{H} = \frac{ω^{8}}{{(4!)}^{2} T} \int_{x_{a}}^{x_{b}} {(x - x_{a})}^{4} {(x - x_{b})}^{4} {sin}^{2} (ω x) d x

(20)

Comparing the error power

P_{q}

(Equation (6)) from the traditional method with the error power

P_{H}

(Equation (20)) after Hermite interpolation reveals the following:

The error in the traditional method mainly stems from linear quantization noise. As the quantization step size $Δ$ increases, the error power also increases.
The error in Hermite interpolation is due to the approximation error of higher-order derivative terms. The interpolation method reduces the quantization error, particularly in cases of higher resolution and more complex signals, making the effect of the interpolation method more pronounced.

Specifically, in cases of high resolution and complex signals, the error power

P_{H}

of the Hermite interpolation method is significantly lower than that of traditional methods, demonstrating its superiority in enhancing signal processing accuracy. By reducing quantization noise and improving interpolation precision, the Hermite interpolation method exhibits outstanding performance in high-performance signal processing applications.

3.2. Single-Quadrant ROM Table Design

According to the cubic Hermite interpolation formula, calculating the interpolation requires both the function values

f (x_{k})

at each node

x_{k}

(

k = 0, 1, \dots, n

) and their derivative values

f^{'} (x_{k})

. This implies that, within a single system clock cycle

f_{c l k}

, the optimized architecture must retrieve four values from the ROM. Conventional LUT methods utilize four ROMs to output these values within one clock cycle, significantly increasing ROM resource consumption [9]. To address this, the paper proposes a ROM table design that combines the single-quadrant storage method with the trigonometric conversion relation.

Leveraging the symmetry of sine and cosine waveforms, only the sine sampling values within the range of 0 to

π / 2

need to be stored, a technique referred to as the single-quadrant storage method [27]. Also, according to the trigonometric relation Equation (21),

f^{'} (x) = {(s i n x)}^{'} = c o s x = s i n (\frac{π}{2} - x)

(21)

It can be shown that given a phase value x, the value of the derivative of the sine function at that moment in time can be obtained by finding the phase value corresponding to the angle

π / 2 - x

in the table. To obtain the magnitude and derivative values of the two nodes simultaneously within a short clock cycle, the optimized architecture uses a dual-port ROM table structure. The ROM table structure comprises n sets of

2 \times M

-bit data, where n stands for the number of nodes. The higher M bit of the data stores the function value of

f_{k}

(

k = 0, 1, \dots, n - 1

), and the lower M bit stores the function value

f_{k + 1}

(

k = 1, 2, \dots, n

). Moreover, as per the trigonometric function relationship, the derivative value of the node read from the ROM table structure is

f_{k}^{'} = f_{m}

(where

m = n - k

). This structure allows for the efficient retrieval of the function values needed for cubic Hermite interpolation within a shorter clock cycle, significantly improving the ROM’s real-time performance and resource efficiency.

Table 2 compares resource utilization between using and not using the single-quadrant method. While the total number of logical elements slightly increases with the method, due to the overhead it introduces, memory bit usage drops significantly. This suggests that the single-quadrant method significantly optimizes memory usage with minimal impact on logic resources.

3.3. Delayed Structural Design

After the phase accumulator obtains the phase information, it is analyzed to extract the address information for the amplitude and derivatives, followed by the execution of the ROM read operation. This process requires two clock cycles: during the first cycle, address decoding is performed, converting the input address into an index for the ROM lookup table. In the second clock cycle, the ROM outputs the data stored at the corresponding index [28].

Since the ROM lookup table requires two clock cycles to read data, the interpolation calculation module will experience a delay of two clock cycles in receiving the node’s amplitude and derivative information. To compensate for this, the optimized architecture incorporates a clock delay module, which employs pipeline delay techniques by adding two delay registers. These registers postpone the node’s phase data, current phase data, and sign information by two clock cycles to maintain synchronization.

3.4. Optimized Architecture Logic Design

Based on Equation (8) and the single-quadrant storage method for ROM table design, the traditional DDFS architecture is optimized using cubic Hermite interpolation. The specific steps are as follows:

Phase calculation and adjustment.
The phase of the node is calculated using a phase accumulator. A single-quadrant storage method is employed to perform phase reversal operations and sign changes at specific phase points (1/4, 1/2, and 3/4 cycles) to generate the desired sinusoidal phase data for the cycle.
Phase data handling.
After acquiring the phase data, the key bits (determined by the number of bits in the ROM storage table) are retained, while the remaining bits are cleared to obtain the node data $x_{a}$ . The other node data $x_{b}$ are then computed using the sum of $x_{a}$ and the interval width, along with the delayed structure, to ensure synchronized input of parameters for the cubic Hermite interpolation.
ROM table structure application.
A single-quadrant storage method and a dual-port ROM table structure are utilized. Based on the interval node data, the address information for the magnitude and derivative data of the two nodes in the interval are obtained from the ROM.
Data read and shift operations.
Using the obtained address information, the magnitude and derivative data for the two nodes in the interval are read from the ROM. Since each data entry in the ROM table contains information about two nodes in an interval, a shift operation is performed after reading to separately access the data for the two nodes.
Cubic Hermite interpolation calculations.
Based on the parameters derived from the previous steps, the cubic Hermite interpolation algorithm is executed to generate the interpolated waveform data.

The DDFS optimization architecture RTL based on cubic Hermite interpolation is shown in Figure 6, and Algorithm 1 provides the pseudocode for the cubic Hermite interpolation in DDFS systems.

Algorithm 1 Cubic Hermite interpolation for DDFS.

Input: Phase accumulator, ROM //Phase increment and ROM with nodes and derivatives
Output: Output waveform value

1:: Main Function: DDFS
2:: Initialize $p h a s e i n c r e m e n t$ , ROM
3:: while system running do
4:: $p h a s e = p h a s e + p h a s e i n c r e m e n t$
5:: $a d j u s t e d_p h a s e \leftarrow$ adjustPhase( $p h a s e$ )
6:: $x_{k}, f_{k}, f_{k}^{'} \leftarrow$ ROM[ $a d j u s t e d_p h a s e$ ]
7:: $x_{k + 1}, f_{k + 1}, f_{k + 1}^{'} \leftarrow$ ROM[ $a d j u s t e d_p h a s e$ + 1]
8:: $w a v e f o r m_v a l u e \leftarrow$ $s i g n \cdot$ cubicHermiteInterpolation( $p h a s e$ , $x_{k}$ , $x_{k + 1}$ , $f_{k}$ , $f_{k + 1}$ , $f_{k}^{'}$ , $f_{k + 1}^{'}$ )
9:: output $w a v e f o r m_v a l u e$ through DAC and LPF
10:: end while
11:
12:: Function cubicHermiteInterpolation(x, $x_{k}$ , $x_{k + 1}$ , $f_{k}$ , $f_{k + 1}$ , $f_{k}^{'}$ , $f_{k + 1}^{'}$ ):
13:: $t = \frac{x - x_{k}}{x_{k + 1} - x_{k}}$ //Calculate the relative position
14:: $h_{0} = 2 t^{3} - 3 t^{2} + 1$ //Hermitian basis function
15:: $h_{1} = t^{3} - 2 t^{2} + t$
16:: $h_{2} = - 2 t^{3} + 3 t^{2}$
17:: $h_{3} = t^{3} - t^{2}$
18:: $i n t e r p o l a t e d_v a l u e = h_{0} \cdot f_{k} + h_{1} \cdot (x_{k + 1} - x_{k}) \cdot f_{k}^{'} + h_{2} \cdot f_{k + 1} + h_{3} \cdot (x_{k + 1} - x_{k}) \cdot f_{k + 1}^{'}$
19:: return $i n t e r p o l a t e d_v a l u e$
20:
21:: Function adjustPhase( $p h a s e$ ) //Adjust phase using single-quadrant storage method
22:: if $p h a s e < \frac{π}{2}$ then
23:: $s i g n = 1$
24:: $a d j u s t e d_p h a s e = p h a s e$
25:: else if $p h a s e < π$ then
26:: $s i g n = 1$
27:: $a d j u s t e d_p h a s e = π - p h a s e$
28:: else if $p h a s e < \frac{3 π}{2}$ then
29:: $s i g n = - 1$
30:: $a d j u s t e d_p h a s e = p h a s e - π$
31:: else
32:: $s i g n = - 1$
33:: $a d j u s t e d_p h a s e = 2 π - p h a s e$
34:: end if
35:: return $a d j u s t e d_p h a s e, s i g n$

4. Experiment and Analysis

To verify the effectiveness of the DDFS optimization architecture based on the cubic Hermite interpolation algorithm proposed in this paper, validation was conducted from both simulation and experimental perspectives. The parameter settings for the optimized architecture were identical to those of the traditional DDFS, as shown in Table 3.

4.1. ModelSim Software Simulation

The proposed architecture was designed on a Windows 10 operating system using Quartus II 15.0, with waveform simulations performed in ModelSim SE, and the simulation data processed in MATLAB. During the simulation code design, the frequency control word (

F w o r d

= 268,740,399) and the phase control word (

P w o r d

= 0) were initialized to achieve an output frequency of 6,257,100 Hz. The following outlines the specific steps for implementing co-simulation with ModelSim and MATLAB.

Initially, a DDFS hardware architecture optimized with the cubic Hermite interpolation algorithm was designed using Verilog. ModelSim simulation tools were then utilized to load test waveforms. The generated sinusoidal analog outputs, as shown in Figure 7, demonstrate that the optimized architecture, based on the cubic Hermite interpolation algorithm, can achieve high-quality waveform reconstruction with smooth signal transitions.

In Figure 7, current_x refers to values obtained from the phase accumulator output, while x represents processed phase information. This involves dividing one period of current_x into four equal parts and starting an inverse operation at each 1/4 period marker. In this architecture, x also serves as the address information, used for forward and reverse reading of amplitude information from the ROM lookup table. The y_symbol is the sign bit, which is set to 0 when the phase is in the third or fourth quadrant. After cubic Hermite interpolation calculations and determining whether to invert the signal based on the sign bit, the final output, y, is produced, ensuring smooth waveform output.

To analyze the spectral information of the simulated waveforms, MATLAB was used to process the waveform files. Initially, in the FPGA simulation code, the $fdisplay() function was utilized for each clock cycle to log the simulated waveform data. After running the ModelSim simulation tool, waveform data for both the optimized cubic Hermite interpolation and traditional DDFS were obtained. These files were imported into MATLAB using the fopen() function to extract the amplitude of the time-domain waveforms. The Blackman window function was applied to process the data, reducing spectral leakage. A Fast Fourier Transform (FFT) was then performed, and a compensation factor for the window function was applied to obtain frequency domain information. Finally, spurious peak search operations were conducted to determine the maximum spurious amplitude and calculate the amplitude–frequency characteristics.

The spectral analysis results of the simulated waveforms are shown in Figure 8. In Figure 8a, the amplitude–frequency curve of the traditional DDFS architecture is displayed. The main frequency amplitude reaches 77.9 dB, while the sub-maximum spurious frequency amplitude is −6.486 dB. Further calculations reveal that the SFDR of the traditional architecture is −84.0262 dBc. Figure 8b presents the amplitude–frequency curve after applying cubic Hermite interpolation optimization. The main frequency amplitude of the optimized architecture is 77.26 dB, and the sub-maximum spurious frequency amplitude is −13.38 dB.

In Figure 8a, the traditional DDFS exhibits prominent spurious components at frequencies such as 10.07 MHz and 22.58 MHz, which result from deviations in stored values caused by finite word lengths and quantization within the ROM table, introducing quantization noise and degrading spectral purity. In contrast, Figure 8b shows a reduction in spurious components around 18.78 MHz and 28.44 MHz, demonstrating that the optimized architecture effectively reduces quantization noise and suppresses spurious components. Additionally, the SFDR of the optimized DDFS system reaches −90.6438 dBc, an improvement of 6 dBc over the traditional DDFS, indicating an expanded dynamic range.

4.2. FPGA Platform Experiment

The experimental test platform comprises a DC power supply, an FPGA-based DDFS frequency synthesizer, an oscilloscope, and coaxial cables. The FPGA module used is the Altera FPGA Cyclone II EP4CE10F17C8 chip, where digital logic circuits such as the phase accumulator and cubic Hermite interpolation module were designed and implemented. The DAC module utilizes the Texas Instruments 14-bit parallel input DAC chip, DAC904, which supports an update rate exceeding 165 MSPS, ensuring optimal dynamic performance for this experiment. A low-pass filter module was employed to remove noise and interference from the DAC output waveform. The Keysight MSO-X3054A oscilloscope was selected to measure spurious suppression performance. Figure 9a illustrates the amplitude–frequency curve of the traditional DDFS method, while Figure 9b shows the curve after applying the cubic Hermite interpolation algorithm.

From Figure 9, it is calculated that the SFDR of the DDFS system optimized by the cubic Hermite interpolation is −88.134dBc, and the SFDR of the traditional DDFS is −81.961 dBc. As seen in Figure 9b, some residual spurious components remain, which can be attributed to the limitations of cubic Hermite interpolation in capturing higher-order terms. Nevertheless, compared to the traditional method, the interpolation approach significantly reduces errors. The noticeable improvement in the noise floor and the reduction in spurious signals in the spectrum further confirm this enhancement.

To evaluate the performance of the optimized scheme across different operating frequencies, several experiments were conducted. As shown in Figure 10, the traditional DDFS architecture displays fluctuating SFDR values, ranging from approximately −82 dBc to −84 dBc, indicating inconsistent spectral purity. In contrast, the linear interpolation-based scheme exhibits more stable SFDR performance, maintaining around −86 dBc. However, the cubic Hermite interpolation method achieves the best results, with SFDR values consistently around −88 dBc across the frequency range, reflecting a significant improvement in spectral purity.

The analysis also highlights some discrepancies between the FPGA platform experiment results and the ModelSim simulation. These differences are likely due to the inability of the simulation model to fully replicate the effects of actual FPGA internal signals, such as timing hazards and LUT mismatches. Moreover, parasitic parameters of FPGA pins and PCB wiring may cause signal distortion and attenuation, increasing non-harmonic components and resulting in larger measurement errors.

Figure 11 shows the phase noise characteristics of the traditional DDFS compared to the interpolation-optimized DDFS. Both designs exhibit similar phase noise behavior at lower offset frequencies, with the noise floor decreasing as the offset frequency increases. However, beyond 20 kHz, the interpolation-optimized DDFS demonstrates a clear improvement over the traditional design, particularly in the higher offset frequency range, where it maintains a lower noise floor. This result emphasizes the effectiveness of the proposed interpolation technique in minimizing phase noise.

Table 4 compares the proposed design with several recent DDFS implementations that achieve high-quality SFDR performance. In earlier DDFS studies, the highest SFDR recorded was 74 dBc using the second-order parabolic equation, the new CORDIC SFDR value is 72.2 dB. More recent studies employing impulse DDFS, while structurally simpler, achieved only 41 dBc. In contrast, the proposed optimized design attains an SFDR of 88.134 dBc, significantly outperforming other DDFS designs, whether FPGA-based or simulated.

Referring to Table 5, the resource utilization of the four methods is compared. It can be calculated that the total ROM compression ratio of the method proposed in this article is 1792:1, which is significantly higher than the previous best value of 843:1 [15]. This demonstrates that, in addition to achieving better SFDR performance, the method also achieves a superior resource compression ratio. However, it should be noted that this method also occupies a large quantity of logic resources and multipliers.

5. Conclusions

This paper outlines the fundamental principles of traditional DDFS systems, identifying key issues such as excessive ROM resource consumption and limited SFDR performance. To address these challenges, an optimized DDFS architecture based on cubic Hermite interpolation is proposed. The experimental results demonstrate the effectiveness of this approach in reducing ROM usage and improving SFDR, as it significantly mitigates spurious interference caused by amplitude quantization during the DDFS digitalization process. This presents a promising new strategy for designing high-precision frequency sources with a wide dynamic range.

While the proposed method shows clear advantages in ROM resource compression and SFDR enhancement, further improvements are possible. Specifically, optimizing the algorithm’s implementation could help reduce FPGA logic resource usage. Such refinements will broaden the applicability of this optimized DDFS scheme, making it better suited for systems that require high-performance frequency sources, such as radar and communication systems.

Author Contributions

Conceptualization, K.Z.; methodology, K.Z.; software, K.Z.; validation, K.Z.; formal analysis, K.Z.; investigation, K.Z. and Q.X; resources, K.Z.; data curation, K.Z. and T.Z.; writing—original draft preparation, K.Z.; writing—review and editing, Q.X. and K.Z.; visualization, K.Z.; supervision, Q.X.; project administration, K.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data used to support the findings of this study are available from the corresponding author upon request. The data are not publicly available due to privacy.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Horlbeck, M.; Scheiner, B.; Weigel, R.; Lurz, F. Fast rf-synthesizer based on direct digital synthesis for an instantaneous frequency measurement system. In Proceedings of the 2021 IEEE Topical Conference on Wireless Sensors and Sensor Networks (WiSNeT), San Diego, CA, USA, 17–22 January 2021; pp. 1–4. [Google Scholar]
Tang, S.; Li, C.; Hou, Y. A suppressing method for spur caused by amplitude quantization in dds. IEEE Access 2019, 7, 62344–62351. [Google Scholar] [CrossRef]
Choi, J.M.; Yoon, D.H.; Jung, D.K.; Seong, K.; Han, J.S.; Lee, W.; Baek, K.H. Design and analysis of low power and high sfdr direct digital frequency synthesizer. IEEE Access 2020, 8, 67581–67590. [Google Scholar] [CrossRef]
Wang, C.; Sulistiyanto, N.; Shih, H.Y.; Lin, Y.C.; Wang, W. Power-effective rom-less ddfs design approach with high sfdr performance. J. Signal Process. Syst. 2019, 92, 213–224. [Google Scholar] [CrossRef]
Wang, C.C.; Lou, P.Y.; Tsai, T.Y.; Shih, H.Y. 74-dBc sfdr 71-Mhz four-stage pipeline rom-less ddfs using factorized second-order parabolic equations. IEEE Trans. on Very Large Scale Integr. (VLSI) Syst. 2019, 27, 2464–2468. [Google Scholar] [CrossRef]
Beheshti, M.; Jannesari, A. A 2-ghz rom-less direct digital frequency synthesizer based on an analog sine-mapper circuit. In Proceedings of the 2016 24th Iranian Conference on Electrical Engineering (ICEE), Shiraz, Iran, 10–12 May 2016; pp. 1603–1608. [Google Scholar]
Wu, X.; Zhao, Z.; Pan, Y.P.; Jiang, M.M. Design of arbitrary waveform generator based on labview. In Proceedings of the 2020 Chinese Automation Congress (CAC), Shanghai, China, 6–8 November 2020; pp. 6382–6386. [Google Scholar]
Yang, D.; Xu, J.; Du, H. A high performance frequency synthesis method based on pll and dds. In Proceedings of the 2020 International Conference on Microwave and Millimeter Wave Technology (ICMMT), Shanghai, China, 17–20 May 2020; pp. 1–3. [Google Scholar]
Bommi, R.M.; Raja, S.S. High performance reversible direct data synthesizer for radio frequency applications. Mob. Netw. Appl. 2019, 24, 224–233. [Google Scholar] [CrossRef]
Huang, L.; Tian, S.; Liu, K.; Guo, G.; Xiao, Y.; Zhao, W.; Yang, X. The design of a wide bandwidth time marker generator. Rev. Sci. Instruments 2018, 89, 115103. [Google Scholar] [CrossRef] [PubMed]
Sharma, A.; Sun, Y.; Simpson, O. Design and implementation of a re-configurable versatile direct digital synthesis-based pulse generator. IEEE Trans. Instrum. Meas. 2021, 70, 1–14. [Google Scholar] [CrossRef]
Xiao, Y.; Chen, Y.; Liu, K.; Huang, L.; Yang, X. A sampling rate selecting algorithm for the arbitrary waveform generator. IEEE Access 2019, 7, 83761–83770. [Google Scholar] [CrossRef]
Yan, C.; Sun, J.; Liu, W. An efficient high sfdr pdds using high-pass-shaped phase dithering. IEEE Trans. Very Large Scale Integr. (VLSI) Syst. 2021, 29, 2003–2007. [Google Scholar] [CrossRef]
Hou, Y.; Li, C.; Tang, S. An accurate dds method using compound frequency tuning word and its fpga implementation. Electronics 2018, 7, 330. [Google Scholar] [CrossRef]
Jeng, S.S.; Lin, H.C.; Lin, C.H. A novel rom compression architecture for ddfs utilizing the parabolic approximation of equi-section division. IEEE Trans. Ultrason. Ferroelectr. Freq. Control 2012, 59, 2603–2612. [Google Scholar] [CrossRef] [PubMed]
Calbaza, D.E.; Savaria, Y. Jitter model of direct digital synthesis clock generators. In Proceedings of the 1999 IEEE International Symposium on Circuits and Systems (ISCAS), Orlando, FL, USA, 30 May–2 June 1999; pp. 1–4. [Google Scholar]
Das, K.; Pradhan, S.N. Field-programmable gate array-based design for real-time computation of ensemble empirical mode decomposition. Int. J. Circuit Theory Appl. 2021, 49, 2312–2328. [Google Scholar] [CrossRef]
George, G.C.; Moitra, A.; Caculo, S.; AmalinPrince, A. Efficient architecture for implementation of hermite interpolation on fpga. In Proceedings of the 2018 Conference on Design and Architectures for Signal and Image Processing (DASIP), Porto, Portugal, 10–12 October 2018; pp. 7–12. [Google Scholar]
Samila, A.; Hotra, O.; Majewski, J.A. Implementation of the configuration structure of an integrated computational core of a pulsed NQR sensor based on FPGA. Sensors 2021, 21, 6029. [Google Scholar] [CrossRef] [PubMed]
Chen, Y.; Yang, M.; Long, J.; Xu, D.; Blaabjerg, F. A dds-based wait-free phase-continuous carrier frequency modulation strategy for emi reduction in fpga-based motor drive. IEEE Trans. Power Electron. 2019, 34, 9619–9631. [Google Scholar] [CrossRef]
D’souza, A.V.; Ravi, D.J. In-phase and quadrature-phase sinusoidal signal generation using dds technique. IETE J. Res. 2021, 69, 4273–4280. [Google Scholar] [CrossRef]
Damnjanović, V.D.; Petrović, M.L.; Milovanović, V.M. A parameterizable chisel generator of numerically controlled oscillators for direct digital synthesis. In Proceedings of the 2021 24th International Symposium on Design and Diagnostics of Electronic Circuits & Systems (DDECS), Vienna, Austria, 7–9 April 2021; pp. 141–144. [Google Scholar]
Bio, M.; Gietler, H.; Plazonic, J.; Ley, M.; Zangl, H.; Scherr, W. Prototyping for a dds-based i/q reference signal generation on a capacitive sensing chip in 65nm cmos using systemc ams, c hls and vhdl. In Proceedings of the 2021 Austrochip Workshop on Microelectronics (Austrochip), Linz, Austria, 14 October 2021; pp. 37–40. [Google Scholar]
Petrinović, D.; Brezović, M. Spline-based high-accuracy piecewise-polynomial phase-to-sinusoid amplitude converters. IEEE Trans. Ultrason. Ferroelectr. Freq. Control 2011, 58, 711–729. [Google Scholar] [CrossRef] [PubMed]
Boukhtache, S.; Blaysat, B.; Grédiac, M.; Berry, F. Alternatives to bicubic interpolation considering fpga hardware resource consumption. IEEE Trans. Very Large Scale Integr. (VLSI) Syst. 2021, 29, 247–258. [Google Scholar] [CrossRef]
George, G.C.; Moitra, A.; Caculo, S.; Prince, A.A.; Buch, J.J.U.; Pathak, S.K. A novel and efficient hardware accelerator architecture for signal normalization. Circuits Syst. Signal Process. 2019, 39, 2425–2441. [Google Scholar] [CrossRef]
Wijaya, R.I.; Ros, S.; Bagus, E.S.; Dadan, M. FPGA-based I/Q chirp generator using first quadrant DDS compression for pulse compression radar. AIP Conf. Proc. 2016, 1755, 170005. [Google Scholar]
Turner, S.E.; Cali, J.D. Phase coherent frequency hopping in direct digital synthesizers and phase locked loops. IEEE Trans. Circuits Syst. I Regul. Pap. 2020, 67, 1815–1823. [Google Scholar] [CrossRef]
Annafianto, N.F.R.; Jabir, M.V.; Burenkov, I.A.; Ugurdag, H.F.; Battou, A.; Polyakov, S.V. FPGA Implementation of a Low Latencyand High SFDR Direct Digital Synthesizer for Resource-Efficient Quantum-Enhanced Communication. In Proceedings of the 2020 IEEE East-West Design & Test Symposium (EWDTS), Varna, Bulgaria, 4–7 September 2020; pp. 1–8. [Google Scholar]

Figure 1. Block diagram of traditional DDFS structure.

Figure 2. DDFS phase–waveform mapping relationships.

Figure 3. Traditional DDFS architecture RTL.

Figure 4. Optimization architecture diagram.

Figure 5. Comparison of two interpolation algorithms.

Figure 6. The optimized architecture RTL.

Figure 7. ModelSim simulation.

Figure 8. MATLAB calculation of the amplitude–frequency curve through the ModelSim simulation data. (a) represents the amplitude–frequency curve of the traditional DDFS, and (b) represents the amplitude–frequency curve after the cubic Hermite interpolation optimization.

Figure 9. Measured oscilloscope output results; (a) is the traditional DDFS, and (b) is the method proposed in this paper.

Figure 10. SFDR performance of different output frequencies.

Figure 11. Single sideband phase noise.

Table 1. Resource utilization.

Logic Unit	Equation (7)	Optimized
Adder	19	11
Multiplier	14	11
Defaulter	6	1

Table 2. Comparison of resource utilization.

	Total Logic Elements	Total Memory Bits
Using the single-quadrant method	1394	128
Single-quadrant method not used	1326	507

Table 3. Parameter settings.

Parameter Name	Set Value
$f_{c l k}$	100 MHz
$F w o r d$	32 bit
$P w o r d$	14 bit
DAC	14 bit

Table 4. SFDR performance comparison.

	[5] TVLSI	[4] JSPS	[3] ACCESS	[29] EWDTS	[13] TVLSI	This Work
Year	2018	2019	2020	2020	2021	2023
Process (nm CMOS)	180	FPGA	65	FPGA	45	FPGA
$F_{word}$ (bits)	32	32	32	32	25	32
Output resolution (bits)	24	24	9	16	25	14
Clock rate (MHz)	71.9	100	2000	251	2000	100
Verification	Mears.	Mears.	Sim.	Mears.	Sim.	Mears.
SFDR (dBc)	74	68.4242	70.8	72.2	41	88.134

Sim. = Simulation, Meas. = Measurement

Table 5. Resource utilization comparison of four methods.

Resource Utilization Rate	Traditional Method	Piecewise Linear Approximation Method [2]	Pulse Width Modulation Method [11]	Proposed Method
Device	EP4CE10F17C8	EP1C12Q24017	EP4CE55F23C6	EP4CE10F17C8
Total logic elements	94/10,320 (<1%)	1104/12,060 (9%)	36,011/55,856 (64%)	1394/10,320 (14%)
Total memory bits	229,376/423,936 (54%)	180,275/23,9616 (75%)	1,286,144/2,396,160 (54%)	128/423,936 (<1%)
Total pins	19/180 (11%)	19/173(11%)	181/325 (56%)	19/180 (11%)
Embedded multiplier 9-bit elements	0/46 (0%)	N/A	184/308 (60%)	46/46 (100%)
Total PLLs	1/2 (50%)	1/2 (50%)	2/4 (50%)	1/2 (50%)

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Zhou, K.; Xu, Q.; Zhang, T. Optimized Design of Direct Digital Frequency Synthesizer Based on Hermite Interpolation. Sensors 2024, 24, 6285. https://doi.org/10.3390/s24196285

AMA Style

Zhou K, Xu Q, Zhang T. Optimized Design of Direct Digital Frequency Synthesizer Based on Hermite Interpolation. Sensors. 2024; 24(19):6285. https://doi.org/10.3390/s24196285

Chicago/Turabian Style

Zhou, Kunpeng, Qiaoyu Xu, and Tianle Zhang. 2024. "Optimized Design of Direct Digital Frequency Synthesizer Based on Hermite Interpolation" Sensors 24, no. 19: 6285. https://doi.org/10.3390/s24196285

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Optimized Design of Direct Digital Frequency Synthesizer Based on Hermite Interpolation

Abstract

1. Introduction

2. DDFS Architectural Design

2.1. Traditional DDFS Architecture

2.2. FPGA-Based DDFS Architectural Design

3. Optimized Design of DDFS Architecture

3.1. Optimization of Interpolation Algorithm

3.2. Single-Quadrant ROM Table Design

3.3. Delayed Structural Design

3.4. Optimized Architecture Logic Design

4. Experiment and Analysis

4.1. ModelSim Software Simulation

4.2. FPGA Platform Experiment

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI