1. Introduction
Recently, the scale and frequency of disasters and accidents have increased, leading to a significant rise in damages. To enhance efficient disaster preparedness and expedite damage relief, various broadcasting and communication systems develop emergency alert technologies [
1,
2,
3,
4,
5,
6]. The ATSC (Advanced Television Systems Committee) 3.0 standard is defined as the state-of-the art terrestrial digital broadcasting system specification [
7,
8] and one of the systems that offers disaster services. The ATSC 3.0 standard includes a wake-up function for emergency alerts [
7]. This feature enables devices in standby or sleep mode to recognize emergency alert signals and automatically awaken to deliver emergency messages to individuals.
To enhance spectral efficiency and robustness, the ATSC 3.0 standard employs various improved physical layer techniques, including orthogonal frequency division multiplexing (OFDM) with low-density parity-check (LDPC), layered division multiplexing (LDM), and others [
8]. Moreover, the ATSC 3.0 broadcasting standard includes the advanced emergency alert (AEA) information service, which is a pivotal element. To provide the AEA information service, the ATSC 3.0 physical layer system transmits the emergency alert wake-up signal through a bootstrap signal. In case of the disaster situation, the emergency alert wake-up signal is transmitted, and the UHD TV can be woken up and activated after receiving the wake-up signal. As a result, people can receive the emergency alert and watch the emergency broadcasting contents through the activated TV. This service creates significant new value for broadcasters and authorities, as well as viewers in disaster situation [
9].
1.1. Related Works
ATSC 3.0 utilizes a frame structure composed of three primary components: the bootstrap, preamble, and one or more subframes. The bootstrap provides a universal entry point into a broadcast waveform and has been designed to be a very robust signal and receptacle even at low signal levels [
7,
8,
9]. The bootstrap consists of four OFDM symbols. The initial OFDM symbol within the bootstrap signal is identical across all transmission frames and features repetition patterns in the time domain. Using these characteristics, signal discovery, coarse synchronization, and initial channel estimation are performed. The other symbols contain an emergency alert wake up field as well as system configuration parameters, respectively. The wake-up signal composes of two bits, with each bit located in bootstrap symbols 1 and 2, respectively. Accordingly, some intricate decoding algorithms have been proposed to ensure reliable decoding of the wake-up bits at a low signal-to-noise ratio (SNR) levels [
10,
11]. In recent times, a machine learning method based on neural networks (NN) has garnered significant attention in the field of wireless communication [
12,
13,
14,
15,
16,
17]. In particular, the references [
18,
19,
20] cover reception method based on baseband signal. Reference [
18] proposed a deep learning-based signal detection technique for the multiple-input multiple-output (MIMO) system with multipath fading channel, and the proposed technique can achieve a performance very similar to an optimal algorithm-based detection technique. Reference [
19] demonstrates the use of two-layer neural networks for pilot design and deep neural networks (DNN) for channel estimation, achieving superior performance compared to traditional estimation techniques. Furthermore, reference [
20] designed and implemented a convolutional neural network (CNN)- and DNN-based approach to model the non-linear transfer function between MIMO transmitters and receivers. Simulation results of the proposed technique show better robustness can be achieved in a condition of an imperfect channel knowledge compared with conventional algorithms. Lastly, in [
21], the DNN structure is studied for ATSC 3.0 and can detect only the first bootstrap symbol even under low SNR levels.
1.2. Motivation and Contribution
The fundamental concern of the paper is to detect the wake-up signal after the synchronization bootstrap symbol. Conventional bootstrap detection methods’ emergency alert systems often rely on a full ATSC 3.0 demodulator. This requirement limits the applicability of these methods, particularly in scenarios where a full demodulator might not be readily available (e.g., low-power devices or disaster situations). Additionally, conventional methods typically involve complex signal processing techniques like Fast Fourier Transform (FFT), frequency synchronization, and interleaving, which can be computationally expensive and decrease overall system efficiency. Therefore, a new deep learning-based wake-up signal detection method that solves the limitations identified above is needed. Although the study conducted in [
21] has demonstrated significant potential, there are several aspects that could further improve the performance of bootstrap detection. Compared to [
21], the additional research conducted can be summarized as follows:
- -
Improve DNN structure by adding four fully connected (FC) layer to learn high-dimensional complex patterns and features.
- -
Reconstruct an appropriate training dataset to achieve high detection performance in wireless channel and prevent overfitting.
- -
Optimize received complex signal data used as an input DNN structure.
In this paper, a deep learning-based wake-up signal detection method is proposed for terrestrial UHD emergency alert service based on ATSC 3.0. The proposed method comprises two functional blocks. The initial block focuses on bootstrap signal detection and offset estimation. Within this bootstrap detection block, the first bootstrap symbol is identified, and the time-offset value of the received first bootstrap symbol is estimated. The second block is the emergency wake-up signal detection block, capable of detecting and demodulating the wake-up bits using a deep learning method. Both DNN and CNN structures are considered for bootstrap detection and emergency wake-up signal detection. DNN is well-suited for tasks requiring time synchronization due to their ability to handle temporal sequences effectively. The proposed DNN structure comprises multiple layers of neurons, allowing it to learn and extract these temporal features from the received signal. CNNs are effective in spatial data processing and analysis, making them suitable for wake-up bit detection. Wake-up bits are embedded within the bootstrap signal and can be identified by examining spatial patterns within the signal. Therefore, the proposed method can be efficiently used in a serious disaster situation to announce emergency alert contents. In addition, the proposed method can detect an emergency alert signal without an ATSC 3.0 demodulator, such as a UHD TV, making it efficiently applicable to various systems. This paper is organized as follows:
Section 2 describes bootstrap generation and the structure of the ATSC 3.0 standard. The proposed deep learning-based bootstrap detection method is described in
Section 3. In
Section 4, simulation results and discussions are presented.
Section 5 concludes this paper.
2. Bootstrap Generation and Structure of ATSC 3.0 Standard
The bootstrap facilitates the provision of multiple wireless-based services via time multiplexing within a single RF channel. Serving as a low-level indicator to support these services, it allows a receiver to detect and identify the bootstrap signal. Consequently, this identification provides guidance on accessing the services available through a signal [
9]. The bootstrap serves as a universal entry point into an ATSC 3.0-based broadcast waveform, employing a fixed configuration universally recognized by all receiver devices. It carries essential information to facilitate the processing and decoding of the signal associated with a detected bootstrap. In
Figure 1, the functional block diagram of the bootstrap generator is presented. Initially, the sequence generator combines a Zadoff–Chu (ZC) sequence with a pseudo-noise (PN) sequence, which includes major and minor versions. The resulting sequence is then transformed into a time-domain sequence through Inverse Fast Fourier Transform (IFFT). After IFFT, cyclic shifts in the time domain are performed to convey information signaled via the bootstrap symbols.
Finally, the cyclically shifted sequence is transformed into two structures (CAB and BCA) as shown in
Figure 2. The initial symbol (bootstrap symbol 0: BS_0) specifically designed for synchronization detection employ the CAB variant. Subsequently, the remaining bootstrap symbols conform to the BCA variant. In the CAB structure, part B consists of the last 504 samples of part A with a frequency and phase shift applied to the originating frequency domain sequence, while part C consists of the last 520 samples of part A. In the BCA structure, part C consists of the last 520 samples of part A, but part B consists of the first 504 samples of part C with a frequency shift applied to the originating frequency domain sequence.
The complete bootstrap comprises 24 signaling bits. With the exception of the initial bootstrap symbol, each symbol contains essential signaling information. This information encompasses parameters crucial for the configuration of the ATSC 3.0 system, as well as OFDM parameters essential for preamble demodulation.
Table 1 shows the signaling information incorporated in the bootstrap, along with the corresponding allocation of bits for each piece of information.
The concatenation of these two bits results in the formation of a 2 bit value representing emergency wake-up information. Consequently, four distinct states emerge, and the presence of at least one ‘1’ in this concatenated value signifies the reception of an emergency alert message. The precise meaning of the wake-up field, as defined in [
7], is as follows.
- -
00: No active emergency message.
- -
01, 10, and 11: Rotating through these values will inform the receiver that there is either a new emergency message or that there is new and substantial information being added to an existing message.
The channel bandwidth of payloads within ATSC 3.0 systems typically occupies 6 MHz. However, the bootstrap symbols exhibit a fixed bandwidth of 4.5 MHz, centered within the RF channel. The system operates at a sampling frequency of 6.144 Msamples/s, employing an FFT size of 2048. This configuration results in a subcarrier spacing of 3 kHz, as illustrated in
Figure 3 and
Table 2.
In the UHD broadcasting signal transmission, bootstrap symbols are firstly transmitted. In the time domain, the received bootstrap signal can be written as follows:
where
h is the channel impulse response which captures the distortion and delay introduced by the transmission channel. The delay is determined by the number of taps in the filter with length (L), and finally
h is defined as
.
is
n-th bootstrap symbol where
is
k-th sample of
n-th bootstrap symbol; finally,
represents additive white Gaussian noise (AWGN) and has the same dimensions as
and
. It has the constant power spectral density (PSD) across all frequencies and the noise samples are independent of
and
.
4. Simulation Results and Discussions
This paper employs the digital terrestrial television broadcasting channel model for experiments with computer simulations, as cited in [
13,
20]. This channel named RC20 represents a scenario where the signal is exclusively received through the Rician fading with 20 reflected paths where it guarantees a stable direct path from a main transmitter. The channel model is defined as follows: the profile of the RC20 channel is depicted in
Table 5, and the received signal in the time domain through the channel is expressed by (6). The proposed deep learning-based method demonstrates the detection of the wake-up signal without compensating for these channels.
where
is the input signal and
denotes the number of echoes, which is set to 20. Additionally,
and
represent the attenuation, relative delay and phase shift from scattering of the
i-th path as listed in
Table 5, respectively. The Rician factor
K, defined as the ratio of the power of the direct path (line-of-sight ray) to that of the reflected paths, is expressed as follows:
When the Rician factor
K is set to 10 dB, the attenuation for this case is as follows:
In this simulation, the following ATSC 3.0 system parameters [
9] are considered for the 14 bits excluding the emergency alert wake up field used when generating the bootstrap symbols.
- -
The minimum time interval to the next frame: 100 ms.
- -
System bandwidth: 6 MHz.
- -
Sample rate of post-bootstrap: 6.912 MHz.
According to the aforementioned system parameters, the bootstrap signal and ATSC 3.0 payload signal is generated for performance evaluation. The generated signal is received through the fading channel and AWGN as shown in (1). Subsequently, the received signal is fed into the deep learning-based bootstrap detector. The proposed method is entirely conducted on the time domain without utilizing FFT.
For the training process, the following SNR environments are considered for the proposed deep learning-based emergency alert wake-up signal detection method:
- -
Bootstrap signal detection: SNR = [−19, −16, −13 dB].
- -
Wake-up signal detection: SNR = [−22, −19, −16 dB].
In our test, if any training SNR with near BER =
is chosen, the learning process of all considered deep learning architectures can give a rational performance. In addition, training data augmentation is performed by oversampling at four times the symbol rate and shift decimation. This approach provides a more diverse set of examples, helping the model generalize better to unseen instances and enabling it to learn more representative features, thereby leading to improved performance on real-world data. Finally, each DNN and CNN model is trained using 100,000 datasets with a learning rate of 0.005 considered at epoch 30 and a batch size of 100.
Figure 8 illustrates the bootstrap detection performance of the proposed method. To benchmark its performance, we compare it with the performance of an existing bootstrap detection technique as referenced in [
10]. The results indicate a significant improvement in performance compared to the existing iterative algorithm-based detection technique [
10]. Notably, the proposed method demonstrates robust detection of the bootstrap signal across all SNR ranges. Moreover, it is capable of detecting both the bootstrap signal and the time-offset value simultaneously.
Figure 9 illustrates the detection error rate performance of the wake-up bits, where the detection error rate signifies the errors encountered when missing all two wake-up bits. For performance evaluation, we compare our proposed method with an existing algorithm-based wake-up bit detection method as outlined in [
4]. In low SNR, the performance of our proposed method exhibits a slight decrease compared to the existing method. However, in a high SNR case, our proposed method surpasses the performance of [
4]. Notably, our proposed method demonstrates approximately a 1 dB enhancement in performance compared to the algorithm-based method when the detection error rate reaches
. We confirm that learning through various SNR and oversampled datasets yields a good wake-up signal detection. Additionally, through this performance evaluation, we can further verify that the time offset obtained from the previous bootstrap detection was well compensated. Lastly, the proposed technology applying deep learning has several advantages over existing signal processing technologies:
- -
It effectively models intricate nonlinear relationships, including channel characteristics, interference, noise, and other factors.
- -
It learns the interactions between system components, leading to more efficient optimization and improved overall system performance.
Next, this paper conducts additional experiments utilizing the ATSC 3.0-based RF signal and a commercial Software-Defined Radio (SDR) platform to assess the effectiveness of the proposed deep learning-based method in a real environment. The receiver structure of experiments in
Figure 10 were designed to validate the performance of our proposed method under realistic conditions. The DekTec DTU-315 device is used as the transmitter for generating the ATSC 3.0 RF signal. The transmitted signals were received using a commercial SDR platform named the Nooelec 820T2. This platform provides functions such as analog-to-digital conversion (ADC), down-conversion, and digital filtering. The experiments parameters are set as follows:
- -
Center frequency: 768 MHz.
- -
Channel bandwidth: 6 MHz.
- -
ADC sampling rate: 1.536 MHz.
In this setup, the Nooelec 820T2 device supports a maximum sampling rate of 2560 kHz. Therefore, we used a sampling rate of 1.536 MHz, which is one-quarter of the ATSC 3.0 sampling clock rate of 6.144 MHz. The laboratory test environment applying the above parameters is shown in
Figure 11 and
Figure 12 which shows the wake-up signal detection performance of the proposed deep learning-based method in a real environment. Through this performance validation, we have demonstrated that the proposed deep learning-based technique effectively detects the 2 bits wake-up signal in real environment disaster scenarios. However, compared to the simulated performance in
Figure 9, a performance gap of approximately 12 dB is observed. This is attributed to a loss of approximately 6.02 dB due to the ADC sampling rate and additional loss incurred by hardware implementation.
Finally,
Table 6 shows a comparison between conventional and proposed methods. The conventional correlation-based method receives input in sample units and performs synchronization using all four bootstrap signals. In contrast, the proposed method conducts bootstrap detection utilizing only the first bootstrap symbol of length 3072 as input for the DNN. The proposed method implements bootstrap demodulation in the time domain without employing channel compensation. By omitting the FFT step, this approach simplifies the procedure. Additionally, it applies CNN instead of the conventional maximum likelihood (ML) decision based on absolute cyclic shift. However, the proposed method has the limitation of acquiring the 2 bits wake-up signal, not the entire bootstrap demodulation signal.
5. Conclusions
In this paper, a deep learning-based emergency alert wake-up signal detection method is proposed for the ATSC 3.0 UHD TV system. The wake-up bits, transmitted via the bootstrap signal, serve to notify the public of emergency alert situations. Consequently, the accurate and rapid detection of these wake-up bits holds significant importance for safeguarding lives and property. The proposed method exhibits enhanced performance in detecting the bootstrap symbol and demodulating the wake-up bits. After the training process, the simplicity of the proposed method’s operations enables fast detection of emergency alert situations. Furthermore, the designed deep learning-based detector can identify and demodulate the emergency alert wake-up signal without requiring an ATSC 3.0 demodulator. In conclusion, the proposed method presents a novel approach to wake-up signal detection in ATSC 3.0 emergency alert systems, leveraging deep learning techniques. By employing DNN and CNN, we effectively extract complex patterns and relationships from the received signal. Our approach addresses limitations of traditional methods by operating directly in the time domain, eliminating the need for full demodulation and enabling efficient detection on resource-constrained devices. Furthermore, our proposed method opens up new possibilities for wake-up signal decoding, offering receivers greater flexibility in selecting decoding mechanisms. The proposed method can be implemented in set-top boxes or dedicated disaster receivers. It can also be applied to affordable SDR platforms, allowing for integration with other devices. This flexibility contributes to improved system performance and improves quick recognition of disaster situations.