1. Introduction
Precipitation is an important research area in many fields of natural science [
1,
2,
3]. In the water cycle, precipitation is the main way that water is transferred from the atmosphere to the ground [
4]. Precipitation changes the salinity and temperature distribution of the upper ocean, further affecting the buoyancy of the ocean [
5]. The erosion caused by precipitation is closely related to the evolution of landforms [
6]. Therefore, the question of how to accurately monitor precipitation has always been a research focus. In recent years, researchers have been working on the improvement of precipitation-detection technology and have made considerable progress, e.g., the optical disdrometer [
7,
8,
9], dual-polarized radar [
10,
11], spaceborne dual-frequency radar [
12], etc. However, due to the complexity of and variability within the precipitation phenomenon itself, there are still obstacles to the accurate measurement of precipitation. Almost all the existing instruments and methods can only be said to have a good detection capability for certain types of hydrometeors, and it is difficult to fully consider the complexity of precipitation. To further improve the ability of precipitation monitoring under the condition of existing technologies, a feasible method is to identify hydrometeor types before making quantitative precipitation estimates. In addition, the identification of hydrometeor type is also of great significance for microwave communication. For example, when there is a melting layer in the air or melting snow near the ground, microwave signals will be seriously attenuated [
13]. At such times, communication devices with an automatic transmission power control (ATPC) function will increase the transmission power to ensure normal communication.
A considerable amount of literature has been published on hydrometeor classification. There are two main ways to do this. One way is by in situ measurement. For example, Yuter et al. [
14] separated the hydrometeors with the coexistence of rain and snow according to the velocity–size relationship measured by a Parsivel disdrometer. Praz et al. [
15] developed an algorithm based on the geometric and texture features of hydrometeor images to identify six hydrometeor types and evaluated the degree of riming by multi-angle snowflake camera (MASC). The second way is microwave remote sensing. According to the polarization and attenuation characteristics under backscatter, polarimetric parameters (such as the reflectivity (
ZH), differential reflectivity (
ZDR), specific differential phase (
KDP), linear depolarization ratio (LDR), and copolar correlation coefficient (
ρHV)) of dual-polarized radar are used to identify hydrometeor types [
16,
17]. Hassan et al. [
18] proposed a new fuzzy logic hydrometeor classification scheme applied to the French X-, C-, and S-Band polarimetric radars. Marzano et al. [
19] estimated the type of hydrometeor from C-band dual-polarized radar by a Bayesian approach. Such methods can detect a wide range of precipitation, but the temporal and spatial resolution is limited, and it is difficult to represent near-surface precipitation due to the existence of radar elevation. Therefore, it is still a tremendous challenge to obtain information relating to hydrometeor types near the ground with a high spatial and temporal resolution.
In recent years, due to the low cost, wide distribution, representation of the real surface conditions, and other factors, the observation of precipitation via a terrestrial microwave link has gradually become an attractive research direction [
3]. The basic principle is to retrieve the precipitation information according to the microwave attenuation caused by precipitation. The inversion of precipitation intensity [
20] and the reconstruction of precipitation field [
21] are the main areas that have been studied. In addition, this technique can be used as a supplement to weather radar measurements [
22,
23]. Furthermore, a few studies have begun to focus on dual-frequency [
24] or dual-polarization [
25] to inverse the hydrometeor size distribution (HSD). In fact, the attenuation of a multi-frequency dual-polarization microwave link also contains the information of hydrometeor type, which can be used to distinguish hydrometeor type. However, there have been few studies on the classification of hydrometeor based on microwave link. Holt et al. [
26] studied the variations in the differential phase and attenuation of several precipitation processes at 12.8 GHz and 17.6 GHz, and indicated the potential of the differential phase for use in the recognition of snow and sleet. Cherkassky et al. [
27] proposed a method to distinguish pure rain and sleet based on physical characteristics by using the receiving signal level of a commercial microwave link, which was tested with three links (18.36 GHz (vertical polarization), 19.37 GHz (horizontal and vertical polarization)). However, it only considered three hydrometeor types, pure rain, snow, and a mixture, and the data volume was small.
Unlike weather radar with several fixed frequencies (such as, S-, C-, or X-band), microwave links have multiple frequencies, ranging from several GHz to dozens of GHz. With the development of communication technology, even W-band (92–114.5 GHz) and D-band (141–174.8 GHz) microwave backhaul links may be used in the future, which will greatly stimulate the potential of microwave-link precipitation measurement. Compared with the classification of hydrometeor type based on dual-polarized radar that is widely used, the classification of hydrometeor type based on microwave link has the following advantages:
The detection object reflects the real precipitation situation near the ground;
Higher spatial and temporal resolution;
It can be widely distributed in remote mountainous areas;
It is low-cost, being based on commercial microwave communication equipment.
In order to explore the method of hydrometeor classification based on microwave links and make constructive suggestions for its practical application in the next step, this paper proposes a new method of hydrometeor-type identification via multiple-frequency microwave links. According to the frequency band of the typical microwave link, the identification results of hydrometeor types using microwave links with different frequency bands under different rain-cell conditions were investigated via numerical simulation. The major contributions of this paper are as follows: (1) A method to distinguish hydrometeor types based on the dual-polarization information of microwave links is proposed, and simulation experiments were carried out at single-frequency, dual-frequency and tri-frequency; (2) some criteria for frequency selection are suggested for future actual experimentation.
This paper is organized as follows: the next section analyzes the differences in the microphysical characteristics and the attenuation value distribution for different types of hydrometeors.
Section 3 details the hydrometeor classification method and the microwave-link simulation experiment, while the performance of the various classification models and the influence of relative position and relative length of precipitation cell on it are analyzed in
Section 4. In addition, several key issues are discussed in
Section 5. Finally, the conclusion and summary are stated.
4. Experimental Results
Combinations of one, two, or three of the frequencies 15 GHz, 18 GHz, 25 GHz, 38 GHz, 50 GHz, 60 GHz, 70 GHz, and 80 GHz were used, and the horizontal and vertically polarized hydrometeor attenuation rates of each frequency were used as feature variables.
Figure 5 shows the scatter distributions of horizontal- and vertical-polarization attenuation rates for different types of hydrometeors at each frequency. As seen from the figure, compared with other types of hydrometeors, the attenuation difference between horizontal and vertical polarizations of graupel was more obvious. In addition, as the frequency increased, the dispersion of points became wider.
It was simply assumed that the length of all links was 1 km to facilitate the comparison of the effects of different frequencies on the performance of the classification model. However, in order to be closer to an actual situation, longer links were also simulated, and the differences in the relative length and relative position between them and the precipitation cell were also taken into account.
The accuracy was used to evaluate the performance of model. Assume that XY is the number of samples of hydrometeor type X that is classified as hydrometeor type Y, and rain, graupel, and wet snow are represented by A, B and C, respectively. The definition of accuracy was
The numbers of rain, graupel, and wet snow samples were 11,102, 7489, and 5984, respectively.
4.1. Single-Frequency Models
Figure 6a shows the test set accuracy of eight single-frequency models, and the model numbers are shown in
Table 2. It can be seen that, for ELM, the accuracy of the model increased with the frequency, for which a possible reason is that low-frequency signals tend to be drowned out by noise. The model accuracies for frequencies of 50 GHz, 60 GHz, 70 GHz, and 80 GHz were all over 80% (80.2%, 81.2%, 82.6%, and 83.0%, respectively) and the mean test set accuracy of the single-frequency models was 75.8% for ELM. It is worth noting that the accuracy of the single-frequency model at high frequencies (especially at 80 GHz) could reach the same performance as the dual-frequency and tri-frequency models (as shown in
Section 4.2 and
Section 4.3). Therefore, if only a single-frequency microwave link is actually available, then the highest possible frequency should be selected. In addition, the results showed that the variation trend of different algorithms with model number was basically the same, but ELM was better than DT and then PNN on the whole (the difference in accuracy between the three algorithms was about 5–15% for the same model number). This indicated that the influence of different frequency combinations (model number) on the accuracy of classification results was relatively stable rather than random, and the overall performance can be improved through the improvement of algorithms (this was true not only for single-frequency models, but also for dual-frequency and tri-frequency models, as shown in
Figure 6b,c. To avoid misunderstanding, all performance analyses in the following article refer to the ELM algorithm by default.
4.2. Dual-Frequency Models
Figure 6b presents the test set accuracies obtained from dual-frequency models. There were a total of 28 dual-frequency models, which were obtained by pairing the eight frequencies. The correspondence between frequency combinations and model numbers is shown in
Table 3. Overall, for ELM, the accuracies of the dual-frequency models were higher than those of the single-frequency models (the mean and maximum accuracies of dual-frequency models were 80.7% and 84.4%, respectively). More than two-thirds of the models had an accuracy that was than 80% for ELM. In addition, the accuracy fluctuated obviously with the change in the model number. It is interesting to note that when one frequency was fixed while the other frequency was increasing, the accuracy of the model was constantly improved (e.g., from Model 1 to Model 7, Model 8 to Model 13, Model 14 to Model 18, and Model 19 to Model 22). Furthermore, the accuracy of the test set increased with the overall frequency (for example, the overall accuracy of Models 19–22 was higher than that of Models 14–18, which was higher than that of Models 8–13, which was finally higher than that of Models 1–7). This trend indicates that when a dual-frequency microwave link is used for hydrometeor type identification, the two frequencies should be selected with as much difference as possible or with as high an overall value as possible.
4.3. Tri-Frequency Models
It was found above that the performance of the dual-frequency models was improved compared with that of the single-frequency models. Therefore, this section discusses whether the accuracy of the tri-frequency models can be further improved. The test set accuracies of 56 tri-frequency models are shown in
Figure 6c, and the corresponding frequency combinations are shown in
Table 4. As shown in the figure, except for Models 1, 2, 7, and 22, the accuracies of the test sets of most models were above 80%, and the mean accuracy of all tri-frequency models was 83.2% for ELM. The accuracy reached a maximum of 85.6% at Model 52 for ELM. Similar to the dual-frequency models, the accuracies of the tri-frequency models fluctuate with the model number. However, compared with the dual-frequency models, the fluctuation range is relatively small. Except for a few irregular cases, the accuracy still increases with the overall frequency or frequency difference.
4.4. Precipitation Cell
Although in many cases we assumed that the precipitation field was evenly distributed within the detection range, this assumption obviously introduced great uncertainty relative to the scale of the microwave link. The typical size of a precipitation cell is 5–10 km [
50]. For high-frequency links (for E-band, the link length may be less than 1 km, about 1/10 of the size), the length of precipitation cells may need less consideration. For the low-frequency links (where the link length may be more than ten kilometers or even twenty kilometers long), it is necessary to discuss the precipitation cell length.
A well-known precipitation cell model is the EXCELL model [
51], which assumes an exponential precipitation rate profile. Similarly, assuming an exponential distribution of surface precipitation rates,
where
S1 (mm h
−1) is the peak precipitation rate,
x (km) is the position relative to the origin,
xcell (km) is the peak position of precipitation cell, and
c is the parameter associated with the effective scope of the cell. For the sake of discussion, the coordinate system shown in
Figure 7 was defined. In order to control variables, the length of the precipitation cell was assumed to be 10 km, and only the influence of the location of the link relative to the precipitation cell and the length of the link on the classification model are analyzed below. Here,
c was set to 0.5 (precipitation rate from the center of the cell to the boundary will be reduced to about 10%). Precipitation intensity
S0 calculated from HSD data (according to Equation (9)) was used to determine the value of
S1. Assuming that
S0 represents the average precipitation rate of cell,
Thus, the value of
S1 can be obtained:
for
xcell = 10 km,
S1 = 2.72 mm h
−1. In addition, we used the HSD data obtained by disdrometer to fit the relationship between the Gamma parameters of different types of precipitation and the precipitation rate (see
Table A2 in the
Appendix A). Thus, according to an HSD sample, the HSD distribution of a precipitation cell can be obtained. Further, the locations of the link relative to the precipitation cell in three cases are shown in
Figure 7. For Case 1 (0 <
xt <
xr <
xcell,
xt and
xr represent the locations of the link transmitter and receiver, respectively), the link was completely within the coverage of the precipitation cell. For Case 2 (0 <
xt <
xcell <
xr), one end of the link was outside the precipitation cell and the other was inside. For Case 3 (
xt <0 <
xcell <
xr), both ends of the link were outside the precipitation cell. For each case, specific locations and lengths of different links were simulated and similar classification models ware established as above.
For Case 1, we assumed that the length of the link was 6 km, and the four location conditions relative to the precipitation cell were simulated ((
xt,
xr) was equal to (0, 6), (1, 7), (2, 8) and (3, 9), respectively, corresponding to getting closer and closer to the center of the precipitation cell) (as shown in
Figure 8a–c. In general, due to the longer link length, the accuracies of the classification model were significantly improved (compared with those provided
Section 4), no matter whether the model being used was a single-frequency, dual-frequency, or tri-frequency model (there are many combinations with accuracy greater than 95% in the dual-frequency and tri-frequency models). The increase of link length reduced the error, which corresponded to the analysis conclusion of
Section 5.2. In addition, the trend of accuracy changing with the model number was basically unchanged (this was true not only for Case 1, but also for Cases 2 and 3, as shown in
Figure 8d–i. Further, as the link came closer to the center of the precipitation cell (from 0–6 km to 1–7 km and then to 3–9 km), the accuracy of the model increased as a whole, but it decreased slightly as a whole when it was away from the precipitation cell (from 2–8 km to 3–9 km).
For Case 2, the length of the link was set to 10 km, and the four location conditions ((
xt,
xr) was equal to (2, 12), (4, 14), (6, 16), and (8, 18), respectively) were simulated, corresponding to getting farther and farther away from the precipitation cell) (as shown in
Figure 8d–f). As the link was far away from the precipitation cell, the accuracy decreased on the whole, especially for 8–18 km, where only one fifth of the link was covered by precipitation cell. However, even in this case, the accuracies of some single-frequency models were still as high as 80%, and those of the dual-frequency and tri-frequency models were above 90%. As precipitation existed in only part of the links, the attenuation rate caused by precipitation as a feature variable was obviously reduced, but it still worked well, probably because the polarization information was introduced into the model. The potential relationships between precipitation-induced attenuation rates under horizontal and vertical polarization at different frequencies may be key to ensuring that the model can still maintain a good effect.
For Case 3, the precipitation cell waas completely contained within the link range, and the four location conditions ((
xt,
xr) was equal to (–1, 11), (–2, 12), (–3, 13), and (–4, 14), respectively) were simulated (as shown in
Figure 8g–i). The results showed that the increase of link length had little effect on the accuracies of the models.
6. Conclusions and Prospects
In this paper, using HSD data measured by a Parsivel disdrometer, microwave link simulation experiments were carried out under different frequencies (15 GHz, 18 GHz, 25 GHz, 38 GHz, 50 GHz, 60 GHz, 70 GHz, and 80 GHz) and polarization modes (horizontal or vertical). The attenuation rates caused by rain, graupel, wet snow, and dry snow were calculated and used as feature variables to establish single-frequency, dual-frequency, and tri-frequency hydrometeor type identification models based on the ELM algorithm.
As the attenuation signal of dry snow was too weak, only rain, graupel, and wet snow were considered in the classification model. The classification results showed that, on the whole, the performance of the tri-frequency model was better than those of the dual-frequency model and the single-frequency model (the mean accuracies of the test sets were 83.2%, 80.7%, and 75.8%, and the highest accuracies are 85.6%, 84.4%, and 83.0%, respectively). For the dual-frequency and tri-frequency models, the accuracies were found to increase with the overall frequency or frequency difference. In addition, the performance of the model decreased with increases in noise level. When the noise coefficient was 0.5 dB, models with an accuracy greater than 80% still existed. When the noise factor was increased to 1.0 dB, there were still many models with an accuracy greater than 75%. Furthermore, the classification model achieved good performance under different precipitation cell and link combinations.
In addition to quantitative precipitation estimation, the identification of hydrometeor types using microwave links has become a hot research area. In this paper, a preliminary attempt was made to identify the hydrometeor type by using multi-frequency microwave links. In the future, measured link data will be collected to verify the simulations. However, there are still certain problems. For example, the problem of detecting dry snow remains difficult to solve using only attenuation information. However, it has been reported that the differential phase may be a good indicator for detecting dry snow [
26], which is our next step. In addition, this study showed that the model classification effect obtained using high-frequency microwave links (60 GHz, 70 GHz, 80 GHz) was better than those obtained using low-frequency links, and these frequencies belong only to the frequency range of E-band commercial microwave links, which is a necessary part of the new generation of 5G networks. With further study of the E-band microwave link network, its potential in the field of hydrometeor type identification may be further explored in the future.