4.1. SNR Prediction
The initial testing results present the MSE between the predictions and labels on the test set for various channels and for the five observed algorithms. The OI is considered a baseline algorithm as it is simple to implement and is often used in the literature, while the ML models were expected to offer improvement. In terms of MSE performance (as well as spectral efficiency), the implemented LR and SVM performed the same or worse than the regular NN, so for an easier comparison, only the results of the NN are presented in this and further sections as it will be a good representative of the best performing simple ML algorithms. The MSE values for all observed scenarios are shown in
Table 4, while
Figure 2,
Figure 3 and
Figure 4 show the same metric visually for light shadowing, average shadowing and heavy shadowing, respectively.
When observing the values from
Table 4, it can be concluded that based on different scenarios, the MSE can have quite a large range of values, from 0.2 dB
2 to 38.2 dB
2 for the outdated information. The vast range of these values shows that based on different channel characteristics, different expectations for SNR prediction quality should be present. The ranges of MSE for NN and NN2 have the same minimum value as the OI, and this value is so low that it can be concluded that the implementation of additional algorithms outside of OI can be completely redundant in certain scenarios, e.g.,
= 5 and higher expected SNR values. On the other hand, the maximum MSE for NN is quite lower than the maximum value for outdated information, amounting to 18.5 dB
2, which is less than half of the maximum value of the OI and almost half of the OI MSE value for that respective scenario. Notably, the NN2 has a consistently higher MSE than the NN but also provides an improvement when compared to the OI. This is to be expected as the loss function of the NN2 is not made to optimize for the MSE or exact prediction; rather, it is created to force the network to rarely overestimate the SNR value. For pure SNR prediction, this is impractical, but it will later be shown that this has benefits when observing practical applications and spectral efficiency.
The results shown in
Figure 2,
Figure 3 and
Figure 4 intuitively show the overall trends of the MSE in terms of various algorithms and channel scenarios. Notably, it can be observed that for all algorithms, the MSE is higher when the
parameter rises. This is expected as the higher values of
correspond to channels that have less predictable changes, and both OI and neural networks have difficulties performing when more rapid SNR changes are present. It is also evident that for a very low
, regardless of SNR, the algorithm performs quite similarly. As presented in
Table 4, improvements exist in most scenarios, but they are so minor that the development of special algorithms for the prediction of SNR can be considered redundant. On the other hand, for high
values, the improvements that the neural networks provide become more evident. It is interesting to note that the discrepancy between the performance of NN and NN2 also increases with the increase in
, regardless of shadowing conditions. This is most likely because a higher
creates a more unpredictable channel, and since the NN2 is penalized for overestimating the SNR, it starts to consistently predict much lower values in order to avoid the overestimating. This results in a higher MSE and, therefore, worse performance when compared to the NN approach.
Another interesting characteristic worth noting is that the OI has a higher MSE for heavy shadowing as opposed to average shadowing for s of 50 Hz and above (75 Hz and 100 Hz) for all SNR values. This is an interesting find, as heavy shadowing is considered a worse scenario than average shadowing. On the other hand, for unpredictable channels, such as those with high s values, heavy shadowing actually creates a more predictable pattern; although the SNR values are lower, more frequent shadowing occurrences create more regularity in the pattern and create a correlation between previous samples and future ones. The neural networks, however, compensate for these characteristics, and the same aspect is not prominent in neural networks’ performance.
4.2. Single Channel Spectral Efficiency
Considering the achieved results, it is clear that the neural networks can provide clear improvement in terms of SNR prediction when compared to the baseline OI approach, and the next step represents the evaluation of spectral efficiencies for all observed channels. The initial step is to analyze the outage probabilities if perfect SNR predictions would have been performed for the test channels, presented in
Table 5. These outage probabilities present the unavoidable error on the test set and are relevant for the interpretation of further results.
The results show that for the set threshold of 0.01 for single channel evaluation, outage probability is too high in some scenarios, making the set threshold unobtainable. These outage probabilities, however, should not impair the improvements in spectral efficiency that the neural networks should offer. The outage probabilities are more prominent for higher levels of shadowing, which is expected as 1 percent of the test set corresponds to 250 samples, and more frequent, heavier shadowing can easily make more than 250 samples have values lower than the minimum operational threshold.
Table 6 shows the results in terms of spectral efficiency and achieved error transmission rate for all considered conditions and algorithms.
Figure 5,
Figure 6 and
Figure 7 show the relationship between the SNR and the achieved spectral efficiency for light, average and heavy shadowing, respectively, while
Figure 8,
Figure 9 and
Figure 10 show the results visually in terms of improvement percentage compared to the OI spectral efficiency for light, average and heavy shadowing, respectively.
The results in
Table 6 show the wide range of performances that can be achieved for various scenarios and again point out that different channel characteristics can quite heavily influence the performance of the algorithms. Expectedly, as opposed to the MSE results, the expected SNR plays a significant role in achieving higher spectral efficiency. More specifically, the higher the expected SNR, the higher the achieved spectral efficiency. This stands regardless of the implemented algorithm or
value. It is also important to note that there are many scenarios for which the transmission error rate is higher than 0.01, essentially making reliable communication impossible, regardless of the spectral efficiency that can be achieved. As can be seen in
Table 5, all of the scenarios for which the desired error rate of 0.01 is not achieved have an unavoidable transmission error higher than 0.01. The implemented approach for margin determination does not make achieving an error of less than 0.01 on the test set certain, as the margin is determined on the training set and only then applied on the test set. However, the results indicate that this approach works quite well, as all the scenarios in which the error is larger than 0.01 correspond to the ones where the unavoidable outage probability is above 0.01.
The visual representations of the obtained spectral efficiencies are shown in
Figure 5,
Figure 6 and
Figure 7, indicating clearly how the trends of spectral efficiencies behave for different shadowing,
and SNR conditions. NN2 is consistently better than the OI, with NN also being better in most cases. For an SNR of 0 dB, the improvement seems negligible, but as the SNR increases, the difference between spectral efficiency becomes more prominent. The shapes of the presented curves also change based on the amount of shadowing. It can be seen in
Figure 5 that the spectral efficiency rises mostly linearly with the SNR for most of the light shadowing conditions, regardless of the algorithm, but the trends become more curved as the
increases. For average shadowing, the trends seem more curved, and for heavy shadowing, almost no curve looks linear. This conclusion also stands for all algorithms, meaning that regardless of the achieved improvement, the relationship between SNR and spectral efficiency has a different trend based on the type of channel that is observed.
Observing the results from
Figure 8, it can be seen more clearly that NN2 has a consistently better performance than OI and almost always a better performance than NN. The improvement in spectral efficiency that the NN2 provides is not drastic in comparison to OI or NN, as can be seen, but it is consistent. This clearly indicates the importance of considering different scenarios, as developing complex algorithms can provide limited improvement in certain ones. The consistency shows that the proposed method is conceptually good, but for practical applications, it is important to weigh the benefits of the spectral efficiency improvement against the complexity of integrating complex models into a system.
When comparing NN2 and NN, it is important to note that, as opposed to the simple SNR prediction, NN2 has better performance. This is due to the introduced margin and the need for a transmission error rate no greater than 0.01. Since there are many MODCODs considered for communication, their operations thresholds are not that far apart. Hence, if the neural networks predict an SNR value that is much higher than the operation point of the best possible MODCOD, no data will be transmitted, and a transmission error will occur. This is why the NN2 approach of underestimating values is useful because it is less likely to make such mistakes, so the determined margin ensuring a low error rate will not be as high and will not bring down the spectral efficiency improvements as much as it will for NN.
In terms of achieving a transmission error rate no greater than 0.01 for light shadowing specifically,
Figure 8 recapitulates that for an expected SNR of 0 dB, it is not possible, and no approach achieves this, but it is also shown that for the SNR of 3 dB and an
of 5 Hz, none of the algorithms could obtain a transmission error rate lower than 0.01. This channel does not fall under the category of difficult or unpredictable, as there is light shadowing, and the
is quite small. However, since the SNR is not high, it is always possible that the SNR values happen to be distributed in such a way that the outage probability is higher than 0.01, which is exactly what happened in this case. This stands in line with the results obtained for average and heavy shadowing, as for both, there was an error rate higher than 0.01 for all scenarios where the expected SNR was equal to 3 dB. One more interesting occurrence is that for an
of 100 Hz and an expected SNR of 3 dB, the NN did not achieve an error rate lower than 0.01, while NN2 and OI did. This simply shows that the errors that NN makes can be such that the SNR prediction itself is better, but the overestimating of values that sometimes occur can have a negative impact on reaching certain goals, such as low rates of transmission error. In terms of obtainable improvement for light shadowing, for a frequency range of 40 MHz, if the best relative improvement scenario is considered for NN2 (
= 75 Hz, SNR = 9 dB), the contribution of NN2 would be (1.12 − 0.92) b/s/Hz × 40 MHz = 8 Mb/s.
Observing average shadowing, similar patterns can be observed for light shadowing, with some changes. Firstly, for average shadowing, for an expected SNR of 3 dB, the transmission error rate was always higher than 0.01. This is because more frequent or heavier shadowing increases the intervals in which no communication can occur, thus raising the unavoidable error, which exceeds 0.01 in these scenarios. It can also be seen, in comparison to the low shadowing conditions, that for higher values, the achieved spectral efficiencies are overall quite lower, while for the lower values, this is not as prominent. This is to be expected as the combination of quicker changes in SNR in combination with more frequent shadowing makes predictions significantly more difficult, whereas if more shadowing but for slower changing SNR channels (lower ), the SNR pattern during shadowing can be more easily predicted and therefore not hinder the performance as severely. One more important observation is that although the absolute values are overall lower for higher when compared to the light shadowing, the relative improvement between OI and NN2 is more pronounced. This would indicate that for less favorable scenarios, such as average shadowing and a high , although the absolute spectral efficiency cannot be high, introducing more complex algorithms for SNR prediction could provide a significant benefit. Another occurrence that has happened for light shadowing was that in certain scenarios, NN has a transmission error rate higher than 0.01 while OI and NN2 do not. This has now happened for of 50 Hz and an expected SNR of 6 dB for the same reason described in the light shadowing scenario. Once again, in the results obtained for heavy shadowing, it can be seen that the expected SNR of 6 dB does not allow for communication under any conditions. For the average shadowing, in terms of obtainable absolute improvement (for a frequency range of 40 MHz), the best relative improvement scenario for NN2 ( = 100 Hz, SNR = 12 dB), the contribution of NN2 would be (0.54 − 0.32) b/s/Hz × 40 MHz = 8.8 Mb/s.
The analysis of the results obtained for heavy shadowing is quite similar to the one for previous scenarios. The transmission error rate was higher than 0.01 (due to outage probability) for almost all s and the expected SNR up to 9, with the only exception being the SNR of 9 dB and of 5 Hz. This shows that, as in the examples above, sometimes channel SNR values can play out in such a way that they allow for communication to be established in a way that is not possible for similar scenarios. One more important observation that stands for all shadowing conditions but can best be seen for heavy shadowing is that even in the scenarios where the transmission error rate is higher than 0.01, the improvements of spectral efficiency exist between OI and NN2 and the absolute value of spectral efficiency rises with the rise of the expected SNR. This is extremely important because even if a desired error rate is unattainable, this approach will still provide an improvement in spectral efficiency, which is crucial when considering multiple channels with different characteristics and the usability of the provided method. The relative improvement provided in certain scenarios for heavy shadowing is the highest among the observed scenarios and exceeds 100% in some cases. For the best relative improvement scenario for NN2 ( = 75 Hz, SNR = 12 dB), considering a frequency range of 40 MHz, the NN2 contribution amounts to (0.68 − 0.36) b/s/Hz × 40 MHz = 12.8 Mb/s.
4.3. Double Channel Spectral Efficiency
The final evaluation step is the one where the performance of the proposed method is evaluated for two communication channels. Here, only the NN2 and OI are compared for an easier overview of the results, especially considering that the NN2 approach has provided better results for the single channel spectral efficiency improvement.
Figure 11 shows the results for light shadowing,
Figure 12 for average shadowing and
Figure 13 for heavy shadowing.
Figure 11 shows how the combination of two channels can influence the performance of the proposed system. Each larger square represents a scenario where the channels
s are fixed (e.g., second row, third column,
1 = 25 Hz,
2 = 50 Hz), while the smaller squares correspond to various combinations of expected SNR. The type of square, such as red outline, regular outline and hatched, corresponds to the range of transmission errors for that scenario, and the color scale corresponds to the relative improvement in spectral efficiency. The relative improvement is above 0 for all scenarios, i.e., there are no scenarios where the OI outperformed the NN2. Secondly, the red squares outline the scenarios in which the desired transmission error rate was achieved, i.e., it was under 0.001. It can be seen that for a low
, it is always achievable, but as the
rises, this becomes more difficult, and for the
= 100 Hz, regardless of SNR, the proposed method achieved a transmission error rate lower than 0.01 but not lower than 0.001. On the other hand, it can be seen that the relative improvement of spectral efficiency provided by the NN2 is much more prominent for the scenarios with a higher
(as seen in dark blue) as opposed to the ones for lower
(seen in white or light blue). This shows that depending on the scenario, different goals can be achieved and that the final goal has to be considered through the design of the algorithm since the most straightforward solution (such as NN) might not provide the best results. Overall, scenarios where lower errors are obtainable present ones where SNR is easier to predict; hence, OI initially had good performance, which is why the relative improvement offered by the NN2 is not as high as for some other scenarios.
Figure 12 shows how the increase in shadowing affects the performance of the system. When compared to the light shadowing conditions, many of the results are in darker blue, showing a greater relative improvement than the one achieved for light shadowing. Secondly, it can be seen that aside from a couple of scenarios of both channels having an
of 25 Hz, an error rate lower than 0.001 could not be achieved if one of the channels does not have an
of 5 Hz. Thirdly, it can be seen that for some scenarios of higher
s and lower expected SNRs, not even an error rate of 0.01 could be achieved. This is due to the unavoidable error rate, in the same manner as it was present for single channel evaluation.
The results shown in
Figure 13 show several outcomes that could be considered expected and several ones that provide new information. Firstly, the scenarios where the unavoidable error is above 0.01 are more prominent, as can be seen for lower expected SNR scenarios where multiple fields are hatched. Secondly, there are more scenarios where the transmission error could be lower than 0.001 for higher
s when compared to the average shadowing. This might seem unexpected as more frequent and more heavy shadowing is not a favorable condition. However, it is possible that for higher
s, more frequent shadowing adds a level of order to the noisy signal, making the NN2 better at predicting what future SNR values will be. The “shadowed” parts of the signal provide very low SNR values, and if the NN2 can predict these values to be quite low, then the margin that is introduced might not need to be as high, and the overall performance could be better. The shadowed intervals have accurately low predicted SNRs, but regular parts of the signal could also have adequate predictions that will not be hindered by an extremely high margin.
When observing the error transmission ranges and the obtained results, it is important to note that the next order of magnitude, i.e., having a transmission error rate of 0.0001 or lower, was only unobtainable since the current test set has 25,000 samples, and such an error rate would imply no more than two samples could be allowed to be incorrectly transmitted. Furthermore, even with a larger test set, two channels and the considered shadowing conditions would probably not allow for such a low error to be theoretically obtainable anywhere where there is average or heavy shadowing. This could direct future work towards analyzing three or more channels or simply analyzing the performance obtainable when two channels of different shadowing levels are combined. These scenarios are understandably of interest but would simply be out of scope for this paper as the goal was to perform a sort of grid analysis in terms of ML algorithm performance for various channels and to evaluate whether a neural network that purposefully underestimates values could be of interest considering fixed transmission error rates.
4.4. Discussion
When discussing the results obtained in this paper, it is not feasible to perform a direct comparison with the metrics obtained from other related work as the experimental setup and goals are so diverse. On the other hand, when comparing the methods and approaches that are present, it is possible to see how the performed research is compatible with other approaches and what could be some of the limitations of the approach considered in this paper.
The results obtained in this paper show that neural networks can successfully be used to predict the SNR values for channels with various characteristics. This stands in line with the results obtained from existing work where different neural network architectures were also shown to be successful in predicting CSI in LEO satellite systems [
25,
26,
28]. The presented paper also provides a thorough evaluation of a multitude of different channels by using the implemented simulator and provides a novel approach in terms of improving spectral efficiency while keeping the transmission error rate above a fixed threshold. The observed thresholds for the transmission error rate (0.01 and 0.001) are not suitable for all applications, but the paper presents a foundation that can be used for future improvements and provides a proof of concept that can be combined with other optimization strategies, such as weather influence [
27] and energy efficiency [
20].
When discussing the barriers to the practical implementation of the proposed method in terms of energy consumption and computation time, it is important to note that the approach uses a relatively small convolutional neural network that contains 6261 parameters in combination with a simple subtraction of the estimated margin. The time needed to infer a prediction on a single batch containing 64 inputs is 18.7 ms on an AMD Ryzen 7 7840HS CPU. Depending on the usage, it is understood that the current complexity might present an obstacle, but smaller architectures of neural networks could be tested, and the length of the prediction could also be changed to not be a single point but rather several points so the need for inference is not as frequent.
The presented approach has several limitations that should be mentioned. The presented work analyzes three different factors (shadowing, , and expected SNR) but does not take into account other factors that could influence the channel, so this could be considered in future research. Furthermore, the paper observes a maximum of two satellites, and the scaling of the proposed method into larger systems, both in terms of implementation complexity and efficiency, is yet to be analyzed. The proposed solution is also focused on the DVB-S2X protocol, and its applications in different communication scenarios or with different satellite types were not in the scope of the paper. Finally, there is also an in-depth analysis of different neural network architectures, which was omitted in this paper as it is not the focus, but obtaining better results with smaller models always presents a broad area for future research.