A Machine Learning Approach for Path Loss Prediction Using Combination of Regression and Classification Models

Iliev, Ilia; Velchev, Yuliyan; Petkov, Peter Z.; Bonev, Boncho; Iliev, Georgi; Nachev, Ivaylo

doi:10.3390/s24175855

Open AccessArticle

A Machine Learning Approach for Path Loss Prediction Using Combination of Regression and Classification Models

by

Ilia Iliev

,

Yuliyan Velchev

,

Peter Z. Petkov

^*

,

Boncho Bonev

,

Georgi Iliev

and

Ivaylo Nachev

Department of Radio Communications and Video Technology, Faculty of Telecommunications, Technical University of Sofia, 1000 Sofia, Bulgaria

^*

Author to whom correspondence should be addressed.

Sensors 2024, 24(17), 5855; https://doi.org/10.3390/s24175855

Submission received: 14 August 2024 / Revised: 2 September 2024 / Accepted: 6 September 2024 / Published: 9 September 2024

(This article belongs to the Topic Advances in Wireless and Mobile Networking)

Download

Browse Figures

Versions Notes

Abstract

:

One of the key parameters in radio link planning is the propagation path loss. Most of the existing methods for its prediction are not characterized by a good balance between accuracy, generality, and low computational complexity. To address this problem, a machine learning approach for path loss prediction is presented in this study. The novelty is the proposal of a compound model, which consists of two regression models and one classifier. The first regression model is adequate when a line-of-sight scenario is fulfilled in radio wave propagation, whereas the second one is appropriate for non-line-of-sight conditions. The classification model is intended to provide a probabilistic output, through which the outputs of the regression models are combined. The number of used input parameters is only five. They are related to the distance, the antenna heights, and the statistics of the terrain profile and line-of-sight obstacles. The proposed approach allows creation of a generalized model that is valid for various types of areas and terrains, different antenna heights, and line-of-sight and non line-of-sight propagation conditions. An experimental dataset is provided by measurements for a variety of relief types (flat, hilly, mountain, and foothill) and for rural, urban, and suburban areas. The experimental results show an excellent performances in terms of a root mean square error of a prediction as low as

7.3

dB and a coefficient of determination as high as 0.702. Although the study covers only one operating frequency of 433

M

Hz

, the proposed model can be trained and applied for any frequency in the decimeter wavelength range. The main reason for the choice of such an operating frequency is because it falls within the range in which many wireless systems of different types are operating. These include Internet of Things (IoT), machine-to-machine (M2M) mesh radio networks, power efficient communication over long distances such as Low-Power Wide-Area Network (LPWAN)—LoRa, etc.

Keywords:

path loss prediction; radio propagation modeling; LoRa; neural network regression; neural network classification

1. Introduction

In recent years, radio access networks have developed at an unprecedented pace. They are generally the bottleneck in end-to-end network communications due to a number of natural phenomena in the propagation of electromagnetic waves, especially in mobile communications. The communication channel is dispersive in time and frequency. The radio interface is exposed to noise and multiple interference of an intra- and inter-system nature. On the other hand, with the evolution of new-generation cellular networks, the emergence of the Internet of Things (IoT) and the Internet of Everything (IoE) has not only expanded the application of wireless networks, but has necessitated the flexible use and reuse of the radio spectrum. Information transmission speeds, and the number and type of telecommunications services are steadily increasing. These changing conditions have entailed new approaches to radio interface construction, in terms of radio signals’ synthesis, modulation, and coding, and continuous adaptation of the radio link and its parameters to the propagation environment. The emergence of software-defined radio has enabled the construction of cognitive radio networks, at the core of which is a cognitive machine. The current trend is to implement cognitive radio using artificial intelligence to maximize the throughput of the communication channel under certain conditions of the electromagnetic wave propagation environment.

In radio communications, all elements of machine learning (ML) and self-learning are applicable due to the continuously changing radio communication environment and communication channel conditions. A key element of the cognitive process is adapting the relationship of the interrelated information and energy parameters of the channel. In order to provide a certain type of communication service, it is necessary to maintain within given limits the basic quality parameters, such as bandwidth, data rate, signal-to-noise interference ratio, binary error rate, etc.

The radio link power budget is used in two aspects. The first one is preliminary engineering planning of the the radio link. The second is the process of its use and adaptation by radio communication facilities. These processes are part of the functions of the physical, media access control (MAC) and radio link control (RLC) layers of the communication protocols. For example, they are an important element in interference reduction in cellular networks through transmitter power management procedures.

The radio link power budget requires the knowledge or prediction of electromagnetic wave propagation losses. Propagation loss models allow us to determine the received signal power as a function of distance and other parameters, and hence to predict the signal-to-noise ratio at the receivers. Given the distance and other parameters of the radio communication system, the terrain, and its characteristics and propagation conditions, the maximum allowable path loss of the radio communication can be determined. Alternatively, given the attenuation, the maximum radius of radio coverage can be calculated.

The losses are determined by several natural phenomena in the propagation of an electromagnetic wave, such as decrease in power density of the electromagnetic wave as a function of distance, diffraction, reflection, and scattering. In most cases, this necessitates the use of a multi-path propagation model, and, in the case of mobility, it is necessary to account for the appearance of fading and Doppler shift. Propagation models may include the influence of large-scale fading. Small-scale fading and Doppler shift are not included in the models because their influence is minimized in the signal processing stage at the transmitting or receiving side. On this basis, a number of models have emerged, which can be divided into three main groups: deterministic (analytical); stochastic; and empirical.

The deterministic models use exact analytical expressions derived from electrodynamics and are directly related to the propagation environment and the characteristics of the region of coverage. They use a multi-path propagation model and accurate 2D or 3D maps of the terrain with detailed electromagnetic characteristics of the obstacles. Many of them apply finite-difference time-domain methods for attenuation estimation [1,2]. Their accuracy is relatively high, but the computational complexity is significant. A classic representative of theses types of models is the Longley–Rice model [3].

The stochastic models account for the random nature of the large-scale fading, which is described by a log-normal distribution. Based on statistical parameters and a given coverage probability, an additional large-scale fading margin is included [4,5]. Based on a number of studies, the 3rd Generation Partnership Project (3GPP) organization proposes TR 38.901 stochastic empirical models for 5G NR for the frequency range 0.5–100

G

Hz

[6]. Models for different coverage scenarios are synthesized, such as Rural Macro, Urban Macro, Urban Micro, Indoor Factory, Indoor Hotspot, etc.

Empirical models are obtained under certain conditions after approximation of a set of measured data. They are characterized by their simplicity and relatively high modeling accuracy, but only for the conditions under which they are defined. Over time, several classical models have been established and used for coverage prediction in radio communication systems: Okumura–Hata model [7], COST 231 Hata [8], COST Walfisch–Ikegami [9], Lee model [10], etc. For example, in the Lee model, a simplified loss model is applied to predict the attenuation as a function of distance. Additionally, correction factors for antenna heights, operating frequency, and antenna gains are included.

A current trend in radio coverage prediction is by using ML models trained with supervised, unsupervised, and reinforcement learning algorithms [11,12,13,14,15]. The application of ML in the models not only enhances the accuracy of coverage prediction, but also allows their direct implementation in the cognitive machine in cognitive radio networks in order to adapt the link, depending on the communication environment conditions.

Supervised learning algorithms find a special place in loss modeling because the task involves both classification and regression elements. The modeling of propagation losses is in most of the cases interpreted as a regression problem. Support vector machines, Gaussian process regression models, kernel approximation and classifiers, neural networks, naive Bayes classifiers, nearest neighbor classifiers, random forest ensemble learning algorithms, etc., are applied. As an example, a number of publications have not only performed comparative analysis, but also demonstrated multiple studies on the parameters of propagation loss models using ML algorithms [11,16,17,18,19,20,21].

In [16], a neural network ensemble learning technique with increased accuracy for path loss prediction is proposed. The measured results are taken from [22], which are for an 1800

M

Hz

frequency band in urban area propagation conditions. The neural network uses six input parameters (features): longitude; latitude; elevation; altitude; clutter height; and distance. Some of the used indicators of the model’s performance are root mean square error,

R M S E

[17]; mean absolute error,

M A E

[18]; and coefficient of determination,

R^{2}

[16]. The achieved results are as follows:

R M S E = 2.941

dB;

M A E = 1.2753

dB; and

R^{2} = 0.8951

.

Paper [17] proposes support vector regression (SVR) and radial basis function (RBF) models for path loss predictions in rural, suburban, and urban areas. The following environmental input parameters are used: elevation; clutter heights; distance; altitude; building-to-building distance; the street orientation angle; and base station antenna heights (fixed to 25

m

and 35

m

). The obtained

R M S E

values for the three types of areas are

1.378

dB;

1.452

dB; and

2.157

dB. The results show a very good regression fitting because the measured points are located approximately on three straight lines originating at the base station. Unfortunately, we did not find any information about the operating frequency.

A technique that combines artificial neural networks (ANNs) with Gaussian process (GP) variance analysis and principal component analysis is proposed in [11]. The multi-layer perceptron (MLP) is the core of the ANN architecture. The input parameters of the model are frequencies (450

M

Hz

, 1450

M

Hz

, and 2300

M

Hz

); elevation plus transmitting antenna height; elevation plus receiving antenna height; and the difference between these two heights. The data are collected for only one transmitting antenna height (15

m

) and for suburban propagation conditions (small town). The achieved quantities for the regression performance using the ANN for frequency 450

M

Hz

are as follows:

R M S E = 7.876

dB;

M A E = 5.896

dB; and

R^{2} = 0.3975

.

Three ML regression models are investigated and compared in [18]: ANN; support vector machine; and multi-linear regression. The measurements are made for one frequency (900

M

Hz

) and one transmitter antenna height (100

m

). The communication distance is limited within the range 100

m

to 800

m

, and a very few number of geographical points (<100) are considered. We believe this is the reason for the very good approximation results that are reported (RMSE = 0.008438 and

R^{2}

= 0.999675) when using ANNs.

An ensemble method consisting of three neural network models—conventional ANNs, long short-term memory-based recurrent neural networks, and convolutional neural networks—is analyzed in [19]. The prediction of path loss is for an indoor environment at three frequencies of 14

G

Hz

, 18

G

Hz

, and 22

G

Hz

. The data used in this research are collected in an indoor environment for line-of-sight (LOS) and non-line-of-sight (NLOS) scenarios. The input features are distance; frequency; angle of arrival; and transmitter antenna height. The distance ranges from 2 to 24

m

. Measurements at various frequencies are carried out for 865 points. The studied models demonstrate high accuracy in terms of the maximum value of

R M S E

being less than

0.3162

dB and the average value of

R^{2}

being as high as 0.9753.

Our analysis shows that many successful ML-based techniques for path loss prediction have been developed. In terms of their approximation accuracy, adequacy and generality, the choice of the method is of big concern. The proper selection of input parameters, training data size, and consideration of the propagation conditions of the electromagnetic wave are of key importance. Based on the physical processes of electromagnetic wave propagation, the most influential parameter is the distance between the transmitter and the receiver. The other parameters, which are related to the diffraction properties of the electromagnetic wave, are the heights of the antennas, the terrain profile, and the type and characteristics of the obstacles. The propagation medium influences the reflection and multi-path propagation, expressed as statistical parameters of the long-scale fading. Each different propagation area has specific dispersion. The operating frequency is also an important parameter because it directly influences the free space propagation losses. It also affects the additional attenuation caused by hydrometeors, vegetation, buildings, and air molecules. An additional influence on the accuracy of the approximation is the measurement error of the primary parameters. Most of the proposed models are limited by the conditions under which the training samples are obtained. They provide the relevant accuracy only under these conditions. For this reason, the comparison between different methods is sometimes incorrect.

In this study, a mobile measurement is performed using specially designed equipment and software. The values of the measured attenuation are for a frequency of 433

M

Hz

. The main reason for the choice of such an operating frequency is because it falls into the industrial, scientific, and medical (ISM) range, in which the number and type of operating wireless systems are increasing. These include IoT, machine-to-machine (M2M) mesh radio networks, power efficient communication over long distances such as Low-Power Wide-Area Network (LPWAN)—LoRa, etc. IoT is the basis of a number of information telecommunication systems, such as Smart Homes, Smart Cities, Smart Grid, Cyber Physical Systems, and others. The use of these systems re-proposes different coverage scenarios, diversity of propagation environments, and different antenna heights, especially of the end devices that are close to low-altitude obstacles. These are the prerequisites for our motivation to face this problem.

2. The Proposed Approach

2.1. Overview

This study proposes and investigates a path loss prediction ANN-based approach using combination of regression and classification models. This compound model is adequate for rural, suburban, and urban areas. The novelty here is that an additional classifier is applied, through which the model automatically estimates the type of coverage scenario—LOS or NLOS, based on indirect input parameters.

The proposed compound model for path loss prediction is composed of two regression models (named Model A and Model B), whose outputs can be combined by different manners (Figure 1).

Model A is adequate in the LOS scenario, whereas Model B is suitable for NLOS conditions. A third model (named Model P), which shares the same input parameters as Models A and B, serves as a binary classifier. Its outputs can be considered as posterior probabilities

P (A | x)

and

P (B | x)

for the input data,

x

, being adequate to each regression model, where A is for the LOS scenario and B is for NLOS. The outputs

{\hat{y}}_{A}

and

{\hat{y}}_{B}

of the regression models can be combined as a weighted average using

P (A | x)

and

P (B | x)

as coefficients (Figure 1a). This can be expressed as follows:

\hat{y} = P (A | x) {\hat{y}}_{A} + P (B | x) {\hat{y}}_{B},

(1)

where

\hat{y}

is the output of the compound model. This type of combination is referred as “soft” in the rest of the paper. A simpler variant is shown in Figure 1b. The output of the compound model is just the output of a particular regression model considering the maximal probability among

P (A | x)

and

P (B | x)

:

\hat{y} = \{\begin{matrix} {\hat{y}}_{A}, & P (A | x) \geq P (B | x) \\ {\hat{y}}_{B}, & P (A | x) < P (B | x) \end{matrix} .

(2)

This type of combination is referred as “hard”. Any of these probabilities can be routed as an additional output, so this can serve as an uncertainty indicator for the selection of a particular regression model.

In order to perform the regression and classification with the required accuracy, the training of the neural networks is performed with an ensemble of measured data. It consists of terrain parameters for the three types of areas, type and height of the buildings, and antenna heights.

2.2. Input and Output Parameters

Since Model P has to be trained using supervised learning, true labels have to be provided. They are denoted as

L O S

and report whether the profile has a direct or indirect line of sight. This binary quantity is determined automatically by a procedure written in MATLAB©. The input parameters of the procedure are the coordinates of the measured point, the coordinates of the stationary station, and the antenna heights. The geographic data of terrain relief and 3D buildings for specified regions of interest taken from [23] are passed for 3D visualization and profile tracking analysis. The constructed altitude profile with obstacles between a measured point and the stationary station and the straight line between the two antennas’ heights determine whether there is a direct or indirect line of sight.

The model is proposed to have five input parameters: the 2D distance between the receiver and the transmitter in logarithmic scale,

d_{l o g}

; the relative height,

h_{e f f}

, between the antennas of the stationary and mobile stations; the relative maximum terrain height,

H_{r, M A X}

, including obstacles; the relative average height of the terrain,

H_{r, A V}

; and the standard deviation of the profile of the propagation path,

H_{r, S T D}

. The chosen input parameters can be expresses as a 5-dimensional vector:

x = [\begin{matrix} x_{1} & x_{2} & x_{3} & x_{4} & x_{5} \end{matrix}] .

(3)

The first element is the distance d (in meters) in logarithmic scale:

x_{1} = d_{l o g} = 20 {log}_{10} d .

(4)

The attenuation is a linear function of distance expressed in logarithmic scale.

The second element is the effective antenna height in meters:

x_{2} = h_{e f f} = |(h_{S} + A_{S}) - (h_{M} + A_{M})|,

(5)

where

h_{S}

and

h_{M}

are the antenna heights relative to the terrain for the stationary and mobile transceivers, respectively.

A_{S}

and

A_{M}

are their corresponding altitudes. Instead of using fixed absolute values for antenna heights (as in many other models), the effective height here is the difference in the heights of the transmitter and receiver antennas, including altitudes of the installation positions. This choice extends the applicability of the model. This parameter is calculated as a absolute value because the link channel is assumed to be reversible concerning the transmitter–receiver direction.

The relative maximal height of the terrain,

H_{r, M A X}

, and obstacles in respect to antenna heights as well participate in the input parameters as the third element of

x

:

x_{3} = H_{r, M A X} = max_{\begin{matrix} j = 1 . . N_{k} \end{matrix}} (A_{j}) - min \{(h_{S} + A_{S}), (h_{M} + A_{M})\} + H_{B, A V},

(6)

where

A_{j}

is the elevation at position j of the altitude profile between the measurement point and the stationary station. The altitude profile is divided into

N_{k}

parts with a step

Δ d = 6 m

, which is the half of a street’s width for a small town. The calculation is made relative to the lowest height of the two antennas in order to account for the worst case in terms of shadowing. The input parameter includes the average height,

H_{B, A V}

, of buildings for the area of consideration.

The fourth input parameter is the average height of the altitude,

H_{r, A V}

, in the direction transmitter–receiver in respect to the antenna heights:

x_{4} = H_{r, A V} = \frac{1}{N_{k}} \sum_{j = 1}^{N_{k}} A_{j} - max \{(h_{S} + A_{S}), (h_{M} + A_{M})\} .

(7)

The last input parameter is the standard deviation,

H_{r, S T D}

, of the altitude profile in respect to the antenna heights:

x_{5} = H_{r, S T D} = \sqrt{\frac{1}{N_{k} - 1} \sum_{j = 1}^{N_{k}} {(A_{j} - \bar{A})}^{2}} - max \{(h_{S} + A_{S}), (h_{M} + A_{M_{k}})\},

(8)

where

\bar{A}

denotes the mean elevation value.

Finally, the

x

vector is a subject of linear transformation. The transformation is chosen so that when applied on the whole dataset matrix:

X = [\begin{matrix} x_{1} \\ x_{2} \\ ⋮ \\ x_{N} \end{matrix}],

the result will be a matrix whose columns have zero mean and unit variance. This data standardization enhances the performance of the models and improves the accuracy.

The input parameters are selected to help the model to account for the effects of electromagnetic wave diffraction and shadowing losses, and to distinguish between LOS and NLOS scenarios. The last three parameters (

x_{3}

,

x_{4}

, and

x_{5}

) are statistical quantities that refer to the path profile between transmitter and receiver. The parameter

H_{r, M A X}

takes into account the highest point of the line transmitter–receiver and the average height,

H_{B, A V}

, of the buildings for the area. This leads to an increase in its value without describing the actual situation. There may or may not be a building at the highest point of the profile. The actual height is not involved here, but the average height for the area is used instead. At first glance, this would increase the regression error if the model had only a regression algorithm. This disadvantage is minimized by the additionally included classifier in the compound model since it is trained with the true labels,

L O S

.

L O S

conveys the actual situation of direct or indirect line of sight. The use of the parameter

H_{B, A V}

(that is the mean height of the buildings for a given coverage area) brings the advantage in predicting the attenuation of the proposed approach. The average height of buildings can be easily found or determined without an accurate 3D Geographic Information System (GIS). Furthermore, instead of the exact statistics for a given altitude profile, the model can use the average estimates of the other two input parameters,

H_{r, A V}

and

H_{r, S T D}

, in order to predict the coverage. In this case, it can be included in the group of zonal models and it will suffer their disadvantage—lesser accuracy.

For brevity, the output parameter (path loss) is denoted as y, i.e.,

y = L

, where the true value for the path loss is calculated according to the radio link budget formula:

y = L = P_{T} [dBm] + G_{S} [dBi] + G_{M} [dBi] - P_{R} [dBm],

(9)

where

P_{T}

is the transmitter power,

P_{R}

is the received power, and

G_{S}

and

G_{M}

are the antenna gains.

2.3. Architectures of the Individual Models

In Figure 2, the architectures of the individual models used are presented. The regression models (Model A and Model B) share a similar structure (Figure 2a). It consists of three hidden fully connected layers (Dense) with 16 perceptrons [24] in each. A fully connected layer means every input of the input vector influences every output of the output vector (all possible connections layer-to-layer are present). The output shapes of each layer are denoted with pairs enclosed in parentheses, where “None” indicates a dummy dimension. The input shape of each layer is the output shape of the previous one, except for the first layer whose inputs are equal to the dimension of

x

. The used activation function for all layers except the last one is tanh [25]. The output layer has a linear activation function, which is well suited for regression problems.

Models A and B are prone to over-fitting. This can be prevented using kernel regularization by adding penalty factors to the layers. In our case, L2 kernel regularization is implemented [26] in each hidden layer by modifying the cost function, C [26]:

C = \sum_{i = 1}^{N} {(y_{i} - f (x_{i}))}^{2} + λ \sum_{j = 1}^{M} w_{j}^{2},

(10)

where

w_{j}

are the weights of the inputs

x_{j}

and f denotes the transfer function of the neuron. The value of the regularization factor

λ = 0.075

is same for each layer.

The classification model (Figure 2b) has similar structure of its hidden layers, but without any kernel regularization. The activation function of the output layer is of type softmax [25], which has the property to normalize the output of a network to a probability distribution over predicted output classes.

The complexity of the models (in terms of number of layers and number of units in each of them) is determined experimentally. Starting from a simple one, the complexity is gradually increased until no significant improvement is obtained.

3. Experimental Results

3.1. Experimental Dataset

The dataset for training the models and evaluating the performance of the proposed approach is provided by a dedicated measurement setup developed by the authors. The objective of the measurement is to create a dataset of primary parameters by which the attenuation can be determined as a function of transmitter and receiver locations, absolute altitudes of the transmitter and receiver antennas, and statistics of the profile transmitter–receiver. Because the operating frequency is in the free ISM range, LoRa modules for machine-to-machine communications are used as measurement transceivers. Each module consists of an ultra-low-power long-range transceiver of type SX1276 made by Semtech Corporation [27] and a microcontroller for control, processing, and interfacing via a universal asynchronous receiver–transmitter (UART). Another reason to use these modules is because the LoRa technology is tied to a high receiver sensitivity of −148

m

. This implies measurements of greater distances between the transmitter and the receiver, especially in urban areas. Doppler shift is compensated for a frequency of up to

31.5

k

Hz

at a bandwidth of 125

k

Hz

. The operating frequency of 433

M

Hz

allows ground mobile measurements with high offset rates. The software (version 1.0.0) for the embedded controller has the function to extract the received power value averaged over the received digital frame. In Figure 3 is depicted the proposed measurement concept as well, as the used equipment.

The stationary part,

T R X_{S}

, consists of a LoRa transceiver and J-pole antenna mounted at a suitable location on a building in the area under examination. The mobile part,

T R X_{M}

, also contains a LoRa transceiver and J-pole antenna, both of the same type as in the stationary subsystem. Additionally, there are a GPS receiver with a GPS antenna and a personal computer with installed and configured processing software. The LoRa transceiver and GPS receiver are connected to the computer via USB interface. The

T R X_{S}

transceiver is configured in repeater mode.

T R X_{M}

transmits a packet of data at a certain time interval.

T R X_{S}

relays the received packet back to

T R X_{M}

. The receiver at

T R X_{M}

processes the packet by measuring the received signal power,

P_{R, k}

. Here, k is the index associated with the k-th measurement point. For each measurement point k, the personal computer receives the measured power,

P_{R, k}

, from the LoRa module and associates it with the given GPS coordinates (latitude, longitude, and altitude), plus a timestamp. Such records are automatically saved to a database file.

The two nodes are equipped with identical J-pole antennas (Figure 4) with ground decoupling chokes [28,29].

The antennas operate on vertical polarization and provide omnidirectional patterns in the horizontal plane. They are made of copper pipes with a diameter of 6

m

m

. The ground decoupling choke provides cancellation of the currents in the mounting mast and the ground below (the car roof in the case of mobile applications), and therefore improves the pattern stability and noise temperature as well. An additional benefit of using the J-pole antenna (instead of the whip or monopole antennas, for example) is that the former cancels the unwanted out-of-band signals that can easily saturate the unprotected front-end of the LoRa receiver.

The operating parameters of the measurement system are transmitter output power

P_{T} = 20 dBm

; antenna gain

G_{S} = G_{M} = 3.8 dBi

; and operating frequency

f = 433 MHz

.

The areas for which the measurements are carried out are selected so that different relief types are presented. The selection includes rural, suburban, and urban areas with direct and indirect lines of sight at different heights of the stationary station antenna, geographical relief, and average height of buildings. The goal is to have a mixture of data through which the training model is adequate for each type of coverage. For example, Septemvri (Bulgaria) and Belogradchik (Bulgaria) are small towns with different terrain types. The measurement conditions and parameters for each selected area are summarized in Table 1.

Ground mobile measurements at a low vehicle speed of no more than 30

k

m

h^{- 1}

are carried out for the selected areas. The mobile antenna is mounted on the top of the vehicle at a height

h_{M} = 1.5

m. The total number of measurement records is 4490. In Figure 5 and Figure 6 are shown the maps of geographical points associated with measurement records for Septemvri and Belogradchik.

Septemvri and Belogradchik are small towns, but the terrain is quite different. For these towns, the selected geographical points are both inside and outside the town, in order to cover rural coverage scenarios. For Septemvri, the antenna of the stationary station is above the average level of the rooftop buildings, while for Belogradchik it is on the terrace of a brick building. Most buildings are made of brick masonry. In Figure 7 are shown the maps of geographical points associated with measurement records of two areas in Sofia city (Bulgaria). The first is the campus of the Technical University of Sofia (Figure 7a). Residential area Darvenitza (Figure 7b) is the second one.

The campus of the Technical University of Sofia is located at the foot of Vitosha Mountain. The terrain is mixed—flat, mountain and foothill. Here, the measurements are performed at different heights of the antenna of the stationary station. The same procedure is followed for residential area Darvenitza.

3.2. Training and Validation

The parameters of all individual models are optimized using the gradient descent approach. This is achieved with the well-known back-propagation method [24].

For training, validation, and testing of each model, the available dataset is split randomly into two subsets. The first one is intended for training and validation, whereas the second is used for testing and performance evaluation. The proportion is 50% for training plus validation and 50% for testing. This spit leads to imbalanced data since the samples that are annotated as adequate for Model B are several times more than those for Model A. Because Model A and Model B preform regression, this imbalance is not crucial for their proper training. However, such disproportion is a significant concern for Model P fitting since its purpose is to perform classification. To overcome this problem, the training data for the minority class are up-sampled using the approach referred as the Synthetic Minority Oversampling Technique (SMOTE) [30]. This method inserts synthetic samples along the line between two data points in the minority class. This procedure is repeated until balanced data is ensured. SMOTE does not cause data loss and generally does not introduce over-fitting of the model.

In the training validation procedure, the k-fold method is applied for cross-validation of each model. The number of folds is selected as five, thus in each iteration the validation is with 20% of the data. The “best” fold is selected according to the minimal value of the loss function for the validation data. After that, the model is retrained with the selected subset, and finally its structure and weights are stored.

For Models A and B, the optimization is based on minimization of the mean squared error between the true output and predicted one. As can be seen from Figure 8, the training process of Models A and B is associated with good convergence of the loss function.

The maximum number of epochs is set to 200, but early stopping is allowed when no significant improvement is obtained on two consecutive steps.

Model P is trained according to loss function minimization, based on binary cross-entropy [31]:

H_{p} (q) = - \frac{1}{N} \sum_{i = 1}^{N} [y_{i} log (p (y_{i})) + (1 - y_{i}) log (1 - p (y_{i}))] .

(11)

This can be regarded as a measure for dissimilarity between the true labels and the predicted probabilities of inputs being in the positive class, where y is the output label and

p (y)

is the predicted probability of the sample being 1 for all N samples. In Figure 9 is visualized the improvement of accuracy and loss functions during training of Model P.

As can be seen, the validation accuracy reaches nearly 85%.

3.3. Testing and Performance Evaluation

The performance of Model P is evaluated using the confusion matrix and Receiver Operating Curve (ROC), calculated when the model classifies the samples from the testing subset. ROC evaluates the performance of a model at all possible classification thresholds, showing the dependency between the true positive rate (

T P R

):

T P R = \frac{T P}{T P + F N}

(12)

and false positive rate (

F P R

), defined as:

F P R = \frac{F P}{F P + T N},

(13)

where

T P

is the true positives,

F N

is the false negatives,

F P

is the false positives, and

T N

denotes the true negatives of the classification. The area under the ROC curve,

A U C

, is a widely used estimate of the classification performance:

A U C = \int_{0}^{1} T P R (F P R) d F P R .

(14)

A perfect classifier has an

A U C

value equal to 1. In Figure 10 is shown the ROC curve of Model P (red line). The achieved value of the area under the ROC curve is

A U C = 0.889

. The so-called random chance line (the black dashed one) is also given in the figure. It corresponds to a complete classification uncertainty. By visual inspection, the “elbow” of the curve does not lie on the diagonal

(F P R = 0, T P R = 1) \to (F P R = 1, T P R = 0)

. This corresponds to a decision threshold different from 0.5 for binary classification. This is an indicator that the model is not perfect. Nevertheless, considering the corresponding

A U C

, the model has a good discrimination capacity.

The confusion matrix is an another way to represent the performance of a classification model. The entries correspond to the number of true positives, false negatives, false positives, and true negatives. Figure 11 presents the confusion matrix when Model P is used to classify the samples from the testing dataset.

Considering entries of the confusion matrix, the accuracy of a model can be calculated according to:

A c c = \frac{T P + T N}{T P + T N + F P + F N} .

(15)

For Model P, the achieved accuracy is

A c c = 80.3 %

.

Models A and B separately, as well as the compound one (with “soft” and “hard” combinations), are evaluated in terms of their performance using the error

e = y - \hat{y}

, where y is the true value and

\hat{y}

is the predicted one. The performance of Models A and B is evaluated using only those samples from the testing subset that are adequate for the particular model. This is achieved with separation, based on the true labels. In Figure 12, diagrams of type boxplot are shown, visualizing the statistics of the prediction error of Models A and B.

As can be seen, the error values of Model A are more tightly grouped than those associated with Model B. Similarly, the boxplot diagrams of the prediction error of the compound model are given in Figure 13 for “soft” and “hard” combination.

In the case of the compound model, the number of outliers is significantly higher than those presented for individual ones. The obvious reason for this is that the classification accuracy of Model P is far from perfect. In Figure 14, the corresponding histograms of the prediction error are shown.

No significant difference in terms of distribution type can be seen. The medians and interquartile ranges are also similar.

There are several numerical quantities by which the performance of a regression model can be evaluated. The first one is the coefficient of determination:

R^{2} = 1 - \frac{\sum_{i = 1}^{N} {(y_{i} - {\hat{y}}_{i})}^{2}}{\sum_{i = 1}^{N} {(y_{i} - \bar{y})}^{2}},

(16)

where

\bar{y} = \frac{1}{N} \sum_{i = 1}^{N} y_{i}

is the mean of the true values and N is the number of data points. The coefficient of determination is a statistical measure of how well the predictions approximate the real data points. If

R^{2}

is 1, this is an indication that the regression predictions perfectly fit the data. The second quantity is the well-known root mean square error:

R M S E = \sqrt{\frac{1}{N} \sum_{i = 1}^{N} {(y_{i} - {\hat{y}}_{i})}^{2}} .

(17)

The third widely adopted indicator is the mean absolute error:

M A E = \frac{1}{N} \sum_{i = 1}^{N} |y_{i} - {\hat{y}}_{i}| .

(18)

For completeness, the mean value,

μ_{e}

, and standard deviation,

σ_{e}

, of the error, e, are calculated as well. In Table 2 are summarized the quantities that characterize the performance of the compound model and individual ones.

As can be seen from Table 2, Model A is almost perfect. This can be expected since the path loss in LOS propagation conditions can be predicted with a high level of confidence. The compound model with a “soft” combination performs better than the “hard” type. Since the “hard” type has no advantages over the “soft” one, it can be rejected in future.

In Figure 15, the true and predicted outputs are visualized when using the compound model with the “soft” combination versus distance, d, and effective antenna height,

h_{e f f}

.

All available samples from the dataset are used to produce these predictions. These results confirm the achievement of good balance in terms of low prediction error and generalization capability.

4. Discussion

More accurate conclusions about the achieved results can be made when comparing the performance of the proposed approach with performances reported in other competitive studies. At the moment, there are no published studies that refer to the same conditions and limitations (operating frequency, area type, etc.). Such comparisons would not be entirely correct because it cannot be said that if a model has a low value of

R M S E

or

R^{2}

among those compared it is the best one. Accuracy depends on the choice of input parameters, the conditions under which they are measured, the accuracy of the measurements, and the regression algorithms used. The models that demonstrate high accuracy are valid only for a certain type of terrain, for a specific antenna height, and for certain propagation conditions—LOS/NLOS scenarios. The dispersion of the large-scale fading, which directly affects the accuracy of the model, depends on the type of terrain and the type of environment: rural, urban, sub-urban, etc. Using measured data from environments with a larger fading dispersion will also lead to larger model error, despite the applied machine learning algorithm. If, when comparing two models, one of them has a slightly worse performance, but it is evaluated under more severe conditions (area type, LOS/NLOS scenarios, etc.), then this model would have more applicability. Considering these stipulations, a conditional comparison in terms of achieved performance is given in Table 3. The goal here is not to make an exact performance comparison, but rather to evaluate how competitive the proposed approach is in terms of good balance between accuracy and generality.

As can be seen, the proposed approach has an

R M S E

that is comparable or even better than that reported in studies under similar conditions [11,32]. In terms of

R^{2}

and

M A E

, both [11,32] are outperformed. The approaches proposed in [16,17] have better values of

R M S E

and

R^{2}

, but they are intended for urban areas only. The

R M S E

achieved in [32] appears to be slightly better, but the operating frequency is quite different. On the other hand, the cited study is applicable for suburban areas only.

Analyzing the numerical quantities that indicate the accuracy of the prediction and the results given in Table 3, the following conclusions can be drawn regarding the proposed approach:

With five properly selected input parameters, the proposed compound model demonstrates satisfactory prediction performance ( $R M S E = 7.3$ dB and $R^{2} = 0.702$ ) for its practical application. This is valid for different antenna heights, various area types (rural, suburban, and urban), and for both LOS/NLOS scenarios;
With an appropriate combination of simplified ordinary neural structures with relatively small number of layers, a satisfactory prediction accuracy can be achieved that is comparable to the one reported in other similar studies;
The two regression models also have high prediction accuracy ( $R M S E$ of $3.8$ dB and $5.3$ dB). These values of the $R M S E$ are comparable to those reported in [33]. The models can be used separately when LOS/NLOS scenarios are predetermined;
The used input parameters are easy to obtain and calculate;
The achieved results are characterized with a high degree of confidence, considering the size and representativeness of the dataset (nearly 4500 measurement records for urban, suburban, and rural areas);
The binary classifier is the bottleneck of the compound model’s performance. If this classifier is refined, the predictive accuracy will approach that of the individual regression models.

The achieved accuracy of the regression models is entrusted within the limits of variation of the input parameters and the environment in which their values are measured. The neural networks are trained under these conditions also. The estimates of the accuracy of the compound model and the individual ones are guaranteed under the following restrictive conditions:

2D distance between antennas: $d > 10 m$ ;
Effective antenna height: $0 m \leq h_{e f f} \leq 237 m$ ;
Urban/suburban/rural environments;
Brick and reinforced concrete buildings in urban and suburban areas;
Flat/foothill/mountain/hilly relief types;
Areas with humid continental climate;
Operating frequency: $f = 433 MHz$ .

The proposed approach can be used to create other regression models of path loss prediction under other boundaries and conditions beyond those mentioned above. Its generalization consists in the fact that it can be used to create a model summarizing the different types of areas: rural/suburban/urban, and for LOS/NLOS propagation conditions. The model is not bound to specific values of antenna heights. From another point of view, its generalization is also expressed in the fact that the model can be easily transformed from a point-to-point into an area-to-area type by replacing the last two input parameters with their average values for a given terrain type.

5. Conclusions

A machine learning approach for path loss prediction is presented in this study. A compound architecture is proposed, by which low prediction error is achieved for various propagation conditions.

The selection of the input parameters of the model is performed not only on the basis of the physical processes for the propagation of the electromagnetic wave, but also on the indirect statistical characteristics related to the terrain morphology. Through them, the proposed model indirectly accounts for diffractive propagation properties and path loss from large-scale fading. The variety of measured input parameters at different antenna heights and types of areas allows an adequate model to be synthesized with automatic classification and consideration of direct and indirect line-of-sight scenarios, applicable to urban, suburban, and rural areas. Another advantage of the model is that, when terrain statistics are missing, they can be replaced with averages, typical for the given coverage area. If these are applied, then the model turns into zonal. The prediction accuracy will be lower in this case.

The obtained experimental results show excellent performance of the compound model in terms of a root mean square error of the prediction as low as

7.3

dB and a coefficient of determination as high as 0.702. This accuracy is fully satisfactory for its practical application. Moreover, the accuracy can be further increased if an improved version of the classification model is developed. This is because the two regression models are characterized by even better accuracy (root mean square error of the prediction of

3.8

dB for line-of-sight scenario and

5.3

dB for non-line-of-sight condition).

The proposed model can be trained and successfully applied for any operating frequency in the decimeter wavelength range and for other propagation environments and conditions.

Author Contributions

Conceptualization, I.I.; methodology, I.I., Y.V., B.B. and P.Z.P.; software, Y.V. and I.I.; validation, P.Z.P., B.B. and I.N.; formal analysis, P.Z.P. and B.B.; investigation, I.I., Y.V., B.B., P.Z.P., G.I. and I.N.; resources, I.I., Y.V., B.B., P.Z.P., G.I. and I.N.; data curation, I.I.; writing—original draft preparation, Y.V. and I.I.; writing—review and editing, P.Z.P., B.B. and I.N.; visualization, Y.V. and I.I.; supervision, P.Z.P.; project administration, P.Z.P.; funding acquisition, P.Z.P. All authors have read and agreed to the published version of the manuscript.

Funding

This study is funded by the European Union—NextGenerationEU, through the National Recovery and Resilience Plan of the Republic of Bulgaria, project № BG-RRP-2.004-0005.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The datasets containing the input data are publicly available at https://zenodo.org/uploads/13320273 (accessed on 5 September 2024), DOI:10.5281/zenodo.13320273. The source codes are available from the corresponding author upon reasonable request.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:

IoT	Internet of Things
IoE	Internet of Everything
ML	Machine learning
MAC	Media access control
RLC	Radio link control
SVR	Support vector regression
RBF	Radial basis function
ANN	Artificial neural network
MLP	Multi-layer perceptron
GP	Gaussian process
ISM	Industrial, scientific, and medical
M2M	Machine-to-machine
LPWAN	Low-Power Wide-Area Network
LOS	Line-of-sight
NLOS	Non-line-of-sight
GIS	Geographic Information System
UART	Universal asynchronous receiver–transmitter
SMOTE	Synthetic Minority Oversampling Technique
ROC	Receiver operating curve

References

Green, D.; Yun, Z.; Iskander, M.F. Path Loss Characteristics in Urban Environments Using Ray-Tracing Methods. IEEE Antennas Wirel. Propag. Lett. 2017, 16, 3063–3066. [Google Scholar] [CrossRef]
Qian, J.; Wu, Y.; Saleem, A.; Zheng, G. Path Loss Model for 3.5 GHz and 5.6 GHz Bands in Cascaded Tunnel Environments. Sensors 2022, 22, 4524. [Google Scholar] [CrossRef] [PubMed]
Longley, A.; Rice, P. Prediction of Tropospheric Radio Transmission Loss Over Irregular Terrain: A Computer Method-1968; ESSA Technical Report; Institute for Telecommunication Sciences: Boulder, CO, USA, 1968. [Google Scholar]
Iliev, I.G.; Bonev, B.G. Propagation Loss Model for Mobile Wireless Communication Systems. In Proceedings of the 2022 57th International Scientific Conference on Information, Communication and Energy Systems and Technologies (ICEST), Ohrid, North Macedonia, 16–18 June 2022; pp. 1–4. [Google Scholar] [CrossRef]
Faruk, N.; Popoola, S.I.; Surajudeen-Bakinde, N.T.; Oloyede, A.A.; Abdulkarim, A.; Olawoyin, L.A.; Ali, M.; Calafate, C.T.; Atayero, A.A. Path Loss Predictions in the VHF and UHF Bands Within Urban Environments: Experimental Investigation of Empirical, Heuristics and Geospatial Models. IEEE Access 2019, 7, 77293–77307. [Google Scholar] [CrossRef]
ETSI. Study on Channel Model for Frequencies from 0.5 to 100 GHz, Technical Report 3GPP TR 38.901 version 16.1.0 Release 16 ; European Telecommunications Standards Institute: Valbonne, France, 2020. [Google Scholar]
Rappaport, T. Wireless Communications: Principles and Practice, 2nd ed.; Prentice Hall PTR: Paramus, NJ, USA, 2001. [Google Scholar]
Dalela, C. Tuning Of Cost-231 Hata Model for Radio Wave Propagation Predictions. In Proceedings of the The Second International Conference on Computer Science, Engineering and Applications, Delhi, India, 25–27 May 2012; Volume 2, pp. 255–267. [Google Scholar] [CrossRef]
Har, D.; Watson, A.; Chadney, A. Comment on diffraction loss of rooftop-to-street in COST 231-Walfisch-Ikegami model. IEEE Trans. Veh. Technol. 1999, 48, 1451–1452. [Google Scholar] [CrossRef]
Lee, W.C.Y. Mobile Communications Design Fundamentals, 2nd ed.; John Wiley & Sons, Inc.: Hoboken, NJ, USA, 1992. [Google Scholar]
Jo, H.S.; Park, C.; Lee, E.; Choi, H.K.; Park, J. Path Loss Prediction Based on Machine Learning Techniques: Principal Component Analysis, Artificial Neural Network, and Gaussian Process. Sensors 2020, 20, 1927. [Google Scholar] [CrossRef] [PubMed]
Nguyen, C.; Cheema, A.A. A Deep Neural Network-Based Multi-Frequency Path Loss Prediction Model from 0.8 GHz to 70 GHz. Sensors 2021, 21, 5100. [Google Scholar] [CrossRef] [PubMed]
He, R.; Gong, Y.; Bai, W.; Li, Y.; Wang, X. Random Forests Based Path Loss Prediction in Mobile Communication Systems. In Proceedings of the 2020 IEEE 6th International Conference on Computer and Communications (ICCC), Chengdu, China, 11–14 December 2020; pp. 1246–1250. [Google Scholar] [CrossRef]
Tahat, A.; Edwan, T.; Al-Sawwaf, H.; Al-Baw, J.; Amayreh, M. Simplistic Machine Learning-Based Air-to-Ground Path Loss Modeling in an Urban Environment. In Proceedings of the 2020 Fifth International Conference on Fog and Mobile Edge Computing (FMEC), Paris, France, 20–23 April 2020; pp. 158–163. [Google Scholar] [CrossRef]
Saleh, T.; Petrov, D.; Tirkkonen, O.; Räisänen, V. Probabilistic Path Loss Predictors for mmWave Networks. In Proceedings of the 2021 IEEE 93rd Vehicular Technology Conference (VTC2021-Spring), Virtual, 25–28 April 2021; pp. 1–6. [Google Scholar] [CrossRef]
Kwon, B.; Son, H. Accurate Path Loss Prediction Using a Neural Network Ensemble Method. Sensors 2024, 24, 304. [Google Scholar] [CrossRef] [PubMed]
Ojo, S.; Sari, A.; Ojo, T. Path Loss Modeling: A Machine Learning Based Approach Using Support Vector Regression and Radial Basis Function Models. Open J. Appl. Sci. 2022, 12, 990–1010. [Google Scholar] [CrossRef]
George, G.; Idogho, J. Path loss prediction based on machine learning techniques: Support vector machine, artificial neural network, and multi linear regression model. J. Phys. Sci. 2022, 3, 1–22. [Google Scholar] [CrossRef]
Elmezughi, M.; Salih, O.; Afullo, T.; Duffy, K. Path loss modeling based on neural networks and ensemble method for future wireless networks. Heliyon 2023, 9, e19685. [Google Scholar] [CrossRef] [PubMed]
Elmezughi, M.K.; Salih, O.; Afullo, T.J.; Duffy, K.J. Comparative Analysis of Major Machine-Learning-Based Path Loss Models for Enclosed Indoor Channels. Sensors 2022, 22, 4967. [Google Scholar] [CrossRef] [PubMed]
Turan, B.; Coleri, S. Machine Learning Based Channel Modeling for Vehicular Visible Light Communication. IEEE Trans. Veh. Technol. 2021, 70, 9659–9672. [Google Scholar] [CrossRef]
Popoola, S.I.; Atayero, A.A.; Arausi, O.D.; Matthews, V.O. Path loss dataset for modeling radio wave propagation in smart campus environment. Data Brief 2018, 17, 1062–1073. [Google Scholar] [CrossRef] [PubMed]
OSM Buildings. 2024. Available online: https://osmbuildings.org (accessed on 28 August 2024).
Manca, V. Artificial Neural Network Learning, Attention, and Memory. Information 2024, 15, 387. [Google Scholar] [CrossRef]
Dubey, S.R.; Singh, S.K.; Chaudhuri, B.B. Activation Functions in Deep Learning: A Comprehensive Survey and Benchmark. arXiv 2022, arXiv:cs.LG/2109.14545. [Google Scholar] [CrossRef]
Cortes, C.; Mohri, M.; Rostamizadeh, A. L2 Regularization for Learning Kernels. In Proceedings of the 25th Conference on Uncertainty in Artificial Intelligence, UAI 2009, Montreal, QC, Canada, 18–21 June 2009. [Google Scholar]
Semtech Corporation. SX1276/77/78/79—137 MHz to 1020 MHz Low Power Long Range Transceiver, 2020. Rev. 7 – May 2020. Available online: https://semtech.my.salesforce.com/sfc/p/#E0000000JelG/a/2R0000001Rbr/6EfVZUorrpoKFfvaF_Fkpgp5kzjiNyiAbqcpqh9qSjE (accessed on 5 September 2024).
Huggins, J.S. Mast Mountable Antenna. U.S. Patent US20170201002A1, 13 July 2017. [Google Scholar]
Ellingson, S.W. The modified j-pole: A wideband end-fed half-wave dipole for a VHF reflector feed system. In Proceedings of the 2017 IEEE International Symposium on Antennas and Propagation & USNC/URSI National Radio Science Meeting, San Diego, CA, USA, 9–14 July 2017; pp. 773–775. [Google Scholar] [CrossRef]
Chawla, N.V.; Bowyer, K.W.; Hall, L.O.; Kegelmeyer, W.P. SMOTE: Synthetic Minority Over-sampling Technique. J. Artif. Intell. Res. 2002, 16, 321–357. [Google Scholar] [CrossRef]
Noel, M.M.; Banerjee, A.; D, G.B.A.; Muthiah-Nakarajan, V. Alternate Loss Functions for Classification and Robust Regression Can Improve the Accuracy of Artificial Neural Networks. arXiv 2023, arXiv:cs.NE/2303.09935. [Google Scholar]
Wu, D.; Zhu, G.; Ai, B. Application of artificial neural networks for path loss prediction in railway environments. In Proceedings of the 2010 5th International ICST Conference on Communications and Networking in China, Beijing, China, 25–27 August 2010; pp. 1–5. [Google Scholar]
Phillips, C.; Sicker, D.; Grunwald, D. Bounding the error of path loss models. In Proceedings of the 2011 IEEE International Symposium on Dynamic Spectrum Access Networks (DySPAN), Chennai, India, 28 February–3 March 2011; pp. 71–82. [Google Scholar] [CrossRef]

Figure 1. Architecture of the compound model for path loss prediction in two variants of combining the outputs of the two regression models (a,b).

Figure 2. Architectures of the proposed individual models (a,b).

Figure 3. The measurement setup through which the experimental dataset is created.

Figure 4. The used J-pole antenna. The length of each elements is given in the table.

Figure 5. Map of geographical points associated with measurement records for Septemvri town (Bulgaria), rural and suburban areas.

Figure 6. Map of geographical points associated with measurement records for Belogradchik town (Bulgaria), rural and suburban areas.

Figure 7. Map of geographical points associated with measurement records for two selected places in Sofia city (Bulgaria), urban and suburban areas.

Figure 8. Loss function minimization during training of Models A/B, (a,b).

Figure 9. Loss and accuracy (Acc) curves improvement during training and validation of Model P.

Figure 10. ROC curve of Model P (red line) with

A U C = 0.889

. The random chance line is the black dashed one.

Figure 10. ROC curve of Model P (red line) with

A U C = 0.889

. The random chance line is the black dashed one.

Figure 11. Confusion matrix of Model P (label “A” corresponds to LOS scenario, whereas “B” is for NLOS).

Figure 12. Boxplot diagrams of the prediction error of Models A and B.

Figure 13. Boxplot diagrams of the prediction error of the compound model with “soft” and “hard” combinations of the outputs.

Figure 14. Histograms of the prediction error of the compound model with (a) “soft” and (b) “hard” combinations of the outputs. A normalization is made in respect of the total number of elements.

Figure 15. True and predicted outputs of the compound model with the “soft” combination versus distance, d, and effective antenna height,

h_{e f f}

(all samples from the dataset are used).

Figure 15. True and predicted outputs of the compound model with the “soft” combination versus distance, d, and effective antenna height,

h_{e f f}

(all samples from the dataset are used).

Table 1. Measurement conditions and parameters for each area.

Area	${TRX}_{S}$ Coordinates	${TRX}_{S}$ Antenna Height $h_{S}$ $[m]$	Area Type	Relief Type	Average Buildings’ Height $[m]$	Maximum Terrain Unevenness $[m]$	Maximum Measurement Distance $[m]$
Septemvri town Bulgaria	lat. 42.20447 lng. 24.13740 alt. 240 m	8 (mounted on the roof of a brick building)	Rural, suburban	Flat	7	95	4824
Belogradchik town, Bulgaria	lat. 43.62802 lng. 22.68377 alt. 500 m	8 (mounted on the 3rd floor of a brick building)	Rural, suburban	Hilly	16	240	5150
Sofia city, Bulgaria, Campus of the Technical University of Sofia	lat. 43.65518 lng. 23.35418 alt. 596 m	8.5, 12, 15.5, 19, 25.5 (mounted on the roof)	Suburban, urban	Flat, mountain, foothill	12	28	1784
Sofia city, Bulgaria, res. area Darvenitza	lat. 42.65676 lng. 23.34264 alt. 595 m	6.25, 9.75, 30	Urban, suburban	Flat, mountain, foothill	24	19	1152

Table 2. Performance of individual and compound models.

Model	$R^{2}$	$RMSE$	$MAE$	$μ_{e}$	$σ_{e}$
Model A	0.930	$3.8$ dB (4.45%)	$3.0$ dB (3.33%)	$1.0$ dB (0.98%)	$3.7$ dB (4.34%)
Model B	0.739	$5.3$ dB (4.90%)	$4.2$ dB (3.77%)	$- 0.7$ dB (−0.87%)	$5.3$ dB (4.82%)
Compound
with “soft”
combination	0.702	$7.3$ dB (6.87%)	$5.2$ dB (4.81%)	$1.6$ dB (1.29%)	$7.1$ dB (6.75%)
Compound
with “hard”
combination	0.604	$8.4$ dB (7.89%)	$5.8$ dB (5.36%)	$1.8$ dB (1.14%)	$8.2$ dB (7.75%)

Table 3. Performance of the proposed approach compared with other benchmark methods/approaches (the original precision of the quantities is preserved).

Method/Approach	$R^{2}$	$RMSE$	$MAE$
ANN, ensemble learning ¹ [16]	0.8951	$2.9416$ dB	$1.2753$ dB
SVR ² [17]	0.8528	$2.1568$ dB	—
ANN-RBF ³ [17]	0.8965	$2.4967$ dB	—
ANN-MLP ⁴ [11]	0.3975	$7.876$ dB	$5.896$ dB
ANN ⁵ [32]	0.4168	$6.9344$ dB	$5.2745$ dB
Proposed in this study ⁶	0.702	$7.3$ dB	$5.2$ dB

¹ Urban area. Operating frequency 1800 MHz. ² Urban area. Operating frequency—not specified. ³ Urban area. Operating frequency—not specified. ⁴ Suburban area. Operating frequency 450 MHz. ⁵ Urban/suburban/rural area. Operating frequency 950 MHz. ⁶ Urban/suburban/rural area. Operating frequency 433 MHz.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Iliev, I.; Velchev, Y.; Petkov, P.Z.; Bonev, B.; Iliev, G.; Nachev, I. A Machine Learning Approach for Path Loss Prediction Using Combination of Regression and Classification Models. Sensors 2024, 24, 5855. https://doi.org/10.3390/s24175855

AMA Style

Iliev I, Velchev Y, Petkov PZ, Bonev B, Iliev G, Nachev I. A Machine Learning Approach for Path Loss Prediction Using Combination of Regression and Classification Models. Sensors. 2024; 24(17):5855. https://doi.org/10.3390/s24175855

Chicago/Turabian Style

Iliev, Ilia, Yuliyan Velchev, Peter Z. Petkov, Boncho Bonev, Georgi Iliev, and Ivaylo Nachev. 2024. "A Machine Learning Approach for Path Loss Prediction Using Combination of Regression and Classification Models" Sensors 24, no. 17: 5855. https://doi.org/10.3390/s24175855

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Machine Learning Approach for Path Loss Prediction Using Combination of Regression and Classification Models

Abstract

1. Introduction

2. The Proposed Approach

2.1. Overview

2.2. Input and Output Parameters

2.3. Architectures of the Individual Models

3. Experimental Results

3.1. Experimental Dataset

3.2. Training and Validation

3.3. Testing and Performance Evaluation

4. Discussion

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI