Sensitive Parameter Analysis for Solar Irradiance Short-Term Forecasting: Application to LoRa-Based Monitoring Technology

Bueso, María C.; Paredes-Parra, José Miguel; Mateo-Aroca, Antonio; Molina-García, Angel

doi:10.3390/s22041499

Open AccessArticle

Sensitive Parameter Analysis for Solar Irradiance Short-Term Forecasting: Application to LoRa-Based Monitoring Technology

by

María C. Bueso

¹

,

José Miguel Paredes-Parra

²,

Antonio Mateo-Aroca

³

and

Angel Molina-García

^3,*

¹

Department of Applied Mathematics and Statistics, Universidad Politécnica de Cartagena, 30202 Cartagena, Spain

²

Technologic Center of Energy and Environment, 30202 Cartagena, Spain

³

Department of Automatic, Electrical Engineering and Electronic Technology, Universidad Politécnica de Cartagena, 30202 Cartagena, Spain

^*

Author to whom correspondence should be addressed.

Sensors 2022, 22(4), 1499; https://doi.org/10.3390/s22041499

Submission received: 25 December 2021 / Revised: 22 January 2022 / Accepted: 8 February 2022 / Published: 15 February 2022

(This article belongs to the Section Optical Sensors)

Download

Browse Figures

Versions Notes

Abstract

:

Due to the relevant penetration of solar PV power plants, an accurate power generation forecasting of these installations is crucial to provide both reliability and stability of current grids. At the same time, PV monitoring requirements are more and more demanded by different agents to provide reliable information regarding performances, efficiencies, and possible predictive maintenance tasks. Under this framework, this paper proposes a methodology to evaluate different LoRa-based PV monitoring architectures and node layouts in terms of short-term solar power generation forecasting. A random forest model is proposed as forecasting method, simplifying the forecasting problem especially when the time series exhibits heteroscedasticity, nonstationarity, and multiple seasonal cycles. This approach provides a sensitive analysis of LoRa parameters in terms of node layout, loss of data, spreading factor and short time intervals to evaluate their influence on PV forecasting accuracy. A case example located in the southeast of Spain is included in the paper to evaluate the proposed analysis. This methodology is applicable to other locations, as well as different LoRa configurations, parameters, and networks structures; providing detailed analysis regarding PV monitoring performances and short-term PV generation forecasting discrepancies.

Keywords:

LoRa technology; PV monitoring; sensitive parameter analysis

1. Introduction

The high integration of variable renewable energy sources (vRES) into current power systems, mainly wind and solar photovoltaic (PV) power plants, can be a key component of the resulting low-carbon power systems. However, their intermittency requires more flexibility from the rest of the power system to maintain certain grid stability and reliability levels [1]. Consequently, the increase in demand variability created by such intermittent sources presents new challenges to provide relevant system flexibility [2]. In addition to these challenges, accurate forecasting of renewable generation is also required by both transmission and distribution system operators in order to mitigate the negative impact on the grids of such variable and noncontrollable resources. According to Daliento et al., similar models are usually adopted for power forecasting for PV fields, which is important for both monitoring purposes and the management of the utility grid [3]. With this aim, different solar PV power generation forecasting solutions can be found in the specific literature [4]. The forecasting timeframe, also called horizon, is firstly defined according to the grid operation and under both spatial and temporal resolutions. Different forecast horizons can be then defined, varying from seconds to days or weeks ahead. Regarding spatial horizons, they can also be spanned from single site to regional forecasts [5]. In [6], a new model based on hourly measurements is proposed and evaluated. In [7], the application of neural networks for photovoltaic power generation forecasting purposes is explored by Oudjana et al. Other forecasting neural network-based solutions can be found in the specific literature [8,9,10,11,12]. Naveed Akhter et al. [13] present a critical and systematic review of photovoltaic (PV) power forecasting methods, mostly focused on machine learning and metaheuristic based-solutions. An extreme learning machine technique is used for PV power forecasting of a real time model [14]. A revision of solar irradiance and PV power forecasting, both topics combined as “solar forecasting”, using text mining is discussed in [15]. Barbieri et al. [16] conclude that cell/module temperature and irradiance can be considered as the best approaches for an accurate PV power forecasting; mainly under cloudy conditions with hardly predictable power generation fluctuations. A probabilistic forecast review focused on inherently erroneous of different forecasting strategies is discussed and quantified in [17]. Short-term photovoltaic power generation forecasting is also an important task in renewable energy power system planning and operating. In fact, Kaur et al. [18] affirm that short-term electricity trading to balance demand and generation offers a remarkable economic opportunity to integrate larger shares of vRES into future power grids. A novel multi-timescale data-driven forecast model to improve the accuracy of short-term PV power production is proposed by Yang et al. [19]. Dambreville et al. [20] propose a new approach of global horizontal irradiance (GHI) forecasting for very short term by using a spatio–temporal autoregressive model.

From the specific literature, observed weather data are commonly applied on the solar PV generation forecasting model [21]. Subsequently, the solar PV forecasting model performance is then evaluated by quantifying discrepancies between such forecasts and the weather measurements through the use of traditional statistical error metrics, such as the mean bias error or the root mean square error (RMSE) [22]. For example, the mean absolute percentage error (MAPE), and mean absolute error (MAE) indicators are used in [23] to evaluate the performance of day-ahead photovoltaic power forecasting models based on deep learning neural network. Normalized root mean square error (nRMSE) is used in [24] to evaluate the forecasting errors. In a similar way, Huang et al. describe a comparative study of solar PV power forecasting methods based on nRMSE [25]. An extensive comparison of simple forecasting methodologies with more sophisticated solutions over 32 photovoltaic (PV) plants of different sizes and technology over a whole year is carried out by Gigoni et al. [26]. However, there is a lack of contributions focused on evaluating possible forecasting errors and minor accurate results derived from the inherent communication network failures: packet losses, possible packet collisions, etc., as well as the short time period influence of such GHI forecasting accuracy. Indeed, possible communication failure can significantly affect both gathering data and forecasting results [27]. Subsequently, and by considering such missed contributions in the specific literature about this issue, the present paper analyzes the influence of the LoRa solution—identified by the literature as a promise and suitable technology—on the GHI short-term forecasting accuracy, by considering real communication architecture/layout and specific LoRa performance characteristics. In addition, current datasets available from satellite-based installations, ground-based installations, and solar PV power plants connected to the grid are also considered for evaluation. In this way, a case example located in the southeast of Spain is included in the paper to assess the suitability of the proposed methodology. The main contributions of this paper can be summarized as (i) a methodology to evaluate GHI short-term forecasting accuracy for different node layouts and LoRa parameters based on a random forest prediction model; (ii) a sensitive analysis of LoRa parameters and short-term intervals on the GHI forecasting values based on a variety of metrics; and (iii) a case study from 2019 GHI data (one-minute sample time), 400 km

^{2}

area, 289 potential nodes under consideration, and a total of 13,140 simulations. This methodology thus provides a preliminary extensive analysis of potential LoRa network characteristics and node layout in terms of data accuracy, packets, and GHI forecasting possibilities before the installation was completed.

The rest of the paper is structured as follows. Section 2 discusses PV power plant monitoring and the use of LoRa technology as a solution to be implemented in such installations. Section 3 describes the proposed methodology. A case example is presented in Section 4. Results and discussion are provided in Section 5. Finally, conclusions are given in Section 6.

2. Pv Monitoring: LoRa Communication Technology

2.1. General Overview

A variety of PV monitoring strategies based on the output PV power plants and their nature have been proposed in the literature; being performed remotely or locally on site. New advanced monitoring techniques are continuously under investigation; mainly due to the evolution and relevant integration of PV installations into power systems. A recent PV monitoring review is analyzed by Triki-Lahiani et al. [28]; discussing their differences, advantages, and limits. An impedance-based monitoring method for detection of distribution system current behavior is presented in [29]. This monitoring technique can be used for small variation of PV penetration level, and for some fast transient detection, such as the effect of cloud movement on a PV system.

2.2. Lora-Based Communication System

Although different wireless technologies—such as Bluetooth, Zigbee, Wi-Fi, GSM, Sigfox, or LoRa—have been evaluated for PV solar monitoring through wireless sensors networks, LoRa technology has been chosen as the wireless technology due to its long range and low power consumption [30]. Moreover, this technology has received significant attention in recent years from network operators and solution providers [31,32,33,34]. An impartial and fair overview regarding the capabilities and the limitations of LoRaWAN is discussed by Adelantado et al. [35] to clarify its comprehension and avoid inflated expectations. LoRa uses six spreading factors (SF07 to SF12) to adapt the data rate and range trade-off. It can be affirmed that a higher spreading factor (SF) allows longer range at the expense of lower data rate [36]. The LoRa data rate depends on the channel bandwidth and the SF, ranging from 0.3 kbps to 27 kbps [37]. According to Mikhaylov et al. [38], messages transmitted with different SFs can be received simultaneously by LoRa base stations, 243 bytes being the maximum payload length for each message.

A collision behavior model,

C (x, y)

, for LoRa network between node x and node y is proposed by Bor et al. [39]. A collision rate analysis with a single LoRa gateway was reported by Alenezi et al. [40]—depending on the number of nodes and the SF for one (20 byte) packet each hour for 24 h (125 kHz bandwidth). In [41], Zhang et al. propose an alternative low-power wide area network information monitoring approach based on LoRa and NB-IoT. In this case, the communication distance in a complex environment was up to 1.6 km, with a system communication packet loss rate of around 3%. Silva et al. [42] test LoRa (long range) technology and LoRaWAN protocol in a precision viticulture scenario using low-power data acquisition devices, distanced 400 m away from the nearest gateway. With regard to the communication area parameter, Liu et al. [43] present a low-power, real-time air quality monitoring system also based on the LoRa technology being able to reach to approximately 2 km. A range of 9.27 km was achieved with SF12 and a bandwidth of 125 kHz in [44], for module-level monitoring of solar PV plants. The LoRa communication usefulness for monitoring the climate information of PV power plants was tested by Jeong et al. [45]. From the test results, it can be affirmed that the communication range for the PV climate information transmission reaches 1.3 km. In general, it can be assumed that LoRa radio chipsets use a maximum of 100 mW when transmitting, with a range of 10 to 30 km in suburban areas. In [46], it is affirmed that LoRa performs much better in comparison to FSK in most scenarios. It also highlights the successful transmission ranges within suburban scenarios up to 10 km.

3. Methodology

3.1. Lora Parameter Modeling

In the specific literature, a review of LoRa simulation environments can be found in [47], where aspects such as the selection of LoRa parameters or the device energy consumption were revised and compared. For example, LoRaSim was described as a well-known custom-built discrete-event simulator implemented with SimPy [39]. It allows us to place a group of N LoRa-nodes and M LoRa-stations in a bidimensional space. A directional antennae is also considered for simulation purposes and identification of LoRa parameters. In this way, a comparison between the use of directional antennae facing multiple base stations as methods of dealing with LoRa internetwork interference is carried out in [48]. However, these solutions do not analyze the influence and sensitive of LoRa nodes on the PV forecasting accuracy, under different communication errors/discrepancies and a diversity of LoRa node layouts.

The transmission parameters define the LoRa node communication characteristics. As discussed in Section 2.1, LoRa provides three bandwidth (BW) settings—125, 250, or 500 kHz, and six different spreading factor (SF) values. According to Croce et al. [49], a larger bandwidth translates to a data rate increase and a receiver sensitivity deterioration. Conversely, higher SFs can be used to improve the link robustness at the cost of lower data rates. LoRa modulation is derived from chirp spread spectrum (CSS). LoRa CSS modulations with BW of 125 kHz are assumed in this work, 1% duty cycle, and a default radiated transmit power of 14 dBm. Croce et al. [50] identified six different SFs: from SF07 to SF12. In Europe, both 868 MHz and 433 MHz bands are allowed to be used. Transmitted power is limited to 14 dBm effective isotropic radiated power (EIRP), with a 1% duty cycle limit of on-air time, and the transmitted power limited to 14 dBm effective isotropic radiated power (EIRP). Table 1 summarizes the LoRa/LoRaWAN main characteristics [51]. According to the specific literature, there is pseudo orthogonality among SFs, having the advantage that multiple signal reception is possible [52]. Table 2 shows the estimated range for different SFs (from SF07 to SF10) that can be used for uplink messages on a 125 kHz channel depending on the terrain: longer distances can be achieved in a rural environment than in an urban environment [53].

For a specific SF, the narrower the bandwidth, the higher the receiver sensitivity [54]. Consequently, the data rate selection is then considered as a trade-off between message duration and communication range. Different tools allow us to estimate time on air and optimum bandwidth. For the present proposal, the time interval on air for a 51-byte payload for each specific SF is considered. The payload size is defined as the maximum payload length. For each transmission, the payload can range from 2 to 255 octets, reaching the data rate up to 50 kbps when channel aggregation is used [55] under the assumption that any packet arrivals follow a Poisson law—thus, considering a uniform distribution of the payloads, their lengths are between 1 and 51 bytes [56]. According to Centenaro et al. [57], it can be assumed that the data transmission in a LoRaWAN presents a typical 1% duty-cycle constraint; from the nodes to their corresponding gateways in a single hop allocated on different sub-bands. Indeed, the European regulations currently ask for adherence to 1% duty cycle per sub-band or applying any mechanism based on “listen-before-talk and adaptive frequency agility” [58]. Table 3 summarizes the corresponding time intervals between subsequent starting packets

(s)

for a 1% duty-cycle.

As previously described, LoRaWAN is built as a star-of-stars topology, where the devices located in the defined grid are able to send packets to a gateway which is then responsible for forwarding those packages to a network server [59]. It is assumed that each end-device selects a specific SF based on the data rate and the distance to the gateway. A radial equidistant distribution with homogeneous end-device density is thus considered, being the energy consumption of j-radial annulus proportional to the airtime. In order to give a more realistic simulation, the study of LPWAN modeling proposed by Georgiou and Raza [60] is used to analyze the capability of this technology to scale. This study also includes an outage probability model which occurs at the gateway, called outage condition [61]. Table 4 gives the outage of a desired signal in the uplink that can occur at the gateway, if the received signal to noise ratio (SNR) is below the SF specific threshold.

It can be then determined the packet delivery ratio, defined as the ratio between the client of packages originated by the “application layer” and the number of packages received by the sink at the final destination [62]. From this parameter, we obtain the probability function that any packet should be lost. We consider a Rayleigh channel, in line with Duda and Heusse [63]. The received signal power is affected by a multiplicative random variable with an exponential distribution of unit mean (and standard deviation). Consequently, the signal power depends on the Rayleigh fading gain and the distance, keeping the noise power constant for a 125 kHz wide band (

N = - 123

dBm). A maximum transmission power of

P = 14

dBm is considered for simulations; the successful transmission probability being with data rate

D R_{j}

and at distance

l_{j}

:

H (l_{j}) = e x p (\frac{N \cdot q_{j}}{P \cdot g (l_{j})}),

(1)

where

g (l_{j})

is the average channel gain at distance

l_{j}

, P is the transmission power (in dBm),

q_{j}

is the signal to noise ratio (SNR) threshold for

D R_{j}

, and N is the wide band (in dBm). The path loss attenuation is estimated by using the Radio Mobile software package [64]. It uses the terrain information and the mathematical model to calculate the coverage area from the fixed radiation point taken as mobile reference point [65]. The irregular terrain model (ITM) is used as a propagation model. It estimates radio propagation losses over irregular terrain in the range 0.020 to 20 GHz frequencies as a function of space and distance and the variability of signal in time [66]. The Okumura–Hata model [67] has also been recently proposed and assessed for path loss attenuation, mostly focused on comparing the performance LoRaWAN analysis in urban scenarios.

In the simulations, it is assumed that all transmitters (i) send packets with the same payload length; (ii) do not switch the SF from one packet to another during the same simulation test—despite that the adaptive data rate is one of the main strengths of LoRa [68]; (iii) do not change the transmit power from one packet to another during the same simulation test; and (iv) all of the transmitters have the same number of packets to send. An example of maximum communication ranges on ground (15 km) and on water (30 km) can be found in [69], including packet loss ratio, depending on the distance and assuming maximum signal SF—868 MHz ISM band using 14 dBm transmit power.

3.2. Spatio–Temporal PV Forecasting

As discussed in Section 1, different probabilistic models for spatio–temporal PV forecasting can be found in the specific literature. However, only a few spatio–temporal models for short-term probabilistic forecasting can be identified [70], which are based on regression trees [71], the vectorial autoregressive model and gradient boosting combination [72], the kNN method [73], multivariate predictive distributions [74], and Gaussian random fields [75]. A least absolute shrinkage and selection operator (LASSO) regression method was also presented by Yang et al. [76] for sub–5–min solar irradiance forecasting. A flexible spatio–temporal model to estimate PV production forecasts was recently proposed by Agoua et al. [77] for horizons up to 6 h ahead, evaluating the effect of different spatial and temporal data sources on the accuracy of the forecasts. According to Muhammad et al. [78], the ARX model is the simplest black box linear model, based on a structure that is known as the most common input–output model. In this work, the author selected the random forest (RF) approach, mainly due to its simplicity to deploy, low computational cost, and ease in interpreting the input interactions. The RF algorithm is then used to find an adequate predictor function f. The RF algorithm is an ensemble learner, proposing a set of decision trees that vote on a final result. Dudek [79] affirms that this model operates on patterns of the time series seasonal cycles, considerably simplifying the forecasting problem—mainly when a time series exhibits heteroscedasticity, trend, nonstationarity, and multiple seasonal cycles. To train and test the RF algorithm, different partitions of the data are used accordingly. Firstly, the parameter-tuning process of the RF learning is carried out through the training set. Subsequently, the test set is used to estimated the final metrics. An R package ranger to be used as a fast RF implementation for high-dimensional data is used in this work [80].

By considering the aim of this paper, a smart persistence and RF model is proposed for spatio–temporal PV forecasting, from the general expression given for time series [81],

{\hat{Y}}_{k} (t + h) = f (Y_{k} (t), Y_{k} (t - 1), Y_{k} (t - 2), \dots, Y_{k} (t - d)),

(2)

where

{\hat{Y}}_{k} (t + h)

is the

k^{t h}

sensor forecasting and time step

t + h

, h is the horizon (in minutes) for which the prediction is being made,

Y_{k} (t - l)

is the data past collected at the

l^{t h}

lag,

l = 0, \dots, d

. This general expression can be generalized and further extended to include other relevant information about radiation, weather, statistical measures, etc.

\begin{matrix} {\hat{Y}}_{k} (t + h) = f (Y_{1} (t), Y_{1} (t - 1), Y_{1} (t - 2), \dots, Y_{1} (t - d), \\ Y_{2} (t), Y_{2} (t - 1), Y_{2} (t - 2), \dots, Y_{2} (t - d), \\ \dots, \\ Y_{m} (t), Y_{m} (t - 1), Y_{m} (t - 2), \dots, Y_{m} (t - d)), \end{matrix}

(3)

being

Y_{j}

, for

j = 1, \dots, m

, the predictors to be used and

Y_{j} (t - l)

a single predictor (see Figure 1), and h the prediction horizon. This extended expression allows us to use additional information aside from the PV production data. Different measurement scenarios are then proposed and evaluated for GHI forecasting comparison purposes, as described in Section 3.3. GHI estimated data are thus forecast on short time horizons from 15 to 45 min. The time steps are set as 1 min, and d = 15 min. It means that, to forecast

{\hat{Y}}_{k} (t + h)

, as can be seen in Figure 1, the time interval data from t to

t - 15

is considered as input for prediction. The d parameter can be modified depending on the time step and the number of nodes (from 1 to m) included in each case study. The parameters are estimated according to the prediction training strategy depicted in Figure 2. The clear sky index

K_{t} (t)

is used in the forecasting model. Assuming to be stationary enough, it is defined as follows:

K_{t} (t) = \frac{G H I (t)}{G H I_{s c} (t)},

(4)

being, thus, the ratio of the measured GHI to GHI under clear sky conditions (GHI

_{s c}

). The training and forecasting processes are summarized in Figure 3, in line with recent studies also focused on machine learning forecast model analysis applied to solar power forecasting [82].

3.3. Proposed Global Methodology

This work aims to evaluate the influence of LoRa performance characteristics and its architecture/layout on the short-term PV forecasting, by considering the RF model described in Section 3.2. Firstly, a node selection and distribution based on LoRaWAN technology is carried out according to a predefined forecasting point of interest and a possible group of potential on-ground sites or satellite based-installations. From these specifications, a one-minute sample time database is defined on each node, as well as the forecasting point of interest. The RF algorithm is used to find a suitable predictor function f and, then, to forecast short-term solar values on such location by considering different forecasting time intervals—from 15 to 45 min. These predictions are estimated under a variety of scenarios: (i) assuming SF = SF09 for all nodes and 0% loss of data; (ii) assuming SF from SF09 to SF12 on each node and 0% loss of data; and (iii) assuming SF from SF09 to SF12 on each node and loss of data from 0% to 50%. Subsequently, the solar forecasting values corresponding to the different scenarios are then compared. Discrepancies and similarities are calculated, discussing the influence of the different realistic LoRa parameters on the solar forecasting process. As previously analyzed by the authors in [83], different metrics can be found in the specific literature to determine discrepancies. From this classification, normalized root mean square error (nRMSE), mean absolute percentage error (MAPE), and dynamic time warping (DTW) are determined to provide complementary information and characterize convenient discrepancies among the PV short-term forecasting data by considering SF09 and 0% loss of data and the rest of scenarios. Figure 4 summarizes the proposed methodology. In addition, a sensitive analysis based on SF parameter and loss of data variability is also included, determining the differences among discrepancies with respect to the forecasting PV values with SF09 and 0% loss of data. The methodology and simulations were implemented in the R–environment [84]. Different contribution software packages were used for methodology implementation purposes. In this case, data.table for fast and memory efficient data manipulation [85], ranger for a fast RF implementation [80], and dtw and dtwclust for the DTW metric estimation [86].

4. Case Example: Datasets Used

Nonnenmacher et al. [87] affirm that satellite images based on prediction methods are mostly used for intraday forecasts lower than four hours. Based on this assumption, a total area of 400 km

^{2}

located in the Region of Murcia (11,300 km

^{2}

, southeast of Spain) is selected as a case example. This region is a promising area to integrate solar PV power plants, with 5.2 kWh/m

^{2} \cdot

day as averaged annual global irradiation [88]. Figure 5 shows a general overview of this area, covering a

17 \times 17

grid portion with a total of 289 points under consideration and a forecasting point identified in the center of this grid that is selected for GHI estimated data—corresponding to

{\hat{Y}}_{k} (t + h)

according to expression (2). The considered spacing between all pairs of grid points is assumed as 2.5 km. Day-ahead GHI estimated data are downloaded from the Copernicus European Project servers [89]—from January to December 2019. In addition, ground data are also available based on the Network of the Agricultural Information System of the Region of Murcia (SIAM), giving additional ground-based GHI data. The SIAM network consists of 49 ground-based automatic stations geographically distributed; 32 stations are from the Regional Murcia Institute of Agricultural and Food Research and Development (IMIDA), 15 stations are from the Spanish Ministry of Agriculture, Food and Environment, one station is from the Universidad Politécnica de Cartagena (Murcia, Spain), and one more is from the City Council of Mazarrón (Murcia, Spain). These ground-based stations are financially supported by several European fund projects [90], see Figure 6. A ground-based station located in the center of the grid corresponds to the forecasting point considered for this GHI forecasting analysis.

According to the proposed methodology described in Section 3.3, the LoRa node distribution is then selected by considering the initial grid depicted in Figure 5. With this aim, Figure 7 gives a general overview of the distribution of potential nodes, including UTM coordinates. Duda and Heusse [63] affirm that a more realistic assumption is to consider that the node density decreases with the inverse square of the gateway distance: a specific intensity of physical quantity is inversely proportional to the distance square from the source. Nevertheless, one node by each circular crown is considered and, thus, a homogeneous and minimum density distribution is considered to be evaluated and compared for short-term PV forecasting purposes. With regard to the LoRa parameters, and as can be found in the specific literature, the lowest transfer rate ensures the highest level of collisions [91]. Indeed, this configuration is the most used one, as it ensures the largest communication range by using a high SF (e.g., SF = 12). Therefore, a trade-off is then determined between increasing the communication range and reducing the transfer rate. As described in Section 3.1, different SF values are also considered in the different conditions. Therefore, different scenarios are considered for each day from the initial grid depicted in Figure 7, with forecast horizons ranging from 15 to 45 min with one-minute time resolution, and under a variety of SF LoRa parameters and loss of data values (see Figure 4). A general comparison of GHI prediction results for the case study is following discussed in Section 5.

5. Results

From the data corresponding to 2019, with one-minute time resolution, and as described in Section 4, different short-term PV solar forecasting periods were considered for simulations. More specifically, three different time horizons were defined: 15, 30, and 45 min. Each day was then simulated by considering such different time horizons for forecasting purposes. In addition, and with the aim to compare the impact of both loss of data and the selected SF, simulations were carried out under such conditions: 0%, 25%, and 50% loss of data; as well as from SF12 to SF09. Indeed, and according to the discussion given in Section 3.1—see Table 2 and Table 3, the selected SF considerably affects the robustness at the cost of lower data rates. These results allow us to evaluate the impact of each parameter and give a preliminary analysis of the influence of these conditions and situations before implementing a real communication and sensoring network. Therefore, each day is simulated through a 3 × 4 × 3 matrix of possible loss of data, foresting time intervals and selected SF, as schematically depicted in Figure 8. Subsequently, 36 different conditions are considered for each day, including three different losses of data percentages, four different SF parameters, and three different forecasting time intervals. In line with the case example shown in Figure 5 and the initial node layout based on UTM coordinates and depicted in Figure 7, an arbitrary node site location is selected and given in Figure 9, including the distribution of selected points for the analysis and the distance to the forecasting point. Subsequently, and as previously discussed, one node is selected on each circular crown with the aim of forecasting GHI data in the center of the grid, corresponding to the

\hat{Y} (t + h)

—see expression (3). The selected nodes are labeled as 119, 170, 115, 60, and 254, respectively.

By considering the 2019 data for the methodology evaluation, the corresponding daily GHI curves are then forecast according to the preliminary selection of possible nodes and the different conditions. In summary, a global of 13,140 simulations were carried out by the authors. These estimated data allow us to analyze in detail the influence of each variable on the short-term forecasting accuracy and the different possibilities to implement a real sensoring network in terms of data gathering accuracy and reliability for forecasting purposes. As an example of the forecast curves for each day, Figure 10 shows the irradiance data corresponding to two arbitrary days—labeled as day 108 and 144, respectively, including clear sky GHI data—see dashed line. As can be seen, data corresponding to the selected nodes given in Figure 9 are plotted, as well as the forecasting point marked in black color being the GHI observed data—corresponding to

Y_{k} (t + h)

. From these initial data, Figure 11 compares the measured and forecast irradiance values for the two previous days—labeled as day 108 and 144—and considers the selected different conditions for each day: loss of data percentages, different SF parameters, and different forecasting time intervals. Subsequently, 36 different forecasting GHI results are determined for each day. In addition, as a complementary result, Figure 12 compares these curves, including the expected clear sky GHI values. These forecasting data are thus determined for 15, 30, and 45 min time horizons, varying the SF parameter from 09 to 12 and considering 0%, 25%, and 50% loss of data scenarios.

With the aim of estimating the influence of each parameters by considering all simulations along the 2019 data, a sensitive analysis was carried out determining discrepancies between the estimated daily GHI values for the selected node and the corresponding daily measured GHI values. Firstly, Figure 13 shows the global histograms and the truncated histograms of such discrepancies based on normalized root mean square error (nRMSE) among the measured GHI data and the forecasting GHI values for the selected node and by considering the different time horizon, SF, and loss of data scenarios. Therefore, 36 global and truncated histograms were determined from the 13,140 simulations. As can be seen, the truncated histograms retain more than 90% of such discrepancies and are considered suitable enough for this sensitive analysis. Secondly, and according to the variety of errors and differences available in the specific literature, as well as the comparison conducted by the authors in [83], the mean absolute percentage error (MAPE) and dynamic time warping (DTW) were also selected as metric estimations—see Section 3.3. Indeed, and as can be found in [92], DTW is considered as an appropriate technique to estimate and find an optimal alignment between two time-dependent sequences under a set of restrictions. DTW was initially used to compare different speech patterns, and also successfully applied in other fields, such as information retrieval and data mining. Additionally, DTW can be applied to detect and cope with different speeds and time deformations associated with time-dependent data. Recently, the R package IncDTW based on the DTW improved the possibilities to classify time series or clusters [93]. To analyze in detail the influence of each variable, Figure 14 shows the truncated histogram for these simulations using discrepancies based on MAPE and DTW metrics. As can be seen from these discrepancies, a higher loss of data range involves more relevant discrepancies, and these values present a secondary peak from 20% to 30% values. In terms of SF parameter, the vast values of discrepancies slightly shift from the [0, 10] interval (for SF09) to [10, 20] interval (for SF12). Therefore, a longer time interval between subsequent packets—see Table 3—implies higher discrepancy errors. These results are similar for the other analyzed forecasting short time intervals—30 and 45 min, respectively, as summarized in Figure 14.

As an alternative sensitive analysis, Figure 15 shows a boxplot diagram of the discrepancies based on nRMSE, MAPE, and DTW metrics classified according to different clear sky GHI ratio

{\bar{K}}_{t}

: [0, 0.65], (0.65, 0.85], and (0.85, 1], respectively. All simulations corresponding to 2019 data are considered to determine these global metrics; thus, 36 different conditions are assumed for each day depending on the defined 3 × 4 × 3 matrix of possible loss of data, SF parameter, and short time period values selected for this case study. From the results, larger time horizons address less-accurate GHI estimations, and higher clear sky GHI ratio values imply more accurate GHI estimations. Moreover, high

{\bar{K}}_{t}

GHI ratio days should be not considered for LoRa network performance evaluation, as they are not sensitive to loss of data nor SF parameter values. Subsequently, low

{\bar{K}}_{t}

GHI ratio days should be considered for estimating discrepancies and GHI forecasting accuracy, as well as potential errors allowed for short-term forecasting purposes depending on the corresponding loss of data, short time periods, and SF parameters.

Finally, an additional metric analysis is also provided to compare the difference evolution of SF09, 0% loss of data (corresponding to “Scenario 1” in Figure 4), and for each short time interval to the rest of the other daily possible conditions and under nRMSE, MAPE, and DTW metric estimations. With this aim, Figure 16 summarizes the boxplot results of differences among “Scenario 1” (for each short time interval) and the other daily conditions (SF values, loss of data, and short-term time intervals). As can be seen, differences are higher with larger loss of data for any metric—see boxplots for the same row and short time interval. In addition, discrepancies are increasing from SF09 to SF12, when loss of data and short time interval variables are kept constant. The results under different daily conditions are divided by considering clear sky GHI ratio intervals

{\bar{K}}_{t}

—[0, 0.65], (0.65, 0.85], and (0.85, 1]. The corresponding boxplot differences for SF09, 0% loss of data, and for each short time interval in comparison to the rest of daily conditions and for all metrics are determined and depicted in Figure 17. In general, differences are higher for lower

{\bar{K}}_{t}

values and, as previously affirmed, low

{\bar{K}}_{t}

GHI ratio days should be considered for evaluating GHI forecasting accuracy approaches and suitable node layouts. Consequently, and depending on the discrepancies range allowed in each case by the specific application, this methodology gives a preliminary analysis for the LoRa-based PV monitoring architectures and potential node layouts. Additionally, it gives an estimation of forecasting GHI estimations and the influence of SF and loss of data variables on the GHI value accuracy. Therefore, both reliability and robustness of the collected data to be used for forecasting purposes are able to be analyzed and evaluated.

6. Conclusions

A methodology to evaluate different LoRa-based PV monitoring configurations in terms of short-term GHI forecasting characterization of metrics is described and assessed. A location analysis of nodes based on potential loss of data, short-term time intervals, and SF ranges are considered to analyze their influence on such short-term GHI forecasting purposes. The methodology also allows us to evaluate different node layouts and forecasting approaches. A random forest model is proposed in this work as suitable forecasting method under nonstationary and multiple seasonal cycle data. A case study located in the southeast of Spain is included to evaluate the methodology. Satellite-based GHI data collected for 2019 covering a 17 × 17 grid portion with a total of 289 points under consideration are considered, with one-minute sample time data. The short-term GHI forecasting simulations for each day included different loss of data ranges, forecasting time intervals (15, 30, and 45 min), and SF values (from SF09 to SF12). A total of 36 different conditions for each were considered, running 13,140 simulations for the corresponding global 2019 GHI data. The results allow us to explore the influence of loss of data, SF values, and short-term time intervals on the corresponding GHI forecasting accuracy. In addition, different LoRa node layouts can be also evaluated in terms of data accuracy and GHI forecasting estimations. In general, higher clear sky GHI ratio values imply more accurate GHI estimations, being less sensitive to loss of data or SF parameter values. Subsequently, low

{\bar{K}}_{t}

GHI ratio days should be considered for estimating potential errors allowed for short-term forecasting purposes depending on the corresponding loss of data, short time periods, and SF parameters. A sensitive analysis is included in the work from complementary metrics: nRMSE, MAPE, and DTW. These results give additional information to characterize discrepancies among collected and forecast GHI data as a consequence of LoRa parameters and/or node layout. This methodology thus provides a preliminary analysis of potential LoRa network characteristics and sensoring in terms of data accuracy, packets, and GHI forecasting possibilities.

Author Contributions

Conceptualization, A.M.-G. and J.M.P.-P.; methodology, M.C.B.; software, M.C.B.; validation, A.M.-A. and J.M.P.-P.; formal analysis, M.C.B.; resources, A.M.-A.; data curation, M.C.B.; writing—original draft preparation, A.M.-G.; writing—review and editing, A.M.-G. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Fondo Europeo de Desarrollo Regional/Ministerio de Ciencia e Innovación–Agencia Estatal de Investigación (FEDER/MICINN-AEI), project RTI2018–099139–B–C21.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The datasets generated and/or analyzed during this study are available from the corresponding author on reasonable request.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:

BW	Bandwidth
CSS	Chirp spread spectrum
DTW	Dynamic time warping
GHI	Global horizontal irradiance
ITM	Irregular terrain model
LASSO	Least absolute shrinkage and selection operator
LoRa	Long range
MAE	Mean absolute error
MAPE	Mean absolute percentage error
nRMSE	Normalized root mean square error
PV	Photovoltaic
RF	Random forest
RMSE	Root mean square error
SF	Spreading factor
SNR	Signal to noise ratio
vRES	Variable renewable energy sources

References

Brouwer, A.S.; van den Broek, M.; Zappa, W.; Turkenburg, W.C.; Faaij, A. Least–cost options for integrating intermittent renewables in low-carbon power systems. Appl. Energy 2016, 161, 48–74. [Google Scholar] [CrossRef] [Green Version]
Eltawil, M.A.; Zhao, Z. Grid-connected photovoltaic power systems: Technical and potential problems—A review. Renew. Sustain. Energy Rev. 2010, 14, 112–129. [Google Scholar] [CrossRef]
Daliento, S.; Chouder, A.; Guerriero, P.; Pavan, A.M.; Mellit, A.; Moeini, R.; Tricoli, P. Monitoring, Diagnosis, and Power Forecasting for Photovoltaic Fields: A Review. Int. J. Photoenergy 2017, 2017, 1356851. [Google Scholar] [CrossRef]
Das, U.K.; Tey, K.S.; Seyedmahmoudian, M.; Mekhilef, S.; Idris, M.Y.I.; Deventer, W.V.; Horan, B.; Stojcevski, A. Forecasting of photovoltaic power generation and model optimization: A review. Renew. Sustain. Energy Rev. 2018, 81, 912–928. [Google Scholar] [CrossRef]
Antonanzas, J.; Osorio, N.; Escobar, R.; Urraca, R.; de Pison, F.M.; Antonanzas-Torres, F. Review of photovoltaic power forecasting. Sol. Energy 2016, 136, 78–111. [Google Scholar] [CrossRef]
Gandoman, F.H.; Raeisi, F.; Ahmadi, A. A literature review on estimating of PV–array hourly power under cloudy weather conditions. Renew. Sustain. Energy Rev. 2016, 63, 579–592. [Google Scholar] [CrossRef]
Hamid Oudjana, S.; Hellal, A.; Hadj Mahamed, I. Short term photovoltaic power generation forecasting using neural network. In Proceedings of the 2012 11th International Conference on Environment and Electrical Engineering, Venice, Italy, 18–25 May 2012; pp. 706–711. [Google Scholar]
Chen, C.; Duan, S.; Cai, T.; Liu, B. Online 24-h solar power forecasting based on weather type classification using artificial neural network. Sol. Energy 2011, 85, 2856–2870. [Google Scholar] [CrossRef]
Rana, M.; Koprinska, I.; Agelidis, V.G. Forecasting solar power generated by grid connected PV systems using ensembles of neural networks. In Proceedings of the 2015 International Joint Conference on Neural Networks (IJCNN), Killarney, Ireland, 12–17 July 2015; pp. 1–8. [Google Scholar]
Chu, Y.; Urquhart, B.; Gohari, S.M.; Pedro, H.T.; Kleissl, J.; Coimbra, C.F. Short-term reforecasting of power output from a 48 MWe solar PV plant. Sol. Energy 2015, 112, 68–77. [Google Scholar] [CrossRef]
Liu, L.; Liu, D.; Sun, Q.; Li, H.; Wennersten, R. Forecasting Power Output of Photovoltaic System Using a BP Network Method. Energy Procedia 2017, 142, 780–786. [Google Scholar] [CrossRef]
De Paiva, G.M.; Pimentel, S.P.; Marra, E.G.; de Alvarenga, B.P.; Mussetta, M.; Leva, S. Intra-day forecasting of building-integrated PV systems for power systems operation using ANN ensemble. In Proceedings of the 2019 IEEE Milan PowerTech, Milan, Italy, 23–27 June 2019; pp. 1–5. [Google Scholar]
Akhter, M.N.; Mekhilef, S.; Mokhlis, H.; Shah, N.M. Review on forecasting of photovoltaic power generation based on machine learning and metaheuristic techniques. IET Renew. Power Gener. 2019, 13, 1009–1023. [Google Scholar] [CrossRef] [Green Version]
Behera, M.K.; Majumder, I.; Nayak, N. Solar photovoltaic power forecasting using optimized modified extreme learning machine technique. Eng. Sci. Technol. Int. J. 2018, 21, 428–438. [Google Scholar] [CrossRef]
Yang, D.; Kleissl, J.; Gueymard, C.A.; Pedro, H.T.; Coimbra, C.F. History and trends in solar irradiance and PV power forecasting: A preliminary assessment and review using text mining. Sol. Energy 2018, 168, 60–101. [Google Scholar] [CrossRef]
Barbieri, F.; Rajakaruna, S.; Ghosh, A. Very short-term photovoltaic power forecasting with cloud modeling: A review. Renew. Sustain. Energy Rev. 2017, 75, 242–263. [Google Scholar] [CrossRef] [Green Version]
Van der Meer, D.; Widén, J.; Munkhammar, J. Review on probabilistic forecasting of photovoltaic power production and electricity consumption. Renew. Sustain. Energy Rev. 2018, 81, 1484–1512. [Google Scholar] [CrossRef]
Kaur, A.; Nonnenmacher, L.; Pedro, H.T.; Coimbra, C.F. Benefits of solar forecasting for energy imbalance markets. Renew. Energy 2016, 86, 819–830. [Google Scholar] [CrossRef]
Yang, C.; Thatte, A.A.; Xie, L. Multitime–Scale Data–Driven Spatio–Temporal Forecast of Photovoltaic Generation. IEEE Trans. Sustain. Energy 2015, 6, 104–112. [Google Scholar] [CrossRef]
Dambreville, R.; Blanc, P.; Chanussot, J.; Boldo, D. Very short term forecasting of the Global Horizontal Irradiance using a spatio–temporal autoregressive model. Renew. Energy 2014, 72, 291–300. [Google Scholar] [CrossRef]
Sangrody, H.; Sarailoo, M.; Zhou, N.; Tran, N.; Motalleb, M.; Foruzan, E. Weather forecasting error in solar energy forecasting. IET Renew. Power Gener. 2017, 11, 1274–1280. [Google Scholar] [CrossRef] [Green Version]
Gueymard, C.A. A review of validation methodologies and statistical performance indicators for modeled solar radiation data: Towards a better bankability of solar projects. Renew. Sustain. Energy Rev. 2014, 39, 1024–1034. [Google Scholar] [CrossRef]
Wang, K.; Qi, X.; Liu, H. A comparison of day-ahead photovoltaic power forecasting models based on deep learning neural network. Appl. Energy 2019, 251, 113315. [Google Scholar] [CrossRef]
Kardakos, E.G.; Alexiadis, M.C.; Vagropoulos, S.I.; Simoglou, C.K.; Biskas, P.N.; Bakirtzis, A.G. Application of time series and artificial neural network models in short-term forecasting of PV power generation. In Proceedings of the 2013 48th International Universities’ Power Engineering Conference (UPEC), Dublin, Ireland, 2–5 September 2013; pp. 1–6. [Google Scholar]
Huang, Y.; Lu, J.; Liu, C.; Xu, X.; Wang, W.; Zhou, X. Comparative study of power forecasting methods for PV stations. In Proceedings of the 2010 International Conference on Power System Technology, Zhejiang, China, 24–28 October 2010; pp. 1–6. [Google Scholar]
Gigoni, L.; Betti, A.; Crisostomi, E.; Franco, A.; Tucci, M.; Bizzarri, F.; Mucci, D. Day-Ahead Hourly Forecasting of Power Generation From Photovoltaic Plants. IEEE Trans. Sustain. Energy 2018, 9, 831–842. [Google Scholar] [CrossRef] [Green Version]
Chaojun, G.; Yang, D.; Jirutitijaroen, P.; Walsh, W.M.; Reindl, T. Spatial Load Forecasting With Communication Failure Using Time-Forward Kriging. IEEE Trans. Power Syst. 2014, 29, 2875–2882. [Google Scholar] [CrossRef]
Triki-Lahiani, A.; Abdelghani, A.B.B.; Slama-Belkhodja, I. Fault detection and monitoring systems for photovoltaic installations: A review. Renew. Sustain. Energy Rev. 2018, 82, 2680–2692. [Google Scholar] [CrossRef]
Mortazavi, H.; Mehrjerdi, H.; Saad, M.; Lefebvre, S.; Asber, D.; Lenoir, L. A Monitoring Technique for Reversed Power Flow Detection With High PV Penetration Level. IEEE Trans. Smart Grid 2015, 6, 2221–2232. [Google Scholar] [CrossRef]
Shuda, J.E.; Rix, A.J.; Booysen, M.J. Towards Module-Level Performance and Health Monitoring of Solar PV Plants Using LoRa Wireless Sensor Networks. In Proceedings of the 2018 IEEE PES/IAS PowerAfrica, Cape Town, South Africa, 28–29 June 2018; pp. 172–177. [Google Scholar]
Haxhibeqiri, J.; De Poorter, E.; Moerman, I.; Hoebeke, J. A Survey of LoRaWAN for IoT: From Technology to Application. Sensors 2018, 18, 3995. [Google Scholar] [CrossRef] [Green Version]
Sherazi, H.H.R.; Piro, G.; Grieco, L.A.; Boggia, G. When Renewable Energy Meets LoRa: A Feasibility Analysis on Cable-Less Deployments. IEEE Internet Things J. 2018, 5, 5097–5108. [Google Scholar] [CrossRef] [Green Version]
Behjati, M.; Mohd Noh, A.B.; Alobaidy, H.A.H.; Zulkifley, M.A.; Nordin, R.; Abdullah, N.F. LoRa Communications as an Enabler for Internet of Drones towards Large-Scale Livestock Monitoring in Rural Farms. Sensors 2021, 21, 5044. [Google Scholar] [CrossRef]
Zhou, Q.; Zheng, K.; Hou, L.; Xing, J.; Xu, R. Design and Implementation of Open LoRa for IoT. IEEE Access 2019, 7, 100649–100657. [Google Scholar] [CrossRef]
Adelantado, F.; Vilajosana, X.; Tuset-Peiro, P.; Martinez, B.; Melia-Segui, J.; Watteyne, T. Understanding the Limits of LoRaWAN. IEEE Commun. Mag. 2017, 55, 34–40. [Google Scholar] [CrossRef] [Green Version]
Mekki, K.; Bajic, E.; Chaxel, F.; Meyer, F. A comparative study of LPWAN technologies for large-scale IoT deployment. ICT Express 2019, 5, 1–7. [Google Scholar] [CrossRef]
Sornin, N.; Luis, M.; Eirich, T.; Kramp, T.; Hersent, O. Lorawan Specification; LoRa alliance. January 2015. Available online: https://lora-alliance.org/wp-content/uploads/2020/11/2015_-_lorawan_specification_1r0_611_1.pdf (accessed on 30 November 2021).
Mikhaylov, K.; Petaejaejaervi, J.; Haenninen, T. Analysis of Capacity and Scalability of the LoRa Low Power Wide Area Network Technology. In Proceedings of the European Wireless 2016, 22th European Wireless Conference, Oulu, Finland, 18–20 May 2016; pp. 1–6. [Google Scholar]
Bor, M.C.; Roedig, U.; Voigt, T.; Alonso, J.M. Do LoRa Low–Power Wide–Area Networks Scale? In Proceedings of the 19th ACM International Conference on Modeling, Analysis and Simulation of Wireless and Mobile Systems, Malta, Malta, 13–17 November 2016; pp. 59–67. [Google Scholar] [CrossRef] [Green Version]
Alenezi, M.; Chai, K.K.; Chen, Y.; Jimaa, S. Ultra–dense LoRaWAN: Reviews and challenges. IET Commun. 2020, 14, 1361–1371. [Google Scholar] [CrossRef]
Zhang, X.; Zhang, M.; Meng, F.; Qiao, Y.; Xu, S.; Hour, S. A Low-Power Wide-Area Network Information Monitoring System by Combining NB-IoT and LoRa. IEEE Internet Things J. 2019, 6, 590–598. [Google Scholar] [CrossRef]
Silva, N.; Mendes, J.; Silva, R.; dos Santos, F.N.; Mestre, P.; Serôdio, C.; Morais, R. Low-Cost IoT LoRa Solutions for Precision Agriculture Monitoring Practices. In Progress in Artificial Intelligence; Moura Oliveira, P., Novais, P., Reis, L.P., Eds.; Springer International Publishing: Cham, Switzerland, 2019; pp. 224–235. [Google Scholar]
Liu, S.; Xia, C.; Zhao, Z. A low-power real-time air quality monitoring system using LPWAN based on LoRa. In Proceedings of the 2016 13th IEEE International Conference on Solid-State and Integrated Circuit Technology (ICSICT), Hangzhou, China, 25–28 October 2016; pp. 379–381. [Google Scholar]
Shuda, J.; Rix, A.; Booysen, M.T. Module-Level Monitoring Of Solar PV Plants Using Wireless Sensor Networks. In Proceedings of the 26th Southern African Universities Power and Engineering Conference (SAUPEC 2018), Johannesburg, South Africa, 24–26 January 2018; pp. 1–6. [Google Scholar]
Jeong, J.; Shin, Y.; Lee, I. Long-Range Transmission of Photovoltaic Climate Information through the LoRa Radio. In Proceedings of the 2018 International Conference on Information and Communication Technology Convergence (ICTC), Jeju, Korea, 17–19 October 2018; pp. 956–959. [Google Scholar]
Erturk, M.A.; Aydın, M.A.; Buyukakkaşlar, M.T.; Evirgen, H. A Survey on LoRaWAN Architecture, Protocol and Technologies. Future Internet 2019, 11, 216. [Google Scholar] [CrossRef] [Green Version]
Bouras, C.; Gkamas, A.; Katsampiris Salgado, S.A.; Kokkinos, V. Comparison of LoRa Simulation Environments. In Advances on Broad-Band Wireless Computing, Communication and Applications; Barolli, L., Hellinckx, P., Enokido, T., Eds.; Springer International Publishing: Cham, Switzerland, 2020; pp. 374–385. [Google Scholar]
Voigt, T.; Bor, M.; Roedig, U.; Alonso, J. Mitigating Inter-network Interference in LoRa Networks. In Proceedings of the International Conference on Embedded Wireless Systems and Networks, Uppsala, Sweden, 20–22 February 2017; pp. 323–328. [Google Scholar]
Croce, D.; Gucciardo, M.; Mangione, S.; Santaromita, G.; Tinnirello, I. Impact of LoRa Imperfect Orthogonality: Analysis of Link-Level Performance. IEEE Commun. Lett. 2018, 22, 796–799. [Google Scholar] [CrossRef] [Green Version]
Croce, D.; Gucciardo, M.; Tinnirello, I.; Garlisi, D.; Mangione, S. Impact of Spreading Factor Imperfect Orthogonality in LoRa Communications. In Digital Communication. Towards a Smart and Secure Future Internet; Piva, A., Tinnirello, I., Morosi, S., Eds.; Springer International Publishing: Cham, Switzerland, 2017; pp. 165–179. [Google Scholar]
Froiz-Míguez, I.; Lopez-Iturri, P.; Fraga-Lamas, P.; Celaya-Echarri, M.; Blanco-Novoa, Ó.; Azpilicueta, L.; Falcone, F.; Fernández-Caramés, T.M. Design, Implementation, and Empirical Validation of an IoT Smart Irrigation System for Fog Computing Applications Based on LoRa and LoRaWAN Sensor Nodes. Sensors 2020, 20, 6865. [Google Scholar] [CrossRef]
Oh, J.; Lim, D.W.; Kang, K.M. The Impact of Imperfect Orthogonality of LoRa Communication in Multiple Drone Identification. In Proceedings of the 2021 International Conference on Information and Communication Technology Convergence (ICTC), Jeju, Korea, 20–22 October 2021; pp. 906–908. [Google Scholar] [CrossRef]
Seller, O. Predicting LoRaWAN Capacity; Technical Report; Semtech: Camarillo, CA, USA, 2020. [Google Scholar]
Cuomo, F.; Campo, M.; Caponi, A.; Bianchi, G.; Rossini, G.; Pisani, P. EXPLoRa: Extending the performance of LoRa by suitable spreading factor allocations. In Proceedings of the 2017 IEEE 13th International Conference on Wireless and Mobile Computing, Networking and Communications (WiMob), Rome, Italy, 9–11 October 2017; pp. 1–8. [Google Scholar] [CrossRef]
Sagala, A.; Siahaan, D.; Nadeak, T.; Sitorus, E. Low Power—Low Rate Vessel Tracking System (VTS) in Territorial Waters. In Proceedings of the 2019 International Conference of Computer Science and Information Technology (ICoSNIKOM), Medan, Indonesia, 28–29 November 2019; pp. 1–5. [Google Scholar] [CrossRef]
Augustin, A.; Yi, J.; Clausen, T.; Townsley, W.M. A Study of LoRa: Long Range & Low Power Networks for the Internet of Things. Sensors 2016, 16, 1466. [Google Scholar] [CrossRef]
Centenaro, M.; Vangelista, L.; Kohno, R. On the impact of downlink feedback on LoRa performance. In Proceedings of the 2017 IEEE 28th Annual International Symposium on Personal, Indoor, and Mobile Radio Communications (PIMRC), Montreal, QC, Canada, 8–13 October 2017; pp. 1–6. [Google Scholar]
Haxhibeqiri, J.; Karaagac, A.; Van den Abeele, F.; Joseph, W.; Moerman, I.; Hoebeke, J. LoRa indoor coverage and performance in an industrial environment: Case study. In Proceedings of the 2017 22nd IEEE International Conference on Emerging Technologies and Factory Automation (ETFA), Limassol, Cyprus, 12–15 September 2017; pp. 1–8. [Google Scholar] [CrossRef] [Green Version]
Bankov, D.; Khorov, E.; Lyakhov, A. On the Limits of LoRaWAN Channel Access. International Conference on Engineering and Telecommunication (EnT), Moscow, Russia, 29–30 November 2016; pp. 10–14. [Google Scholar] [CrossRef]
Georgiou, O.; Raza, U. Low Power Wide Area Network Analysis: Can LoRa Scale? IEEE Wirel. Commun. Lett. 2017, 6, 162–165. [Google Scholar] [CrossRef] [Green Version]
Latiff, N.A.; Ismail, I.S.; Yusoff, M.H. Scalability Performance for Low Power Wide Area Network Technology using Multiple Gateways. Int. J. Adv. Comput. Sci. Appl. 2020, 11, 212–218. [Google Scholar] [CrossRef]
Fazeldehkordi, E.; Amiri, I.S.; Akanbi, O.A. Chapter 2—Literature Review. In A Study of Black Hole Attack Solutions; Fazeldehkordi, E., Amiri, I.S., Akanbi, O.A., Eds.; Syngress: Rockland, MA, USA, 2016; pp. 7–57. [Google Scholar] [CrossRef]
Duda, A.; Heusse, M. Spatial Issues in Modeling LoRaWAN Capacity; HAL Open Science: Miami Beach, FL, USA, 2019; pp. 191–198. [Google Scholar] [CrossRef] [Green Version]
RF Propagation Simulation Software. 2020. Available online: http://radiomobile.pe1mew.nl/ (accessed on 30 November 2021).
Fujdiak, R.; Mlynek, P.; Misurec, J.; Strajt, M. Simulated Coverage Estimation of Single Gateway LoRaWAN Network. In Proceedings of the 2018 25th International Conference on Systems, Signals and Image Processing (IWSSIP), Maribor, Slovenia, 20–22 June 2018; pp. 1–4. [Google Scholar] [CrossRef]
Abdelraheem, A.M.; Abdalla, M.A. Prediction of WiMAX radio wave propagation over outdoor irregular terrains and spacing. In Proceedings of the 2014 31st National Radio Science Conference (NRSC), Cairo, Egypt, 28–30 April 2014; pp. 167–174. [Google Scholar] [CrossRef]
Harinda, E.; Hosseinzadeh, S.; Larijani, H.; Gibson, R.M. Comparative Performance Analysis of Empirical Propagation Models for LoRaWAN 868MHz in an Urban Scenario. In Proceedings of the 2019 IEEE 5th World Forum on Internet of Things (WF-IoT), Limerick, Ireland, 15–18 April 2019; pp. 154–159. [Google Scholar] [CrossRef] [Green Version]
Kufakunesu, R.; Hancke, G.P.; Abu-Mahfouz, A.M. A Survey on Adaptive Data Rate Optimization in LoRaWAN: Recent Solutions and Major Challenges. Sensors 2020, 20, 5044. [Google Scholar] [CrossRef]
Petajajarvi, J.; Mikhaylov, K.; Roivainen, A.; Hanninen, T.; Pettissalo, M. On the coverage of LPWANs: Range evaluation and channel attenuation model for LoRa technology. In Proceedings of the 2015 14th International Conference on ITS Telecommunications (ITST), Copenhagen, Denmark, 2–4 December 2015; pp. 55–59. [Google Scholar] [CrossRef]
Agoua, X.G.; Girard, R.; Kariniotakis, G. Probabilistic Models for Spatio—Temporal Photovoltaic Power Forecasting. IEEE Trans. Sustain. Energy 2019, 10, 780–789. [Google Scholar] [CrossRef] [Green Version]
Nagy, G.I.; Barta, G.; Kazi, S.; Borbély, G.; Simon, G. GEFCom2014: Probabilistic solar and wind power forecasting using a generalized additive tree ensemble approach. Int. J. Forecast. 2016, 32, 1087–1093. [Google Scholar] [CrossRef]
Bessa, R.J.; Trindade, A.; Silva, C.S.; Miranda, V. Probabilistic solar power forecasting in smart grids using distributed information. Int. J. Electr. Power Energy Syst. 2015, 72, 16–23. [Google Scholar] [CrossRef] [Green Version]
Huang, J.; Perry, M. A semi-empirical approach using gradient boosting and k–nearest neighbors regression for GEFCom2014 probabilistic solar power forecasting. Int. J. Forecast. 2016, 32, 1081–1086. [Google Scholar] [CrossRef]
Golestaneh, F.; Gooi, H.B.; Pinson, P. Generation and evaluation of space–time trajectories of photovoltaic power. Appl. Energy 2016, 176, 80–91. [Google Scholar] [CrossRef] [Green Version]
Zhang, B.; Dehghanian, P.; Kezunovic, M. Spatial-temporal solar power forecast through use of Gaussian conditional random fields. In Proceedings of the 2016 IEEE Power and Energy Society General Meeting (PESGM), Boston, MA, USA, 17–21 July 2016; pp. 1–5. [Google Scholar]
Yang, D.; Ye, Z.; Lim, L.H.I.; Dong, Z. Very short term irradiance forecasting using the lasso. Sol. Energy 2015, 114, 314–326. [Google Scholar] [CrossRef] [Green Version]
Agoua, X.G.; Girard, R.; Kariniotakis, G. Photovoltaic Power Forecasting: Assessment of the Impact of Multiple Sources of Spatio-Temporal Data on Forecast Accuracy. Energies 2021, 14, 1432. [Google Scholar] [CrossRef]
Muhammad, B.; Azmi, K.Z.M.; Ibrahim, Z.; bin Mohd Faudzi, A.A.; Pebrianti, D. Simultaneous computation of model order and parameter estimation for system identification based on opposition-based simulated Kalman filter. In Proceedings of the 2018 SICE International Symposium on Control Systems (SICE ISCS), Tokyo, Japan, 9–11 March 2018; pp. 105–112. [Google Scholar] [CrossRef]
Dudek, G. Short-Term Load Forecasting Using Random Forests. In Intelligent Systems’2014; Filev, D., Jabłkowski, J., Kacprzyk, J., Krawczak, M., Popchev, I., Rutkowski, L., Sgurev, V., Sotirova, E., Szynkarczyk, P., Zadrozny, S., Eds.; Springer International Publishing: Cham, Switzerland, 2015; pp. 821–828. [Google Scholar]
Wright, M.N.; Ziegler, A. Ranger: A Fast Implementation of Random Forests for High Dimensional Data in C++ and R. J. Stat. Softw. 2017, 77, 1–17. [Google Scholar] [CrossRef] [Green Version]
Huertas Tato, J.; Centeno Brito, M. Using Smart Persistence and Random Forests to Predict Photovoltaic Energy Production. Energies 2019, 12, 100. [Google Scholar] [CrossRef] [Green Version]
Duchaud, J.L.; Voyant, C.; Fouilloy, A.; Notton, G.; Nivet, M.L. Trade-Off between Precision and Resolution of a Solar Power Forecasting Algorithm for Micro-Grid Optimal Control. Energies 2020, 13, 3565. [Google Scholar] [CrossRef]
Bueso, M.C.; Paredes-Parra, J.M.; Mateo-Aroca, A.; Molina-García, A. A Characterization of Metrics for Comparing Satellite-Based and Ground-Measured Global Horizontal Irradiance Data: A Principal Component Analysis Application. Sustainability 2020, 12, 2454. [Google Scholar] [CrossRef] [Green Version]
R Core Team. R: A Language and Environment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2021; Available online: https://www.r-project.org/ (accessed on 30 November 2021).
Dowle, M.; Srinivasan, A. Data.table: Extension of ‘Data.frame’; R Package Version 1.14.2; September 2019; Available online: https://cran.r-project.org/web/packages/data.table/data.table.pdf (accessed on 30 November 2021).
Sardá-Espinosa, A. Comparing time-series clustering algorithms in r using the dtwclust package. R Package Vignette 2017, 12, 41. [Google Scholar]
Nonnenmacher, L.; Coimbra, C.F. Streamline-based method for intra-day solar forecasting through remote sensing. Sol. Energy 2014, 108, 447–459. [Google Scholar] [CrossRef]
Solar Irradiation Data, CETA-CIEMAT. 2020. Available online: http://www.adrase.com/en/ (accessed on 30 November 2021).
Copernicus European Union’s Earth Observation Programme. 2020. Available online: http://copernicus.eu (accessed on 30 November 2021).
Network of the Agricultural Information System of Murcia (SIAM). 2019. Available online: https://www.imida.es/siam/ (accessed on 30 November 2021).
Lavric, A.; Popa, V. Performance Evaluation of LoRaWAN Communication Scalability in Large-Scale Wireless Sensor Networks. Wirel. Commun. Mob. Comput. 2018, 2018, 6730719. [Google Scholar] [CrossRef] [Green Version]
Dynamic Time Warping. In Information Retrieval for Music and Motion; Springer: Berlin/Heidelberg, Germany, 2007; pp. 69–84. [CrossRef]
Leodolter, M.; Plant, C.; Brändle, N. IncDTW: An R Package for Incremental Calculation of Dynamic Time Warping. J. Stat. Softw. 2021, 99, 1–23. [Google Scholar] [CrossRef]

Figure 1. Smart persistence and RF model proposed for spatio–temporal forecasting.

Figure 2. Forecast training strategy for a prediction horizon (h).

Figure 3. Training and forecast processes from collected data by using the clear sky index (

K_{t}

).

Figure 3. Training and forecast processes from collected data by using the clear sky index (

K_{t}

).

Figure 4. Proposed methodology: general scheme.

Figure 5. Case example: general overview and datasets (Region of Murcia, Spain).

Figure 6. Case example: ground-based data available online (Region of Murcia, Spain) [90].

Figure 7. Case example: distribution of datasets (UTM coordinates).

Figure 8. Case example: simulation general scheme.

Figure 9. Case example: distribution of points selected and the forecasting point marked in black color (UTM coordinates).

Figure 10. Example of irradiance data for two days: curves used for short-term forecasting purposes—clear sky GHI data included in dashed line. Curve in the forecasting point is marked in black color.

Figure 11. Comparison of estimated and monitored GHI data for different time horizon, SF, and loss of data scenarios.

Figure 12. Comparison of estimated, monitored, and clear sky GHI data for different time horizon, SF, and loss of data scenarios.

Figure 13. Summary of discrepancies for forecasting and monitoring data based on nRMSE. Histogram (left) and truncated histogram (right).

Figure 14. Summary of discrepancies for forecasting and monitoring data based on MAPE and DTW. Truncated histograms.

Figure 15. Boxplot of the discrepancies based on nRMSE, MAPE, and DTW.

Figure 16. Boxplot of differences based on SF09 and 0% loss of data: nRMSE, MAPE, and DTW metrics.

Figure 17. Boxplot of differences based on SF09 and 0% loss of data: nRMSE, MAPE, and DTW metrics for different

K_{t}

clear sky GHI ratios.

Figure 17. Boxplot of differences based on SF09 and 0% loss of data: nRMSE, MAPE, and DTW metrics for different

K_{t}

clear sky GHI ratios.

Table 1. LoRa/LoRaWAN main characteristics based on the EU 863–870 MHz data rates.

0	SF12/125 kHz	250
1	SF11/125 kHz	440
2	SF10/125 kHz	980
3	SF09/125 kHz	1760
4	SF08/125 kHz	3125
5	SF07/125 kHz	5470
6	SF06/250 kHz	11,000
7	FSK: 50 kbps 440	50,000

Table 2. LoRa spreading factor (SF).

Spreading Factor (SF)	Range (Depending on the Terrain)
SF10	8 km
SF09	6 km
SF08	4 km
SF07	4 km

Table 3. Time interval between subsequent packets (1% duty-cycle).

Spreading Factor (SF)
SF12	SF11	SF10	SF09	SF08	SF07	SF06
Time Interval between Subsequent Packets (s)
214	115	62	33	18.5	10	6

Table 4. Signal to noise ratio (SNR) limits.

Spreading Factor (SF)	Signal to Noise Ratio (SNR) Limit $q (j)$
SF07	−7.5 dB
SF08	−10.0 dB
SF09	−12.5 dB
SF10	−15.0 dB
SF 11	−17.5 dB
SF 12	−20.0 dB

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Bueso, M.C.; Paredes-Parra, J.M.; Mateo-Aroca, A.; Molina-García, A. Sensitive Parameter Analysis for Solar Irradiance Short-Term Forecasting: Application to LoRa-Based Monitoring Technology. Sensors 2022, 22, 1499. https://doi.org/10.3390/s22041499

AMA Style

Bueso MC, Paredes-Parra JM, Mateo-Aroca A, Molina-García A. Sensitive Parameter Analysis for Solar Irradiance Short-Term Forecasting: Application to LoRa-Based Monitoring Technology. Sensors. 2022; 22(4):1499. https://doi.org/10.3390/s22041499

Chicago/Turabian Style

Bueso, María C., José Miguel Paredes-Parra, Antonio Mateo-Aroca, and Angel Molina-García. 2022. "Sensitive Parameter Analysis for Solar Irradiance Short-Term Forecasting: Application to LoRa-Based Monitoring Technology" Sensors 22, no. 4: 1499. https://doi.org/10.3390/s22041499

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Sensitive Parameter Analysis for Solar Irradiance Short-Term Forecasting: Application to LoRa-Based Monitoring Technology

Abstract

1. Introduction

2. Pv Monitoring: LoRa Communication Technology

2.1. General Overview

2.2. Lora-Based Communication System

3. Methodology

3.1. Lora Parameter Modeling

3.2. Spatio–Temporal PV Forecasting

3.3. Proposed Global Methodology

4. Case Example: Datasets Used

5. Results

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI