1. Introduction
Electricity is the most dynamic and efficient form of energy [
1]. It is one of the main drivers of scientific, technological, and economic development, and is key to the quality of life and economic development in poor countries [
2]. The United Nations Sustainable Development Goal 7 (SDG7) roadmap aims to provide affordable, reliable, sustainable, and modern energy services for all by 2030. To achieve this, accurate and timely estimates of electricity consumption and its spatial patterns are important for energy-efficient electricity planning in the region and for achieving SDG7.
When Earth is observed from above at night, the dazzling glow of electric lights in human settlements and economic zones can be easily ascertained. When the sky is cloudless, remote sensing satellites can capture various visible sources of radiation, such as town lights, fishing boat lights, and fires. Images of the Earth’s visible lights acquired at night are known as night-light images. From a historical point of view, the development of nighttime light imagery began in the 1970s with the Defense Meteorological Satellite Program (DMSP), which was carried out by the Operational Linescan System (OLS). This system was designed to enable effective weather forecasting using the night imaging capability of its embedded sensors. These sensors can effectively capture the weak moonlight reflected from clouds and thus obtain the cloud cover distribution at night. However, it was inadvertently discovered that in the absence of clouds, these images can clearly capture light emitted by human activity on the Earth’s surface. This prompted a new field of scientific research based on nighttime light data [
3].
The current mainstream nighttime light data originate from the Visible Infrared Imaging Radiometer Suite (VIIRS) Day/Night Band (DNB) night-lights imagery system and the DMSP/OLS [
4]. It is also interesting to note that the OLS sensors on DMSP satellites can capture a variety of light spectra of different intensities, such as small residential areas, traffic lights, fires, boat lights, etc. As a result, changes in the intensity of human economic and social activities can be estimated. However, the OLS sensor acquires night-light images with a digital number (DN) range of only 0–63, which leads to the oversaturation of light values in areas with high light levels. In addition, DMSP data are provided by satellites of different ages with different sensors, leading to inconsistent data that require continuity correction. The VIIRS/DNB, which is considered a second-generation night-light imagery system, offers significant improvements in terms of extracted data quality while inheriting the characteristics of DMSP data. Moreover, the spatial resolution of VIIRS/DNB imagery is much better than that of DMSP/OLS data, allowing for more accurate representations of the nighttime lights without oversaturation issues. Many studies have explored techniques for handling these two types of night-light images and rendering them more suitable for practical use [
5,
6,
7]. Unlike traditional remote sensing imagery, which is commonly used for land feature classification [
8,
9,
10], the night-light imagery approach is currently widely employed for various applications, including the estimation of gross domestic product [
11], populations [
12], poverty [
13], and carbon emissions [
14,
15]. Furthermore, it has been actively used to map urbanization processes [
16,
17,
18], monitor disasters [
19], monitor armed conflicts [
20,
21], examine ecological light pollution [
22], map fires [
23], monitor fisheries [
24], and monitor power outages [
25] and grid reliability [
26]. Night-light data have been also employed to determine electrification processes [
27,
28,
29].
Night-light images can reflect the density and usage of lighting infrastructure. Therefore, they can also be used as an index of electricity consumption in an area, which is a subject that has been studied extensively. Indeed, linear correlations between electricity consumption and light intensity (from DMSP/OLS night-light images) have been reported in a dozen Asian countries, providing preliminary evidence of the feasibility of modelling electricity consumption based on night-light data [
30]. Chand et al., simulated both the spatial and temporal patterns of total electricity consumption in India from 1993 to 2002 based on the correlation between light intensity from DMSP/OLS night-light images and electricity consumption [
31]. Researchers using night-light imagery for electricity consumption estimates often construct one-dimensional regression models, such as linear regression, power function regression, or exponential regression, to relate electricity consumption to the total night-light value of images. A researcher will choose the regression model with the highest
R2-value to estimate electricity consumption. Early studies have tended to use DMSP/OLS night-light imagery due to its availability, which covers the time period of 1992–2013. With the advent of better quality VIIRS/DNB night-light images in 2012, researchers have increasingly used these images for their studies. In another interesting work, Shi et al., studied the boundaries and electricity consumption of 12 cities in China using VIIRS/DNB night-light data. The results revealed that the VIIRS/DNB method is more accurate than the DMSP/OLS approach for estimating the total electricity consumption, based on simulations [
32]. It has been also been found that VIIRS/DNB images, with their higher quality data, are more accurate in simulating electricity consumption [
33,
34]. Both DMSP/OLS and VIIRS/DNB night-light images require denoising, the quality of which has a direct impact on subsequent analyses. However, previous simulations of electricity consumption based on night-light data have mainly focused on areas with low cloudiness. Associated denoising methods also focus on less cloudy areas, whereas few studies have been conducted on cloudy areas. In the denoising of cloudy areas, common denoising methods can mask useful information, such that the denoised images do not reflect the true night-light conditions.
In response to the various problems encountered in previous studies, this study proposes a new method for denoising and making annual image syntheses of night-light images in cloudy areas. The country of Cambodia is used as an example, and VIIRS/DNB night-light images with high accuracy were used to simulate electricity consumption. Multiple one-dimensional regression models were used to explore the relationship between annual electricity consumption and total night-light values in Cambodia. The best model was used to estimate the annual electricity consumption in Cambodia and the changes in its spatial distribution pattern. Hence, a new method of annual electricity consumption estimation and spatial distribution modelling in Cambodia is provided. This work also provides a methodological reference for using night-light imagery to estimate annual electricity consumption in cloudy areas. The results are anticipated to contribute to the planning of electricity infrastructure in Cambodia and the realisation of SDG7.
3. Methods
Cambodian electricity consumption data were used in this work in conjunction with VIIRS/DNB night-light remote sensing data. The data were first pre-processed by cropping the downloaded monthly VIIRS/DNB images from 2012 to 2019 to obtain images of the Cambodian region. Afterwards, they were denoised and projected as Albers equivalent cone projections and resampled to a 500 m spatial resolution. Then, three different methods were used—the annual average method, annual cloud-free average method, and month-specific substitution method—to obtain average night-light remote sensing images of Cambodia for each year from 2012 to 2019. Finally, electricity consumption data for Cambodia from 2012 to 2019 were fitted by functional models of total night-light data obtained from each of the three methods to determine the best model. This provided estimates of Cambodia’s electricity consumption and its spatial distribution. A flowchart of the research process is presented in
Figure 1.
3.1. Data Pre-Processing
The VIIRS/DNB is a primary data product in a split format that requires extensive pre-processing, such as cropping, reprojection, and denoising, to obtain images of the desired areas. A total of 92 monthly nighttime images from April 2012 to December 2019 were masked using a vector of the Cambodian border to obtain VIIRS/DNB nighttime images of the Cambodian region. To ensure minimal areal distortion, the data were projected as Albers’ cone of equivalence; the area of the region is proportional to the area of the same region on Earth, while the area of the Cambodian region is less distorted. Finally, the monthly night-light images were resampled to a 500 m spatial resolution.
Since the acquired VIIRS/DNB night-light images had not yet been stripped of auroras, fires, other transient light sources, and background noise, a large number of negative and very high values existed in the night-light images. Theoretically, it is not possible for image elements to have negative DN values. Nevertheless, negative DN values were caused by background noise and outliers during data processing. The maximum light value in an image should not be greater than the highest light value in the most developed area of the image. DN values greater than the highest light value in the developed area are called extreme values, which mainly originate from fires and data outliers. To reduce the impact of noise on image elements, the images needed to be denoised. The three most widely used methods for denoising VIIRS/DNB luminous images are: (1) stable nighttime lights extracted from VIIRS/DNB data using non-zero DMSP/OLS data as a mask [
35]; (2) selection of area-specific thresholds as maximum and minimum light thresholds as masks for rejecting background noise [
36]; (3) denoising night-light images by enforcing the neighbourhood algorithm [
32,
37]. All these methods can significantly enhance the quality of images by removing some of the noise and light from auroras, fires, and certain ephemeral light sources. Although the first denoising method can remove noise, it also removes much usable night-light information. This method is more suitable for studies of short time series and regions with little change in the size of the energised area. However, from 2012 to 2019, electricity consumption in Cambodia increased at a faster rate than before and a large number of unelectrified areas became connected to electricity. For this reason, we can draw the conclusion that this method is not suitable for denoising long time series of the Cambodian region. Furthermore, since Cambodia’s night-light images are seriously affected by cloud, the neighbourhood algorithm is also unsuitable. Accordingly, the threshold method was selected for image denoising in the present study. Phnom Penh is the most developed region in Cambodia, so was used to establish a theoretical maximum value of the image element DN for the 2012–2019 night-light images. Image element DN values in the annual average images that were greater than the theoretical maximum were replaced by the maximum value, while negative values were replaced with zero-values. This completed the first denoising process for the 2012–2019 night-light images.
The threshold denoising procedure removes both negative and extreme values from images; however, it is difficult to remove other sources of noise. The denoising method employed in this work is based on Cao et al.’s continuity correction method for night-light images in different years [
38]. (1) More specifically, when
DN = 0 in an image of the subsequent year, then the
DN at the same position in the previous year’s image should also equal 0. (2) On the other hand, when the
DN value of an image element in a later year’s image is not equal to 0, the
DN in the previous year’s image should be no greater than that. The formula for this denoising method is as follows:
where
and
denote the
DN values of image
i in years
n and
n + 1, respectively. The extracted night-light image, from which the negative and extreme values have been removed, is denoised a second time using this formula.
Figure 2 gives an example dataset showing the changes in the raster values of the night-light images for years
n − 1,
n and
n + 1 before and after denoising.
3.2. Three Methods for Obtaining Annual Night-Light Images
To synthesise a suitable annual night-light image for the Cambodian region, which will be heavily influenced by cloud cover, three methods were used: annual average, annual cloud-free average, and month-specific substitution. In a night sky without clouds, light emanating from the surface can be captured by satellites; if there are clouds in the atmosphere, satellites cannot capture nighttime lights beneath them. Daily data products based on VIIRS/DNB night-light imagery cannot be used to estimate electricity consumption, as they are heavily influenced by cloud cover. Monthly data products from VIIRS/DNB night-light imagery are synthesised from the daily data products and, when an area is obscured by clouds, that day’s data are replaced by the non-obscured days for that month. With this process, unobstructed monthly night-light images are available for most of the world; however, in low latitudes like the Cambodian region, there are still many months of the year when unobstructed monthly images are not available. In this study, months that were unaffected by cloud cover after daily data synthesis are called cloud-free months.
- (1)
Annual average method
Annual night-light images were obtained by averaging the night-light values of the twelve monthly images of the year, regardless of whether they were affected by cloud.
- (2)
Annual cloud-free average method
Cambodia experiences distinct dry and rainy seasons, such that night-light images are often heavily influenced by cloud cover in the rainy season. Using the annual cloud-free average method, the cloud-influenced months in
Table 2 were excluded and the night-light images from non-cloud-influenced months were averaged to obtain annual night-light images.
- (3)
Month-specific substitution method
December is part of the dry season in Cambodia and is the month least affected by cloud. As a result, the December night-light images for each year from 2012 to 2019 are not affected by cloud cover and are the most stable. Thus, this method used December night-light images as annual night-light images.
3.3. Coupled Model of Electricity Consumption and Total Night-Light Value
Denoised night-light images only contain electrical illumination, e.g., from streetlights. Using Cambodia’s electricity consumption from 2012 to 2019 as the independent variable and the total night-light value of annual night-light images data as the dependent variable, several one-dimensional regression models were established based on linear, power, and exponential functions. The regression models for this study were obtained via curve fitting using the
ordinary least squares (OLS) and
maximum likelihood estimation (MLE) methods [
39]. Linear optimization problems like linear regression models are obtained by OLS, and nonlinear optimization problems like exponential models are obtained by MLE. The equations for these three regression models are shown in
Table 3.
In the mathematical equation, x represents electricity consumption, y represents the total night-light value, e is a natural constant, a and b are the derived coefficients after performing the OLS regression.
The models were evaluated in terms of the adjusted coefficient of determination (adj-R2) and its significance (p). In regression analysis, the coefficient of determination (R2) is used to determine the explanatory power of a model. This process measures the proportion of variation in the dependent variable that can be explained by the independent variable. The adj-R2 can eliminate the effect of variation in the number of independent variables and is more appropriate for judging the merits of a regression model than R2; higher adj-R2 values indicate a better model. Significant differences in the regression analysis are an evaluation of the variability of the data. Significant differences are usually expressed as p > 0.05 for non-significant differences, 0.01 < p < 0.05 for significant differences, and p < 0.01 for very significant differences. In regression analysis, a regression model is often considered valid when p < 0.05. The smaller the p-value, the higher the correlation between the two sets of data and the better the regression model.
3.4. Error Analysis
Error analysis of the annual night-light images of Cambodia, pre-processed by denoising and other means, was conducted using version 1 of the 2015 Nightlight Image of the Year from the Payne Institute. This image has noise removed, including from auroras, fires, ships, and other temporary light sources. The results obtained in this work were compared with the total night-light values of the official processed data by using the relative error
(RE) to measure the quality of image processing, according to the following equation:
where
denotes the total night-light value of the 2015 Cambodian regional image provided by the Payne Institute and
denotes that of the 2015 Cambodian regional images processed by the present study. The smaller the
RE, the closer the image is to the Payne Institute’s data and the better the processing quality.
5. Discussion
5.1. Analysis of Annual Images Synthesised by Different Methods and Modelling Results
In this work, night-light images of Cambodia from 2012 to 2019 were obtained by processing with three methods. The annual night-light images obtained by the annual average method were heavily influenced by cloud cover and, theoretically, were the least accurate. Although cloud-free months were removed in the annual cloud-free average method, these months varied interannually, which could influence the comparability of images from one year to another. Furthermore, the December night-light images from different years were slightly affected by cloud cover. By taking into account that December is the last month of the year, the annual images obtained by month-specific substitution method possess better continuity between years and, theoretically, produce the best modelling results. Ranking the quality of different models according to the magnitude of adj-
R2-value and significance, the results confirm these suspicions. The only exception was with the month-specific substitution method, where the linear model had a slightly higher adj-
R2-value than that of the cloud-free averaging method. The adj-
R2 values and significance rankings for the other regression methods show that the annual average method < annual cloud-free average method < month-specific substitution method. This conclusion suggests that the three annual night-light image synthesis methods should be ranked as follows: annual average method < annual cloud-free average method < month-specific substitution method. The results of the regression models are shown in
Table 8.
Table 8 indicates that when using data processed by the annual average method, the quality of the regression models is ranked exponential < linear < power. Using data processed by the annual cloud-free average method, the model accuracy is ranked linear < exponential < power. Using data processed by the month-specific substitution method, the model accuracy is ranked linear < power < exponential. The overall quality of the annual images synthesized by different methods is annual average < annual cloud-free average < month-specific substitution. The exponential model was ranked third, second, and first using data from the annual averaging, annual cloud-free averaging, and month-specific substitution methods, respectively. This suggests that as the annual night-light image synthesis method becomes more reasonable, an exponential regression model provides a better fit. Therefore, the relationship between electricity consumption and total night-light values in Cambodia is best expressed using an exponential regression model.
5.2. Estimated Annual Electricity Consumption in Cambodia
This study demonstrates a strong correlation between electricity consumption and total annual night-light values in Cambodia, with an adj-
R2 of 0.97 using month-specific substitution data. This effect clearly demonstrates the feasibility of using night-light data to estimate annual electricity consumption. With the estimated electricity consumption in Cambodia from 2012 to 2019 using the exponential regression model,
Table 9 was obtained.
Table 9 shows that electricity consumption in Cambodia from 2012 to 2019 estimated from night-light data had
RE-values of 0.16–10.95% with a mean of 5.47%, which is a good overall result.
Electricity consumption and demand are influenced by several factors, which is a focus of research. Recent research on national electricity consumption and electricity demand prediction has focused on the development of complex models. Rivera-Gonzalez et al., modelled Ecuador’s electricity consumption and demand until 2040 using the Long-range Energy Alternatives Planning (LEAP) system [
40]. Pakistan’s electricity consumption and demand were investigated by Perwez et al., using the LEAP system [
41]. Shah et al., forecasted day-ahead electricity demand with nonparametric functional models [
42] and subsequently used a component-estimation technique to forecast short-term electricity demand [
43]. Other researchers have recently used deep learning for energy time series forecasting [
44]. With the development of night-light remote sensing technology, research on electricity consumption based on night-light data is gradually increasing. Unlike data obtained from traditional surveys, satellite imagery records data that are more objective, have wider coverage, and cost less to obtain. The use of night-light images for electricity consumption prediction only requires a simple one-dimensional regression model. The required data are easy to obtain, and the predictive accuracy is high, providing a novel and simple method of electricity consumption prediction.
5.3. Changing Spatial Patterns of Electricity Consumption in Cambodia
Timely and accurate access to data on the spatial distribution of electricity consumption is essential in the construction of electrical infrastructure and for assessing the achievement of SDG7. Total night-light data from 2012 to 2019 were used to obtain the spatial pattern of electricity consumption in Cambodia at a 500 m resolution (
Figure 6 shows three years as examples).
The analysis of the spatial pattern of electricity consumption in Cambodia discloses that in 2012, electricity was mainly consumed in the capital city of Phnom Penh, the southwestern port city of Sihanoukville, and the northwestern cities of Battambang and Siem Reap. Interestingly, these four cities are the most economically developed regions in Cambodia and have the highest electricity consumption. From 2012 to 2019, the electricity consumption in Cambodia has been spreading outwards from these four cities, demonstrating the phenomenon of urban sprawl. The rapid increases in electricity consumption in the vicinity of the two main transport routes along the western part of Cambodia indicate that transport has made a huge boost to Cambodia’s development. Additionally, the country’s electricity coverage has improved considerably in recent years, with its electricity consumption slowly spreading from the major cities to the vast rural areas. The most rapid growth in electricity consumption has occurred in the southwestern port city of Sihanoukville and the southeastern province of Svay Rieng, which are inextricably linked to Cambodia’s national development strategy and Chinese investment.
5.4. Limitations Analysis
(1) The proposed denoising method required high-quality night-light images from 2019. If the quality of the images from 2019 was poor, the overall denoising effect could be poor. In addition, recent years are not as good at denoising as previous years, e.g., 2017 is better at denoising than 2018 and 2018 is better than 2019.
(2) The data used in this work were only available until 2019, so the latest years were not included.
(3) As Cambodia is located in a cloudy region at low latitudes, many of its daily night-light images are heavily obscured by clouds and were unusable. Meanwhile, monthly images are only affected by cloud for some months of the year. Only December was cloud-free in all years. This means that the estimation of electricity consumption and its spatial distribution were only possible on an annual scale and not on daily or monthly scales.
(4) When using night-light images to estimate electricity consumption and its spatial pattern, it is particularly important to denoise the images, as this estimation method is ineffective if the noise cannot be effectively removed. Moreover, a set of electricity consumption data corresponding to the year of the night-light images is required for building regression models.