A Physics-Informed Machine Learning Approach utilizing Multiband Satellite Data for Solar Irradiance Estimation

Jun Sasaki    Maki Okada    Kenji Utsunomiya    Koji Yamaguchi
Abstract

Solar irradiance is fundamental data crucial for analyses related to weather and climate. High-precision estimation models are necessary to create areal data for solar irradiance. In this study, we developed a novel estimation model by utilizing machine learning and multiband data from meteorological satellite observations. Particularly under clear-sky and thin clouds, satellite observations can be influenced by surface reflections, which may lead to overfitting to ground observations. To make the model applicable at any location, we constructed the model incorporating prior information such as radiative transfer models and clear-sky probability, based on physical and meteorological knowledge. As a result, the estimation accuracy significantly improved at validation sites.

Machine Learning, ICML

1 Introduction

Solar irradiance data is essential for monitoring solar power, assessing potential output, climate simulations, and climate monitoring. These applications often require high-precision areal data reflecting regional characteristics. In principle, it is possible to estimate solar irradiance through radiative transfer calculations (Mueller et al., 2009; Xie et al., 2016; Zhang et al., 2018); however there are some challenges such as cloud modeling and computational costs. As a result, statistical models based on meteorological satellite data are widely used (Engerer et al., 2017; Saito et al., 2018; Hashimoto & Yoshimoto, 2023).

However, these standard models are simplistic, utilizing limited band information. For example, Saito et al. (2018) uses visible and infrared band data for weather classification with rule-based algorithms and piecewise linear regression for estimation. Actually, the relationship between solar irradiance and satellite observations is highly complex due to the diversity of clouds, suggesting that combining multiband observation data with machine learning models can lead to more precise estimations of solar irradiance.

In previous studies where machine learning was applied to solar irradiance estimation models, primarily limited elements such as the visible band and simpler architectures like MLPs (Multilayer Perceptrons) were used, indicating that there is potential for improvement in accuracy (Cornejo-Bueno et al., 2019; Jiang et al., 2019; Yeom et al., 2019; Palacios et al., 2022).

This study aims to develop a high-precision model applicable at any location, leveraging multiband satellite images and machine learning. To avoid overfitting to ground observations and ensure applicability at any site, we incorporated constraints based on physical and meteorological knowledge to enhance the model’s generalizability. we have quantitatively demonstrated the model’s superiority by comparing it with the Saito’s standard model (2018) under consistent conditions.

2 Data description

In this study, we used satellite image data from Japan’s Himawari-8, the imager on board of which has 16 observation bands ranging from 0.47μ0.47𝜇0.47\mu0.47 italic_μm to 13.3μ13.3𝜇13.3\mu13.3 italic_μm in wavelength. Six of these bands are in the shortwave spectrum (visible and near-infrared) and ten in the longwave spectrum (infrared), with the visible bands strongly correlating with solar irradiance due to their high spectral intensity. Near-infrared bands reflect the size and phase state (water or ice) of cloud particles, and infrared bands vary in vertical sensitivity according to their wavelengths, providing vertical structural information of clouds (Shimizu et al., 2017).

For model training, we used one-minute interval ground observations from 47 sites across Japan, provided by the Japan Meteorological Agency. For validation, we used solar irradiance data independently observed in this study (Ishii et al., 2013). The observation sites are Naganuma, Tsukuba, Gifu, Tosu, and Okinoerabu, referred to as ST1, ST2, ST3, ST4, and ST5, respectively. Each site has distinct solar irradiance characteristics and surface conditions, allowing for a comprehensive evaluation of the model’s accuracy and generality across diverse environmental conditions. The model was trained over four years, from 2016 to 2020, excluding 2018, with a specific test period set for 2018.

3 Methodology

In this study, we utilized a deep learning framework to model the relationship between satellite imagery and solar irradiance. Specifically, we input data from 16 satellite observation bands and trained the model using ground-measured solar irradiance as the truth values. However, under clear skies and thin clouds, the contribution of reflections and emissions from the surface to the satellite observations is significant. Due to the limited number of ground observation sites compared to the diversity of surface conditions (such as urban areas, forests, seas, and snow covers), there is a risk of the model overfitting to the characteristics of the observation sites. To address this, we incorporated a radiative transfer model and clear-sky probability to improve the model’s generalizability to unknown locations. The developed model consists of the following three sub-models:

  • Clear-Sky Solar Irradiance Estimation Model.

  • All-Sky Solar Irradiance Estimation Model.

  • Clear-Sky Probability Estimation Model.

3.1 Clear-Sky Solar Irradiance Estimation Model

This sub-model is based on the model by Kondo (1994) and is represented by the following equations:

Iclr=a0I0NcosZ(1+a1×10a2m)Cwsubscript𝐼clrsubscript𝑎0subscript𝐼0𝑁𝑍1subscript𝑎1superscript10subscript𝑎2𝑚subscript𝐶𝑤\displaystyle I_{\text{clr}}=a_{0}I_{0N}\cos Z(1+a_{1}\times 10^{-a_{2}m})C_{w}italic_I start_POSTSUBSCRIPT clr end_POSTSUBSCRIPT = italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_I start_POSTSUBSCRIPT 0 italic_N end_POSTSUBSCRIPT roman_cos italic_Z ( 1 + italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT × 10 start_POSTSUPERSCRIPT - italic_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_m end_POSTSUPERSCRIPT ) italic_C start_POSTSUBSCRIPT italic_w end_POSTSUBSCRIPT (1)
Cw=[1+a3log10w+a4log102w+a5mlog10w]subscript𝐶𝑤delimited-[]1subscript𝑎3subscript10𝑤subscript𝑎4subscriptsuperscript210𝑤subscript𝑎5𝑚subscript10𝑤\displaystyle C_{w}=[1+a_{3}\log_{10}w+a_{4}\log^{2}_{10}w+a_{5}m\log_{10}w]italic_C start_POSTSUBSCRIPT italic_w end_POSTSUBSCRIPT = [ 1 + italic_a start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT roman_log start_POSTSUBSCRIPT 10 end_POSTSUBSCRIPT italic_w + italic_a start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT roman_log start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 10 end_POSTSUBSCRIPT italic_w + italic_a start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT italic_m roman_log start_POSTSUBSCRIPT 10 end_POSTSUBSCRIPT italic_w ] (2)

Where Iclrsubscript𝐼clrI_{\text{clr}}italic_I start_POSTSUBSCRIPT clr end_POSTSUBSCRIPT is the solar irradiance under clear-sky conditions, I0Nsubscript𝐼0𝑁I_{0N}italic_I start_POSTSUBSCRIPT 0 italic_N end_POSTSUBSCRIPT is the extraterrestrial solar irradiance, Z𝑍Zitalic_Z is the solar zenith angle, m𝑚mitalic_m is the air mass, and w𝑤witalic_w is the precipitable water vapor. Cwsubscript𝐶𝑤C_{w}italic_C start_POSTSUBSCRIPT italic_w end_POSTSUBSCRIPT represents the term for seasonal variation of solar irradiance due to changes in water vapor amount. The precipitable water vapor w𝑤witalic_w is estimated based on the date and latitude of the target site. The parameters a0subscript𝑎0a_{0}italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT to a5subscript𝑎5a_{5}italic_a start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT are optimized using ground-measured solar irradiance data. Since this model is determined only by the location and date/time and does not rely on satellite observations, it can be applied to any region.

3.2 All-Sky Solar Irradiance Estimation Model

Refer to caption


Figure 1: Architecture of the all-sky solar irradiance estimation model. Satellite images from multiple bands are cropped around the target site and used as input. Features of the spatial distribution of clouds are extracted using a CNN, and the transmittance and reflectance of clouds are predicted using an MLP. Finally, downward and upward shortwave radiation are calculated using radiative transfer equations. Additionally, upward shortwave radiation is directly calculated from satellite observation data.

This sub-model calculates the all-sky solar irradiance from 16-band satellite images. First, we formulated a radiative transfer model based on Liou (2002). When the flux corresponding to clear-sky irradiance, Iclrsubscript𝐼clrI_{\text{clr}}italic_I start_POSTSUBSCRIPT clr end_POSTSUBSCRIPT, is incident on a multilayer atmospheric system consisting of clouds, atmosphere, and surface, the radiative transfer equations, considering multiple reflections between clouds and the surface, are expressed as follows:

Iall=Tc1RsRcIclr,Iup=(Rc+Tc2Rs1RsRc)Iclrformulae-sequencesubscript𝐼allsubscript𝑇𝑐1subscript𝑅𝑠subscript𝑅𝑐subscript𝐼clrsubscript𝐼upsubscript𝑅𝑐superscriptsubscript𝑇𝑐2subscript𝑅𝑠1subscript𝑅𝑠subscript𝑅𝑐subscript𝐼clrI_{\text{all}}=\frac{T_{c}}{1-R_{s}R_{c}}I_{\text{clr}},I_{\text{up}}=\left(R_% {c}+\frac{T_{c}^{2}R_{s}}{1-R_{s}R_{c}}\right)I_{\text{clr}}italic_I start_POSTSUBSCRIPT all end_POSTSUBSCRIPT = divide start_ARG italic_T start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT end_ARG start_ARG 1 - italic_R start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT italic_R start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT end_ARG italic_I start_POSTSUBSCRIPT clr end_POSTSUBSCRIPT , italic_I start_POSTSUBSCRIPT up end_POSTSUBSCRIPT = ( italic_R start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT + divide start_ARG italic_T start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_R start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT end_ARG start_ARG 1 - italic_R start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT italic_R start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT end_ARG ) italic_I start_POSTSUBSCRIPT clr end_POSTSUBSCRIPT (3)

Here, Iallsubscript𝐼allI_{\text{all}}italic_I start_POSTSUBSCRIPT all end_POSTSUBSCRIPT represents the downward shortwave radiation at the surface, corresponding to the all-sky solar irradiance, and Iupsubscript𝐼upI_{\text{up}}italic_I start_POSTSUBSCRIPT up end_POSTSUBSCRIPT is the upward shortwave radiation at the top of the atmosphere. Tcsubscript𝑇𝑐T_{c}italic_T start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT and Rcsubscript𝑅𝑐R_{c}italic_R start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT are the transmittance and reflectance of clouds, respectively, and Rssubscript𝑅𝑠R_{s}italic_R start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT represents the surface albedo. Iclrsubscript𝐼clrI_{\text{clr}}italic_I start_POSTSUBSCRIPT clr end_POSTSUBSCRIPT is estimated using equation 1,2, and Rssubscript𝑅𝑠R_{s}italic_R start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT is estimated from the most recent visible band images under clear-sky conditions.

The transmittance Tcsubscript𝑇𝑐T_{c}italic_T start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT and reflectance Rcsubscript𝑅𝑐R_{c}italic_R start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT are calculated using a neural network with multiband satellite images as inputs. The architecture, as shown in Figure 1, consists of a CNN (Convolutional Neural Network) and an MLP. Here, the input data comprises satellite images of a rectangular area approximately 20 km around the target site. It is expected that the CNN can extract features of the cloud’s spatial structure, thereby improving the representation of scattered irradiance.

The calculated Tcsubscript𝑇𝑐T_{c}italic_T start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT and Rcsubscript𝑅𝑐R_{c}italic_R start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT are converted into the upward and downward shortwave radiations through equation 3. Here, Iallsubscript𝐼allI_{\text{all}}italic_I start_POSTSUBSCRIPT all end_POSTSUBSCRIPT is optimized using the ground-measured irradiance Is(o)superscriptsubscript𝐼𝑠oI_{s}^{(\text{o})}italic_I start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( o ) end_POSTSUPERSCRIPT, while Iupsubscript𝐼upI_{\text{up}}italic_I start_POSTSUBSCRIPT up end_POSTSUBSCRIPT corresponds to the shortwave radiation observed by the satellite, integrated over the wavelength region, denoted as Iupsuperscriptsubscript𝐼upI_{\text{up}}^{\prime}italic_I start_POSTSUBSCRIPT up end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT, which can approximately be expressed as:

Iup=I0N(j=16θjBj+θ0)superscriptsubscript𝐼upsubscript𝐼0𝑁superscriptsubscript𝑗16subscript𝜃𝑗subscript𝐵𝑗subscript𝜃0I_{\text{up}}^{\prime}=I_{0N}\left(\sum\nolimits_{j=1}^{6}\theta_{j}B_{j}+% \theta_{0}\right)italic_I start_POSTSUBSCRIPT up end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT = italic_I start_POSTSUBSCRIPT 0 italic_N end_POSTSUBSCRIPT ( ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 6 end_POSTSUPERSCRIPT italic_θ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT italic_B start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT + italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT )

Here, Bjsubscript𝐵𝑗B_{j}italic_B start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT represents the shortwave radiation observed by the satellite, and θjsubscript𝜃𝑗\theta_{j}italic_θ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT are the learning parameters. Assuming that absorption by the atmosphere, ice clouds, and water clouds is low (Hale & Querry, 1973; Warren, 1984), Rc+Tc1subscript𝑅𝑐subscript𝑇𝑐1R_{c}+T_{c}\approx 1italic_R start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT + italic_T start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT ≈ 1, the overall loss function can be expressed using the hyperparameters α𝛼\alphaitalic_α, β𝛽\betaitalic_β, γ𝛾\gammaitalic_γ as follows:

=αIallIs(o)2+βRc+Tc12+γIupIup2𝛼subscriptnormsubscript𝐼allsuperscriptsubscript𝐼𝑠o2𝛽subscriptnormsubscript𝑅𝑐subscript𝑇𝑐12𝛾subscriptnormsubscript𝐼upsuperscriptsubscript𝐼up2\mathcal{L}=\alpha\|I_{\text{all}}-I_{s}^{(\text{o})}\|_{2}+\beta\|R_{c}+T_{c}% -1\|_{2}+\gamma\|I_{\text{up}}-I_{\text{up}}^{\prime}\|_{2}caligraphic_L = italic_α ∥ italic_I start_POSTSUBSCRIPT all end_POSTSUBSCRIPT - italic_I start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( o ) end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT + italic_β ∥ italic_R start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT + italic_T start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT - 1 ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT + italic_γ ∥ italic_I start_POSTSUBSCRIPT up end_POSTSUBSCRIPT - italic_I start_POSTSUBSCRIPT up end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT

Through optimization with this loss function, the energy balance is maintained so that Iallsubscript𝐼allI_{\text{all}}italic_I start_POSTSUBSCRIPT all end_POSTSUBSCRIPT fits the ground observations while Iupsubscript𝐼upI_{\text{up}}italic_I start_POSTSUBSCRIPT up end_POSTSUBSCRIPT remains consistent with satellite observations, Iupsuperscriptsubscript𝐼upI_{\text{up}}^{\prime}italic_I start_POSTSUBSCRIPT up end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT, thus preventing overfitting to observation sites. In practice, optimizing without energy-related constraints resulted in outliers under complex conditions such as thin clouds over snow cover (Figure 2).

Refer to caption

Figure 2: Differences in estimated solar irradiance with and without radiative energy constraints. Black dotted, red, and blue lines indicate ground observations, estimation without constraints, and estimation with constraints, respectively. Time series for ST1, March 7, 2018.

3.3 Clear-Sky Probability Estimation Model

Refer to caption

Figure 3: Processing flow and example results of the clear-sky probability estimation model. Satellite images from multiple bands are used as inputs to U-net, which outputs the contribution maps for different radiation sources such as seas, forests, snow covers, upper clouds and middle clouds. Input data include images from various bands (B03, B04, B05, B06, B10, B13) as well as surface albedo from Band03 and Band06 (SfcB03, SfcB06), and surface brightness temperature from Band13 (SfcB13). The results shown here are for the Kanto region, spanning latitudes 35°N to 38°N and longitudes 137°E to 143°E, on January 15, 2018, at 14:00.

Especially under clear sky conditions, the influence of reflections and emissions from the surface is significant, and there are limitations in enhancing the generalizability of the all-sky solar irradiance estimation model to unknown locations. Therefore, we introduce the clear-sky probability, Pclrsubscript𝑃clrP_{\text{clr}}italic_P start_POSTSUBSCRIPT clr end_POSTSUBSCRIPT, defined as a random variable that takes the value 1 in clear-sky and 0 in cloudy-sky conditions. The final estimated solar irradiance, Issubscript𝐼𝑠I_{s}italic_I start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT, is calculated as the expected value based on the clear-sky probability:

Is=Pclr×Iclr+(1Pclr)×Iallsubscript𝐼𝑠subscript𝑃clrsubscript𝐼clr1subscript𝑃clrsubscript𝐼allI_{s}=P_{\text{clr}}\times I_{\text{clr}}+(1-P_{\text{clr}})\times I_{\text{% all}}italic_I start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT = italic_P start_POSTSUBSCRIPT clr end_POSTSUBSCRIPT × italic_I start_POSTSUBSCRIPT clr end_POSTSUBSCRIPT + ( 1 - italic_P start_POSTSUBSCRIPT clr end_POSTSUBSCRIPT ) × italic_I start_POSTSUBSCRIPT all end_POSTSUBSCRIPT

This approach reduces the influence of the surface and ensures accuracy for unknown locations by using a larger proportion of the clear-sky solar irradiance, Iclrsubscript𝐼clrI_{\text{clr}}italic_I start_POSTSUBSCRIPT clr end_POSTSUBSCRIPT, which is independent of satellite observations, under clear-sky conditions.

This sub-model is designed based on the model by Akimoto et al. (2020). First, we perform cluster analysis on states defined by combinations of multiband satellite observation values, Bisubscript𝐵𝑖B_{i}italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT (where i𝑖iitalic_i denotes the band type), such as (Bi1,Bi2,)subscript𝐵subscript𝑖1subscript𝐵subscript𝑖2(B_{i_{1}},B_{i_{2}},...)( italic_B start_POSTSUBSCRIPT italic_i start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , italic_B start_POSTSUBSCRIPT italic_i start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , … ). The centroid of each cluster, bjisubscript𝑏𝑗𝑖b_{ji}italic_b start_POSTSUBSCRIPT italic_j italic_i end_POSTSUBSCRIPT (where j𝑗jitalic_j is the cluster number), can be considered as representative states corresponding to different radiation sources. By estimating coefficients gjsubscript𝑔𝑗g_{j}italic_g start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT that reproduce the original satellite observation values Bisubscript𝐵𝑖B_{i}italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT through a linear combination of bjisubscript𝑏𝑗𝑖b_{ji}italic_b start_POSTSUBSCRIPT italic_j italic_i end_POSTSUBSCRIPT, gjsubscript𝑔𝑗g_{j}italic_g start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT can be considered as the contribution of each radiation source to the satellite observations. In practice, a U-net (Ronneberger et al., 2015) was optimized to output the contribution map gj(𝒙)subscript𝑔𝑗𝒙g_{j}(\bm{x})italic_g start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( bold_italic_x ) from the input satellite images Bi(𝒙)subscript𝐵𝑖𝒙B_{i}(\bm{x})italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( bold_italic_x ). The loss function was defined as follows:

=Bi(𝒙)jgj(𝒙)bji(𝒙)1s.t.jgj(𝒙)=1formulae-sequencesubscriptnormsubscript𝐵𝑖𝒙subscript𝑗subscript𝑔𝑗𝒙subscript𝑏𝑗𝑖𝒙1𝑠𝑡subscript𝑗subscript𝑔𝑗𝒙1\displaystyle\mathcal{L}=\left\|B_{i}(\bm{x})-\sum\nolimits_{j}g_{j}(\bm{x})b_% {ji}(\bm{x})\right\|_{1}s.t.\sum\nolimits_{j}g_{j}(\bm{x})=1caligraphic_L = ∥ italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( bold_italic_x ) - ∑ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT italic_g start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( bold_italic_x ) italic_b start_POSTSUBSCRIPT italic_j italic_i end_POSTSUBSCRIPT ( bold_italic_x ) ∥ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_s . italic_t . ∑ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT italic_g start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( bold_italic_x ) = 1

Here, 𝒙𝒙\bm{x}bold_italic_x represents the image coordinates.

The process flow and an example of the results are shown in Figure 3. Here, other information such as outputs from numerical weather predictions and temporal changes in satellite images was referenced to identify the radiation sources for each cluster j𝑗jitalic_j. By aggregating the contribution of radiation sources corresponding to the surface, such as seas, forests, and snow covers, the clear sky probability can be calculated:

Pclr(𝒙)=jSurfacegj(𝒙)subscript𝑃clr𝒙subscript𝑗Surfacesubscript𝑔𝑗𝒙\displaystyle P_{\text{clr}}(\bm{x})=\sum\nolimits_{j\in\text{Surface}}g_{j}(% \bm{x})italic_P start_POSTSUBSCRIPT clr end_POSTSUBSCRIPT ( bold_italic_x ) = ∑ start_POSTSUBSCRIPT italic_j ∈ Surface end_POSTSUBSCRIPT italic_g start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( bold_italic_x )

In an event shown in Figure 4, ST1 site is covered with snow under clear-sky. Although clouds in the vicinity cannot be distinguished from snow covers in visible images, the clear-sky probability calculated by this sub-model shows a value close to 1, confirming the model’s validity.

Refer to caption

Figure 4: Visible satellite image (left) and distribution of estimated clear-sky probability (right) for the Hokkaido region, spanning latitudes 41°N to 46°N and longitudes 138°E to 146°E, on December 3, 2018, at 12:00. The yellow arrow points to ST1, the ground observation site.

4 Results and Conclusions

In this paper, we demonstrated the effectiveness of utilizing machine learning and multiband data by comparing the developed model against the standard model across cases of thin clouds and snow covers.

For the thin clouds case, we refer to an event for ST2 on August 15, 2018. The presence of thin clouds in the area around ST2 reduced solar irradiance, but the standard model overestimated the solar irradiance around 10:00 (see Figure 5, left). This overestimation is likely due to surface reflections transmitted through thin clouds being observed in the visible band, leading to an underestimation of cloud thickness. In contrast, the developed model accurately captured the reduction in solar irradiance caused by the thin clouds. For this event, when only the visible band was used in the developed model, an overestimation similar to that of the standard model occurred. This experimental result indicate that by extracting information about the refractive index of ice particles and the vertical structure of upper-level clouds from the near-infrared and infrared band data, the cloud thickness could be accurately estimated.

For the snow case, we refer to an event for ST1 on December 3, 2018, at 12:00, shown in Figure 4. The solar irradiance time series for the day appears in Figure 5 (right). The standard model cannot distinguish snow cover from clouds, leading to an underestimation of solar irradiance. However, the developed model accurately estimates solar irradiance. This accuracy is achieved by correctly identifying snow cover based on the clear-sky probability. Snow, composed of ice particles and accumulated on the ground, can be distinguished by appropriately combining near-infrared and infrared bands, although the relationship is inherently complex. The standard model, which employs a rule-based classification method utilizing several bands, often misidentifies snow as clouds due to its simplistic approach. In contrast, the developed model uses six bands and leverages machine learning to accurately represent this complexity, thus achieving precise identification of snow and clouds.

Refer to caption

Figure 5: Comparison of solar irradiance time series. Black dotted, red, and blue lines indicate ground observations, the standard model, and the developed model, respectively. Left: ST2, August 15, 2018, at 10:00. Right: ST1, December 3, 2018.

To quantitatively compare the standard and developed models, we calculated the Root Mean Square Error (RMSE) for the year 2018. The calculations used one-minute average values for ground observations and instantaneous values estimated from satellite observation data. Table 1 shows the RMSE for each model by validation site. The developed model shows an improvement trend at all sites compared to the standard model, with more than a 10% improvement effect, particularly when excluding ST5, where localized clouds frequently occur that are not captured by satellites.

These results confirm that the developed model outperforms the standard model. The main factors for improvement include enhanced representation of thin clouds and convective clouds (not shown in this paper) through the use of multiband data and machine learning, and the introduction of radiative transfer models and clear-sky probability, which have improved the generalization performance for various surface conditions including snow covers. Additionally, the use of CNN has contributed by taking into account the spatial structure of clouds and improving the representation of scattered irradiance.

Table 1: Comparison of RMSE[W/m2]delimited-[]superscriptW/m2[\text{W/m}^{2}][ W/m start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ] between ths standard model and the developed model by validation site.
Model ST1 ST2 ST3 ST4 ST5
Standard 106.9 98.2 101.5 111.3 124.8
Developed 86.6 86.1 86.0 94.2 118.6

This research provides a versatile framework for constructing solar irradiance estimation models applicable to any region, based on limited ground observation data, not only in Japan but worldwide. This contributes to the creation of foundational data for meteorological and climatic analyses.

Impact Statement

This paper presents work whose goal is to advance the field of Machine Learning and Earth Science. There are many potential societal consequences of our work, none which we feel must be specifically highlighted here.

Acknowledgments

This study is based on results obtained by the project “development of solar radiation forecasting technologies for short-term forecast of photovoltaic power (Research on short-term forecast of solar radiation)”, funded by the New Energy and Industrial Technology Development Organization (NEDO).

References

  • Akimoto et al. (2020) Akimoto, N., Zhu, H., Jin, Y., and Aoki, Y. Fast soft color segmentation. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp.  8277–8286, 2020.
  • Cornejo-Bueno et al. (2019) Cornejo-Bueno, L., Casanova-Mateo, C., Sanz-Justo, J., and Salcedo-Sanz, S. Machine learning regressors for solar radiation estimation from satellite data. Solar Energy, 183:768–775, 2019.
  • Engerer et al. (2017) Engerer, N. A., Bright, J. M., and Killinger, S. Himawari-8 enabled real-time distributed pv simulations for distribution networks. In 2017 IEEE 44th Photovoltaic Specialist Conference (PVSC), pp.  1405–1410. IEEE, 2017.
  • Hale & Querry (1973) Hale, G. M. and Querry, M. R. Optical constants of water in the 200-nm to 200-μ𝜇\muitalic_μm wavelength region. Applied optics, 12(3):555–563, 1973.
  • Hashimoto & Yoshimoto (2023) Hashimoto, A. and Yoshimoto, K. Development of a short-term solar irradiance forecasting using satellite image in combination with numerical weather prediction model. Electrical Engineering in Japan, 216(3):e23432, 2023.
  • Ishii et al. (2013) Ishii, T., Otani, K., Itagaki, A., and Utsunomiya, K. A simplified methodology for estimating solar spectral influence on photovoltaic energy yield using average photon energy. Energy Science & Engineering, 1(1):18–26, 2013.
  • Jiang et al. (2019) Jiang, H., Lu, N., Qin, J., Tang, W., and Yao, L. A deep learning algorithm to estimate hourly global solar radiation from geostationary satellite data. Renewable and Sustainable Energy Reviews, 114:109327, 2019.
  • Kondo (1994) Kondo, J. Meteorology of water environment. pp.  55–91, 1994.
  • Liou (2002) Liou, K.-N. An introduction to atmospheric radiation, volume 84. Elsevier, 2002.
  • Mueller et al. (2009) Mueller, R., Matsoukas, C., Gratzki, A., Behr, H., and Hollmann, R. The cm-saf operational scheme for the satellite based retrieval of solar surface irradiance—a lut based eigenvector hybrid approach. Remote Sensing of Environment, 113(5):1012–1024, 2009.
  • Palacios et al. (2022) Palacios, L. E. O., Guerrero, V. B., and Ordoñez, H. Machine learning model to predict solar radiation, based on the integration of meteorological data and data obtained from satellite images. arXiv preprint arXiv:2204.04313, 2022.
  • Ronneberger et al. (2015) Ronneberger, O., Fischer, P., and Brox, T. U-net: Convolutional networks for biomedical image segmentation. In Medical image computing and computer-assisted intervention–MICCAI 2015: 18th international conference, Munich, Germany, October 5-9, 2015, proceedings, part III 18, pp.  234–241. Springer, 2015.
  • Saito et al. (2018) Saito, T., Sasaki, K., Itagaki, A., Utsunomiya, K., and Yamaguchi, K. Preliminary study on a cloud and snow classification of a solar radiation estimation using the meteorological satellite himawari-8 data. The transactions of the Institute of Electrical Engineers of Japan. B, A publication of Power and Energy Society, 138(6):460–465, 2018.
  • Shimizu et al. (2017) Shimizu, A., Koutarou, S., and Mikito, Y. Image characteristics of the 16 bands of himawari-8’s ahi. Meteorological Satellite Center technical note, (62):39–71, 2017.
  • Warren (1984) Warren, S. G. Optical constants of ice from the ultraviolet to the microwave. Applied optics, 23(8):1206–1225, 1984.
  • Xie et al. (2016) Xie, Y., Sengupta, M., and Dudhia, J. A fast all-sky radiation model for solar applications (farms): Algorithm and performance evaluation. Solar Energy, 135:435–445, 2016.
  • Yeom et al. (2019) Yeom, J.-M., Park, S., Chae, T., Kim, J.-Y., and Lee, C. S. Spatial assessment of solar radiation by machine learning and deep neural network models using data provided by the coms mi geostationary satellite: A case study in south korea. Sensors, 19(9):2082, 2019.
  • Zhang et al. (2018) Zhang, Y., He, T., Liang, S., Wang, D., and Yu, Y. Estimation of all-sky instantaneous surface incident shortwave radiation from moderate resolution imaging spectroradiometer data using optimization method. Remote sensing of environment, 209:468–479, 2018.