Very short term irradiance forecasting using the lasso

doi:10.1016/j.solener.2015.01.016

Solar Energy

Volume 114, April 2015, Pages 314-326

https://doi.org/10.1016/j.solener.2015.01.016 Get rights and content

Highlights

•
The lasso is applied to perform sub-5-min irradiance forecasting.
•
Spatio-temporal neighbors are automatically selected using data from a monitoring network.
•
The lasso outperforms the persistence, ARIMA, ETS and OLS models significantly.
•
The lasso has good performance when training data are few and predictors are many.

Abstract

We find an application of the lasso (least absolute shrinkage and selection operator) in sub-5-min solar irradiance forecasting using a monitoring network. Lasso is a variable shrinkage and selection method for linear regression. In addition to the sum of squares error minimization, it considers the sum of $ℓ_{1}$ -norms of the regression coefficients as penalty. This bias–variance trade-off very often leads to better predictions.

One second irradiance time series data are collected using a dense monitoring network in Oahu, Hawaii. As clouds propagate over the network, highly correlated lagged time series can be observed among station pairs. Lasso is used to automatically shrink and select the most appropriate lagged time series for regression. Since only lagged time series are used as predictors, the regression provides true out-of-sample forecasts. It is found that the proposed model outperforms univariate time series models and ordinary least squares regression significantly, especially when training data are few and predictors are many. Very short-term irradiance forecasting is useful in managing the variability within a central PV power plant.

Graphical abstract

Introduction

Variability in solar irradiance reaching the ground is primarily caused by moving clouds. To accurately forecast the irradiance, cloud information must be directly or indirectly incorporated into the formulation. Due to the stochastic nature of the clouds, it is difficult to fully model their generation, propagation, and extinction using physical approaches. Statistical methods are therefore often used to extract cloud information from observations (e.g. Yang et al., 2015, Dong et al., 2014, Lonij et al., 2013).

We are particularly interested in very short term (sub-5-min) irradiance forecasting as the clouds are relatively persistent during a short time frame. Unlike the forecasts with longer horizons where the results are essential for electricity grid operations, very short term forecasts find their applications in large photovoltaics (PV) installations. Knowing the potential shading/unshading over a particular section of a PV system in advance may be advantageous to maximum power point tracking algorithms (Hohm and Ropp, 2000). Accurate sub-minute forecasts could also bring possibilities to better control of ramp-absorbing ultracapacitors (Mahamadou et al., 2011, Teleke et al., 2010).

Inman et al. (2013) reviewed the state-of-the-art methods for very short term irradiance forecasting. The methods involve using either sky cameras (Nguyen and Kleissl, 2014, Yang et al., 2014c, Quesada-Ruiz et al., 2014) or a sensor network (Lipperheide et al., 2015, Bosch and Kleissl, 2013, Bosch et al., 2013). All of these listed references aim at explicitly deriving the cloud motion and thus forecast the irradiance. Beside many assumptions, such as linear cloud edge, that have to be made, various types of error will be embedded in different phases of such methods, especially during the conversion from cloud condition to ground-level irradiance. It is therefore worth investigating the alternative methods where cloud information is considered indirectly.

Along-wind and cross-wind correlations observed between two irradiance time series have been studied intensively in the literature (e.g. Arias-Castro et al., 2014, Hinkelman, 2013, Lonij et al., 2013, Perez et al., 2012). If along-wind correlation between a pair of stations can be observed, we can use regression-based methods for forecasting. However, several problems have to be addressed before we describe our method:

•
The discrepancy between the direction of a station pair and the direction of wind may result in a smaller correlation. How do we incorporate the strength of cross-correlation between monitoring sites into the forecasting model?
•
When the wind speed changes from day to day or even within a day, the choices of lagged time series also need to be constantly updated. How do we then automatically select the most appropriate spatio-temporal neighbors for forecasting?
•
When the correlation is unobserved, do we need to switch the spatio-temporal forecasting algorithm to a purely temporal algorithm in an ad hoc manner?

With these questions, we consider the lasso (least absolute shrinkage and selection operator) regression (Efron et al., 2004, Tibshirani, 2011, Tibshirani, 1996). Lasso is a variable shrinkage and selection method for linear regression. In our application, the predictors (regressors) are the time series collected at the neighboring stations at various time lags (autocorrelated time series may also be used); the responses (regressands) are the time series collected at the forecast station. Some advantages of the lasso over the ordinary least squares regression, ridge regression and subset selection methods are discussed in Section 2.

Data from a dense grid of irradiance sensors located on Oahu Island, Hawaii, are used in this work. The network is installed by the National Renewable Energy Laboratory (NREL) in March 2010. It consists of 17 radiometers, as shown in Fig. 1. The sampling rate of these stations is 1 s. Previously, Hinkelman (2013) showed the possibility of observing highly correlated time series from this network; data from 13 days dominated by broken clouds were used in that study. We therefore use the data from the exact same days (Hinkelman, 2014) to study the predictive performance of such network configuration. The data are freely available at http://www.nrel.gov/midc/oahu_archive/.

Throughout the paper, the 1 s irradiance data will be averaged into various intervals to evaluate the forecasts with different forecast horizons. As high frequency data often have local maxima and minima caused by noise rather than cloud effects (Bosch and Kleissl, 2013), the smallest aggregation interval is 10 s. Prior to any forecasting, the global horizontal irradiance (GHI) time series from these 17 stations are first transformed into clearness index time series. Such transformation is commonly used in irradiance forecasting to stabilize the variance, i.e., to remove the diurnal trends in the GHI time series. We use the solar positioning algorithm developed by Reda and Andreas (2008) for extraterrestrial irradiance calculation. Finally, we include a zenith angle filter of <80°.

All the forecasting models in this paper are built using the clearness index time series and the errors are evaluated using the GHI transformed back from the forecast clearness index. Two error metrics are used in this paper, namely, the normalized mean absolute error (nMAE) and the forecast skill (FS). The nMAE is given by: $nMAE = \frac{\frac{1}{n} \sum_{i = 1}^{n} |{\hat{G}}_{i} - G_{i}|}{\frac{1}{n} \sum_{i = 1}^{n} G_{i}} \times 100 %$ where $G_{i}$ denotes the GHI measured at ith time step; ${\hat{G}}_{i}$ denotes the forecast produced. The forecast skill (Chu et al., 2015) is given by: $FS (fh) = 1 - \frac{nRMSE (fh)}{{nRMSE}_{p} (fh)}$ where fh denotes the forecast horizon; ${nRMSE}_{p}$ and $nRMSE$ are the normalized root mean square errors of the persistence model and the proposed model respectively. A persistence model assumes that the forecast is equal to the current observation; it is often used as a naive benchmark. The $nRMSE$ is given by: $nRMSE = \frac{\sqrt{\frac{1}{n} \sum_{i = 1}^{n} {({\hat{G}}_{i} - G_{i})}^{2}}}{\frac{1}{n} \sum_{i = 1}^{n} G_{i}} \times 100 %$

The nMAE is a form of mean absolute error (MAE) while the forecast skill is a form of mean square error (MSE). MAE and MSE both measure the average magnitude of the errors and are frequently used in forecasting applications. MAE is a linear score which weights individual error equally. For the case of the MSE, the errors are squared before averaging; it gives higher weights to large errors. This indicates that the MSE is more useful when large errors are particularly undesirable, as in the case of solar power forecasting.

Section snippets

Method

Given data $(x^{i}, y_{i})$ , $i = 1, \dots, n$ , where $x^{i} = {(x_{i 1}, \dots, x_{ip})}^{⊤}$ are the p predictor variables and $y_{i}$ are the responses, the linear regression model has the form: $y_{i} = β_{0} + \sum_{j = 1}^{p} β_{j} x_{ij}$ where $β = {(β_{0}, β_{1}, \dots, β_{p})}^{⊤}$ is the regression parameter. The lasso estimate of $β$ is defined by: $\begin{matrix} \hat{β} & = \underset{β}{argmin} \{\sum_{i = 1}^{n} {(y_{i} - β_{0} - \sum_{j = 1}^{p} β_{j} x_{ij})}^{2}\}, \\ s . t . \sum_{j = 1}^{p} | β_{j} | ⩽ t \end{matrix}$ where $t \geq 0$ is a tuning parameter which controls the amount of shrinkage. Eq. (5) is equivalent to the $ℓ_{1}$ -penalized regression problem of finding: $\hat{β} = \underset{β}{argmin} \{\sum_{i = 1}^{n} {(y_{i} - β_{0} - \sum_{j = 1}^{p} β_{j} x_{ij})}^{2} + λ \sum_{j = 1}^{p} | β_{j} |\}$ where λ is

Results from a single day with a single forecast horizon

Throughout this section, only the 10 s averaged data from a single day, namely, 2010 July 31, is used. After applying the data filters described in Section 1.1, 4133 data points are obtained for each station. A total of 5 case studies are presented in this section.

Results from all 13 days with various forecast horizons

In the previous section, performance of the lasso along with several benchmarking models is evaluated at a forecast horizon of 10 s for 2010 July 31. In this section, additional forecasting results are shown using data from all 13 selected days with various forecast horizons.

Conclusions

A very short-term irradiance forecasting method is proposed. The lasso is used to shrink and select the spatio-temporal neighbors from lagged time series collected by a dense network of monitoring stations. Due to the presence of highly correlated data from the along-wind station pairs, the forecast results improve significantly from persistence and other univariate time series methods. The lasso also outperforms the ordinary least squares model. The advantage of the lasso over OLS is more

References (38)

E. Arias-Castro et al.
A poisson model for anisotropic solar ramp rate correlations
Sol. Energy
(2014)
J. Bosch et al.
Cloud motion vectors from a network of ground sensors in a solar power plant
Sol. Energy
(2013)
J. Bosch et al.
Deriving cloud velocity from an array of solar radiation measurements
Sol. Energy
(2013)
Y. Chu et al.
Short-term reforecasting of power output from a 48 MWe solar PV plant
Sol. Energy
(2015)
Z. Dong et al.
Short-term solar irradiance forecasting using exponential smoothing state space model
Energy
(2013)
Z. Dong et al.
Satellite image analysis and a hybrid ESSS/ANN model to forecast solar irradiance in the tropics
Energy Convers. Manage.
(2014)
L.M. Hinkelman
Differences between along-wind and cross-wind solar irradiance variability on small spatial scales
Sol. Energy
(2013)
R.H. Inman et al.
Solar forecasting methods for renewable energy integration
Prog. Energy Combust. Sci.
(2013)
M. Lipperheide et al.
Embedded nowcasting method using cloud speed persistence for a photovoltaic power plant
Sol. Energy
(2015)
V.P. Lonij et al.
Intra-hour forecasts of solar power production using measurements from a network of irradiance sensors
Sol. Energy
(2013)

D.A. Nguyen et al.

Stereographic methods for cloud base height determination using two sky images

Sol. Energy

(2014)

R. Perez et al.

Short-term irradiance variability: preliminary estimation of station pair correlation as a function of distance

Sol. Energy

(2012)

S. Quesada-Ruiz et al.

Cloud-tracking methodology for intra-hour {DNI} forecasting

Sol. Energy

(2014)

D. Yang et al.

Hourly solar irradiance time series forecasting using cloud cover index

Sol. Energy

(2012)

D. Yang et al.

Evaluation of transposition and decomposition models for converting global solar irradiance from tilted surface to horizontal in tropical regions

Sol. Energy

(2013)

D. Yang et al.

Solar irradiance forecasting using spatial–temporal covariance structures and time-forward kriging

Renew. Energy

(2013)

D. Yang et al.

Solar irradiance forecasting using spatio-temporal empirical kriging and vector autoregressive models with parameter shrinkage

Sol. Energy

(2014)

D. Yang et al.

Bidirectional irradiance transposition based on the Perez model

Sol. Energy

(2014)

H. Yang et al.

Solar irradiance forecasting using a ground-based sky imager developed at UC San Diego

Sol. Energy

(2014)

Cited by (106)

Semi-real-time decision tree ensemble algorithms for very short-term solar irradiance forecasting
2024, International Journal of Electrical Power and Energy Systems
Industrial activities are transitioning towards decarbonization, focusing on renewable energy sources, particularly photovoltaic solar energy. However, the inherent high variability of photovoltaic energy poses challenges. Some of them can be partially addressed by predicting electricity production, which in the case of photovoltaic solar energy is heavily based on solar irradiance prediction. Although extensive research has been conducted in this field, there is a noticeable gap in research regarding very short-term (intra-minute) forecasting under high-variability scenarios. In this proposal, real data from a photovoltaic solar plant in Alderville (Canada) were used to predict irradiance with a horizon of 15 and 30 s. The objective is to make this prediction in near-real time. To achieve this, we propose the use of machine learning algorithms based on decision tree ensembles, due to their low computational training cost and known effectiveness. On the other hand, we propose pre-processing the data through a temporal and spatial correlation analysis between measurements from different sensors. Feature selection analysis allows us to determine the direction of the wind and consequently identify the most relevant panels for model training. This preprocessing enhances the model retraining without the need for external information such as sky images or wind speed and direction on days with highly variable cloud cover. The presented methodology offers promising results with significantly reduced training times, demonstrating the suitability of this semi-online training approach for highly variable time series forecasting.
Handling forecast uncertainty and variability in solar generation to mitigate schedule deviation penalties
2024, Solar Energy
With increasing installed renewable capacity the uncertainty and variability poses many challenges to planners and operators of the power systems in terms of generators deviating from commitments due to imperfect forecasts as well as fluctuations due to weather phenomena. These schedule deviations cause power imbalances and increase the balancing/regulating resources requirements. A possible alternative approach is to compensate for fluctuations locally: this study is geared towards exploring this approach. A multi-timescale scheduling framework is presented herein which ensures high degree of compliance between the day-ahead grid commitments and the actual grid injections. Battery is deployed as a local hedging device to provide compensating measures against uncertainty and variability. A two-stage recourse-based stochastic optimization framework for day-ahead scheduling is presented to tackle deviations due to forecast uncertainty. And model predictive control strategy is employed to tackle fluctuations due to short-term variability. Since the two approaches work on different time-scales, an effective temporal link between these two approaches is established and the energy management system for batteries is presented to reflect this link. A case study using on-field recorded solar generation data set for a PV producer is presented to showcase the effectiveness of the proposed approach.
A framework for developing data-driven correction factors for solar PV systems
2024, Energy
Correcting simulated solar photovoltaic (PV) output poses challenges due to the limited availability of measured PV output data. This study introduces a framework for developing correction factors capable of adjusting bias errors in hourly simulated PV output across various levels of global horizontal irradiance (GHI). GHI-dependent correction factors are developed for each PV plant, with hourly simulated PV output validated against the measured output for 37 PV plants in South Korea. Performance evaluation using $U_{95}$ , a measure of model uncertainty, reveals a significant reduction (by up to 0.24) in prediction errors. The reduction is largely attributed to reductions of $n M B E$ s (by up to 0.33) and partly to reductions of $n R M S E$ s (by up to 0.11), demonstrating mitigation of both random and bias errors. The framework exhibits a promising reduction in forecasting errors for monthly energy yields and performance ratios. Given that the proposed framework requires a short length of training data (<4 months), its versatility allows for adoption by existing software packages relying on physical PV modeling, offering potential enhancements in forecasting accuracy for practical applications.
Hybrid model from cloud motion vector and spatio-temporal autoregressive technics for hourly satellite-derived irradiance in a complex meteorological context
2023, Solar Energy Advances
Islands in tropical regions have high potential for solar energy, but the weather conditions in these areas are complex, with high fluctuations in the amount of sunlight received over time and across different locations, making it difficult to predict solar irradiance accurately.In a preliminary study, two spatio-temporal technics STVAR (spatio-temporal autoregressive) and CMV (cloud motion vector) showing a good predictive performance in literature, were assessed in this challenging environment. The strengths and the weaknesses of different models for different conditions/locations were presented. In this paper, we focus on the validation STVAR/CMV blends for the same satellite-derived irradiance dataset. In a first step, the research of the equation defining the blended model is investigated, highlighting a linear combination of irradiance predicted from CMV and STVAR by least-squares fit, as being optimal. A benchmarking illustration as a function of the orographic context exhibits the reduction of their respective gaps forced by their separate application. Then, the analysis of spatial evolution of the linear combination coefficients, led us to propose a model that quantifies coefficients of the blended model as a function of site elevation that represents an effective proxy for the microclimatological/topographical nature of the considered location. The proposed model shows good performance with an averaged relative RMSE of 16.50% in the entire study area. This model can be an appropriate choice for short-term forecasting even under complex orography conditions.
Recent advances in intra-hour solar forecasting: A review of ground-based sky image methods
2023, International Journal of Forecasting
As the penetration of solar energy generation into power systems keeps rising, intra-hour solar forecasting (IHSF) is becoming increasingly important for the secure and economical operation of a power system. One major difficulty in providing very accurate IHSF emanates from rapid cloud changes in the sky. The ground-based sky image (GSI) provides the intuitive information of intra-hour cloud changes and has thus been widely utilized in studies on IHSF. This paper presents a systematic review of the state-of-the-art of ground-based sky image-based intra-hour solar forecasting (GSI-IHSF). To our knowledge, we first propose a generic framework of GSI-IHSF consisting of four modules, i.e., sky image acquisition, sky image preprocessing, cloud forecasting, and solar forecasting. Then, as for each module, this paper introduces its core function, shows the major challenges, briefly reviews several extensively used techniques, summarizing research trends. Finally, this paper offers a prospect of GSI-IHSF research, discusses recent advances that demonstrate the potential for a great improvement in forecast accuracy, pointing out some new requirements and challenges that should be further investigated in the future.
Towards the applicability of solar nowcasting: A practice on predictive PV power ramp-rate control
2022, Renewable Energy
Solar forecasting has been widely adopted in modern power system operations to facilitate a reliable and continuous photovoltaic (PV) integration. Solar nowcasting, also known as intra-minute solar forecasting, is a new subdomain of solar forecasting. Nevertheless, despite the significant progress achieved in solar nowcasting over the last decade, one important aspect, that is, applicability—the value and operability of nowcasts in practical grid operations—is generally left out. To that end, this paper brings forth the applicability of solar nowcasting for the first time. Three time parameters involved in operational solar nowcasting are first identified, namely, forecast horizon, forecast resolution, and forecast model updating rate. Then paired with the state-of-the-art PV power ramp-rate control algorithm, i.e., predictive active power curtailment (PAPC), the effect of different time parameters is investigated, thus revealing the nowcasting applicability at large. Through four case studies and eight standardized deterministic and probabilistic solar nowcasting models, the applicability of solar nowcasting on PAPC is shown to be most characterized by the forecast horizon (up to a deviation of ramp smoothing rate around 12%, with smart persistence (SP) being the reference model), and least characterized by the forecast model updating rate (with a deviation of ramp smoothing rate less than 1% for SP). Moreover, the negatively-biased deterministic nowcasts and wider probabilistic nowcasts are found more applicable to PAPC. To promote solar nowcasting applicability on PAPC further, an outlook for future research is provided, from both a solar forecaster's and a system operator's viewpoints.

View all citing articles on Scopus

¹: Previously at: Solar Energy Research Institute of Singapore (SERIS), National University of Singapore, Singapore.

View full text

Very short term irradiance forecasting using the lasso

Highlights

Abstract

Graphical abstract

Introduction

Section snippets

Method

Results from a single day with a single forecast horizon

Results from all 13 days with various forecast horizons

Conclusions

Sol. Energy

Sol. Energy

Sol. Energy

Sol. Energy

Energy

Energy Convers. Manage.

Sol. Energy

Prog. Energy Combust. Sci.

Sol. Energy

Sol. Energy

Sol. Energy

Sol. Energy

Sol. Energy

Sol. Energy

Sol. Energy

Renew. Energy

Sol. Energy

Sol. Energy

Sol. Energy