Performance of state space and ARIMA models for consumer retail sales forecasting

https://doi.org/10.1016/j.rcim.2014.12.015Get rights and content

Highlights

  • We compare the forecasting performance of state space models and ARIMA models.

  • The forecasting performance is demonstrated through a case study of retail sales of women footwear.

  • The model with the minimum value of Akaike's Information Criteria for the in-sample period was selected from all admissible models for further evaluation in the out-of-sample.

  • The overall out-of-sample forecasting performance of state space and ARIMA models evaluated via RMSE, MAE and MAPE is quite similar on both one-step and multi-step forecasts.

  • Both state space and ARIMA produce coverage probabilities that are very close to the nominal rates.

Abstract

Forecasting future sales is one of the most important issues that is beyond all strategic and planning decisions in effective operations of retail businesses. For profitable retail businesses, accurate demand forecasting is crucial in organizing and planning production, purchasing, transportation and labor force. Retail sales series belong to a special type of time series that typically contain trend and seasonal patterns, presenting challenges in developing effective forecasting models. This work compares the forecasting performance of state space models and ARIMA models. The forecasting performance is demonstrated through a case study of retail sales of five different categories of women footwear: Boots, Booties, Flats, Sandals and Shoes. On both methodologies the model with the minimum value of Akaike's Information Criteria for the in-sample period was selected from all admissible models for further evaluation in the out-of-sample. Both one-step and multiple-step forecasts were produced. The results show that when an automatic algorithm the overall out-of-sample forecasting performance of state space and ARIMA models evaluated via RMSE, MAE and MAPE is quite similar on both one-step and multi-step forecasts. We also conclude that state space and ARIMA produce coverage probabilities that are close to the nominal rates for both one-step and multi-step forecasts.

Introduction

Sales forecasting is one of the most important issues that is beyond all strategic and planning decisions in any retail business. The importance of accurate sales forecasts to efficient inventory management at both disaggregate and aggregate levels has long been recognized [1]. Poor forecasts usually lead to either too much or too little inventory directly affecting the profitability and the competitive position of the company. At the organizational level, sales forecasting is very important to any retail business as its outcome is used by many functions in the organization: finance and accounting departments are able to project costs, profit levels and capital needs; sales department is able to get a good knowledge of the sales volume of each product; purchasing department is able to plan short- and long-term purchases; marketing department is able to plan its actions and assess the impact of different marketing strategies on sales volume; and finally logistics department is able to define specific logistic needs [2]. Accurate forecasts of sales have the potential to increase the profitability of retailers by improving the chain operations efficiency and minimizing wastes. Moreover, accurate forecasts of retail sales may improve portfolio investors’ ability to predict movements in the stock prices of retailing chains [3]. Aggregate retail sales time series are usually preferred because they contain both trend and seasonal patterns, providing a good testing ground for comparing forecasting methods, and because companies can benefit from more accurate forecasts.

Retail sales time series often exhibit strong trend and seasonal variations presenting challenges in developing effective forecasting models. How to effectively model retail sales series and how to improve the quality of forecasts are still outstanding questions. Exponential smoothing and Autoregressive Integrated Moving Average (ARIMA) models are the two most widely used approaches to time series forecasting, and provide complementary approaches to the problem. While exponential smoothing methods are based on a description of trend and seasonality in the data, ARIMA models aim to describe the autocorrelations in the data. The ARIMA framework to forecasting originally developed by Box et al. [4] involves an iterative three-stage process of model selection, parameter estimation and model checking. A statistical framework to exponential smoothing methods was recently developed based on state space models called ETS models [5].

Despite the investigator's efforts, the several existing studies have not led to a consensus about the relative forecasting performances of these two modeling frameworks when they are applied to retail sales data. Alon [6] concluded that the Winters exponential smoothing method' forecasts of aggregate retail sales were more accurate than the simple and Holt exponential smoothing methods' forecasts. Additionally, Alon et al. [3] compared out-of-sample forecasts of aggregated retail sales made using artificial neural networks (ANNs), Winters exponential smoothing, ARIMA and multiple regression via MAPE (mean absolute percentage error). Their results indicate that Winters exponential smoothing and ARIMA perform well when macroeconomic conditions are relatively stable. When economic conditions are volatile (supply push inflation, recessions, high interest rates and high unemployment) ANNs outperform the linear methods and multi-step forecasts may be preferred. Chu and Zhang [7] also conducted a comparative study of linear and nonlinear models for aggregate retail sales forecasting. The linear models studied were the ARIMA model, regression with dummy variables and regression with trigonometric variables. The nonlinear models studied were the ANNs for which the effect of seasonal adjustment and use of dummy or trigonometric variables was investigated. Using multiple cross-validation samples evaluated via the RMSE (root mean squared error), the MAE (mean absolute error) and MAPE, the authors concluded that no single forecasting model is the best for all situations under all circumstances. Their empirical results show that (1) prior seasonal adjustment of the data can significantly improve forecasting performance of the neural network model; (2) seasonal dummy variables can be useful in developing effective regression models (linear and nonlinear) but the performance of these dummy regression models may not always be robust; (3) trigonometric models are not useful in aggregate retail sales forecasting. Another interesting example is by Frank et al. [8] forecast women's apparel sales using single seasonal exponential smoothing (SSES), the Winters' three parameter model and ANNs. The performance of the models was tested by comparing the goodness-of-fit statistics R2 and by comparing actual sales with the forecasted sales of different types of garments. Their results indicated that the three parameter Winters' model outperformed SSES in terms of R2 and forecasting sales. ANN model performed best in terms of R2 (among three models) but correlations between actual and forecasted sales were not satisfactory. Zhang and Qi [9] and Kuvulmaz et al. [10] further investigated the use of ANNs in forecasting time series with strong trend and seasonality and conclude that the overall out-of-sample forecasting performance of ANNs, evaluated via RMSE, MAE and MAPE, is not better than ARIMA models in predicting retail sales without appropriate data preprocessing namely detrending and deseasonalization. Motivated by the particular advantages of ARIMA models and ANNs, Aburto and Weber [11] developed a hybrid intelligent system combining ARIMA type approaches and MLP-type neural networks for demand forecasting that showed improvements in forecasting accuracy. Encouraged by their results they proposed a replenishment system for a Chilean supermarket which led simultaneously to fewer sales failures and lower inventory levels. Motivated by the recent success of evolutionary computation Au et al. [12] studied the use of evolutionary neural networks (ENNs) for sales forecasting in fashion retailing. Their experiments show that when guided with the BIC (Bayesian Information Criterion) and the pre-search approach, the ENN can converge much faster and be more accurate in forecasting than the fully connected neural network. The authors also conclude that the performance of these algorithms is better than the performance of the ARIMA model only for products with features of low demand uncertainty and weak seasonal trends. Further, it is emphasized that the ENN approach for forecasting is a highly automatic one while the ARIMA modeling involves more human knowledge. Wong and Guo [13] propose a hybrid intelligent model using extreme learning machine (ELM) and a harmony search algorithm to forecast medium-term sales in fashion retail supply chains. The authors show that the proposed model exhibits superior out-of-sample forecasting performance over the ARIMA, ENN and ELM models when evaluated via RMSE, MAPE and MASE (mean absolute scaled error). However, they also observe that the performance of the proposed model deteriorated when the time series was irregular and random pointing that it may not work well with high irregularity and nonlinearity. Finally, Pan et al. [14] investigate the feasibility and potential of applying empirical mode decomposition (EMD) in forecasting aggregate retail sales. The hybrid forecasting method of integrating EMD and neural network models (EMD-NN) was compared with the direct NN model and the ARIMA model for aggregate retail sales forecasting. Data from two sampling periods with different macroeconomic conditions were studied. The out-of-sample forecasting results indicate that the performance of the hybrid NN model is more stable compared to direct NN model and ARIMA during volatile economy. However, during relatively stable economic activity, ARIMA performs consistently well. In summary, over the last few decades several methods such as Winters exponential smoothing, ARIMA model, multiple regression and ANNs have been proposed and widely used because of their ability to model trend and seasonal fluctuations present in aggregate retail sales. However, all these methods have shown difficulties and limitations being necessary to investigate further on how to improve the quality of forecasts. The purpose of this work is to compare the forecasting performance of state space models and ARIMA models when applied to a case study of retail sales of five different categories of women footwear from the Portuguese retailer Foreva. As far as we know it is the first time ETS models are tested for retail sales forecasting.

The remainder of the paper is organized as follows. The next section describes the datasets used in the study. Section 3 discusses the methodology used in the time series modeling and forecasting. The empirical results obtained in the research study are presented in Section 4. The last section offers the concluding remarks.

Section snippets

Data

The brand Foreva was born in September 1984. Since the beginning is characterized by offering a wide range of footwear for all seasons, the geographical coverage of Foreva shops in Portugal is presently vast; it has around 70 stores opened to the public most of them in Shopping Centers. In this study we analyze the monthly sales of the five categories of women footwear of the brand Foreva, Boots, Booties, Flats, Sandals and Shoes, from January 2007 to April 2012 (64 observations). These time

Forecast error measures

Denote the actual observation for time period t by yt and the forecasted value for the same period by y^t. To evaluate the out-of-sample forecast accuracy using an in-sample set of size m<n (where n is the total number of observations), the most commonly used scale-dependent statistics are the mean error (ME), the mean absolute error (MAE) and the root mean squared error (RMSE) defined as follows [17]:ME=1nmt=m+1n(yty^t)MAE=1nmt=m+1n|yty^t|RMSE=1nmt=m+1n(yty^t)2

When comparing the

State space model

An appropriate model can be selected among several candidates by minimizing an error measure such as RMSE, provided the errors are computed from a hold-out set that was not used to estimate the model parameters. However, since there are often few historical data available a procedure based on the in-sample fit is usually preferred. One approach can be is to use an information criterion which penalizes the likelihood of the model to compensate for the potential overfitting of the data. Akaike's

Point forecasts

The results of Table 2, Table 3, Table 4 show that the overall out-of-sample forecasting performance of ETS and ARIMA models evaluated via RMSE, MAE and MAPE is quite similar on both one-step and multi-step forecasts.

In one-step forecasts, ETS forecasts more accurately Flats and Shoes series than ARIMA, regardless of the forecast error measure considered. Improvements are of the order 20% or less. For Boots series the RMSE and MAE values of the ETS model are 33% and 40% smaller, respectively,

Conclusions and future work

Accurate retail sales forecasting can have a great impact on effective management of retail operations. Retail sales time series often exhibit strong trend and seasonal variations presenting challenges in developing effective forecasting models. How to effectively model these series and how to improve the quality of forecasts are still outstanding questions. Despite the investigator's efforts, the several existing studies have not led to a consensus about the relative forecasting performances

Acknowledgments

Project “NORTE-07-0124-FEDER-000057” is financed by the North Portugal Regional Operational Programme (ON.2 - O Novo Norte), under the National Strategic Reference Framework (NSRF), through the European Regional Development Fund (ERDF), and by national funds, through the Portuguese Funding Agency, Fundação para a Ciência e a Tecnologia (FCT).

References (36)

  • X. Zhao et al.

    Improving the supply chain performanceuse of forecasting models versus early order commitments

    Int. J. Prod. Res.

    (2001)
  • P. Doganis et al.

    A combined model predictive control and time series forecasting framework for production–inventory systems

    Int. J. Prod. Res.

    (2008)
  • G. Box et al.

    Time Series Analysis

    (2008)
  • R.J. Hyndman et al.

    Forecasting with Exponential SmoothingThe State Space Approach

    (2008)
  • I. Alon

    Forecasting aggregate retail salesthe winters' model revisited

  • C. Frank et al.

    Forecasting women's apparel sales using mathematical modeling

    Int. J. Cloth. Sci. Technol.

    (2013)
  • J. Kuvulmaz, S. Usanmaz, S.N. Engin, Time-series forecasting by means of linear and nonlinear models, in: Advances in...
  • L. Aburto et al.

    Improved supply chain management based on hybrid demand forecasts

    Appl. Soft Comput.

    (2007)
  • Cited by (159)

    View all citing articles on Scopus
    View full text