Performance of state space and ARIMA models for consumer retail sales forecasting
Introduction
Sales forecasting is one of the most important issues that is beyond all strategic and planning decisions in any retail business. The importance of accurate sales forecasts to efficient inventory management at both disaggregate and aggregate levels has long been recognized [1]. Poor forecasts usually lead to either too much or too little inventory directly affecting the profitability and the competitive position of the company. At the organizational level, sales forecasting is very important to any retail business as its outcome is used by many functions in the organization: finance and accounting departments are able to project costs, profit levels and capital needs; sales department is able to get a good knowledge of the sales volume of each product; purchasing department is able to plan short- and long-term purchases; marketing department is able to plan its actions and assess the impact of different marketing strategies on sales volume; and finally logistics department is able to define specific logistic needs [2]. Accurate forecasts of sales have the potential to increase the profitability of retailers by improving the chain operations efficiency and minimizing wastes. Moreover, accurate forecasts of retail sales may improve portfolio investors’ ability to predict movements in the stock prices of retailing chains [3]. Aggregate retail sales time series are usually preferred because they contain both trend and seasonal patterns, providing a good testing ground for comparing forecasting methods, and because companies can benefit from more accurate forecasts.
Retail sales time series often exhibit strong trend and seasonal variations presenting challenges in developing effective forecasting models. How to effectively model retail sales series and how to improve the quality of forecasts are still outstanding questions. Exponential smoothing and Autoregressive Integrated Moving Average (ARIMA) models are the two most widely used approaches to time series forecasting, and provide complementary approaches to the problem. While exponential smoothing methods are based on a description of trend and seasonality in the data, ARIMA models aim to describe the autocorrelations in the data. The ARIMA framework to forecasting originally developed by Box et al. [4] involves an iterative three-stage process of model selection, parameter estimation and model checking. A statistical framework to exponential smoothing methods was recently developed based on state space models called ETS models [5].
Despite the investigator's efforts, the several existing studies have not led to a consensus about the relative forecasting performances of these two modeling frameworks when they are applied to retail sales data. Alon [6] concluded that the Winters exponential smoothing method' forecasts of aggregate retail sales were more accurate than the simple and Holt exponential smoothing methods' forecasts. Additionally, Alon et al. [3] compared out-of-sample forecasts of aggregated retail sales made using artificial neural networks (ANNs), Winters exponential smoothing, ARIMA and multiple regression via MAPE (mean absolute percentage error). Their results indicate that Winters exponential smoothing and ARIMA perform well when macroeconomic conditions are relatively stable. When economic conditions are volatile (supply push inflation, recessions, high interest rates and high unemployment) ANNs outperform the linear methods and multi-step forecasts may be preferred. Chu and Zhang [7] also conducted a comparative study of linear and nonlinear models for aggregate retail sales forecasting. The linear models studied were the ARIMA model, regression with dummy variables and regression with trigonometric variables. The nonlinear models studied were the ANNs for which the effect of seasonal adjustment and use of dummy or trigonometric variables was investigated. Using multiple cross-validation samples evaluated via the RMSE (root mean squared error), the MAE (mean absolute error) and MAPE, the authors concluded that no single forecasting model is the best for all situations under all circumstances. Their empirical results show that (1) prior seasonal adjustment of the data can significantly improve forecasting performance of the neural network model; (2) seasonal dummy variables can be useful in developing effective regression models (linear and nonlinear) but the performance of these dummy regression models may not always be robust; (3) trigonometric models are not useful in aggregate retail sales forecasting. Another interesting example is by Frank et al. [8] forecast women's apparel sales using single seasonal exponential smoothing (SSES), the Winters' three parameter model and ANNs. The performance of the models was tested by comparing the goodness-of-fit statistics R2 and by comparing actual sales with the forecasted sales of different types of garments. Their results indicated that the three parameter Winters' model outperformed SSES in terms of R2 and forecasting sales. ANN model performed best in terms of R2 (among three models) but correlations between actual and forecasted sales were not satisfactory. Zhang and Qi [9] and Kuvulmaz et al. [10] further investigated the use of ANNs in forecasting time series with strong trend and seasonality and conclude that the overall out-of-sample forecasting performance of ANNs, evaluated via RMSE, MAE and MAPE, is not better than ARIMA models in predicting retail sales without appropriate data preprocessing namely detrending and deseasonalization. Motivated by the particular advantages of ARIMA models and ANNs, Aburto and Weber [11] developed a hybrid intelligent system combining ARIMA type approaches and MLP-type neural networks for demand forecasting that showed improvements in forecasting accuracy. Encouraged by their results they proposed a replenishment system for a Chilean supermarket which led simultaneously to fewer sales failures and lower inventory levels. Motivated by the recent success of evolutionary computation Au et al. [12] studied the use of evolutionary neural networks (ENNs) for sales forecasting in fashion retailing. Their experiments show that when guided with the BIC (Bayesian Information Criterion) and the pre-search approach, the ENN can converge much faster and be more accurate in forecasting than the fully connected neural network. The authors also conclude that the performance of these algorithms is better than the performance of the ARIMA model only for products with features of low demand uncertainty and weak seasonal trends. Further, it is emphasized that the ENN approach for forecasting is a highly automatic one while the ARIMA modeling involves more human knowledge. Wong and Guo [13] propose a hybrid intelligent model using extreme learning machine (ELM) and a harmony search algorithm to forecast medium-term sales in fashion retail supply chains. The authors show that the proposed model exhibits superior out-of-sample forecasting performance over the ARIMA, ENN and ELM models when evaluated via RMSE, MAPE and MASE (mean absolute scaled error). However, they also observe that the performance of the proposed model deteriorated when the time series was irregular and random pointing that it may not work well with high irregularity and nonlinearity. Finally, Pan et al. [14] investigate the feasibility and potential of applying empirical mode decomposition (EMD) in forecasting aggregate retail sales. The hybrid forecasting method of integrating EMD and neural network models (EMD-NN) was compared with the direct NN model and the ARIMA model for aggregate retail sales forecasting. Data from two sampling periods with different macroeconomic conditions were studied. The out-of-sample forecasting results indicate that the performance of the hybrid NN model is more stable compared to direct NN model and ARIMA during volatile economy. However, during relatively stable economic activity, ARIMA performs consistently well. In summary, over the last few decades several methods such as Winters exponential smoothing, ARIMA model, multiple regression and ANNs have been proposed and widely used because of their ability to model trend and seasonal fluctuations present in aggregate retail sales. However, all these methods have shown difficulties and limitations being necessary to investigate further on how to improve the quality of forecasts. The purpose of this work is to compare the forecasting performance of state space models and ARIMA models when applied to a case study of retail sales of five different categories of women footwear from the Portuguese retailer Foreva. As far as we know it is the first time ETS models are tested for retail sales forecasting.
The remainder of the paper is organized as follows. The next section describes the datasets used in the study. Section 3 discusses the methodology used in the time series modeling and forecasting. The empirical results obtained in the research study are presented in Section 4. The last section offers the concluding remarks.
Section snippets
Data
The brand Foreva was born in September 1984. Since the beginning is characterized by offering a wide range of footwear for all seasons, the geographical coverage of Foreva shops in Portugal is presently vast; it has around 70 stores opened to the public most of them in Shopping Centers. In this study we analyze the monthly sales of the five categories of women footwear of the brand Foreva, Boots, Booties, Flats, Sandals and Shoes, from January 2007 to April 2012 (64 observations). These time
Forecast error measures
Denote the actual observation for time period t by yt and the forecasted value for the same period by . To evaluate the out-of-sample forecast accuracy using an in-sample set of size (where n is the total number of observations), the most commonly used scale-dependent statistics are the mean error (ME), the mean absolute error (MAE) and the root mean squared error (RMSE) defined as follows [17]:
When comparing the
State space model
An appropriate model can be selected among several candidates by minimizing an error measure such as RMSE, provided the errors are computed from a hold-out set that was not used to estimate the model parameters. However, since there are often few historical data available a procedure based on the in-sample fit is usually preferred. One approach can be is to use an information criterion which penalizes the likelihood of the model to compensate for the potential overfitting of the data. Akaike's
Point forecasts
The results of Table 2, Table 3, Table 4 show that the overall out-of-sample forecasting performance of ETS and ARIMA models evaluated via RMSE, MAE and MAPE is quite similar on both one-step and multi-step forecasts.
In one-step forecasts, ETS forecasts more accurately Flats and Shoes series than ARIMA, regardless of the forecast error measure considered. Improvements are of the order 20% or less. For Boots series the RMSE and MAE values of the ETS model are 33% and 40% smaller, respectively,
Conclusions and future work
Accurate retail sales forecasting can have a great impact on effective management of retail operations. Retail sales time series often exhibit strong trend and seasonal variations presenting challenges in developing effective forecasting models. How to effectively model these series and how to improve the quality of forecasts are still outstanding questions. Despite the investigator's efforts, the several existing studies have not led to a consensus about the relative forecasting performances
Acknowledgments
Project “NORTE-07-0124-FEDER-000057” is financed by the North Portugal Regional Operational Programme (ON.2 - O Novo Norte), under the National Strategic Reference Framework (NSRF), through the European Regional Development Fund (ERDF), and by national funds, through the Portuguese Funding Agency, Fundação para a Ciência e a Tecnologia (FCT).
References (36)
- et al.
Forecasting aggregate retail salesa comparison of artificial neural networks and traditional method
J. Retail. Consum. Serv.
(2001) - et al.
A comparative study of linear and nonlinear models for aggregate retail sales forecasting
Int. J. Prod. Econ.
(2003) - et al.
Neural network forecasting for seasonal and trend time series
Eur. J. Oper. Res.
(2005) - et al.
Fashion retail forecasting by evolutionary neural networks
Int. J. Prod. Econ.
(2008) - et al.
A hybrid intelligent model for medium-term sales forecasting in fashion retail supply chains using extreme learning machine and harmony search algorithm
Int. J. Prod. Econ.
(2010) Exponential smoothingthe state of the art—Part II
Int. J. Forecast.
(2006)- et al.
Testing the null hypothesis of stationary against the alternative of a unit root
J. Econom.
(1992) - et al.
Design and operation of manufacturing networks for mass customisation
CIRP Ann. Manuf. Technol.
(2013) - et al.
A multi-criteria evaluation of centralized and decentralized production networks in a highly customer-driven environment
CIRP Ann. Manuf. Technol.
(2012) - et al.
Sku demand forecasting in the presence of promotions
Expert Syst. Appl.
(2009)
Improving the supply chain performanceuse of forecasting models versus early order commitments
Int. J. Prod. Res.
A combined model predictive control and time series forecasting framework for production–inventory systems
Int. J. Prod. Res.
Time Series Analysis
Forecasting with Exponential SmoothingThe State Space Approach
Forecasting aggregate retail salesthe winters' model revisited
Forecasting women's apparel sales using mathematical modeling
Int. J. Cloth. Sci. Technol.
Improved supply chain management based on hybrid demand forecasts
Appl. Soft Comput.
Cited by (159)
A data-driven construction method of aggregated value chain in three phases for manufacturing enterprises
2024, Computers and Industrial EngineeringA novel featurization methodology using JaGen algorithm for time series forecasting with deep learning techniques
2024, Expert Systems with ApplicationsPOST-BaLSTM: A Bagged LSTM forecasting ensemble embedded with a postponement framework to target the semiconductor shortage in the automotive industry
2023, Computers and Industrial EngineeringWhat do consumers want? A methodological framework to identify determinant product attributes from consumers’ online questions
2023, Journal of Retailing and Consumer ServicesWeekly Forecast Indonesian Currency During Ramadan
2024, AIP Conference Proceedings