A new class of hybrid models for time series forecasting

doi:10.1016/j.eswa.2011.09.157

Expert Systems with Applications

Volume 39, Issue 4, March 2012, Pages 4344-4357

https://doi.org/10.1016/j.eswa.2011.09.157 Get rights and content

Abstract

Applying quantitative models for forecasting and assisting investment decision making has become more indispensable in business practices than ever before. Improving forecasting especially time series forecasting accuracy is an important yet often difficult task facing forecasters. Both theoretical and empirical findings have indicated that integration of different models can be an effective way of improving upon their predictive performance, especially when the models in the ensemble are quite different. In the literature, several hybrid techniques have been proposed by combining different time series models together, in order to overcome the deficiencies of single models and yield hybrid models that are more accurate. In this paper, in contrast of the traditional hybrid models, a new methodology is proposed in order to construct a new class of hybrid models using a time series model as basis model and a classifier. As classifiers cannot be lonely applied as forecasting model for continuous problems, in the first stage of the proposed model, a forecasting model is used as basis model. Then, the estimated values of the basis model are modified in the second stage, based on the distinguished trend of the residuals of the basis model and the optimum step length, which are respectively calculated by a classifier model and a mathematical programming model. Empirical results with three well-known real data sets indicate that the proposed model can be an effective way in order to construct a more accurate hybrid model than its basis time series model. Therefore, it can be used as an appropriate alternative model for forecasting tasks, especially when higher forecasting accuracy is needed.

Introduction

Applying quantitative methods for forecasting and assisting investment decision making has become more indispensable in business practices than ever before. Time series forecasting is one of the most important quantitative models in which historical observations of the same variable are collected and analyzed to develop a model that captures the underlying data generating process. Then the model is used to predict the future. This modeling approach is particularly useful when little knowledge is available on the underlying data generating process or when there is no satisfactory explanatory model that relates the prediction variable to other explanatory variables. Over the past several decades, much effort has been devoted to the development and improvement of time series forecasting models (Zhang, Patuwo, & Hu, 1998).

Combining several models or using hybrid models can be an effective way to overcome the limitations of each components model and improve forecasting performance. Theoretical as well empirical evidences in the literature suggest that by using dissimilar models or models that disagree each other strongly, the hybrid model will have lower generalization variance or error. In combined models, the aim is to reduce the risk of using an inappropriate model by combining several models to reduce the risk of failure and obtain results that are more accurate (Hibon & Evgeniou, 2005). Typically, this is done because the underlying process cannot easily be determined. The motivation for using hybrid models comes from the assumption that either one cannot identify the true data generating process or that a single model may not be totally sufficient to identify all the characteristics of the time series (Terui & van Dijk, 2002).

In the literature, different combination techniques have been proposed in order to overcome the deficiencies of single models and to improve forecasting performance. The difference between these combination techniques can be described using terminology developed for the classification and neural network literature (Sharkey, 2002). Hybrid models can be homogeneous, such as using differently configured neural networks, or heterogeneous, such as with both linear and nonlinear models (Taskaya & Casey, 2005). In a competitive architecture, the aim is to build appropriate modules to represent different parts of the time series, and to be able to switch control to the most appropriate. For example, a time series may exhibit nonlinear behavior generally, but this may change to linearity depending on the input conditions. Early work on threshold autoregressive models (TAR) used two different linear AR processes, each of which change control among themselves according to the input values (Tong, 1990). An alternative is a mixture density model, also known as nonlinear gated expert, which comprises neural networks integrated with a feedforward gating network (Taskaya & Casey, 2005).

In a cooperative modular combination, the aim is to combine models to build a complete picture from a number of partial solutions (Sharkey, 2002). The assumption is that a model may not be sufficient to represent the complete behavior of a time series, for example, if a time series exhibits both linear and nonlinear patterns during the same time interval, neither linear models nor nonlinear models alone are able to model both components simultaneously. A good exemplar is models that fuse autoregressive integrated moving average with artificial neural networks. In such hybrids, whilst the neural network model deals with nonlinearity, the autoregressive integrated moving average model deals with the non-stationary linear component (Tseng et al., 2002, Valenzuela et al., 2008).

Much effort has been devoted to develop and improve the hybrid time series forecasting models, since the early work of Reid (1968), and Bates and Granger (1969). In pioneering work on combined forecasts, Bates and Granger showed that a linear combination of forecasts would give a smaller error variance than any of the individual methods. Since then, the studies on this topic have expanded dramatically. Makridakis et al. (1982) claimed that using a hybrid model or combining several models has become common practice in improving forecasting accuracy ever since the well-known M-competition in which a combination of forecasts from more than one model often leads to improved forecasting performance. Likewise, Pelikan et al., 1992, Ginzburg and Horn, 1994 proposed combining several feedforward neural networks to improve time series forecasting accuracy. In 1989, Clemen (1989) provided a comprehensive review and annotated bibliography in this area.

In recent years, more hybrid forecasting models have been proposed and applied in many areas with good prediction performance. Pai and Lin (2005) proposed a hybrid methodology to exploit the unique strength of autoregressive integrated moving average models and support vector machines (SVMs) for stock prices forecasting. Chen and Wang (2007) constructed a combination model incorporating seasonal autoregressive integrated moving average (SARIMA) model and support vector machines for seasonal time series forecasting. Zhou and Hu (2008) proposed a hybrid modeling and forecasting approach based on grey and the Box–Jenkins autoregressive moving average models. Armano, Marchesi, and Murru (2005) presented a new hybrid approach that integrated artificial neural networks (ANNs) with genetic algorithms (Gas) to stock market forecast. Yu, Wang, and Lai (2005) proposed a novel nonlinear ensemble forecasting model integrating generalized linear auto regression (GLAR) with artificial neural networks in order to obtain accurate prediction in foreign exchange market. Khashei, Hejazi, and Bijari (2008) proposed a new hybrid model in order to overcome the data limitation of artificial neural networks and yield more accurate results than traditional neural networks in financial markets forecasting. Lin and Cobourn (2007) combined the Takagi–Sugeno fuzzy system and a nonlinear regression (NLR) model for time series forecasting. Pai (2006) proposed the hybrid ellipsoidal fuzzy system for time series forecasting (HEFST) model to forecast regional electricity loads in Taiwan.

Kim and Shin (2007) investigated the effectiveness of a hybrid approach based on the artificial neural networks for time series properties, such as the adaptive time delay neural networks (ATNNs) and the time delay neural networks (TDNNs), with the genetic algorithms in detecting temporal patterns for stock market prediction tasks. Zhang (2003) presented a hybrid autoregressive moving average integrated and artificial neural networks approach for time series forecasting. Tseng, Tzeng, Yu, and Yuana (2001) proposed a hybrid model called FARIMA in order to use the advantages and to fulfill the limitations of the fuzzy regression and ARIMA models for time series forecasting. Ince and Trafalis (2006) proposed a two-stage hybrid model which incorporates parametric techniques such as autoregressive integrated moving average, vector autoregressive (VAR) and co-integration techniques, and nonparametric techniques such as support vector regression (SVR) and artificial neural networks for exchange rate prediction. Chang, Liu, and Wang (2006) developed a hybrid model by integrating self organization map (SOM) neural network, genetic algorithms (GAs) and fuzzy rule base (FRB) to forecast the future sales of a printed circuit board factory. Huarng and Yu (2006) described a combining methodology using neural networks to forecast fuzzy time series.

In this paper, classifier methods are applied to construct a new hybrid model using a basis time series model in order to yield more accurate results. In our proposed model, the residuals of the basis time series model are considered by a classifier in order to distinguish their trend. In the next stage, the optimum step length is calculated by a mathematical programming model using the distinguished trend obtained in the previous stage. Then, the estimated values of the basis time series model are modified according to the optimum step length and the distinguished trend. In this paper, probabilistic neural networks (PNNs) are used as classifier. Technically, probabilistic neural network is a classifier and is able to deduce the class/group of a given input vector after the training process is completed. There are a number of appealing features, which justify our adoption of this type of neural networks to this study. First, training of probabilistic neural networks is rapid, enabling us to develop a frequently updated training scheme. Essentially, the network is re-trained each time the data set is updated and thus the most current information can be reflected in estimation. Second, the logic of probabilistic neural network is able to extenuate the effects of outliers and questionable data points and thereby reduces extra effort on scrutinizing training data. Third and the most important, probabilistic neural networks are conceptually built on the Bayesian method of classification which given enough data, is capable of classifying a sample with the maximum probability of success (Wasserman, 1993).

Given the advantages of the probabilistic neural networks, it is not surprising that this methodology has attracted overwhelming attention in prediction (Kim and Chun, 1998, Yang et al., 1999), identification (Gaganis et al., 2007, Sun et al., 2006), and especially in classification task (Karthikeyan et al., 2005, Xue et al., 2005) in various areas. Chen, Leung, and Daouk (2003) used the probabilistic neural networks in order to model and predict the direction of return on market index of the Taiwan stock exchange. Axinte (2006) applied the probabilistic neural networks for automated classification of tool malfunctions in broaching. Hajmeer and Basheer (2002) proposed to use probabilistic neural networks (PNNs) for classification of bacterial growth/no-growth data and modeling the probability of growth. Tam, Tong, Lau, and Chan (2004) used the probabilistic neural networks for Diagnosis of prestressed concrete pile defects. Shan, Zhao, Xu, Liebich, and Zhang (2002) presented an application of probabilistic neural network in the clinical diagnosis of cancers based on clinical chemistry data. Al-Omari and Al-Jarrah (2004) presented a system for recognition of the handwritten Indian numerals using the probabilistic neural networks. Kim, Kim, and Chang (2008) presented an application of probabilistic neural network to design breakwater armor blocks. Srinivasan, Jin, and Cheu (2005) proposed and applied a constructive probabilistic neural network (CPNN) model for automatic incident detection on freeways. Shang, Huang, Du, and Zheng (2006) investigated on the Palm print recognition using Fast ICA algorithm and radial basis probabilistic neural network.

The rest of the paper is organized as follows. In the next section, the basic concepts of autoregressive integrated moving average (ARIMA) and artificial neural networks (ANNs), which are chosen as basis models to construct hybrid model, are briefly reviewed. In Section 3, probabilistic neural networks (PNNs), which are selected as classifier method, are reviewed. In Section 4, the formulation of the proposed model is introduced. In Section 5, the proposed model is applied to three well-known real data sets—the Wolf’s sunspot data, the Canadian lynx data, and the British pound against the United States dollar exchange rate data—forecasting and its performance is compared with those of other forecasting models in order to show the appropriateness and effectiveness of the proposed method. Section 6 contains the concluding remarks.

Section snippets

Time series forecasting models

There are several different approaches to time series forecasting, which are generally categorized as follow. Traditional statistical models including moving average, exponential smoothing, and autoregressive integrated moving average are linear in that predictions of the future values are constrained to be linear functions of past observations. Second category of time series models are nonlinear models. Several classes of nonlinear models have been proposed in the literature in order to

Probabilistic neural networks (PNNs)

The probabilistic neural network (PNN) is a Bayes–Parzen classifier (Masters, 1995) that is often an excellent pattern classifier in practice. The foundation of the approach is well known decades ago (1960s), however, the method was not of a widespread use because of the lack of sufficient computation power until recently. The probabilistic neural networks were first introduced by Donald Specht in 1990, who demonstrated how the Bayes–Parzen classifier could be broken up into a large number of

Formulation of the proposed model

Despite the numerous time series models available, the accuracy of time series forecasting currently is fundamental to many decision processes, and hence, never research into ways of improving the effectiveness of forecasting models been given up. Many researches in time series forecasting have been argued that predictive performance improves in combined models (Taskaya & Casey, 2005). In the literature, different combination techniques have been proposed in order to overcome the deficiencies

Application of the proposed model to time series forecasting

In this section, the proposed model is applied to time series forecasting using the three well-known real data sets in order to demonstrate the appropriateness and effectiveness of the proposed model and its performance is compared with those of other forecasting models.

Conclusions

Improving forecasting especially time series forecasting accuracy is an important yet often difficult task facing forecasters. Despite the numerous time series models available, the research for improving the effectiveness of forecasting models has never stopped. Several large-scale forecasting competitions with a large number of commonly used time series forecasting models conclude that combining forecasts from more than one model often leads to improved performance, especially when the models

Acknowledgements

The authors wish to express their gratitude to Seyed Reza Hejazi, assist professor of industrial engineering, Isfahan University of Technology, for their insightful and constructive comments, which helped to improve the paper greatly.

References (86)

F. Al-Omari et al.
Handwritten Indian numerals recognition system using probabilistic neural networks
Advanced Engineering Informatics
(2004)
G. Armano et al.
A hybrid genetic-neural architecture for stock indexes forecasting
Information Sciences
(2005)
D. Axinte
Approach into the use of probabilistic neural networks for automated classification of tool malfunctions in broaching
International Journal of Machine Tools & Manufacture
(2006)
P.G. Benardos et al.
Optimizing feed-forward artificial neural network architecture
Engineering Applications of Artificial Intelligence
(2007)
T. Bollerslev
Generalized autoregressive conditional heteroscedasticity
Journal of Econometrics
(1986)
P. Chang et al.
A hybrid model by clustering and evolving fuzzy rules for sales decision supports in printed circuit board industry
Decision Support Systems
(2006)
K.Y. Chen et al.
A hybrid SARIMA and support vector machines in forecasting the production values of the machinery industry in Taiwan
Expert Systems with Applications
(2007)
A. Chen et al.
Application of neural networks to an emerging financial market: Forecasting and trading the Taiwan Stock Index
Computers & Operations Research
(2003)
P. Cornillon et al.
Forecasting time series using principal component analysis with respect to instrumental variables
Computational Statistics & Data Analysis
(2008)
C. De Groot et al.
Analysis of univariate time series with connectionist nets: a case study of two classical examples
Neurocomputing
(1991)

C. Gaganis et al.

Probabilistic neural networks for the identification of qualified audit opinions

Expert Systems with Applications

(2007)

M. Hajmeer et al.

A probabilistic neural network approach for modeling and classification of bacterial growth/no-growth data

Journal of Microbiological Methods

(2002)

M. Haseyama et al.

An ARMA order selection method with fuzzy reasoning

Signal Process

(2001)

M. Hibon et al.

To combine or not to combine: Selecting among forecasts and their combinations

International Journal of Forecasting

(2005)

H. Hosseini et al.

The comparison of different feed forward neural network architectures for ECG signal diagnosis

Medical Engineering & Physics

(2006)

K. Huarng et al.

The application of neural networks to forecast fuzzy time series

Physica A

(2006)

H. Ince et al.

A hybrid model for exchange rate prediction

Decision Support Systems

(2006)

X. Jiang et al.

Constructing and training feed-forward neural networks for pattern classification

Pattern Recognition

(2003)

B. Karthikeyan et al.

Conception of complex probabilistic neural network system for classification of partial discharge patterns using multifarious inputs

Expert Systems with Applications

(2005)

M. Khashei et al.

A new hybrid artificial neural networks and fuzzy regression model for time series forecasting

Fuzzy Sets and Systems

(2008)

S.H. Kim et al.

Graded forecasting using an array of bipolar predictions: application of probabilistic neural networks to a stock market index

International Journal of Forecasting

(1998)

D. Kim et al.

Application of probabilistic neural network to design breakwater armor blocks

Ocean Engineering

(2008)

H. Kim et al.

A hybrid approach based on neural networks and genetic algorithms for detecting temporal patterns in stock markets

Applied Soft Computing

(2007)

J. Lee et al.

GA based meta-modeling of BPN architecture for constrained approximate optimization

International Journal of Solids and Structures

(2007)

J. Leski et al.

A new artificial network based fuzzy interference system with moving consequents in if-then rules and selected applications

Fuzzy Sets and Systems

(1999)

Y. Lin et al.

Fuzzy system models combined with nonlinear regression for daily ground-level ozone predictions

Atmospheric Environment

(2007)

L. Ma et al.

A new strategy for adaptively constructing multilayer feed-forward neural networks

Neurocomputing

(2003)

R.A. Meese et al.

Empirical exchange rate models of the seventies: do they/t out of samples?

Journal of International Economics

(1983)

C. Ong et al.

Model identification of ARIMA family using genetic algorithms

Applied Mathematical Computation

(2005)

P.F. Pai

Hybrid ellipsoidal fuzzy systems in forecasting regional electricity loads

Energy Conversion Management

(2006)

P.F. Pai et al.

A hybrid ARIMA and support vector machines model in stock price forecasting

Omega

(2005)

Y. Shan et al.

Application of probabilistic neural network in the clinical diagnosis of cancers based on clinical chemistry data

Analytica Chimica Acta

(2002)

L. Shang et al.

Palm print recognition using fast ICA algorithm and radial basis probabilistic neural network

Neurocomputing

(2006)

D. Specht

Probabilistic neural networks

Neural Networks

(1990)

D. Srinivasan et al.

Adaptive neural network models for automatic incident detection on freeways

Neurocomputing

(2005)

L. Stone et al.

Chaotic oscillations and cycles in multi-trophic ecological systems

Journal of Theoretical Biology

(2007)

G. Sun et al.

Tumor tissue identification based on gene expression data using DWT feature extraction and PNN classifier

Neurocomputing

(2006)

C.M. Tam et al.

Diagnosis of prestressed concrete pile defects using probabilistic neural networks

Engineering Structures

(2004)

Y. Tang et al.

A consistent nonparametric Bayesian procedure for estimating autoregressive conditional densities

Computational Statistics & Data Analysis

(2007)

N. Terui et al.

Combined forecasts from linear and nonlinear time series models

International Journal of Forecasting

(2002)

F.M. Tseng et al.

Fuzzy ARIMA model for forecasting the foreign exchange market

Fuzzy Sets and Systems

(2001)

F.M. Tseng et al.

Combining neural network model with seasonal time series ARIMA model

Technological Forecasting & Social Change

(2002)

C. Xue et al.

Study of probabilistic neural networks to classify the active compounds in medicinal plants

Journal of Pharmaceutical and Biomedical Analysis

(2005)

Cited by (90)

Etemadi reliability-based multi-layer perceptrons for classification and forecasting
2023, Information Sciences
Multi-layer perceptrons (MLPs) rank among the most popular and widely employed intelligent approaches for approximating the relationships between dependent and independent variables, demonstrating a wide range of successful applications. MLPs are flexible techniques capable of universally modeling and analyzing real-world problems in forecasting and classification domains with a desirable level of accuracy. In conventional MLPs, cost functions are formulated based on the error term. Subsequently, the learning process aims to estimate the unknown parameters to minimize the error-based cost function. The logic of the procedure is founded on the assumption that maximum generalization will be achieved from models displaying the highest accuracy within the training sample. While this learning process is rational and beneficial, a model's generalization capability depends simultaneously on the model's accuracy and the reliability level of that accuracy. In this manner, reliability is not taken into consideration when formulating the conventional cost function for MLPs. This paper introduces a reliability-based cost function for estimating the unknown weights and biases of MLP models during the learning process. The proposed cost function is designed to calculate the variation of the performances in dissimilar data situations. In the learning process of the proposed reliability-based multi-layer perceptron, in contrast to traditionally developed models, the goal is to minimize variation rather than error, or equivalently, to maximize reliability instead of accuracy. The generalizability of the proposed reliability-based MLP (EMLP) model in forecasting and classification domains is comprehensively evaluated using 30 benchmark data sets in each domain. Empirical results in the forecasting area indicate that, from a general perspective, the proposed EMLP model exhibits superior generalizability in 23 cases (76.67%) compared to traditional models. Furthermore, numerical results in the classification area reveal that the proposed model outperforms or matches the performance of the conventional MLP in 26 cases (86.67%). These outcomes clearly underscore the significance of reliability, a factor not considered in any of the conventional MLP modeling procedures. Therefore, the proposed MLP model represents a suitable alternative in modeling, especially when greater generalizability is desired.
A novel discrete learning-based intelligent methodology for breast cancer classification purposes
2023, Artificial Intelligence in Medicine
Classification is one of the most significant subfields of data mining that has been successfully applied to various applications. The literature has expended substantial effort to present more efficient and accurate classification models. Despite the diversity of the proposed models, they were all created using the same methodology, and their learning processes ignored a fundamental issue. In all existing classification model learning processes, a continuous distance-based cost function is optimized to estimate the unknown parameters. The classification problem's objective function is discrete. Consequently, applying a continuous cost function to a classification problem with a discrete objective function is illogical or inefficient. This paper proposes a novel classification methodology utilizing a discrete cost function in the learning process. To this end, one of the most popular intelligent classification models, the multilayer perceptron (MLP), is used to implement the proposed methodology. Theoretically, the classification performance of the proposed discrete learning-based MLP (DIMLP) model is not dissimilar to that of its continuous learning-based counterpart. Nevertheless, in this study, to demonstrate the efficacy of the DIMLP model, it was applied to several breast cancer classification datasets, and its classification rate was compared to that of the conventional continuous learning-based MLP model. The empirical results indicate that the proposed DIMLP model outperforms the MLP model across all datasets. The results demonstrate that the presented DIMLP classification model achieves an average classification rate of 94.70 %, a 6.95 % improvement over the classification rate of the traditional MLP model, which was 88.54 %. Therefore, the classification approach proposed in this study can be utilized as an alternative learning process in intelligent classification methods for medical decision-making and other classification applications, particularly when more accurate results are required.
Series Hybridization of Parallel (SHOP) models for time series forecasting
2022, Physica A: Statistical Mechanics and its Applications
Accurate forecasting of real-world systems becomes a highly challenging task due to the inherent complexity of time series modeling. Hybrid models have been successfully applied to deal with such problems and yield desired forecasting accuracy. The fundamental objective of hybridization is to exploit the unit modeling benefits of every single model and lift its disadvantages. For reaching these goals, individual models are combined in two main parallel and series frameworks. The parallel hybridization method relied on employing different individual models and integrated the weighted forecasts to capture the advantages contained in all models, concurrently. However, existing parallel hybrid models suffer from some crucial shortcomings that need to be addressed and eliminated. One of the critical deficiencies of parallel models is that the residual obtained by different models is not modeled, and the unprocessed patterns have remained in the data. The principal goal of this paper is to alleviate this deficiency of parallel hybrid models using the capability of the series hybridization strategy in modeling remaining patterns in residuals. Thus, the key innovation of this study is to combine parallel hybrid models employing a series hybridization scheme to yield an enhanced forecasting model and overcome the drawback of the parallel models. Despite the vast hybrid models proposed for combining individual models, this paper aims to combine both the above-mentioned hybrid structures instead of individual models. For this purpose, the novel hybrid model named Series Hybridization of Parallel (SHOP) model is proposed, which integrates a parallel hybrid model by series hybridization approach. In this research, Autoregressive Integrated Moving Average (ARIMA) and Multilayer perceptrons (MLP) models are used to implement the proposed hybrid SHOP structure. In this way, the SHOP contains a series hybridization of parallel hybridization of ARIMA and MLP models. The effectiveness of the SHOP model is verified by applying it to four benchmark data sets, including the closing of the DAX index, the closing of the Nikkei 225 index (N225), the opening of the Dow Jones Industrial Average Index (DJIAI), and the wind speed data in Colorado State. The predictive power of the SHOP model is evaluated by comparing the obtained results with ARIMA, MLP, LSTM, RBFNN, SVM, and traditional series and parallel hybridization of ARIMA and MLP models. Remarkably, the obtained forecasting accuracy from the SHOP model is outstanding than other models.
A novel method for time series prediction based on error decomposition and nonlinear combination of forecasters
2021, Neurocomputing
For time series prediction, hybrid systems that combine linear and nonlinear models can provide more accurate performance than a single model. However, the irregularity of the error series and the unknown nature of combinations of different forecasters may strongly impact the performance of hybrid systems. Therefore, in this paper, we propose a novel method for time series prediction, in which error decomposition and a nonlinear combination of forecasters are introduced. The proposed method performs the following: (i) linear modeling to obtain the error series, (ii) error decomposition by using variational mode decomposition (VMD), (iii) nonlinear modeling and a phase fix procedure for the error subseries, and (iv) a combination of forecasters through an appropriate combination function generated by a nonlinear model. By using the proposed method, this paper constructs two hybrid systems, in which the autoregressive integrated moving average (ARIMA) is used for linear modeling, and two artificial intelligence (AI) models, namely, the multilayer perceptron (MLP) and support vector regression (SVR), are used for nonlinear modeling and combination, respectively. Finally, four time series data sets, six evaluation metrics, two single models and thirteen hybrid systems are used to assess the effectiveness of the proposed method. The empirical results show that hybrid systems based on error decomposition and a nonlinear combination of forecasters can achieve better performance than some existing systems and models.
Parallel genetic algorithms for optimizing the SARIMA model for better forecasting of the NCDC weather data
2021, Alexandria Engineering Journal
Autoregressive Integrated Moving Average (ARIMA) and seasonal ARIMA (SARIMA) models are common techniques that are widely used in analysing and forecasting stationary and seasonal time series data. The three essential steps involved to construct ARIMA are identification, estimation, and checking the validity of the model. The most critical step followed in constructing the ARIMA model is model identification. However, overcoming the difficult local optima problem for both ARIMA and SARIMA is still challenging as there is no appropriate method to solve it. In this paper, the proposed parallel GA-SARIMA model is used to solve the problem of local optima, where the genetic algorithm (GA) is used at the initial stage to identify the order and estimation of the parameters for SARIMA. The National Climate Data Centre (NCDC) time series dataset is used for testing the efficiency of the final parallel GA-SARIMA model to forecast the mean temperature of India from 2000 to 2017. The GA algorithm is successfully implemented to solve the optimization problems by introducing better solutions suitable for SARIMA models. The results of the study showed that the implementation of the combined approach of parallel GA and SARIMA enhances the prediction accuracy of the model. The parallel GA-SARIMA method is particularly robust, faster and performs better than sequential SARIMA models in terms of running time and cost function values.
Weighted sequential hybrid approaches for time series forecasting
2019, Physica A: Statistical Mechanics and its Applications
Citation Excerpt :
These models can be homogeneous or heterogeneous. Literature shows that hybrid models, which use heterogeneous models, can yield better performance such as statistical/intelligent, fuzzy/crisp, linear/nonlinear combination topology [16]. Combining statistical and intelligent models is a well-known structure of hybrid models, which are frequently applied in many studies.
Over the past few decades, a large literature has evolved through time series forecasting via combining different individual models by employing various hybrid strategies in order to improve forecasting accuracy. One of the most attractive and extensively-used methodologies, proposed in the literature of time series forecasting is the series methodology in which different components in data are sequentially modeled. In this hybrid methodology, a time series is decomposed to dissimilar components and then appropriate models for each component is used sequentially in order to capture all kind of patterns in data. It means that the output of the first model is used as input of the second model, and the output of the second model is used as input the third model, and so on. Recent literature confirms that the sequential hybrid methodology has become commonplace approach for combining different models and can yield desired accuracy. However, this methodology, despite of all advantages, has some critical assumptions that may degenerate its performance in some specific situations. One of the most essential of these assumptions is that the equal weights are considered for individual components. In other words, the related importance of each component rather than other those components are not considered in calculating procedure of the final hybrid forecasts. In this paper, a novel weighted sequential hybrid model is proposed in order to lift the equal weight assumption in the traditional series hybrid models. In the proposed model, components are first modeled sequentially and then the optimal weight of each component is calculated based on a least square process. It can be theoretically demonstrated that the performance of the proposed model, due to lift the equal weight constraint, will not be worse than traditional sequential hybrid models by same components. Empirical results of four well-known real word benchmark data sets also indicate the effectiveness and appropriateness of the proposed model in comparison to traditional bi-component series hybrid models.

View all citing articles on Scopus

View full text

A new class of hybrid models for time series forecasting

Abstract

Introduction

Section snippets

Time series forecasting models

Probabilistic neural networks (PNNs)

Formulation of the proposed model

Application of the proposed model to time series forecasting

Conclusions

Acknowledgements

Advanced Engineering Informatics

Information Sciences

International Journal of Machine Tools & Manufacture

Engineering Applications of Artificial Intelligence

Journal of Econometrics

Decision Support Systems

Expert Systems with Applications

Computers & Operations Research

Computational Statistics & Data Analysis

Neurocomputing

Expert Systems with Applications

Journal of Microbiological Methods

Signal Process

International Journal of Forecasting

Medical Engineering & Physics

Physica A

Decision Support Systems

Pattern Recognition

Expert Systems with Applications

Fuzzy Sets and Systems

International Journal of Forecasting

Ocean Engineering

Applied Soft Computing

International Journal of Solids and Structures

Fuzzy Sets and Systems

Atmospheric Environment

Neurocomputing

Journal of International Economics

Applied Mathematical Computation

Energy Conversion Management

Omega

Analytica Chimica Acta

Neurocomputing

Neural Networks

Neurocomputing

Journal of Theoretical Biology

Neurocomputing

Engineering Structures

Computational Statistics & Data Analysis

International Journal of Forecasting

Fuzzy Sets and Systems

Technological Forecasting & Social Change

Journal of Pharmaceutical and Biomedical Analysis