Elsevier

Neurocomputing

Volume 72, Issues 10–12, June 2009, Pages 2581-2594
Neurocomputing

Rule induction for forecasting method selection: Meta-learning the characteristics of univariate time series

https://doi.org/10.1016/j.neucom.2008.10.017Get rights and content

Abstract

For univariate forecasting, there are various statistical models and computational algorithms available. In real-world exercises, too many choices can create difficulties in selecting the most appropriate technique, especially for users lacking sufficient knowledge of forecasting. This study focuses on rule induction for forecasting method selection by understanding the nature of historical forecasting data. A novel approach for selecting a forecasting method for univariate time series based on measurable data characteristics is presented that combines elements of data mining, meta-learning, clustering, classification and statistical measurement. We conducted a large-scale empirical study of over 300 time series using four of the most popular forecasting methods. To provide a rich portrait of the global characteristics of univariate time series, we extracted measures from a comprehensive set of features such as trend, seasonality, periodicity, serial correlation, skewness, kurtosis, nonlinearity, self-similarity, and chaos. Both supervised and unsupervised learning methods are used to learn the relationship between the characteristics of the time series and the forecasting method suitability, providing both recommendation rules, as well as visualizations in the feature space. A derived weighting schema based on the rule induction is also used to improve forecasting accuracy based on combined forecasting models.

Introduction

Time series forecasting has been a traditional research area for decades, and various statistical models and advanced computational algorithms have been developed to improve forecasting accuracy. With the continuous emergence of more methods, forecasters have been given more choices. However, more options could also create potential problems in practice especially when forecasts are based on a trial-and-error procedure with little understanding of the conditions under which certain forecasting methods perform well. Certainly the ‘no free lunch theorem’ [1] informs us that there is never likely to be a single method that fits all situations. In the forecasting context, therefore, recommendation rules on how to select a suitable forecasting method for a given type of time series have attracted attention.

From more than a decade ago, the research on forecasting methods selection attracted many attempts to find recommendations and rules [2], [3], [4], [5] mostly based on expert systems approaches. Certainly there are obvious limitations for systems based on human judgment, with the strongest concern being that expert system based rules are not dynamic and therefore require significant rework and validation prior to updating. This is not a trivial problem especially when the forecasting domain or situation changes. In this study, we aim to develop an automated rule induction system that couples forecasting methods performance with time series data characteristics. Metrics to characterize a time series are developed to provide a rich portrait of the time series including its trend, seasonality, serial correlation, nonlinearity, skewness, kurtosis, self-similarity, chaos, and periodicity. Self-organizing maps (SOMs) and decision trees (DTs) are used to induce rules explaining the relationships between these characteristics and forecasting method performance. The induced rules from such a system are envisaged to provide recommendations to forecasters on how to select forecasting methods. In the proposed system, a data-driven approach based on a meta-learning framework and machine learning algorithms are employed, reducing the dependence on expert knowledge. Such an automated system is more flexible, adaptive and efficient when situations change and rules are required to be revised.

After outlining related work in forecasting method selection, as well as relevant cross-disciplinary work in Section 2, we explain the detailed components and procedures of the proposed meta-learning based system in Section 3. Then the background knowledge on four forecasting methods which were used as candidates in our empirical study is provided in Section 4. Section 5 then follows in which each identified characteristic for univariate time series data in our study are introduced including algorithms used to extract descriptive metric for each characteristic. Three machine learning techniques used for learning the relationship between time series characteristics and forecasting method performance are discussed in Section 6. Our empirical study and experimental results including induced rules are demonstrated in Section 7. Future research directions are discussed and conclusions drawn in Section 8.

Section snippets

Related work

In the literature on forecasting method selection, there are two common approaches: (1) comparing the track record of various approaches and using expert knowledge to provide guidelines to select forecasting methods and (2) using the results of large empirical studies to estimate a relationship between data features and model performance. The first approach has been developed over many decades. To select a forecasting method, some general guidelines consisting of many factors—convenience,

Meta-learning based system for rule induction

Meta-learning was proposed to support data mining tasks and to understand the conditions under which a given learning strategy is most appropriate for a given task. Meta-learning involves a process of studying the relationships between learning strategies and tasks [15]. The central property of the meta-learning approach is to understand the nature of data, and to learn to select the method which performs best for certain types of data.

We adapt a meta-learning architecture from Vilalta's

Background: forecasting methods

Forecasting is designed to predict possible future alternatives and helps current planning and decision making. For example, the forecasting of annual student enrollment is critical information for a university to determine financial plans and design strategies. Time series analysis provides foundations for forecasting model construction and selection based on historical data. Modeling the time series is a complex problem, because the difference in characteristics of time series data can make

Time series characteristics extraction

In this study, we investigated various data characteristics from diverse perspectives related to univariate time series structure-based characteristic identification and feature extraction. We selected the nine most informative, representative and easily-measurable characteristics to summarize the time series structure. Based on these identified characteristics, corresponding metrics are calculated. The extracted data characteristics and corresponding metrics are mapped to forecasting

Machine learning techniques

After global characteristics and corresponding metrics have been defined, we then can use this finite set of descriptors to characterize or analyze time series data using appropriate machine learning techniques such as clustering algorithms and DTs. The mining of time series data has attracted great attention in the data mining community in recent years and many clustering algorithms have been applied to search for the similarity between series. k-means clustering is the most commonly used

Data sets

In our empirical study, we included various types of data sets consisting of synthetic and real-world time series from different domains such as economics, medical, and engineering. We included 46 data sets from the UCR time series data mining archive [54] which covers data sets of time series from diverse fields, including finance, medicine, biometrics, chemistry, astronomy, robotics, and networking industry. These data sets have the complete spectrum of stationary, non-stationary, noisy,

Future research and conclusions

In this research, we have focused on analyzing the nature of the time series data and developing a novel approach to generate recommendation rules for selection of forecasting methods based on data characteristics of the time series. The research work presented in this paper has not only extended the study on forecasting rules generation with a wider range of forecasting methods and algorithms, but has also deepened the research into a more specific or quantitative manner rather than merely

Xiaozhe Wang is a lecturer at School of Management, LaTrobe University. Prior to joining LaTrobe University, she obtained a Ph.D. from Monash University, and was a Research Fellow at both Monash University and the University of Melbourne, Australia. Dr. Wang also worked as senior statistician in industry after finished her Ph.D. Her research interests are data mining, machine learning, meta-learning and time series forecasting. Her research have been published in journals, book chapter and

References (59)

  • N. Meade

    Evidence for the selection of forecasting methods

    International Journal of Forecasting

    (2000)
  • J.S. Armstrong

    Research needs in forecasting

    International Journal of Forecasting

    (1988)
  • J.C. Chambers et al.

    How to choose the right forecasting technique

    Harvard Business Review

    (1971)
  • D.M. Georgoff et al.

    Manager's guide to forecasting

    Harvard Business Review

    (1986)
  • L. Moutinho et al.

    Expert systems: a new tool in marketing

    Qualitative Review in Marketing

    (1988)
  • D.J. Reid, A comparison of forecasting techniques on economic time series, in: Forecasting in Action, OR Society,...
  • K.A. Smith-Miles, Cross-disciplinary perspectives on meta-learning for algorithm selection, ACM Computing Surveys 41...
  • R. Vilalta et al.

    Using meta-learning to support data-mining

    International Journal of Computer Science Applications I

    (2004)
  • J.S. Armstrong, (Ed.), Principles of Forecasting: A Handbook for Researchers and Practitioners, Kluwer Academic...
  • S. Makridakis et al.

    Forecasting Methods and Applications

    (1998)
  • A.R. Ganguly, Hybrid statistical and data mining approaches for forecasting complex systems, in: Proceedings of the...
  • C.C. Pegels

    Exponential forecasting: some new variations

    Management Science

    (1969)
  • E.S. Gardner

    Exponential smoothing: the state of the art

    International Journal of Forecasting

    (1985)
  • G.E.P. Box, G.M. Jenkins, Time Series Analysis: Forecasting and Control, Holden-Day, San Fransisco, CA,...
  • G.E.P. Box et al.

    Time Series Analysis: Forecasting and Control

    (1994)
  • S.C. Ahalt, P. Chen, C.T. Chou, Proceedings of the Second International IEEE Conference on Tools for Artificial...
  • Forecasting-Principles, Forecasting with Artificial Neural Networks, Special Interest Group,...
  • P.J. Werbos, Beyong regression: new tools for prediction and analysis in the behavioral sciences, Ph.D. Thesis, Harvard...
  • J.M. Zurada

    An Introduction to Artificial Neural Systems

    (1992)
  • Cited by (153)

    • Forecast combinations: An over 50-year review

      2023, International Journal of Forecasting
    • Model combinations through revised base rates

      2023, International Journal of Forecasting
    View all citing articles on Scopus

    Xiaozhe Wang is a lecturer at School of Management, LaTrobe University. Prior to joining LaTrobe University, she obtained a Ph.D. from Monash University, and was a Research Fellow at both Monash University and the University of Melbourne, Australia. Dr. Wang also worked as senior statistician in industry after finished her Ph.D. Her research interests are data mining, machine learning, meta-learning and time series forecasting. Her research have been published in journals, book chapter and conference proceedings since 2002.

    Kate Smith-Miles is a Professor and Head of the School of Mathematical Sciences at Monash University in Australia. She obtained a B.Sc.(Hons.) in Mathematics and a Ph.D. in Electrical Engineering, both from the University of Melbourne, Australia. Kate has published two books on neural networks and data mining applications, and over 175 refereed journal and international conference papers in the areas of neural networks, combinatorial optimization, intelligent systems and data mining. She has been awarded over AUD $1.75 million in competitive grants, including eight Australian Research Council grants and industry awards. She is on the editorial board of several international journals, including IEEE Transactions on Neural Networks, has been program chair for several international conferences (e.g. HIS’03, CIDM’09) and has chaired the IEEE Computational Intelligence Society's Technical Committee on Data Mining (2007–2008). She is a frequent reviewer of international research activities including grant applications in Canada, UK, Finland, Singapore and Australia, refereeing for international research journals, and Ph.D. examinations. In addition to her academic activities, she also regularly acts as a consultant to industry in the areas of optimization, data mining and intelligent systems.

    Rob Hyndman is Professor of Statistics at Monash University, Australia, and holds a Ph.D. in Statistics from the University of Melbourne. He is the Editor-in-Chief of the International Journal of Forecasting and Director of the Business and Economic Forecasting Unit, Monash University, one of the leading forecasting research groups in the world. He is currently supervising seven Ph.D. students on forecasting-related projects. Rob is also an experienced consultant and has worked with over 200 clients during the last 20 years, on projects covering all areas of applied statistics from forecasting to the ecology of lemmings. He is co-author of the well-known textbook Forecasting: Methods and Applications (Wiley, 1998) with Makridakis and Wheelwright, and has had more than 50 published papers in many journals.

    View full text