Skip to main content
Erschienen in: Progress in Artificial Intelligence 2/2018

07.11.2017 | Regular Paper

A learning system based on lazy metareasoning

verfasst von: Tor Gunnar Houeland, Agnar Aamodt

Erschienen in: Progress in Artificial Intelligence | Ausgabe 2/2018

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Metareasoning has been widely studied in the literature, with a wide variety of algorithms and partially overlapping methodological approaches. However, these methods are typically either not targeted toward practical machine learning systems or alternatively are focused on achieving the best possible performance for a particular domain, with extensive human tuning and research, and vast computing resources. In this paper, our goal is to create systems that perform sustained autonomous learning, with automatically determined domain-specific optimizations for any given domain, and without requiring human assistance. We present Alma, a metareasoning architecture that creates and selects reasoning methods based on empirically observed performance. This is achieved by using lazy learning at the metalevel, and automatically training and combining reasoning methods at run-time. In experiments across diverse data sets, we demonstrate the ability of Alma to successfully reason about learner performance in different domains and achieve a better overall result than any of the individual reasoning methods, even with limited computing time available.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Fußnoten
1
Algorithms with stochastic behavior are modeled as different random number generator states being different functions.
 
2
Standard UCB1 can provably achieve asymptotic zero-regret for the basic multi-armed bandit problem, while this has not been shown for UCB1TUNED. However, the zero-regret proof doesn’t apply to the harder metareasoning problem Alma is addressing, and for our components, we are primarily interested in practical performance.
 
Literatur
1.
Zurück zum Zitat Aha, D.W. (ed.): Lazy Learning. Kluwer, Norwell (1997)MATH Aha, D.W. (ed.): Lazy Learning. Kluwer, Norwell (1997)MATH
3.
Zurück zum Zitat Bates, J.M., Granger, C.W.: The combination of forecasts. J. Oper. Res. Soc. 20(4), 451–468 (1969)CrossRef Bates, J.M., Granger, C.W.: The combination of forecasts. J. Oper. Res. Soc. 20(4), 451–468 (1969)CrossRef
4.
Zurück zum Zitat Bennett, J., Lanning, S., et al.: The Netflix prize. In: Proceedings of the KDD Cup and Workshop, vol. 2007, p. 35. New York, NY, USA (2007) Bennett, J., Lanning, S., et al.: The Netflix prize. In: Proceedings of the KDD Cup and Workshop, vol. 2007, p. 35. New York, NY, USA (2007)
5.
Zurück zum Zitat Bonissone, P.P.: Lazy meta-learning: creating customized model ensembles on demand. In: IEEE World Congress on Computational Intelligence, pp. 1–23. Springer, Berlin (2012) Bonissone, P.P.: Lazy meta-learning: creating customized model ensembles on demand. In: IEEE World Congress on Computational Intelligence, pp. 1–23. Springer, Berlin (2012)
6.
Zurück zum Zitat Brügmann, B.: Monte Carlo Go. In: AAAI Fall Symposium on Games: Playing, Planning, and Learning (1993) Brügmann, B.: Monte Carlo Go. In: AAAI Fall Symposium on Games: Playing, Planning, and Learning (1993)
7.
Zurück zum Zitat Caruana, R., Niculescu-Mizil, A., Crew, G., Ksikes, A.: Ensemble selection from libraries of models. In: Proceedings of the Twenty-First International Conference on Machine Learning, p. 18. ACM (2004) Caruana, R., Niculescu-Mizil, A., Crew, G., Ksikes, A.: Ensemble selection from libraries of models. In: Proceedings of the Twenty-First International Conference on Machine Learning, p. 18. ACM (2004)
8.
Zurück zum Zitat Cesa-Bianchi, N., Lugosi, G.: Prediction, Learning, and Games. Cambridge University Press, Cambridge (2006)CrossRefMATH Cesa-Bianchi, N., Lugosi, G.: Prediction, Learning, and Games. Cambridge University Press, Cambridge (2006)CrossRefMATH
9.
Zurück zum Zitat Chakrabarti, D., Kumar, R., Radlinski, F., Upfal, E.: Mortal multi-armed bandits. In: Koller, D., Schuurmans, D., Bengio, Y., Bottou, L. (eds.) Advances in Neural Information Processing Systems 21, pp. 273–280. Curran Associates, Inc. (2009) Chakrabarti, D., Kumar, R., Radlinski, F., Upfal, E.: Mortal multi-armed bandits. In: Koller, D., Schuurmans, D., Bengio, Y., Bottou, L. (eds.) Advances in Neural Information Processing Systems 21, pp. 273–280. Curran Associates, Inc. (2009)
10.
Zurück zum Zitat Chan, P.K., Stolfo, S.J.: Experiments on multistrategy learning by meta-learning. In: Proceedings of the Second International Conference on Information and Knowledge Management, pp. 314–323. ACM (1993) Chan, P.K., Stolfo, S.J.: Experiments on multistrategy learning by meta-learning. In: Proceedings of the Second International Conference on Information and Knowledge Management, pp. 314–323. ACM (1993)
11.
Zurück zum Zitat Clemen, R.T.: Combining forecasts: a review and annotated bibliography. Int. J. Forecast. 5(4), 559–583 (1989)CrossRef Clemen, R.T.: Combining forecasts: a review and annotated bibliography. Int. J. Forecast. 5(4), 559–583 (1989)CrossRef
12.
Zurück zum Zitat Cox, M.: Introspective Multistrategy Learning: Constructing a Learning Strategy Under Reasoning Failure. Ph.D. Thesis, College of Computing, Georgia Institute of Technology (1996) Cox, M.: Introspective Multistrategy Learning: Constructing a Learning Strategy Under Reasoning Failure. Ph.D. Thesis, College of Computing, Georgia Institute of Technology (1996)
13.
Zurück zum Zitat Cox, M.T., Eiselt, K., Kolodner, J., Nersessian, N., Recker, M., Simon, T.: Introspective multistrategy learning: on the construction of learning strategies. Artif. Intell. 112, 1–55 (1999)CrossRef Cox, M.T., Eiselt, K., Kolodner, J., Nersessian, N., Recker, M., Simon, T.: Introspective multistrategy learning: on the construction of learning strategies. Artif. Intell. 112, 1–55 (1999)CrossRef
16.
Zurück zum Zitat Francis, A.G., Ram, A.: The utility problem in case-based reasoning. In: AAAICBR-93, Proceedings of the 1993 Case-Based Reasoning Workshop (1993) Francis, A.G., Ram, A.: The utility problem in case-based reasoning. In: AAAICBR-93, Proceedings of the 1993 Case-Based Reasoning Workshop (1993)
17.
Zurück zum Zitat Gelly, S., Wang, Y.: Exploration Exploitation in Go: UCT for Monte-Carlo Go. In: Twentieth Annual Conference on Neural Information Processing Systems (NIPS 2006) (2006) Gelly, S., Wang, Y.: Exploration Exploitation in Go: UCT for Monte-Carlo Go. In: Twentieth Annual Conference on Neural Information Processing Systems (NIPS 2006) (2006)
18.
Zurück zum Zitat Guyon, I., Chaabane, I., Escalante, H.J., Escalera, S., Jajetic, D., Lloyd, J.R., Macià, N., Ray, B., Romaszko, L., Sebag, M., Statnikov, A., Treguer, S., Viegas, E.: A brief review of the ChaLearn AutoML challenge: any-time any-dataset learning without human intervention. In: Hutter, F., Kotthoff, L., Vanschoren, J. (eds.) Proceedings of the Workshop on Automatic Machine Learning, Proceedings of the Machine Learning Research, vol. 64, pp. 21–30. PMLR, New York, NY, USA (2016). http://proceedings.mlr.press/v64/guyon_review_2016.html Guyon, I., Chaabane, I., Escalante, H.J., Escalera, S., Jajetic, D., Lloyd, J.R., Macià, N., Ray, B., Romaszko, L., Sebag, M., Statnikov, A., Treguer, S., Viegas, E.: A brief review of the ChaLearn AutoML challenge: any-time any-dataset learning without human intervention. In: Hutter, F., Kotthoff, L., Vanschoren, J. (eds.) Proceedings of the Workshop on Automatic Machine Learning, Proceedings of the Machine Learning Research, vol. 64, pp. 21–30. PMLR, New York, NY, USA (2016). http://​proceedings.​mlr.​press/​v64/​guyon_​review_​2016.​html
19.
Zurück zum Zitat Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P., Witten, I.H.: The WEKA data mining software: an update. SIGKDD Explor. Newsl. 11(1), 10–18 (2009)CrossRef Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P., Witten, I.H.: The WEKA data mining software: an update. SIGKDD Explor. Newsl. 11(1), 10–18 (2009)CrossRef
20.
Zurück zum Zitat Holte, R.: Very simple classification rules perform well on most commonly used datasets. Mach. Learn. 11, 63–91 (1993)CrossRefMATH Holte, R.: Very simple classification rules perform well on most commonly used datasets. Mach. Learn. 11, 63–91 (1993)CrossRefMATH
21.
Zurück zum Zitat Houeland, T.G., Aamodt, A.: Towards an introspective architecture for meta-level reasoning in clinical decision support systems. In: ICCBR 2009, 7th Workshop on CBR in the Health Sciences (2009) Houeland, T.G., Aamodt, A.: Towards an introspective architecture for meta-level reasoning in clinical decision support systems. In: ICCBR 2009, 7th Workshop on CBR in the Health Sciences (2009)
23.
Zurück zum Zitat Houeland, T.G., Bruland, T., Aamodt, A., Langseth, H.: Extended abstract: combining CBR and BN using metareasoning. In: Kofod-Petersen, A., Heintz, F., Langseth, H. (eds.) SCAI, Frontiers in Artificial Intelligence and Applications, vol. 227, pp. 189–190. IOS Press (2011). https://doi.org/10.3233/978-1-60750-754-3-189 Houeland, T.G., Bruland, T., Aamodt, A., Langseth, H.: Extended abstract: combining CBR and BN using metareasoning. In: Kofod-Petersen, A., Heintz, F., Langseth, H. (eds.) SCAI, Frontiers in Artificial Intelligence and Applications, vol. 227, pp. 189–190. IOS Press (2011). https://​doi.​org/​10.​3233/​978-1-60750-754-3-189
24.
Zurück zum Zitat Kocsis, L., Szepesvári, C.: Bandit based monte-carlo planning. ECML-06. Number 4212 in LNCS, pp. 282–293. Springer, Berlin (2006) Kocsis, L., Szepesvári, C.: Bandit based monte-carlo planning. ECML-06. Number 4212 in LNCS, pp. 282–293. Springer, Berlin (2006)
25.
Zurück zum Zitat Kotthoff, L., Thornton, C., Hoos, H.H., Hutter, F., Leyton-Brown, K.: Auto-weka 2.0: automatic model selection and hyperparameter optimization in weka. J. Mach. Learn. Res. 17, 1–5 (2016)MATH Kotthoff, L., Thornton, C., Hoos, H.H., Hutter, F., Leyton-Brown, K.: Auto-weka 2.0: automatic model selection and hyperparameter optimization in weka. J. Mach. Learn. Res. 17, 1–5 (2016)MATH
26.
Zurück zum Zitat Krawczyk, B., Minku, L.L., Gama, J., Stefanowski, J., Woźniak, M.: Ensemble learning for data stream analysis: a survey. Inf. Fusion 37, 132–156 (2017)CrossRef Krawczyk, B., Minku, L.L., Gama, J., Stefanowski, J., Woźniak, M.: Ensemble learning for data stream analysis: a survey. Inf. Fusion 37, 132–156 (2017)CrossRef
27.
Zurück zum Zitat Laird, J.: The Soar Cognitive Architecture. MIT Press, Cambridge (2012) Laird, J.: The Soar Cognitive Architecture. MIT Press, Cambridge (2012)
28.
Zurück zum Zitat Lemke, C., Budka, M., Gabrys, B.: Metalearning: a survey of trends and technologies. Artif. Intell. Rev. 44(1), 117–130 (2015)CrossRef Lemke, C., Budka, M., Gabrys, B.: Metalearning: a survey of trends and technologies. Artif. Intell. Rev. 44(1), 117–130 (2015)CrossRef
29.
Zurück zum Zitat Lenz, M.: CABATA: a hybrid CBR system. In: Richter, M.M., Wess, S., Althoff, K.D., Maurer, F. (eds.) First European Workshop on Case-Based Reasoning (EWCBR-93): Posters and Presentations (volume I), pp. 204–209 (1993) Lenz, M.: CABATA: a hybrid CBR system. In: Richter, M.M., Wess, S., Althoff, K.D., Maurer, F. (eds.) First European Workshop on Case-Based Reasoning (EWCBR-93): Posters and Presentations (volume I), pp. 204–209 (1993)
33.
Zurück zum Zitat Mendes-Moreira, J., Soares, C., Jorge, A.M., Sousa, J.F.D.: Ensemble approaches for regression: a survey. ACM Comput. Surv. (CSUR) 45(1), 10 (2012)CrossRefMATH Mendes-Moreira, J., Soares, C., Jorge, A.M., Sousa, J.F.D.: Ensemble approaches for regression: a survey. ACM Comput. Surv. (CSUR) 45(1), 10 (2012)CrossRefMATH
34.
Zurück zum Zitat Mitchell, T.M.: The Need for Biases in Learning Generalizations. Tech. rep (1980) Mitchell, T.M.: The Need for Biases in Learning Generalizations. Tech. rep (1980)
35.
Zurück zum Zitat Nascimento, D.S., Canuto, A.M., Coelho, A.L.: An empirical analysis of meta-learning for the automatic choice of architecture and components in ensemble systems. In: 2014 Brazilian Conference on Intelligent Systems (BRACIS), pp. 1–6. IEEE (2014) Nascimento, D.S., Canuto, A.M., Coelho, A.L.: An empirical analysis of meta-learning for the automatic choice of architecture and components in ensemble systems. In: 2014 Brazilian Conference on Intelligent Systems (BRACIS), pp. 1–6. IEEE (2014)
38.
Zurück zum Zitat Rubin, J., Watson, I.: On combining decisions from multiple expert imitators for performance. In: Walsh, T. (ed.) IJCAI, pp. 344–349. IJCAI/AAAI (2011) Rubin, J., Watson, I.: On combining decisions from multiple expert imitators for performance. In: Walsh, T. (ed.) IJCAI, pp. 344–349. IJCAI/AAAI (2011)
42.
Zurück zum Zitat Silver, D., Huang, A., Maddison, C.J., Guez, A., Sifre, L., van den Driessche, G., Schrittwieser, J., Antonoglou, I., Panneershelvam, V., Lanctot, M., Dieleman, S., Grewe, D., Nham, J., Kalchbrenner, N., Sutskever, I., Lillicrap, T., Leach, M., Kavukcuoglu, K., Graepel, T., Hassabis, D.: Mastering the game of Go with deep neural networks and tree search. Nature 529(7587), 484–489 (2016). https://doi.org/10.1038/nature16961 CrossRef Silver, D., Huang, A., Maddison, C.J., Guez, A., Sifre, L., van den Driessche, G., Schrittwieser, J., Antonoglou, I., Panneershelvam, V., Lanctot, M., Dieleman, S., Grewe, D., Nham, J., Kalchbrenner, N., Sutskever, I., Lillicrap, T., Leach, M., Kavukcuoglu, K., Graepel, T., Hassabis, D.: Mastering the game of Go with deep neural networks and tree search. Nature 529(7587), 484–489 (2016). https://​doi.​org/​10.​1038/​nature16961 CrossRef
44.
Zurück zum Zitat Thompson, W.R.: On the likelihood that one unknown probability exceeds another in view of the evidence of two samples. Biometrika 25(3–4), 285–294 (1933)CrossRefMATH Thompson, W.R.: On the likelihood that one unknown probability exceeds another in view of the evidence of two samples. Biometrika 25(3–4), 285–294 (1933)CrossRefMATH
45.
Zurück zum Zitat Thornton, C., Hutter, F., Hoos, H.H., Leyton-Brown, K.: Auto-WEKA: Combined selection and hyperparameter optimization of classification algorithms. In: Proceedings of the KDD-2013, pp. 847–855 (2013) Thornton, C., Hutter, F., Hoos, H.H., Leyton-Brown, K.: Auto-WEKA: Combined selection and hyperparameter optimization of classification algorithms. In: Proceedings of the KDD-2013, pp. 847–855 (2013)
48.
Zurück zum Zitat Watson, I.: A case study of maintenance of a commercially fielded case-based reasoning system. Comput. Intell. 17, 387–398 (2001)CrossRef Watson, I.: A case study of maintenance of a commercially fielded case-based reasoning system. Comput. Intell. 17, 387–398 (2001)CrossRef
50.
Zurück zum Zitat Woźniak, M., Graña, M., Corchado, E.: A survey of multiple classifier systems as hybrid systems. Inf. Fusion 16, 3–17 (2014)CrossRef Woźniak, M., Graña, M., Corchado, E.: A survey of multiple classifier systems as hybrid systems. Inf. Fusion 16, 3–17 (2014)CrossRef
51.
Zurück zum Zitat Zhang, X.S., Shrestha, B., Yoon, S., Kambhampati, S., et al.: An ensemble architecture for learning complex problem-solving techniques from demonstration. ACM Trans. Intell. Syst. Technol. 3(4), 75:1–75:38 (2012)CrossRef Zhang, X.S., Shrestha, B., Yoon, S., Kambhampati, S., et al.: An ensemble architecture for learning complex problem-solving techniques from demonstration. ACM Trans. Intell. Syst. Technol. 3(4), 75:1–75:38 (2012)CrossRef
52.
Zurück zum Zitat Zhang, X.S., Yoon, S., DiBona, P., Appling, D.S., Ding, L., et al.: An ensemble learning and problem solving architecture for airspace management. In: Haigh, K.Z., Rychtyckyj, N. (eds.) Proceedings of Twenty-First Annual Conference on Innovative Applications of Artificial Intelligence (IAAI-09). AAAI (2009) Zhang, X.S., Yoon, S., DiBona, P., Appling, D.S., Ding, L., et al.: An ensemble learning and problem solving architecture for airspace management. In: Haigh, K.Z., Rychtyckyj, N. (eds.) Proceedings of Twenty-First Annual Conference on Innovative Applications of Artificial Intelligence (IAAI-09). AAAI (2009)
Metadaten
Titel
A learning system based on lazy metareasoning
verfasst von
Tor Gunnar Houeland
Agnar Aamodt
Publikationsdatum
07.11.2017
Verlag
Springer Berlin Heidelberg
Erschienen in
Progress in Artificial Intelligence / Ausgabe 2/2018
Print ISSN: 2192-6352
Elektronische ISSN: 2192-6360
DOI
https://doi.org/10.1007/s13748-017-0138-0

Weitere Artikel der Ausgabe 2/2018

Progress in Artificial Intelligence 2/2018 Zur Ausgabe