Skip to main content

2020 | OriginalPaper | Buchkapitel

Optimizing Ensemble Weights for Machine Learning Models: A Case Study for Housing Price Prediction

verfasst von : Mohsen Shahhosseini, Guiping Hu, Hieu Pham

Erschienen in: Smart Service Systems, Operations Management, and Analytics

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Designing ensemble learners has been recognized as one of the significant trends in the field of data knowledge, especially, in data science competitions. Building models that are able to outperform all individual models in terms of bias, which is the error due to the difference in the average model predictions and actual values, and variance, which is the variability of model predictions, has been the main goal of the studies in this area. An optimization model has been proposed in this paper to design ensembles that try to minimize bias and variance of predictions. Focusing on service sciences, two well-known housing datasets have been selected as case studies: Boston housing and Ames housing. The results demonstrate that our designed ensembles can be very competitive in predicting the house prices in both Boston and Ames datasets.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat World development indicators. World Bank (1978) World development indicators. World Bank (1978)
2.
Zurück zum Zitat B. Hefley, W. Murphy, Service Science, Management and Engineering: Education for the 21st Century. Springer Science & Business Media (2008) B. Hefley, W. Murphy, Service Science, Management and Engineering: Education for the 21st Century. Springer Science & Business Media (2008)
3.
Zurück zum Zitat H. Katzan, Foundations of service science concepts and facilities. J. Serv. Sci. 1(1) (2008)CrossRef H. Katzan, Foundations of service science concepts and facilities. J. Serv. Sci. 1(1) (2008)CrossRef
4.
Zurück zum Zitat G. Xiong, Z. Liu, X. Liu, F. Zhu, D. Shen, Service Science, Management, and Engineering: Theory and Applications. Academic (2012) G. Xiong, Z. Liu, X. Liu, F. Zhu, D. Shen, Service Science, Management, and Engineering: Theory and Applications. Academic (2012)
5.
Zurück zum Zitat L. Breiman, Statistical modeling: the two cultures (with comments and a rejoinder by the author). Stat. Sci. 16(3), 199–231 (2001)CrossRef L. Breiman, Statistical modeling: the two cultures (with comments and a rejoinder by the author). Stat. Sci. 16(3), 199–231 (2001)CrossRef
6.
Zurück zum Zitat B. Park, J.K. Bae, Using machine learning algorithms for housing price prediction: the case of Fairfax County, Virginia housing data. Expert Syst. Appl. 42(6), 2928–2934 (2015)CrossRef B. Park, J.K. Bae, Using machine learning algorithms for housing price prediction: the case of Fairfax County, Virginia housing data. Expert Syst. Appl. 42(6), 2928–2934 (2015)CrossRef
7.
Zurück zum Zitat J. Gu, M. Zhu, L. Jiang, Housing price forecasting based on genetic algorithm and support vector machine. Expert Syst. Appl. 38(4), 3383–3386 (2011)CrossRef J. Gu, M. Zhu, L. Jiang, Housing price forecasting based on genetic algorithm and support vector machine. Expert Syst. Appl. 38(4), 3383–3386 (2011)CrossRef
8.
Zurück zum Zitat X. Wang, J. Wen, Y. Zhang, Y. Wang, Real estate price forecasting based on SVM optimized by PSO. Opt.-Int. J. Light Electron Opt. 125(3), 1439–1443 (2014)CrossRef X. Wang, J. Wen, Y. Zhang, Y. Wang, Real estate price forecasting based on SVM optimized by PSO. Opt.-Int. J. Light Electron Opt. 125(3), 1439–1443 (2014)CrossRef
9.
Zurück zum Zitat H. Selim, Determinants of house prices in Turkey: hedonic regression versus artificial neural network. Expert Syst. Appl. 36(2), 2843–2852 (2009)CrossRef H. Selim, Determinants of house prices in Turkey: hedonic regression versus artificial neural network. Expert Syst. Appl. 36(2), 2843–2852 (2009)CrossRef
10.
Zurück zum Zitat E.A. Antipov, E.B. Pokryshevskaya, Mass appraisal of residential apartments: an application of random forest for valuation and a CART-based approach for model diagnostics. Expert Syst. Appl. 39(2), 1772–1778 (2012)CrossRef E.A. Antipov, E.B. Pokryshevskaya, Mass appraisal of residential apartments: an application of random forest for valuation and a CART-based approach for model diagnostics. Expert Syst. Appl. 39(2), 1772–1778 (2012)CrossRef
11.
Zurück zum Zitat P.-N. Tan, M. Steinbach, V. Kumar, Introduction to data mining, addison, ed. by M.A. Boston, USA: Wesley Longman, Publishing Co., Inc (2005) P.-N. Tan, M. Steinbach, V. Kumar, Introduction to data mining, addison, ed. by M.A. Boston, USA: Wesley Longman, Publishing Co., Inc (2005)
12.
Zurück zum Zitat D. Talia, P. Trunfio, F. Marozzo, Data analysis in the cloud: models, techniques and applications. Elsevier (2015) D. Talia, P. Trunfio, F. Marozzo, Data analysis in the cloud: models, techniques and applications. Elsevier (2015)
13.
Zurück zum Zitat M. Sugiyama, Introduction to statistical machine learning. Morgan Kaufmann (2015) M. Sugiyama, Introduction to statistical machine learning. Morgan Kaufmann (2015)
14.
Zurück zum Zitat G. Wang, J. Hao, J. Ma, H. Jiang, A comparative assessment of ensemble learning for credit scoring. Expert Syst. Appl. 38(1), 223–230 (2011)CrossRef G. Wang, J. Hao, J. Ma, H. Jiang, A comparative assessment of ensemble learning for credit scoring. Expert Syst. Appl. 38(1), 223–230 (2011)CrossRef
15.
Zurück zum Zitat L. Nanni, A. Lumini, An experimental comparison of ensemble of classifiers for bankruptcy prediction and credit scoring. Expert Syst. Appl. 36(2), 3028–3033 (2009)CrossRef L. Nanni, A. Lumini, An experimental comparison of ensemble of classifiers for bankruptcy prediction and credit scoring. Expert Syst. Appl. 36(2), 3028–3033 (2009)CrossRef
16.
Zurück zum Zitat J. Friedman, T. Hastie, R. Tibshirani, The Elements of Statistical Learning (no. 10). Springer series in statistics New York (2001) J. Friedman, T. Hastie, R. Tibshirani, The Elements of Statistical Learning (no. 10). Springer series in statistics New York (2001)
17.
Zurück zum Zitat L. Breiman, Bias, variance, and arcing classifiers (1996) L. Breiman, Bias, variance, and arcing classifiers (1996)
18.
Zurück zum Zitat H. Pham, S. Olafsson, Bagged ensembles with tunable parameters. Comput. Intell. 35(1), 184–203 (2019)CrossRef H. Pham, S. Olafsson, Bagged ensembles with tunable parameters. Comput. Intell. 35(1), 184–203 (2019)CrossRef
19.
Zurück zum Zitat H. Pham, S. Olafsson, On Cesaro averages for weighted trees in the random forest. J. Classif. (2019) H. Pham, S. Olafsson, On Cesaro averages for weighted trees in the random forest. J. Classif. (2019)
20.
Zurück zum Zitat S. Boyd, L. Vandenberghe, Convex Optimization. Cambridge university press, Cambridge (2004) S. Boyd, L. Vandenberghe, Convex Optimization. Cambridge university press, Cambridge (2004)
22.
Zurück zum Zitat D. Kraft, A software package for sequential quadratic programming, Forschungsbericht- Deutsche Forschungs- und Versuchsanstalt fur Luft- und Raumfahrt (1988) D. Kraft, A software package for sequential quadratic programming, Forschungsbericht- Deutsche Forschungs- und Versuchsanstalt fur Luft- und Raumfahrt (1988)
23.
Zurück zum Zitat A. Wendorff, E. Botero, J.J. Alonso, Comparing different off-the-shelf optimizers’ performance in conceptual aircraft design, in 17th AIAA/ISSMO Multidisciplinary Analysis and Optimization Conference (2016), p. 336 A. Wendorff, E. Botero, J.J. Alonso, Comparing different off-the-shelf optimizers’ performance in conceptual aircraft design, in 17th AIAA/ISSMO Multidisciplinary Analysis and Optimization Conference (2016), p. 336
24.
Zurück zum Zitat D. Harrison Jr., D.L. Rubinfeld, Hedonic housing prices and the demand for clean air. J. Environ. Econ. Manag. 5(1), 81–102 (1978)CrossRef D. Harrison Jr., D.L. Rubinfeld, Hedonic housing prices and the demand for clean air. J. Environ. Econ. Manag. 5(1), 81–102 (1978)CrossRef
Metadaten
Titel
Optimizing Ensemble Weights for Machine Learning Models: A Case Study for Housing Price Prediction
verfasst von
Mohsen Shahhosseini
Guiping Hu
Hieu Pham
Copyright-Jahr
2020
DOI
https://doi.org/10.1007/978-3-030-30967-1_9

Premium Partner