Skip to main content
Erschienen in: Neural Computing and Applications 7/2020

07.01.2019 | Original Article

Estimation of missing prices in real-estate market agent-based simulations with machine learning and dimensionality reduction methods

verfasst von: Iván García-Magariño, Carlos Medrano, Jorge Delgado

Erschienen in: Neural Computing and Applications | Ausgabe 7/2020

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The opacity of real-estate market involves some challenges in their agent-based simulation. While some real-estate Web sites provide the prices of a great amount of houses publicly, the prices of the rest are not available. The estimation of these prices is necessary for simulating their evolution from a complete initial set of houses. Additionally, this estimation could also be useful for other purposes such as appraising houses, letting buyers know which are the best offered prices (i.e., the lowest ones compared to the appraisals) and recommending the buyers to set an initial price. This work proposes combining dimensionality reduction methods with machine learning techniques to obtain the estimated prices. In particular, this work analyzes the use of nonnegative factorization, recursive feature elimination and feature selection with a variance threshold, as dimensionality reduction methods. It compares the application of linear regression, support vector regression, the k-nearest neighbors and a multilayer perceptron neural network, as machine learning techniques. This work has applied a tenfold cross-validation for comparing the estimations and errors and assessing the improvement over a basic estimator commonly used in the beginning of simulations. The developed software and the used dataset are freely available from a data research repository for the sake of reproducibility and the support to other researchers.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Fußnoten
1
https://​www.​idealista.​com (last accessed October 22, 2018).
 
2
http://​data.​gov.​uk/​ (last accessed September 16, 2017).
 
3
https://​www.​fotocasa.​es/​es/​ (last accessed October 22, 2018).
 
Literatur
1.
Zurück zum Zitat Anya O, Moore B, Kieliszewski C, Maglio P, Anderson L (2015) Understanding the practice of discovery in enterprise big data science: an agent-based approach. Procedia Manuf 3:882–889CrossRef Anya O, Moore B, Kieliszewski C, Maglio P, Anderson L (2015) Understanding the practice of discovery in enterprise big data science: an agent-based approach. Procedia Manuf 3:882–889CrossRef
2.
Zurück zum Zitat Bárcena Ruiz MJ, Menéndez P, Palacios MB, Tusell Palmer FJ (2011) Measuring the effect of the real estate bubble: a house price index for Bilbao. Biltoki 5463. http://hdl.handle.net/10810/5463. Last accessed 19 July 2017 Bárcena Ruiz MJ, Menéndez P, Palacios MB, Tusell Palmer FJ (2011) Measuring the effect of the real estate bubble: a house price index for Bilbao. Biltoki 5463. http://​hdl.​handle.​net/​10810/​5463. Last accessed 19 July 2017
3.
Zurück zum Zitat Becker T, Illigen C, McKelvey B, Hülsmann M, Windt K (2016) Using an agent-based neural-network computational model to improve product routing in a logistics facility. Int J Prod Econ 174:156–167CrossRef Becker T, Illigen C, McKelvey B, Hülsmann M, Windt K (2016) Using an agent-based neural-network computational model to improve product routing in a logistics facility. Int J Prod Econ 174:156–167CrossRef
4.
Zurück zum Zitat Bishop CM (2006) Pattern recognition and machine learning. Springer, New YorkMATH Bishop CM (2006) Pattern recognition and machine learning. Springer, New YorkMATH
5.
Zurück zum Zitat Borges F, Gutierrez-Milla A, Luque E, Suppi R (2017) Care HPS: a high performance simulation tool for parallel and distributed agent-based modeling. Future Gener Comput Syst 68:59–73CrossRef Borges F, Gutierrez-Milla A, Luque E, Suppi R (2017) Care HPS: a high performance simulation tool for parallel and distributed agent-based modeling. Future Gener Comput Syst 68:59–73CrossRef
6.
Zurück zum Zitat Bosch M, Carnero MA, Farré L (2015) Rental housing discrimination and the persistence of ethnic enclaves. SERIEs 6(2):129–152CrossRef Bosch M, Carnero MA, Farré L (2015) Rental housing discrimination and the persistence of ethnic enclaves. SERIEs 6(2):129–152CrossRef
7.
Zurück zum Zitat Brown JM, Phelps JJ, Barkwith A, Hurst MD, Ellis MA, Plater AJ (2016) The effectiveness of beach mega-nourishment, assessed over three management epochs. J Environ Manag 184:400–408CrossRef Brown JM, Phelps JJ, Barkwith A, Hurst MD, Ellis MA, Plater AJ (2016) The effectiveness of beach mega-nourishment, assessed over three management epochs. J Environ Manag 184:400–408CrossRef
9.
Zurück zum Zitat Chang CC, Chao CH, Yeh JH (2016) The role of buy-side anchoring bias: evidence from the real estate market. Pacific-Basin Finance J 38:34–58CrossRef Chang CC, Chao CH, Yeh JH (2016) The role of buy-side anchoring bias: evidence from the real estate market. Pacific-Basin Finance J 38:34–58CrossRef
10.
Zurück zum Zitat Chasco Yrigoyen C, Le Gallo J (2012) Hierarchy and spatial autocorrelation effects in hedonic models. Econ Bull 32(2):1474–1480 Chasco Yrigoyen C, Le Gallo J (2012) Hierarchy and spatial autocorrelation effects in hedonic models. Econ Bull 32(2):1474–1480
11.
Zurück zum Zitat Chen J, Feng S, Liu J (2014) Topic sense induction from social tags based on non-negative matrix factorization. Inf Sci 280:16–25CrossRefMathSciNet Chen J, Feng S, Liu J (2014) Topic sense induction from social tags based on non-negative matrix factorization. Inf Sci 280:16–25CrossRefMathSciNet
13.
Zurück zum Zitat Chung H, Badeau R, Plourde E, Champagne B (2018) Training and compensation of class-conditioned nmf bases for speech enhancement. Neurocomputing 284:107–118CrossRef Chung H, Badeau R, Plourde E, Champagne B (2018) Training and compensation of class-conditioned nmf bases for speech enhancement. Neurocomputing 284:107–118CrossRef
14.
Zurück zum Zitat Cicirelli F, Furfaro A, Giordano A, Nigro L (2011) HLA\_ACTOR\_REPAST: an approach to distributing RePast models for high-performance simulations. Simul Modell Pract Theory 19(1):283–300CrossRef Cicirelli F, Furfaro A, Giordano A, Nigro L (2011) HLA\_ACTOR\_REPAST: an approach to distributing RePast models for high-performance simulations. Simul Modell Pract Theory 19(1):283–300CrossRef
15.
Zurück zum Zitat Cui G, Zhuang G, Lu J (2016) Neural-network-based distributed adaptive synchronization for nonlinear multi-agent systems in pure-feedback form. Neurocomputing 218:234–241CrossRef Cui G, Zhuang G, Lu J (2016) Neural-network-based distributed adaptive synchronization for nonlinear multi-agent systems in pure-feedback form. Neurocomputing 218:234–241CrossRef
16.
Zurück zum Zitat Davidsson P (2002) Agent based social simulation: a computer science view. J Artif Soc Soc Simul 5(1):1–7 Davidsson P (2002) Agent based social simulation: a computer science view. J Artif Soc Soc Simul 5(1):1–7
17.
Zurück zum Zitat Dismuke C, Lindrooth R (2006) Ordinary least squares. In: Chumney E, Simpson NK (eds) Methods and designs for outcomes research. American Society of Health-System Pharmacists, Bethesda, pp 93–104 Dismuke C, Lindrooth R (2006) Ordinary least squares. In: Chumney E, Simpson NK (eds) Methods and designs for outcomes research. American Society of Health-System Pharmacists, Bethesda, pp 93–104
18.
Zurück zum Zitat Duda RO, Hart PE, Stork DG (2012) Pattern classification. Wiley, HobokenMATH Duda RO, Hart PE, Stork DG (2012) Pattern classification. Wiley, HobokenMATH
19.
Zurück zum Zitat Faul F, Erdfelder E, Lang AG, Buchner A (2007) G* power 3: a flexible statistical power analysis program for the social, behavioral, and biomedical sciences. Behav Res Methods 39(2):175–191CrossRef Faul F, Erdfelder E, Lang AG, Buchner A (2007) G* power 3: a flexible statistical power analysis program for the social, behavioral, and biomedical sciences. Behav Res Methods 39(2):175–191CrossRef
20.
Zurück zum Zitat Galey M (2005) System and method of online real estate listing and advertisement. US Patent App. 10/896,331 Galey M (2005) System and method of online real estate listing and advertisement. US Patent App. 10/896,331
22.
Zurück zum Zitat García M (2010) The breakdown of the spanish urban growth model: social and territorial effects of the global crisis. Int J Urban Reg Res 34(4):967–980CrossRef García M (2010) The breakdown of the spanish urban growth model: social and territorial effects of the global crisis. Int J Urban Reg Res 34(4):967–980CrossRef
23.
Zurück zum Zitat García-Magariño I, Lacuesta R (2017) Agent-based simulation of real-estate transactions. J Comput Sci 21:60–76CrossRef García-Magariño I, Lacuesta R (2017) Agent-based simulation of real-estate transactions. J Comput Sci 21:60–76CrossRef
24.
Zurück zum Zitat García-Magariño I, Plaza I (2017) ABS-MindHeart: an agent based simulator of the influence of mindfulness programs on heart rate variability. J Comput Sci 19:11–20CrossRef García-Magariño I, Plaza I (2017) ABS-MindHeart: an agent based simulator of the influence of mindfulness programs on heart rate variability. J Comput Sci 19:11–20CrossRef
25.
Zurück zum Zitat García-Magariño I, Gómez-Rodríguez A, González-Moreno JC, Palacios-Navarro G (2015) PEABS: a process for developing efficient agent-based simulators. Eng Appl Artif Intell 46:104–112CrossRef García-Magariño I, Gómez-Rodríguez A, González-Moreno JC, Palacios-Navarro G (2015) PEABS: a process for developing efficient agent-based simulators. Eng Appl Artif Intell 46:104–112CrossRef
27.
Zurück zum Zitat Gilbert N, Terna P (2000) How to build and use agent-based models in social science. Mind Soc 1(1):57–72CrossRef Gilbert N, Terna P (2000) How to build and use agent-based models in social science. Mind Soc 1(1):57–72CrossRef
28.
Zurück zum Zitat Gómez-Sanz JJ, Fernández CR, Arroyo J (2010) Model driven development and simulations with the INGENIAS agent framework. Simul Model Pract Theory 18(10):1468–1482CrossRef Gómez-Sanz JJ, Fernández CR, Arroyo J (2010) Model driven development and simulations with the INGENIAS agent framework. Simul Model Pract Theory 18(10):1468–1482CrossRef
29.
Zurück zum Zitat Hassan S, Garmendia L, Pavón J (2010) Introducing uncertainty into social simulation: using fuzzy logic for agent-based modelling. Int J Reasoning-based Intell Syst 2(2):118–124 Hassan S, Garmendia L, Pavón J (2010) Introducing uncertainty into social simulation: using fuzzy logic for agent-based modelling. Int J Reasoning-based Intell Syst 2(2):118–124
30.
Zurück zum Zitat Houari R, Bounceur A, Kechadi MT, Tari AK, Euler R (2016) Dimensionality reduction in data mining: a copula approach. Expert Syst Appl 64:247–260CrossRef Houari R, Bounceur A, Kechadi MT, Tari AK, Euler R (2016) Dimensionality reduction in data mining: a copula approach. Expert Syst Appl 64:247–260CrossRef
31.
Zurück zum Zitat Jalalimanesh A, Haghighi HS, Ahmadi A, Soltani M (2017) Simulation-based optimization of radiotherapy: agent-based modeling and reinforcement learning. Math Comput Simul 133:235–248CrossRefMathSciNet Jalalimanesh A, Haghighi HS, Ahmadi A, Soltani M (2017) Simulation-based optimization of radiotherapy: agent-based modeling and reinforcement learning. Math Comput Simul 133:235–248CrossRefMathSciNet
32.
Zurück zum Zitat Jayaram D, Manrai AK, Manrai LA (2015) Effective use of marketing technology in Eastern Europe: web analytics, social media, customer analytics, digital campaigns and mobile applications. J Econ Finance Adm Sci 20(39):118–132 Jayaram D, Manrai AK, Manrai LA (2015) Effective use of marketing technology in Eastern Europe: web analytics, social media, customer analytics, digital campaigns and mobile applications. J Econ Finance Adm Sci 20(39):118–132
33.
Zurück zum Zitat Jiang GM, Hu ZP, Jin JY (2007) Quantitative evaluation of real estate’s risk based on AHP and simulation. Syst Eng Theory Pract 27(9):77–81CrossRef Jiang GM, Hu ZP, Jin JY (2007) Quantitative evaluation of real estate’s risk based on AHP and simulation. Syst Eng Theory Pract 27(9):77–81CrossRef
34.
Zurück zum Zitat Khalil KM, Abdel-Aziz M, Nazmy TT, Salem ABM (2015) MLIMAS: a framework for machine learning in interactive multi-agent systems. Procedia Comput Sci 65:827–835CrossRef Khalil KM, Abdel-Aziz M, Nazmy TT, Salem ABM (2015) MLIMAS: a framework for machine learning in interactive multi-agent systems. Procedia Comput Sci 65:827–835CrossRef
35.
Zurück zum Zitat Lee D, Seung H (1999) Learning the parts of objects by non-negative matrix factorization. Nature 401:788–791CrossRefMATH Lee D, Seung H (1999) Learning the parts of objects by non-negative matrix factorization. Nature 401:788–791CrossRefMATH
36.
Zurück zum Zitat Lee D, Seung H (2001) Algorithms for non-negative matrix factorization. Adv Neural Inf Process Syst 13:556–562 Lee D, Seung H (2001) Algorithms for non-negative matrix factorization. Adv Neural Inf Process Syst 13:556–562
38.
Zurück zum Zitat Maltamo M, Kangas A (1998) Methods based on k-nearest neighbor regression in the prediction of basal area diameter distribution. Can J For Res 28(8):1107–1115CrossRef Maltamo M, Kangas A (1998) Methods based on k-nearest neighbor regression in the prediction of basal area diameter distribution. Can J For Res 28(8):1107–1115CrossRef
39.
Zurück zum Zitat Maruyama R, Maeda K, Moroda H, Kato I, Inoue M, Miyakawa H, Aonishi T (2014) Detecting cells using non-negative matrix factorization on calcium imaging data. Neural Netw 55:11–19CrossRef Maruyama R, Maeda K, Moroda H, Kato I, Inoue M, Miyakawa H, Aonishi T (2014) Detecting cells using non-negative matrix factorization on calcium imaging data. Neural Netw 55:11–19CrossRef
40.
Zurück zum Zitat Nguyen N, Cripps A (2001) Predicting housing value: a comparison of multiple regression analysis and artificial neural networks. J Real Estate Res 22(3):313–336 Nguyen N, Cripps A (2001) Predicting housing value: a comparison of multiple regression analysis and artificial neural networks. J Real Estate Res 22(3):313–336
41.
Zurück zum Zitat North MJ, Collier NT, Ozik J, Tatara ER, Macal CM, Bragen M, Sydelko P (2013) Complex adaptive systems modeling with Repast Simphony. Complex Adapt Syst Model 1(1):1CrossRef North MJ, Collier NT, Ozik J, Tatara ER, Macal CM, Bragen M, Sydelko P (2013) Complex adaptive systems modeling with Repast Simphony. Complex Adapt Syst Model 1(1):1CrossRef
42.
Zurück zum Zitat Paatero P, Tapper U (1994) Positive matrix factorization: a non-negative factor model with optimal utilization of error estimates of data values. Environmetrics 5:111–126CrossRef Paatero P, Tapper U (1994) Positive matrix factorization: a non-negative factor model with optimal utilization of error estimates of data values. Environmetrics 5:111–126CrossRef
44.
Zurück zum Zitat Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, Blondel M, Prettenhofer P, Weiss R, Dubourg V et al (2011) Scikit-learn: machine learning in python. J Mach Learn Res 12(Oct):2825–2830MathSciNetMATH Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, Blondel M, Prettenhofer P, Weiss R, Dubourg V et al (2011) Scikit-learn: machine learning in python. J Mach Learn Res 12(Oct):2825–2830MathSciNetMATH
45.
Zurück zum Zitat Provost F, Fawcett T (2013) Data science and its relationship to big data and data-driven decision making. Big Data 1(1):51–59CrossRef Provost F, Fawcett T (2013) Data science and its relationship to big data and data-driven decision making. Big Data 1(1):51–59CrossRef
46.
Zurück zum Zitat Pyhrr SA (1973) A computer simulation model to measure the risk in real estate investment. Real Estate Econ 1(1):48–78CrossRef Pyhrr SA (1973) A computer simulation model to measure the risk in real estate investment. Real Estate Econ 1(1):48–78CrossRef
47.
Zurück zum Zitat Reiser L, Mueller LA, Rhee SY (2002) Surviving in a sea of data: a survey of plant genome data resources and issues in building data management systems. Functional genomics. Springer, Berlin, pp 59–74CrossRef Reiser L, Mueller LA, Rhee SY (2002) Surviving in a sea of data: a survey of plant genome data resources and issues in building data management systems. Functional genomics. Springer, Berlin, pp 59–74CrossRef
48.
Zurück zum Zitat Sabarina K, Priya N (2015) Lowering data dimensionality in big data for the benefit of precision agriculture. Procedia Comput Sci 48:548–554CrossRef Sabarina K, Priya N (2015) Lowering data dimensionality in big data for the benefit of precision agriculture. Procedia Comput Sci 48:548–554CrossRef
49.
Zurück zum Zitat Simovici D (2012) Linear algebra tools for data mining. World Scientific Publishing, SingaporeCrossRefMATH Simovici D (2012) Linear algebra tools for data mining. World Scientific Publishing, SingaporeCrossRefMATH
50.
Zurück zum Zitat Sun Y, Wen G (2017) Cognitive facial expression recognition with constrained dimensionality reduction. Neurocomputing 230:397–408CrossRef Sun Y, Wen G (2017) Cognitive facial expression recognition with constrained dimensionality reduction. Neurocomputing 230:397–408CrossRef
51.
Zurück zum Zitat Symeonidis S, Effrosynidis D, Arampatzis A (2018) A comparative evaluation of pre-processing techniques and their interactions for twitter sentiment analysis. Expert Syst Appl 110:298–310CrossRef Symeonidis S, Effrosynidis D, Arampatzis A (2018) A comparative evaluation of pre-processing techniques and their interactions for twitter sentiment analysis. Expert Syst Appl 110:298–310CrossRef
52.
Zurück zum Zitat Tratalos J, Haines-Young R, Potschin M, Fish R, Church A (2016) Cultural ecosystem services in the UK: lessons on designing indicators to inform management and policy. Ecol Indic 61:63–73CrossRef Tratalos J, Haines-Young R, Potschin M, Fish R, Church A (2016) Cultural ecosystem services in the UK: lessons on designing indicators to inform management and policy. Ecol Indic 61:63–73CrossRef
53.
Zurück zum Zitat Urbanavičiene V, Kaklauskas A, Zavadskas EK (2009) The conceptual model of construction and real estate negotiation. Int J Strateg Prop Manag 13(1):53–70CrossRef Urbanavičiene V, Kaklauskas A, Zavadskas EK (2009) The conceptual model of construction and real estate negotiation. Int J Strateg Prop Manag 13(1):53–70CrossRef
54.
Zurück zum Zitat Wang R, Hou J, He X (2017) Real estate price and heterogeneous investment behavior in China. Econ Model 60:271–280CrossRef Wang R, Hou J, He X (2017) Real estate price and heterogeneous investment behavior in China. Econ Model 60:271–280CrossRef
55.
Zurück zum Zitat Wang S, Wan J, Zhang D, Li D, Zhang C (2016) Towards smart factory for industry 4.0: a self-organized multi-agent system with big data based feedback and coordination. Comput Netw 101:158–168CrossRef Wang S, Wan J, Zhang D, Li D, Zhang C (2016) Towards smart factory for industry 4.0: a self-organized multi-agent system with big data based feedback and coordination. Comput Netw 101:158–168CrossRef
56.
Zurück zum Zitat Wojtusiak J, Warden T, Herzog O (2012) Machine learning in agent-based stochastic simulation: inferential theory and evaluation in transportation logistics. Comput Math Appl 64(12):3658–3665CrossRefMATH Wojtusiak J, Warden T, Herzog O (2012) Machine learning in agent-based stochastic simulation: inferential theory and evaluation in transportation logistics. Comput Math Appl 64(12):3658–3665CrossRefMATH
57.
Zurück zum Zitat Yaqoob I, Hashem IAT, Gani A, Mokhtar S, Ahmed E, Anuar NB, Vasilakos AV (2016) Big data: from beginning to future. Int J Inf Manag 36(6):1231–1247CrossRef Yaqoob I, Hashem IAT, Gani A, Mokhtar S, Ahmed E, Anuar NB, Vasilakos AV (2016) Big data: from beginning to future. Int J Inf Manag 36(6):1231–1247CrossRef
58.
59.
Zurück zum Zitat Zhuge C, Shao C, Gao J, Dong C, Zhang H (2016) Agent-based joint model of residential location choice and real estate price for land use and transport model. Comput Environ Urban Syst 57:93–105CrossRef Zhuge C, Shao C, Gao J, Dong C, Zhang H (2016) Agent-based joint model of residential location choice and real estate price for land use and transport model. Comput Environ Urban Syst 57:93–105CrossRef
60.
Zurück zum Zitat Žibert J, Cedilnik J, Pražnikar J (2016) Particulate matter (pm10) patterns in Europe: an exploratory data analysis using non-negative matrix factorization. Atmos Environ 132:217–228CrossRef Žibert J, Cedilnik J, Pražnikar J (2016) Particulate matter (pm10) patterns in Europe: an exploratory data analysis using non-negative matrix factorization. Atmos Environ 132:217–228CrossRef
Metadaten
Titel
Estimation of missing prices in real-estate market agent-based simulations with machine learning and dimensionality reduction methods
verfasst von
Iván García-Magariño
Carlos Medrano
Jorge Delgado
Publikationsdatum
07.01.2019
Verlag
Springer London
Erschienen in
Neural Computing and Applications / Ausgabe 7/2020
Print ISSN: 0941-0643
Elektronische ISSN: 1433-3058
DOI
https://doi.org/10.1007/s00521-018-3938-7

Weitere Artikel der Ausgabe 7/2020

Neural Computing and Applications 7/2020 Zur Ausgabe

Deep Learning & Neural Computing for Intelligent Sensing and Control

Recognition and prediction of ground vibration signal based on machine learning algorithm

Premium Partner