Skip to main content
Erschienen in: Annals of Data Science 1/2019

01.03.2019

NBA Game Result Prediction Using Feature Analysis and Machine Learning

verfasst von: Fadi Thabtah, Li Zhang, Neda Abdelhamid

Erschienen in: Annals of Data Science | Ausgabe 1/2019

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In the recent years, sports outcome prediction has gained popularity, as demonstrated by massive financial transactions in sports betting. One of the world’s popular sports that lures betting and attracts millions of fans worldwide is basketball, particularly the National Basketball Association (NBA) of the United States. This paper proposes a new intelligent machine learning framework for predicting the results of games played at the NBA by aiming to discover the influential features set that affects the outcomes of NBA games. We would like to identify whether machine learning methods are applicable to forecasting the outcome of an NBA game using historical data (previous games played), and what are the significant factors that affect the outcome of games. To achieve the objectives, several machine learning methods that utilise different learning schemes to derive the models, including Naïve Bayes, artificial neural network, and Decision Tree, are selected. By comparing the performance and the models derived against different features sets related to basketball games, we can discover the key features that contribute to better performance such as accuracy and efficiency of the prediction model. Based on the results analysis, the DRB (defensive rebounds) feature was chosen and was deemed as the most significant factor influencing the results of an NBA game. Furthermore, others crucial factors such as TPP (three-point percentage), FT (free throws made), and TRB (total rebounds) were also selected, which subsequently increased the model’s prediction accuracy rate by 2–4%.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Abdelhamid N, Thabtah F, Abdel-jaber H (2017) Phishing detection: a recent intelligent machine learning comparison based on models content and features. In: Proceedings of the 2017 IEEE international conference on intelligence and security informatics (ISI). Beijing Abdelhamid N, Thabtah F, Abdel-jaber H (2017) Phishing detection: a recent intelligent machine learning comparison based on models content and features. In: Proceedings of the 2017 IEEE international conference on intelligence and security informatics (ISI). Beijing
2.
Zurück zum Zitat AlShboul R, Thabtah F, Abdelhamid N, Al-diabat M (2018) A visualization cybersecurity method based on features’ dissimilarity. Comput Secur 77:289–303CrossRef AlShboul R, Thabtah F, Abdelhamid N, Al-diabat M (2018) A visualization cybersecurity method based on features’ dissimilarity. Comput Secur 77:289–303CrossRef
5.
Zurück zum Zitat Burges C (1998) Tutorial on support vector machines for pattern recognition. Data Min Knowl Discov 2:121–167CrossRef Burges C (1998) Tutorial on support vector machines for pattern recognition. Data Min Knowl Discov 2:121–167CrossRef
8.
Zurück zum Zitat Cohen W (1995) Fast effective rule induction. Proceedings of the 12th International Conference on Machine Learning 115–123 Cohen W (1995) Fast effective rule induction. Proceedings of the 12th International Conference on Machine Learning 115–123
9.
Zurück zum Zitat Haghighat M, Rastegari H, Nourafza N (2013) A review of data mining techniques for result prediction in sports. In: Advances in computer science, pp 2322–5157 Haghighat M, Rastegari H, Nourafza N (2013) A review of data mining techniques for result prediction in sports. In: Advances in computer science, pp 2322–5157
10.
Zurück zum Zitat Hall M (1999) Correlation-based feature selection for machine learning. Doctoral dissertation, University of Waikato, Dept. of Computer Science Hall M (1999) Correlation-based feature selection for machine learning. Doctoral dissertation, University of Waikato, Dept. of Computer Science
11.
Zurück zum Zitat Hall M, Frank E, Holmes G, Pfahringer B, Reutemann P, Witten I (2009) The WEKA Data Mining Software: An Update. SIGKDD Explor 11(1) Hall M, Frank E, Holmes G, Pfahringer B, Reutemann P, Witten I (2009) The WEKA Data Mining Software: An Update. SIGKDD Explor 11(1)
12.
Zurück zum Zitat Higgins J (2005) Introduction to multiple regression, Chapt 4, pp 111–115. Accessed 9 Feb 2018 Higgins J (2005) Introduction to multiple regression, Chapt 4, pp 111–115. Accessed 9 Feb 2018
13.
Zurück zum Zitat Hosmer D, Lemeshow S (2000) Applied logistic regression. Wiley, New York, pp 236–269CrossRef Hosmer D, Lemeshow S (2000) Applied logistic regression. Wiley, New York, pp 236–269CrossRef
15.
Zurück zum Zitat Keller JM, Gray MR, Givens JA (1985) A fuzzy K-nearest neighbour algorithm. IEEE Trans Syst Man Cyberne 580(4):580–585CrossRef Keller JM, Gray MR, Givens JA (1985) A fuzzy K-nearest neighbour algorithm. IEEE Trans Syst Man Cyberne 580(4):580–585CrossRef
17.
Zurück zum Zitat Landwehr N, Hall M, Frank E (2005) Logistic model trees. Mach Learn 95(1–2):161–205CrossRef Landwehr N, Hall M, Frank E (2005) Logistic model trees. Mach Learn 95(1–2):161–205CrossRef
18.
Zurück zum Zitat Langley P, Iba W, Thompson K (1992) An analysis of Bayesian classifiers. In: The tenth national conference on artificial intelligence, vol. 24. AAAI Press, San Jose, pp 399–406 Langley P, Iba W, Thompson K (1992) An analysis of Bayesian classifiers. In: The tenth national conference on artificial intelligence, vol. 24. AAAI Press, San Jose, pp 399–406
20.
Zurück zum Zitat Lewis D (1998) Naive (Bayes) at forty: the independence assumption in information retrieval. In: European conference on machine learning, pp 4–15 Lewis D (1998) Naive (Bayes) at forty: the independence assumption in information retrieval. In: European conference on machine learning, pp 4–15
22.
Zurück zum Zitat Loeffelholz B, Bednar E, Bauer KW (2009) Predicting NBA games using neural networks. J Quant Anal Sports 5(1):1156 Loeffelholz B, Bednar E, Bauer KW (2009) Predicting NBA games using neural networks. J Quant Anal Sports 5(1):1156
24.
Zurück zum Zitat Meyera D, Leischa F, Hornik K (2003) The support vector machine under test. Neurocomputing 55:169–186CrossRef Meyera D, Leischa F, Hornik K (2003) The support vector machine under test. Neurocomputing 55:169–186CrossRef
25.
Zurück zum Zitat Miljkovic D, Gajic L, Kovacevic A, Konjovic Z (2010) The use of data mining for basketball matches outcomes prediction. In: IEEE 8th international symposium on intelligent systems and informatics. SISY, Subotica, pp 10–11 Miljkovic D, Gajic L, Kovacevic A, Konjovic Z (2010) The use of data mining for basketball matches outcomes prediction. In: IEEE 8th international symposium on intelligent systems and informatics. SISY, Subotica, pp 10–11
28.
Zurück zum Zitat Schalkoff RJ (1997) Artificial neural networks. International ed. McGraw-Hill, New York Schalkoff RJ (1997) Artificial neural networks. International ed. McGraw-Hill, New York
30.
Zurück zum Zitat Thabtah F (2017) Autism spectrum disorder screening: machine learning adaptation and DSM-5 fulfillment. In: Proceedings of the 1st international conference on medical and health informatics. ACM, Taichung City, pp 1–6 Thabtah F (2017) Autism spectrum disorder screening: machine learning adaptation and DSM-5 fulfillment. In: Proceedings of the 1st international conference on medical and health informatics. ACM, Taichung City, pp 1–6
31.
Zurück zum Zitat Thabtah F, Abdelhamid N (2016) Deriving correlated sets of website features for phishing detection: a computational intelligence approach. J Inform Knowl Manag 15(04):1650042CrossRef Thabtah F, Abdelhamid N (2016) Deriving correlated sets of website features for phishing detection: a computational intelligence approach. J Inform Knowl Manag 15(04):1650042CrossRef
32.
Zurück zum Zitat Thabtah F, Kamalov F, Rajab K (2018) A new computational intelligence approach to detect autistic features for autism screening. Int J Med Inform 117:112–124CrossRef Thabtah F, Kamalov F, Rajab K (2018) A new computational intelligence approach to detect autistic features for autism screening. Int J Med Inform 117:112–124CrossRef
Metadaten
Titel
NBA Game Result Prediction Using Feature Analysis and Machine Learning
verfasst von
Fadi Thabtah
Li Zhang
Neda Abdelhamid
Publikationsdatum
01.03.2019
Verlag
Springer Berlin Heidelberg
Erschienen in
Annals of Data Science / Ausgabe 1/2019
Print ISSN: 2198-5804
Elektronische ISSN: 2198-5812
DOI
https://doi.org/10.1007/s40745-018-00189-x

Weitere Artikel der Ausgabe 1/2019

Annals of Data Science 1/2019 Zur Ausgabe

Premium Partner