Skip to main content

2018 | OriginalPaper | Buchkapitel

The Classification of Music by the Genre Using the KNN Classifier

Aktivieren Sie unsere intelligente Suche um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The article presents the possibility of classifying music tracks according to their musical genre. This issue is interesting because it is difficult to find solutions that look for similarity between songs based on their waveforms, as in this work. This article shows that such a classification is possible. For this process, the KNN classifier was used, for which it is possible to apply different metrics (metric spaces). The article shows the validity of testing different distance measures in the classification process. The analysis of music tracks and assignment to the appropriate genre is carried out, on the basis of attributes describing the music track. These attributes are obtained using the jAudio library. The development of further research in this area may allow finding other suitable music not only on the basis of historical data about the user (what he was listening to along with the music track) but also directly on the basis of the genre of the given song.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Agrawal, R., Imielinski, T., Swami, A.: Database mining: a performance perspective. IEEE Trans. Knowl. Data Eng. 5(6), 914–925 (1993)CrossRef Agrawal, R., Imielinski, T., Swami, A.: Database mining: a performance perspective. IEEE Trans. Knowl. Data Eng. 5(6), 914–925 (1993)CrossRef
2.
Zurück zum Zitat Aha, D.W., Kibler, D., Albert, M.K.: Instance-based learning algorithms. Mach. Learn. 6(1), 37–66 (1991) Aha, D.W., Kibler, D., Albert, M.K.: Instance-based learning algorithms. Mach. Learn. 6(1), 37–66 (1991)
3.
Zurück zum Zitat Aksoy, S., Haralick, R.M.: Feature normalization and likelihood-based similarity measures for image retrieval. Pattern Recognit. Lett. 22(5), 563–582 (2001)CrossRef Aksoy, S., Haralick, R.M.: Feature normalization and likelihood-based similarity measures for image retrieval. Pattern Recognit. Lett. 22(5), 563–582 (2001)CrossRef
5.
Zurück zum Zitat Bach, M., Werner, A., Żywiec, J., Pluskiewicz, W.: The study of under-and over-sampling methods utility in analysis of highly imbalanced data on osteoporosis. Inf. Sci. 384, 174–190 (2017)CrossRef Bach, M., Werner, A., Żywiec, J., Pluskiewicz, W.: The study of under-and over-sampling methods utility in analysis of highly imbalanced data on osteoporosis. Inf. Sci. 384, 174–190 (2017)CrossRef
6.
Zurück zum Zitat Basili, R., Serafini, A., Stellato, A.: Classification of musical genre: a machine learning approach. In: ISMIR (2004) Basili, R., Serafini, A., Stellato, A.: Classification of musical genre: a machine learning approach. In: ISMIR (2004)
7.
Zurück zum Zitat Ben-David, A.: Comparison of classification accuracy using Cohens Weighted Kappa. Exp. Syst. Appl. 34(2), 825–832 (2008)CrossRef Ben-David, A.: Comparison of classification accuracy using Cohens Weighted Kappa. Exp. Syst. Appl. 34(2), 825–832 (2008)CrossRef
8.
Zurück zum Zitat Costa, E., Lorena, A., Carvalho, A., Freitas, A.: A review of performance evaluation measures for hierarchical classifiers. In: Evaluation Methods for Machine Learning II: Papers from the AAAI-2007 Workshop, pp. 1–6 (2007) Costa, E., Lorena, A., Carvalho, A., Freitas, A.: A review of performance evaluation measures for hierarchical classifiers. In: Evaluation Methods for Machine Learning II: Papers from the AAAI-2007 Workshop, pp. 1–6 (2007)
9.
Zurück zum Zitat Hamel, P., Eck, D.: Learning features from music audio with deep belief networks. In: ISMIR, vol. 10, Utrecht, The Netherlands, pp. 339–344 (2010) Hamel, P., Eck, D.: Learning features from music audio with deep belief networks. In: ISMIR, vol. 10, Utrecht, The Netherlands, pp. 339–344 (2010)
10.
Zurück zum Zitat Henaff, M., Jarrett, K., Kavukcuoglu, K., LeCun, Y.: Unsupervised learning of sparse features for scalable audio classification. In: ISMIR, vol. 11. Citeseer (2011) Henaff, M., Jarrett, K., Kavukcuoglu, K., LeCun, Y.: Unsupervised learning of sparse features for scalable audio classification. In: ISMIR, vol. 11. Citeseer (2011)
13.
Zurück zum Zitat Kostrzewa, D., Brzeski, R.: The data dimensionality reduction in the classification process through greedy backward feature elimination. In: Gruca, A., Czachórski, T., Harezlak, K., Kozielski, S., Piotrowska, A. (eds.) ICMMI 2017. AISC, vol. 659, pp. 397–407. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-67792-7_39CrossRef Kostrzewa, D., Brzeski, R.: The data dimensionality reduction in the classification process through greedy backward feature elimination. In: Gruca, A., Czachórski, T., Harezlak, K., Kozielski, S., Piotrowska, A. (eds.) ICMMI 2017. AISC, vol. 659, pp. 397–407. Springer, Cham (2018). https://​doi.​org/​10.​1007/​978-3-319-67792-7_​39CrossRef
15.
Zurück zum Zitat Kubanski, M.: Znajdowanie utworow podobnych metoda najblizszego sasiada (in polish): bachelor thesis. SUT, Gliwice, Poland (2018) Kubanski, M.: Znajdowanie utworow podobnych metoda najblizszego sasiada (in polish): bachelor thesis. SUT, Gliwice, Poland (2018)
16.
Zurück zum Zitat Lee, C.H., Shih, J.L., Yu, K.M., Su, J.M.: Automatic music genre classification using modulation spectral contrast feature. In: 2007 IEEE International Conference on Multimedia and Expo, pp. 204–207. IEEE (2007) Lee, C.H., Shih, J.L., Yu, K.M., Su, J.M.: Automatic music genre classification using modulation spectral contrast feature. In: 2007 IEEE International Conference on Multimedia and Expo, pp. 204–207. IEEE (2007)
17.
Zurück zum Zitat Li, M., Sleep, R.: Genre classification via an LZ78-based string kernel. In: ISMIR, pp. 252–259 (2005) Li, M., Sleep, R.: Genre classification via an LZ78-based string kernel. In: ISMIR, pp. 252–259 (2005)
19.
Zurück zum Zitat McEnnis, D., McKay, C., Fujinaga, I.: Overview of on-demand metadata extraction network (OMEN). In: Proceedings of the Seventh International Conference on Music Information Retrieval (ISMIR 2006) (2006) McEnnis, D., McKay, C., Fujinaga, I.: Overview of on-demand metadata extraction network (OMEN). In: Proceedings of the Seventh International Conference on Music Information Retrieval (ISMIR 2006) (2006)
20.
Zurück zum Zitat McEnnis, D., McKay, C., Fujinaga, I., Depalle, P.: jAudio: Additions and improvements. In: ISMIR, pp. 385–386 (2006) McEnnis, D., McKay, C., Fujinaga, I., Depalle, P.: jAudio: Additions and improvements. In: ISMIR, pp. 385–386 (2006)
21.
Zurück zum Zitat McKay, C., Fujinaga, I., Depalle, P.: jAudio: a feature extraction library. In: Proceedings of the International Conference on Music Information Retrieval, pp. 600–603 (2005) McKay, C., Fujinaga, I., Depalle, P.: jAudio: a feature extraction library. In: Proceedings of the International Conference on Music Information Retrieval, pp. 600–603 (2005)
22.
Zurück zum Zitat Mehra, N., Gupta, S.: Survey on multiclass classification methods (2013) Mehra, N., Gupta, S.: Survey on multiclass classification methods (2013)
23.
Zurück zum Zitat Pampalk, E., Flexer, A., Widmer, G., et al.: Improvements of audio-based music similarity and genre classificaton. In: ISMIR, vol. 5, London, UK, pp. 634–637 (2005) Pampalk, E., Flexer, A., Widmer, G., et al.: Improvements of audio-based music similarity and genre classificaton. In: ISMIR, vol. 5, London, UK, pp. 634–637 (2005)
24.
Zurück zum Zitat Powers, D.M.: Evaluation: from precision, recall and F-measure to roc, informedness, markedness and correlation (2011) Powers, D.M.: Evaluation: from precision, recall and F-measure to roc, informedness, markedness and correlation (2011)
25.
Zurück zum Zitat Ricci, F., Avesani, P.: Data compression and local metrics for nearest neighbor classification. IEEE Trans. Pattern Anal. Mach. Intell. 21(4), 380–384 (1999)CrossRef Ricci, F., Avesani, P.: Data compression and local metrics for nearest neighbor classification. IEEE Trans. Pattern Anal. Mach. Intell. 21(4), 380–384 (1999)CrossRef
26.
Zurück zum Zitat Scaringella, N., Zoia, G., Mlynek, D.: Automatic genre classification of music content: a survey. IEEE Sig. Process. Mag. 23(2), 133–141 (2006)CrossRef Scaringella, N., Zoia, G., Mlynek, D.: Automatic genre classification of music content: a survey. IEEE Sig. Process. Mag. 23(2), 133–141 (2006)CrossRef
28.
Zurück zum Zitat Sigtia, S., Dixon, S.: Improved music feature learning with deep neural networks. In: 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 6959–6963. IEEE (2014) Sigtia, S., Dixon, S.: Improved music feature learning with deep neural networks. In: 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 6959–6963. IEEE (2014)
29.
Zurück zum Zitat Silla, C.N., Koerich, A.L., Kaestner, C.A.: A machine learning approach to automatic music genre classification. J. Braz. Comput. Soc. 14(3), 7–18 (2008)CrossRef Silla, C.N., Koerich, A.L., Kaestner, C.A.: A machine learning approach to automatic music genre classification. J. Braz. Comput. Soc. 14(3), 7–18 (2008)CrossRef
30.
Zurück zum Zitat Simsekli, U.: Automatic music genre classification using bass lines. In: 2010 20th International Conference on Pattern Recognition (ICPR), pp. 4137–4140. IEEE (2010) Simsekli, U.: Automatic music genre classification using bass lines. In: 2010 20th International Conference on Pattern Recognition (ICPR), pp. 4137–4140. IEEE (2010)
31.
Zurück zum Zitat Tzanetakis, G., Cook, P.: Musical genre classification of audio signals. IEEE Trans. Speech Audio Process. 10(5), 293–302 (2002)CrossRef Tzanetakis, G., Cook, P.: Musical genre classification of audio signals. IEEE Trans. Speech Audio Process. 10(5), 293–302 (2002)CrossRef
32.
33.
Zurück zum Zitat Weinberger, K.Q., Saul, L.K.: Distance metric learning for large margin nearest neighbor classification. J. Mach. Learn. Res. 10(Feb), 207–244 (2009) Weinberger, K.Q., Saul, L.K.: Distance metric learning for large margin nearest neighbor classification. J. Mach. Learn. Res. 10(Feb), 207–244 (2009)
35.
Zurück zum Zitat Werenski, S.: Topologia (in Polish). Politechnika Radomska, Wydawnictwo (2008) Werenski, S.: Topologia (in Polish). Politechnika Radomska, Wydawnictwo (2008)
36.
Zurück zum Zitat West, K., Cox, S.: Features and classifiers for the automatic classification of musical audio signals. In: ISMIR (2004) West, K., Cox, S.: Features and classifiers for the automatic classification of musical audio signals. In: ISMIR (2004)
37.
Zurück zum Zitat Wu, X., et al.: Top 10 algorithms in data mining. Knowl. Inf. Syst. 14(1), 1–37 (2008)CrossRef Wu, X., et al.: Top 10 algorithms in data mining. Knowl. Inf. Syst. 14(1), 1–37 (2008)CrossRef
38.
Zurück zum Zitat Xu, C., Maddage, N.C., Shao, X., Cao, F., Tian, Q.: Musical genre classification using support vector machines. In: Proceedings of the 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2003, vol. 5, pp. V–429. IEEE (2003) Xu, C., Maddage, N.C., Shao, X., Cao, F., Tian, Q.: Musical genre classification using support vector machines. In: Proceedings of the 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2003, vol. 5, pp. V–429. IEEE (2003)
39.
Zurück zum Zitat Zheng, J., Oussalah, M.: Automatic system for music genre classification. ntM 1, 1 (2006) Zheng, J., Oussalah, M.: Automatic system for music genre classification. ntM 1, 1 (2006)
40.
Zurück zum Zitat Zyt, J., Klosgen, W., Zytkow, J.: Handbook of Data Mining and Knowledge Discovery (2002) Zyt, J., Klosgen, W., Zytkow, J.: Handbook of Data Mining and Knowledge Discovery (2002)
Metadaten
Titel
The Classification of Music by the Genre Using the KNN Classifier
verfasst von
Daniel Kostrzewa
Robert Brzeski
Maciej Kubanski
Copyright-Jahr
2018
DOI
https://doi.org/10.1007/978-3-319-99987-6_18