Skip to main content

2018 | OriginalPaper | Buchkapitel

Emotion Based Categorization of Music Using Low Level Features and Agglomerative Clustering

verfasst von : Rajib Sarkar, Saikat Dutta, Aneek Roy, Sanjoy Kumar Saha

Erschienen in: Computer Vision, Pattern Recognition, Image Processing, and Graphics

Verlag: Springer Singapore

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Music emotion recognition (MER) has become an eminent field of interest in music information retrieval (MIR) group with the objective to provide more flexibility in content based music retrieval. It is quite important to categorize the music according to the emotional characteristics as it enables the users to retrieve the music according to their cognitive state. In this work, we have considered low level time-domain and spectral features extracted from the music signal. Instead of considering a wide range of features, they are judiciously considered based on our perception about the particular emotion. For classification, unsupervised approach based on K-means and Agglomerative clustering are considered. Experiment is carried out on a benchmark dataset. Performance comparison with existing work reflects the superiority of our proposed work.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Russell, J.: A circumspect model of affect, 1980. J. Pers. Soc. Psychol. 39(6), 1161–1178 (1980)CrossRef Russell, J.: A circumspect model of affect, 1980. J. Pers. Soc. Psychol. 39(6), 1161–1178 (1980)CrossRef
2.
Zurück zum Zitat Sammler, D., Grigutsch, M., Fritz, T., Koelsch, S.: Music and emotion: electrophysiological correlates of the processing of pleasant and unpleasant music. Psychophysiology 44(2), 293–304 (2007)CrossRef Sammler, D., Grigutsch, M., Fritz, T., Koelsch, S.: Music and emotion: electrophysiological correlates of the processing of pleasant and unpleasant music. Psychophysiology 44(2), 293–304 (2007)CrossRef
3.
Zurück zum Zitat Krumhansl, C.L.: An exploratory study of musical emotions and psychophysiology. Can. J. Exp. Psychol./Rev. canadienne de psychologie expérimentale 51(4), 336 (1997)CrossRef Krumhansl, C.L.: An exploratory study of musical emotions and psychophysiology. Can. J. Exp. Psychol./Rev. canadienne de psychologie expérimentale 51(4), 336 (1997)CrossRef
4.
Zurück zum Zitat Miyashita, H., Nishimoto, K.: What is the essence of music? A case study on a Japanese audience. In: Proceedings of the Sound and Music Computing (2005) Miyashita, H., Nishimoto, K.: What is the essence of music? A case study on a Japanese audience. In: Proceedings of the Sound and Music Computing (2005)
5.
Zurück zum Zitat Kim, Y.E., Schmidt, E.M., Migneco, R., Morton, B.G., Richardson, P., Scott, J., Speck, J.A., Turnbull, D.: Music emotion recognition: a state of the art review. In: Proceedings of the ISMIR, pp. 255–266 (2010) Kim, Y.E., Schmidt, E.M., Migneco, R., Morton, B.G., Richardson, P., Scott, J., Speck, J.A., Turnbull, D.: Music emotion recognition: a state of the art review. In: Proceedings of the ISMIR, pp. 255–266 (2010)
6.
Zurück zum Zitat Yang, Y.H., Chen, H.H.: Machine recognition of music emotion: a review. ACM Trans. Intell. Syst. Technol. 3(3), 40:1–40:30 (2012)CrossRef Yang, Y.H., Chen, H.H.: Machine recognition of music emotion: a review. ACM Trans. Intell. Syst. Technol. 3(3), 40:1–40:30 (2012)CrossRef
7.
Zurück zum Zitat Han, B., Rho, S., Jun, S., Hwang, E.: Music emotion classification and context-based music recommendation. Multimedia Tools Appl. 47(3), 433–460 (2010)CrossRef Han, B., Rho, S., Jun, S., Hwang, E.: Music emotion classification and context-based music recommendation. Multimedia Tools Appl. 47(3), 433–460 (2010)CrossRef
8.
Zurück zum Zitat Thayer, R.E.: The Biopsychology of Mood and Arousal. Oxford University Press, Oxford (1990) Thayer, R.E.: The Biopsychology of Mood and Arousal. Oxford University Press, Oxford (1990)
9.
Zurück zum Zitat Gómez, L.M., Cáceres, M.N.: Applying data mining for sentiment analysis in music. In: Proceedings of the PAAMS, pp. 198–205 (2017) Gómez, L.M., Cáceres, M.N.: Applying data mining for sentiment analysis in music. In: Proceedings of the PAAMS, pp. 198–205 (2017)
10.
Zurück zum Zitat Chen, Y.A., Wang, J.C., Yang, Y.H., Chen, H.: Component tying for mixture model adaptation in personalization of music emotion recognition. IEEE/ACM Trans. Audio. Speech Lang. Process. 25(7), 1409–1420 (2017)CrossRef Chen, Y.A., Wang, J.C., Yang, Y.H., Chen, H.: Component tying for mixture model adaptation in personalization of music emotion recognition. IEEE/ACM Trans. Audio. Speech Lang. Process. 25(7), 1409–1420 (2017)CrossRef
11.
Zurück zum Zitat Koch, C., Krupii, G., Hausheer, D.: Proactive caching of music videos based on audio features, mood, and genre. In: Proceedings of the MMSys, pp. 100–111 (2017) Koch, C., Krupii, G., Hausheer, D.: Proactive caching of music videos based on audio features, mood, and genre. In: Proceedings of the MMSys, pp. 100–111 (2017)
12.
Zurück zum Zitat Lartillot, O., Toiviainen, P.: A matlab toolbox for musical feature extraction from audio. In: Proceedings of the DAFx, pp. 237–244 (2007) Lartillot, O., Toiviainen, P.: A matlab toolbox for musical feature extraction from audio. In: Proceedings of the DAFx, pp. 237–244 (2007)
13.
Zurück zum Zitat Zhang, F., Meng, H., Li, M.: Emotion extraction and recognition from music. In: Proceedings of the ICNC-FSKD, pp. 1728–1733 (2016) Zhang, F., Meng, H., Li, M.: Emotion extraction and recognition from music. In: Proceedings of the ICNC-FSKD, pp. 1728–1733 (2016)
14.
Zurück zum Zitat Soleymani, M., Aljanaki, A., Yang, Y.H., Caro, M.N., Eyben, F., Markov, K., Schuller, B.W., Veltkamp, R., Weninger, F., Wiering, F.: Emotional analysis of music: a comparison of methods. In: Proceedings of the 22nd ACM International Conference on Multimedia, pp. 1161–1164. ACM (2014) Soleymani, M., Aljanaki, A., Yang, Y.H., Caro, M.N., Eyben, F., Markov, K., Schuller, B.W., Veltkamp, R., Weninger, F., Wiering, F.: Emotional analysis of music: a comparison of methods. In: Proceedings of the 22nd ACM International Conference on Multimedia, pp. 1161–1164. ACM (2014)
15.
Zurück zum Zitat Weninger, F., Eyben, F., Schuller, B.W.: The TUM approach to the mediaeval music emotion task using generic affective audio features. In: MediaEval (2013) Weninger, F., Eyben, F., Schuller, B.W.: The TUM approach to the mediaeval music emotion task using generic affective audio features. In: MediaEval (2013)
16.
Zurück zum Zitat Saari, P., Eerola, T., Lartillot, O.: Generalizability and simplicity as criteria in feature selection: application to mood classification in music. IEEE Trans. Audio Speech Lang. Process. 19(6), 1802–1812 (2011)CrossRef Saari, P., Eerola, T., Lartillot, O.: Generalizability and simplicity as criteria in feature selection: application to mood classification in music. IEEE Trans. Audio Speech Lang. Process. 19(6), 1802–1812 (2011)CrossRef
17.
Zurück zum Zitat Lerch, A.: An Introduction to Audio Content Analysis: Applications in Signal Processing and Music Informatics. Wiley, Hoboken (2012)CrossRef Lerch, A.: An Introduction to Audio Content Analysis: Applications in Signal Processing and Music Informatics. Wiley, Hoboken (2012)CrossRef
18.
Zurück zum Zitat Huang, X., Acero, A., Hon, H.W., Foreword By-Reddy, R.: Spoken Language Processing: A Guide to Theory, Algorithm, and System Development. Prentice Hall PTR, Upper Saddle River (2001) Huang, X., Acero, A., Hon, H.W., Foreword By-Reddy, R.: Spoken Language Processing: A Guide to Theory, Algorithm, and System Development. Prentice Hall PTR, Upper Saddle River (2001)
19.
Zurück zum Zitat Welch, P.: The use of fast fourier transform for the estimation of power spectra: a method based on time averaging over short, modified periodograms. IEEE Trans. Audio Electroacoust. 15(2), 70–73 (1967)CrossRef Welch, P.: The use of fast fourier transform for the estimation of power spectra: a method based on time averaging over short, modified periodograms. IEEE Trans. Audio Electroacoust. 15(2), 70–73 (1967)CrossRef
20.
Zurück zum Zitat Zhang, T., Kuo, C.C.J.: Audio content analysis for online audiovisual data segmentation and classification. IEEE Trans. Speech Audio Process. 9(4), 441–457 (2001)CrossRef Zhang, T., Kuo, C.C.J.: Audio content analysis for online audiovisual data segmentation and classification. IEEE Trans. Speech Audio Process. 9(4), 441–457 (2001)CrossRef
21.
Zurück zum Zitat Gray, A., Markel, J.: A spectral-flatness measure for studying the autocorrelation method of linear prediction of speech analysis. IEEE Trans. Acoust. Speech Sig. Process. 22(3), 207–217 (1974)CrossRef Gray, A., Markel, J.: A spectral-flatness measure for studying the autocorrelation method of linear prediction of speech analysis. IEEE Trans. Acoust. Speech Sig. Process. 22(3), 207–217 (1974)CrossRef
22.
Zurück zum Zitat Hartigan, J.A., Wong, M.A.: Algorithm as 136: a k-means clustering algorithm. J. Roy. Stat. Soc.: Ser. C (Appl. Stat.) 28(1), 100–108 (1979)MATH Hartigan, J.A., Wong, M.A.: Algorithm as 136: a k-means clustering algorithm. J. Roy. Stat. Soc.: Ser. C (Appl. Stat.) 28(1), 100–108 (1979)MATH
23.
Zurück zum Zitat Beeferman, D., Berger, A.: Agglomerative clustering of a search engine query log. In: Proceeding of the SIGKDD, pp. 407–416 (2000) Beeferman, D., Berger, A.: Agglomerative clustering of a search engine query log. In: Proceeding of the SIGKDD, pp. 407–416 (2000)
24.
Zurück zum Zitat Kuhn, H.W.: The Hungarian method for the assignment problem. Nav. Res. Logist. (NRL) 2(1–2), 83–97 (1955)MathSciNetCrossRef Kuhn, H.W.: The Hungarian method for the assignment problem. Nav. Res. Logist. (NRL) 2(1–2), 83–97 (1955)MathSciNetCrossRef
25.
Zurück zum Zitat Eerola, T., Vuoskoski, J.K.: A comparison of the discrete and dimensional models of emotion in music. Psychol. Music 39(1), 18–49 (2011)CrossRef Eerola, T., Vuoskoski, J.K.: A comparison of the discrete and dimensional models of emotion in music. Psychol. Music 39(1), 18–49 (2011)CrossRef
26.
Zurück zum Zitat Cronbach, L.J.: Coefficient alpha and the internal structure of tests. Psychometrika 16(3), 297–334 (1951)CrossRef Cronbach, L.J.: Coefficient alpha and the internal structure of tests. Psychometrika 16(3), 297–334 (1951)CrossRef
27.
Zurück zum Zitat Tzanetakis, G., Cook, P.: MARSYAS: a framework for audio analysis. Organ. Sound 4(3), 169–175 (2000)CrossRef Tzanetakis, G., Cook, P.: MARSYAS: a framework for audio analysis. Organ. Sound 4(3), 169–175 (2000)CrossRef
28.
Zurück zum Zitat Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., Blondel, M., Prettenhofer, P., Weiss, R., Dubourg, V., et al.: Scikit-learn: machine learning in python. J. Mach. Learn. Res. 12(Oct), 2825–2830 (2011)MathSciNetMATH Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., Blondel, M., Prettenhofer, P., Weiss, R., Dubourg, V., et al.: Scikit-learn: machine learning in python. J. Mach. Learn. Res. 12(Oct), 2825–2830 (2011)MathSciNetMATH
Metadaten
Titel
Emotion Based Categorization of Music Using Low Level Features and Agglomerative Clustering
verfasst von
Rajib Sarkar
Saikat Dutta
Aneek Roy
Sanjoy Kumar Saha
Copyright-Jahr
2018
Verlag
Springer Singapore
DOI
https://doi.org/10.1007/978-981-13-0020-2_44