Skip to main content
Top

2017 | OriginalPaper | Chapter

Learning Music Emotions via Quantum Convolutional Neural Network

Authors : Gong Chen, Yan Liu, Jiannong Cao, Shenghua Zhong, Yang Liu, Yuexian Hou, Peng Zhang

Published in: Brain Informatics

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Music can convey and evoke powerful emotions. But it is very challenging to recognize the music emotions accurately by computational models. The difficulty of the problem can exponentially increase when the music segments delivery multiple and complex emotions. This paper proposes a novel quantum convolutional neural network (QCNN) to learn music emotions. Inheriting the distinguished abstraction ability from deep learning, QCNN automatically extracts the music features that benefit emotion classification. The main contribution of this paper is that we utilize measurement postulate to simulate the human emotion awareness in music appreciation. Statistical experiments on the standard dataset shows that QCNN outperforms the classical algorithms as well as the state-of-the-art in the task of music emotion classification. Moreover, we provide demonstration experiment to explain the good performance of the proposed technique from the perspective of physics and psychology.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
2.
go back to reference Juslin, P.N., Västfjäll, D.: Emotional responses to music: the need to consider underlying mechanisms. Behav. Brain Sci. 31(05), 559–575 (2008) Juslin, P.N., Västfjäll, D.: Emotional responses to music: the need to consider underlying mechanisms. Behav. Brain Sci. 31(05), 559–575 (2008)
3.
go back to reference Zentner, M., Grandjean, D., Scherer, K.R.: Emotions evoked by the sound of music: characterization, classification, and measurement. Emotion 8(4), 494 (2008)CrossRef Zentner, M., Grandjean, D., Scherer, K.R.: Emotions evoked by the sound of music: characterization, classification, and measurement. Emotion 8(4), 494 (2008)CrossRef
4.
go back to reference Eerola, T., Vuoskoski, J.K.: A comparison of the discrete and dimensional models of emotion in music. Psychol. Mus. 39, 18–49 (2010)CrossRef Eerola, T., Vuoskoski, J.K.: A comparison of the discrete and dimensional models of emotion in music. Psychol. Mus. 39, 18–49 (2010)CrossRef
5.
go back to reference Drever, J.: A Dictionary of Psychology. Oxford University Press, Oxford (1952) Drever, J.: A Dictionary of Psychology. Oxford University Press, Oxford (1952)
6.
go back to reference Ekman, P.: Expression and the nature of emotion. In: Approaches to Emotion, vol. 3, pp. 19–344 (1984) Ekman, P.: Expression and the nature of emotion. In: Approaches to Emotion, vol. 3, pp. 19–344 (1984)
7.
go back to reference Turnbull, D., Barrington, L., Torres, D., Lanckriet, G.: Semantic annotation and retrieval of music and sound effects. IEEE Trans. Audio Speech Lang. Process. 16(2), 467–476 (2008)CrossRef Turnbull, D., Barrington, L., Torres, D., Lanckriet, G.: Semantic annotation and retrieval of music and sound effects. IEEE Trans. Audio Speech Lang. Process. 16(2), 467–476 (2008)CrossRef
8.
go back to reference Russell, J.A.: A circumplex model of affect. J. Pers. Soc. Psychol. 39(6), 1161–1178 (1980)CrossRef Russell, J.A.: A circumplex model of affect. J. Pers. Soc. Psychol. 39(6), 1161–1178 (1980)CrossRef
9.
go back to reference Thayer, R.E.: The Biopsychology of Mood and Arousal. Oxford University Press, Oxford (1990) Thayer, R.E.: The Biopsychology of Mood and Arousal. Oxford University Press, Oxford (1990)
10.
go back to reference Lu, L., Liu, D., Zhang, H.J.: Automatic mood detection and tracking of music audio signals. IEEE Trans. Audio Speech Lang. Process. 14(1), 5–18 (2006)CrossRef Lu, L., Liu, D., Zhang, H.J.: Automatic mood detection and tracking of music audio signals. IEEE Trans. Audio Speech Lang. Process. 14(1), 5–18 (2006)CrossRef
11.
go back to reference Yang, Y.H., Lin, Y.C., Su, Y.F., Chen, H.H.: A regression approach to music emotion recognition. IEEE Trans. Audio Speech Lang. Process. 16(2), 448–457 (2008)CrossRef Yang, Y.H., Lin, Y.C., Su, Y.F., Chen, H.H.: A regression approach to music emotion recognition. IEEE Trans. Audio Speech Lang. Process. 16(2), 448–457 (2008)CrossRef
12.
go back to reference Li, T., Ogihara, M.: Content-based music similarity search and emotion detection. In: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Proceedings, vol. 5, p. V-705. IEEE (2004) Li, T., Ogihara, M.: Content-based music similarity search and emotion detection. In: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Proceedings, vol. 5, p. V-705. IEEE (2004)
13.
go back to reference Wu, B., Zhong, E., Horner, A., Yang, Q.: Music emotion recognition by multi-label multi-layer multi-instance multi-view learning. In: Proceedings of the 22nd ACM International Conference on Multimedia, pp. 117–126. ACM (2014) Wu, B., Zhong, E., Horner, A., Yang, Q.: Music emotion recognition by multi-label multi-layer multi-instance multi-view learning. In: Proceedings of the 22nd ACM International Conference on Multimedia, pp. 117–126. ACM (2014)
14.
go back to reference Panagakis, Y., Kotropoulos, C.: Automatic music tagging via parafac2. In: 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 481–484. IEEE (2011) Panagakis, Y., Kotropoulos, C.: Automatic music tagging via parafac2. In: 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 481–484. IEEE (2011)
15.
go back to reference Liu, Y., Liu, Y., Zhao, Y., Hua, K.A.: What strikes the strings of your heart? Feature mining for music emotion analysis. IEEE Trans. Affect. Comput. 6(3), 247–260 (2015)CrossRef Liu, Y., Liu, Y., Zhao, Y., Hua, K.A.: What strikes the strings of your heart? Feature mining for music emotion analysis. IEEE Trans. Affect. Comput. 6(3), 247–260 (2015)CrossRef
16.
go back to reference Chen, Y.A., Wang, J.C., Yang, Y.H., Chen, H.: Linear regression-based adaptation of music emotion recognition models for personalization. In: 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 2149–2153. IEEE (2014) Chen, Y.A., Wang, J.C., Yang, Y.H., Chen, H.: Linear regression-based adaptation of music emotion recognition models for personalization. In: 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 2149–2153. IEEE (2014)
17.
go back to reference Weninger, F., Eyben, F., Schuller, B.: On-line continuous-time music mood regression with deep recurrent neural networks. In: 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 5412–5416. IEEE (2014) Weninger, F., Eyben, F., Schuller, B.: On-line continuous-time music mood regression with deep recurrent neural networks. In: 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 5412–5416. IEEE (2014)
18.
go back to reference Wang, X., Rosenblum, D., Wang, Y.: Context-aware mobile music recommendation for daily activities. In: Proceedings of the 20th ACM International Conference on Multimedia, pp. 99–108. ACM (2012) Wang, X., Rosenblum, D., Wang, Y.: Context-aware mobile music recommendation for daily activities. In: Proceedings of the 20th ACM International Conference on Multimedia, pp. 99–108. ACM (2012)
19.
go back to reference Lin, J.C., Wei, W.L., Wang, H.M.: Demv-matchmaker: emotional temporal course representation and deep similarity matching for automatic music video generation. In: 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 2772–2776. IEEE (2016) Lin, J.C., Wei, W.L., Wang, H.M.: Demv-matchmaker: emotional temporal course representation and deep similarity matching for automatic music video generation. In: 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 2772–2776. IEEE (2016)
20.
go back to reference Wang, S.Y., Wang, J.C., Yang, Y.H., Wang, H.M.: Towards time-varying music auto-tagging based on cal500 expansion. In: 2014 IEEE International Conference on Multimedia and Expo (ICME), pp. 1–6. IEEE (2014) Wang, S.Y., Wang, J.C., Yang, Y.H., Wang, H.M.: Towards time-varying music auto-tagging based on cal500 expansion. In: 2014 IEEE International Conference on Multimedia and Expo (ICME), pp. 1–6. IEEE (2014)
21.
go back to reference Van Rijsbergen, C.J.: The Geometry of Information Retrieval. Cambridge University Press, Cambridge (2004)CrossRefMATH Van Rijsbergen, C.J.: The Geometry of Information Retrieval. Cambridge University Press, Cambridge (2004)CrossRefMATH
24.
go back to reference Zhu, S., Wang, B., Liu, Y.: Using non-parametric quantum theory to rank images. In: 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 1049–1052. IEEE (2012) Zhu, S., Wang, B., Liu, Y.: Using non-parametric quantum theory to rank images. In: 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 1049–1052. IEEE (2012)
25.
go back to reference Sordoni, A., Nie, J.Y., Bengio, Y.: Modeling term dependencies with quantum language models for IR. In: Proceedings of the 36th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 653–662. ACM (2013) Sordoni, A., Nie, J.Y., Bengio, Y.: Modeling term dependencies with quantum language models for IR. In: Proceedings of the 36th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 653–662. ACM (2013)
26.
go back to reference Xie, M., Hou, Y., Zhang, P., Li, J., Li, W., Song, D.: Modeling quantum entanglements in quantum language models. In: Proceedings of the 24th International Joint Conference on Artificial Intelligence, pp. 1362–1368. AAAI Press (2015) Xie, M., Hou, Y., Zhang, P., Li, J., Li, W., Song, D.: Modeling quantum entanglements in quantum language models. In: Proceedings of the 24th International Joint Conference on Artificial Intelligence, pp. 1362–1368. AAAI Press (2015)
27.
go back to reference Bruza, P.D., Wang, Z., Busemeyer, J.R.: Quantum cognition: a new theoretical approach to psychology. Trends Cognit. Sci. 19(7), 383–393 (2015)CrossRef Bruza, P.D., Wang, Z., Busemeyer, J.R.: Quantum cognition: a new theoretical approach to psychology. Trends Cognit. Sci. 19(7), 383–393 (2015)CrossRef
28.
go back to reference Sakurai, J.J., Napolitano, J.: Modern Quantum Mechanics. Addison-Wesley, Reading (2011) Sakurai, J.J., Napolitano, J.: Modern Quantum Mechanics. Addison-Wesley, Reading (2011)
29.
go back to reference Bohr, N., et al.: The Quantum Postulate and the Recent Development of Atomic Theory, vol. 3. Printed in Great Britain by R. & R. Clarke, Limited (1928) Bohr, N., et al.: The Quantum Postulate and the Recent Development of Atomic Theory, vol. 3. Printed in Great Britain by R. & R. Clarke, Limited (1928)
30.
go back to reference Elisseeff, A., Weston, J.: A kernel method for multi-labelled classification. In: Advances in neural Information Processing Systems, pp. 681–687 (2001) Elisseeff, A., Weston, J.: A kernel method for multi-labelled classification. In: Advances in neural Information Processing Systems, pp. 681–687 (2001)
31.
go back to reference Zhang, M.L., Zhou, Z.H.: Multilabel neural networks with applications to functional genomics and text categorization. IEEE Trans. Knowl. Data Eng. 18(10), 1338–1351 (2006)CrossRef Zhang, M.L., Zhou, Z.H.: Multilabel neural networks with applications to functional genomics and text categorization. IEEE Trans. Knowl. Data Eng. 18(10), 1338–1351 (2006)CrossRef
32.
go back to reference Lo, H.Y., Wang, J.C., Wang, H.M., Lin, S.D.: Cost-sensitive stacking for audio tag annotation and retrieval. In: 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 2308–2311. IEEE (2011) Lo, H.Y., Wang, J.C., Wang, H.M., Lin, S.D.: Cost-sensitive stacking for audio tag annotation and retrieval. In: 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 2308–2311. IEEE (2011)
Metadata
Title
Learning Music Emotions via Quantum Convolutional Neural Network
Authors
Gong Chen
Yan Liu
Jiannong Cao
Shenghua Zhong
Yang Liu
Yuexian Hou
Peng Zhang
Copyright Year
2017
DOI
https://doi.org/10.1007/978-3-319-70772-3_5

Premium Partner