Skip to main content

2021 | OriginalPaper | Buchkapitel

Music Genre Classification ChatBot

verfasst von : Rishit Jain, Ritik Sharma, Preeti Nagrath, Rachna Jain

Erschienen in: Proceedings of Second International Conference on Computing, Communications, and Cyber-Security

Verlag: Springer Singapore

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Classification of music on the basis of genre is a sub-domain of the multidisciplinary field of music information retrieval (MIR) that is gaining traction among researchers and data scientists. Even though this problem has been extensively researched and tested, the problem still lies in the foundations, as the true definition of genre still lies to the mercy of human subjectivity. In this paper, we have proposed a classification model which employs a convolutional neural network (CNN) to differentiate between audio files by assessing the visual representations of their timbral features [1]. The music genre classification model is outlined by a ChatBot model built using NLTK, which can simulate an intelligent conversation with a user, and it employs a feature that enables it to recognize and process the audio file based on the input from the user. The GTZAN dataset [2] was used for training the music genre classification model, and the so trained model for this purpose yielded an accuracy of nearly 68.9%. The accuracy so obtained is relatively better than several other classification models that we had researched. Through extensive research and constant trials, we can state, with some certainty, that such a system can be extensively used alongside several music streaming services, as it would facilitate the process of automation of the classification of songs.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Rabiner L, Juang B (1993) Fundamentals of speech recognition. Prentice-Hall, NJ Rabiner L, Juang B (1993) Fundamentals of speech recognition. Prentice-Hall, NJ
3.
Zurück zum Zitat MUSIC type classification by spectral contrast feature. Department of Computer Science and Technology, Tsinghua University, China {llu, hjzhang}@ microsoft.com. Database, pp 0–3 MUSIC type classification by spectral contrast feature. Department of Computer Science and Technology, Tsinghua University, China {llu, hjzhang}@ microsoft.com. Database, pp 0–3
4.
Zurück zum Zitat Caclin A, McAdams S, Smith BK, Winsberg S (2005) Acoustic correlates of timbre space dimensions: a confirmatory study using synthetic tones. J Acoust Soc Am 118:471CrossRef Caclin A, McAdams S, Smith BK, Winsberg S (2005) Acoustic correlates of timbre space dimensions: a confirmatory study using synthetic tones. J Acoust Soc Am 118:471CrossRef
6.
Zurück zum Zitat Li D, Sethi IK, Dimitrova N, McGee T (2001) Classification of general audio data for content-based retrieval. Pattern Recogn Lett 22(5):533–544CrossRef Li D, Sethi IK, Dimitrova N, McGee T (2001) Classification of general audio data for content-based retrieval. Pattern Recogn Lett 22(5):533–544CrossRef
7.
Zurück zum Zitat Lambrou T, Kudumakis P, Speller R, Sandler M, Linney A (1998) Classification of audio signals using statistical features on time and wavelet transform domains. In: Proceedings of the 1998 IEEE international conference on acoustics, speech and signal processing, ICASSP’98 (Cat. No. 98CH36181), vol 6, pp 3621–3624 Lambrou T, Kudumakis P, Speller R, Sandler M, Linney A (1998) Classification of audio signals using statistical features on time and wavelet transform domains. In: Proceedings of the 1998 IEEE international conference on acoustics, speech and signal processing, ICASSP’98 (Cat. No. 98CH36181), vol 6, pp 3621–3624
8.
Zurück zum Zitat Deshpande H, Singh R, Nam U (2001) Classification of music signals in the visual domain. In Proceedings of the COST-G6 conference on digital audio effects Deshpande H, Singh R, Nam U (2001) Classification of music signals in the visual domain. In Proceedings of the COST-G6 conference on digital audio effects
15.
Zurück zum Zitat Elhadad M (2010) Natural language processing with python Steven Bird, Ewan Klein, and Edward Loper. University of Melbourne, University of Edinburgh, and BBN Technologies) O’Reilly Media, Sebastopol, CA, xx + 482 pp; paperbound, ISBN 978-0-596-51649-9, $44.99; on-line free of charge at nltk.org/book. Comput Linguist 36:767–771. https://doi.org/10.1162/coli_r_00022 Elhadad M (2010) Natural language processing with python Steven Bird, Ewan Klein, and Edward Loper. University of Melbourne, University of Edinburgh, and BBN Technologies) O’Reilly Media, Sebastopol, CA, xx + 482 pp; paperbound, ISBN 978-0-596-51649-9, $44.99; on-line free of charge at nltk.org/book. Comput Linguist 36:767–771. https://​doi.​org/​10.​1162/​coli_​r_​00022
18.
Zurück zum Zitat Liang Y, Zhou Y, Wan T, Shu X, (2019) Deep neural networks with depthwise separable convolution for music genre classification. In: IEEE 2nd international conference on information communication and signal processing, ICICSP 2019, pp 267–270. 10.1109/ICICSP48821.2019.8958603 Liang Y, Zhou Y, Wan T, Shu X, (2019) Deep neural networks with depthwise separable convolution for music genre classification. In: IEEE 2nd international conference on information communication and signal processing, ICICSP 2019, pp 267–270. 10.1109/ICICSP48821.2019.8958603
19.
Zurück zum Zitat Costa YMG, Oliveria LS, Koerich AL et al (2011) Music genre recognition using spectrograms. In: 2011 18th International conference on systems, signals and image processing, pp 1–4 Costa YMG, Oliveria LS, Koerich AL et al (2011) Music genre recognition using spectrograms. In: 2011 18th International conference on systems, signals and image processing, pp 1–4
20.
Zurück zum Zitat Ng AY (2004) Feature selection, L 1 vs. L 2 regularization, and rotational invariance. In: Proceedings of the twenty-first international conference on machine learning, p 78 Ng AY (2004) Feature selection, L 1 vs. L 2 regularization, and rotational invariance. In: Proceedings of the twenty-first international conference on machine learning, p 78
21.
Zurück zum Zitat Bahuleyan H (2018) Music genre classification using machine learning techniques Bahuleyan H (2018) Music genre classification using machine learning techniques
22.
Zurück zum Zitat Choi K, Joo D, Kim J (2017) Kapre: on-GPU audio preprocessing layers for a quick implementation of deep neural network models with Keras Choi K, Joo D, Kim J (2017) Kapre: on-GPU audio preprocessing layers for a quick implementation of deep neural network models with Keras
23.
Zurück zum Zitat Chillara S, Kavitha AS, Neginhal SA, Haldia S, Vidyullatha KS (2019) Music genre classification using machine learning algorithms: a comparison. Int Res J Eng Technol 6(5):851–858 Chillara S, Kavitha AS, Neginhal SA, Haldia S, Vidyullatha KS (2019) Music genre classification using machine learning algorithms: a comparison. Int Res J Eng Technol 6(5):851–858
24.
Zurück zum Zitat Avinash SV (2017) Understanding activation functions in neural networks. Medium 4(12):1–10 Avinash SV (2017) Understanding activation functions in neural networks. Medium 4(12):1–10
25.
Zurück zum Zitat Zhang Z (2018) Improved adam optimizer for deep neural networks. In: 2018 IEEE/ACM 26th international symposium on quality of service (IWQoS), Banff, AB, Canada, pp 1–2. 10.1109/IWQoS.2018.8624183 Zhang Z (2018) Improved adam optimizer for deep neural networks. In: 2018 IEEE/ACM 26th international symposium on quality of service (IWQoS), Banff, AB, Canada, pp 1–2. 10.1109/IWQoS.2018.8624183
27.
Zurück zum Zitat Tang Y (2016) TF.Learn: TensorFlow’s high-level module for distributed machine learning, pp 1–7 Tang Y (2016) TF.Learn: TensorFlow’s high-level module for distributed machine learning, pp 1–7
28.
Zurück zum Zitat Bertin-Mahieux T, Ellis DPW, Whitman B, Lamere P (2011) The million song dataset. In: Proceedings of the 12th international society for music information retrieval conference ISMIR 2011, pp 591–596 Bertin-Mahieux T, Ellis DPW, Whitman B, Lamere P (2011) The million song dataset. In: Proceedings of the 12th international society for music information retrieval conference ISMIR 2011, pp 591–596
Metadaten
Titel
Music Genre Classification ChatBot
verfasst von
Rishit Jain
Ritik Sharma
Preeti Nagrath
Rachna Jain
Copyright-Jahr
2021
Verlag
Springer Singapore
DOI
https://doi.org/10.1007/978-981-16-0733-2_27

Neuer Inhalt