nach oben

Erschienen in:

2021 | OriginalPaper | Buchkapitel

Music Genre Classification ChatBot

verfasst von : Rishit Jain, Ritik Sharma, Preeti Nagrath, Rachna Jain

Erschienen in: Proceedings of Second International Conference on Computing, Communications, and Cyber-Security

Verlag: Springer Singapore

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

Classification of music on the basis of genre is a sub-domain of the multidisciplinary field of music information retrieval (MIR) that is gaining traction among researchers and data scientists. Even though this problem has been extensively researched and tested, the problem still lies in the foundations, as the true definition of genre still lies to the mercy of human subjectivity. In this paper, we have proposed a classification model which employs a convolutional neural network (CNN) to differentiate between audio files by assessing the visual representations of their timbral features [1]. The music genre classification model is outlined by a ChatBot model built using NLTK, which can simulate an intelligent conversation with a user, and it employs a feature that enables it to recognize and process the audio file based on the input from the user. The GTZAN dataset [2] was used for training the music genre classification model, and the so trained model for this purpose yielded an accuracy of nearly 68.9%. The accuracy so obtained is relatively better than several other classification models that we had researched. Through extensive research and constant trials, we can state, with some certainty, that such a system can be extensively used alongside several music streaming services, as it would facilitate the process of automation of the classification of songs.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Vorheriges Kapitel Machine Learning Applications for Computer-Aided Medical Diagnostics

Nächstes Kapitel Detection of COVID-19 by X-rays Using Machine Learning and Deep Learning Models

Rabiner L, Juang B (1993) Fundamentals of speech recognition. Prentice-Hall, NJ

Tzanetakis G, Cook P (2002) Musical genre classification of audio signals. IEEE Trans Speech Audio Process 10(5):293–302. https://doi.org/10.1109/TSA.2002.800560CrossRef

MUSIC type classification by spectral contrast feature. Department of Computer Science and Technology, Tsinghua University, China {llu, hjzhang}@ microsoft.com. Database, pp 0–3

Caclin A, McAdams S, Smith BK, Winsberg S (2005) Acoustic correlates of timbre space dimensions: a confirmatory study using synthetic tones. J Acoust Soc Am 118:471CrossRef

Følstad, Brandtzaeg PB (2017) Chatbots and the new world of HCI. Interactions 24(4):38–42. https://doi.org/10.1145/3085558

Li D, Sethi IK, Dimitrova N, McGee T (2001) Classification of general audio data for content-based retrieval. Pattern Recogn Lett 22(5):533–544CrossRef

Lambrou T, Kudumakis P, Speller R, Sandler M, Linney A (1998) Classification of audio signals using statistical features on time and wavelet transform domains. In: Proceedings of the 1998 IEEE international conference on acoustics, speech and signal processing, ICASSP’98 (Cat. No. 98CH36181), vol 6, pp 3621–3624

Deshpande H, Singh R, Nam U (2001) Classification of music signals in the visual domain. In Proceedings of the COST-G6 conference on digital audio effects

Soltau H, Schultz T, Westphal M, Waibel A (1998) Recognition of music types. In: Proceedings of the 1998 IEEE international conference on acoustics, speech and signal processing ICASSP, vol 2, pp 1137–1140. https://doi.org/10.1109/icassp.1998.675470

10.

Vyas G, Dutta MK (2014) Automatic mood detection of indian music using mfccs and k-means algorithm. In: 2014 7th International conference on contemporary computing IC3 2014, pp 117–122. https://doi.org/10.1109/ic3.2014.6897159

11.

Asim M, Ahmed Z (2017) Automatic music genres classification using machine learning. Int J Adv Comput Sci Appl 8(8):337–344. https://doi.org/10.14569/ijacsa.2017.080844CrossRef

12.

Ramírez J, Flores MJ (2019) Machine learning for music genre: multifaceted review and experimentation with audioset. J Intell Inf Syst. https://doi.org/10.1007/s10844-019-00582-9CrossRef

13.

Liu C, Feng L, Liu G, Wang H, Liu S (2019) Bottom-up broadcast neural network for music genre classification, pp 1–7. [Online]. Available: http://arxiv.org/abs/1901.08928

14.

Dokania S, Singh V (2019) Graph representation learning for audio & music genre classification, no. 2017. [Online]. Available: http://arxiv.org/abs/1910.11117

15.

Elhadad M (2010) Natural language processing with python Steven Bird, Ewan Klein, and Edward Loper. University of Melbourne, University of Edinburgh, and BBN Technologies) O’Reilly Media, Sebastopol, CA, xx + 482 pp; paperbound, ISBN 978-0-596-51649-9, $44.99; on-line free of charge at nltk.org/book. Comput Linguist 36:767–771. https://doi.org/10.1162/coli_r_00022

16.

Loper E, Bird S (2002) NLTK: the natural language toolkit. [Online]. Available: http://arxiv.org/abs/cs/0205028

17.

Salamon J, Bello JP (2017) Deep convolutional neural networks and data augmentation for environmental sound classification. IEEE Signal Process Lett 24(3):279–283. https://doi.org/10.1109/LSP.2017.2657381CrossRef

18.

Liang Y, Zhou Y, Wan T, Shu X, (2019) Deep neural networks with depthwise separable convolution for music genre classification. In: IEEE 2nd international conference on information communication and signal processing, ICICSP 2019, pp 267–270. 10.1109/ICICSP48821.2019.8958603

19.

Costa YMG, Oliveria LS, Koerich AL et al (2011) Music genre recognition using spectrograms. In: 2011 18th International conference on systems, signals and image processing, pp 1–4

20.

Ng AY (2004) Feature selection, L 1 vs. L 2 regularization, and rotational invariance. In: Proceedings of the twenty-first international conference on machine learning, p 78

21.

Bahuleyan H (2018) Music genre classification using machine learning techniques

22.

Choi K, Joo D, Kim J (2017) Kapre: on-GPU audio preprocessing layers for a quick implementation of deep neural network models with Keras

23.

Chillara S, Kavitha AS, Neginhal SA, Haldia S, Vidyullatha KS (2019) Music genre classification using machine learning algorithms: a comparison. Int Res J Eng Technol 6(5):851–858

24.

Avinash SV (2017) Understanding activation functions in neural networks. Medium 4(12):1–10

25.

Zhang Z (2018) Improved adam optimizer for deep neural networks. In: 2018 IEEE/ACM 26th international symposium on quality of service (IWQoS), Banff, AB, Canada, pp 1–2. 10.1109/IWQoS.2018.8624183

26.

Abdul-Kader S, John D (2015) Survey on Chatbot design techniques in speech conversation systems. Int J Adv Comput Sci Appl 6(7):72–80. https://doi.org/10.14569/ijacsa.2015.060712

27.

Tang Y (2016) TF.Learn: TensorFlow’s high-level module for distributed machine learning, pp 1–7

28.

Bertin-Mahieux T, Ellis DPW, Whitman B, Lamere P (2011) The million song dataset. In: Proceedings of the 12th international society for music information retrieval conference ISMIR 2011, pp 591–596

Titel: Music Genre Classification ChatBot
verfasst von: Rishit Jain
Ritik Sharma
Preeti Nagrath
Rachna Jain
Verlag: Springer Singapore
Buch: Proceedings of Second International Conference on Computing, Communications, and Cyber-Security
Print ISBN: 978-981-16-0732-5

Electronic ISBN: 978-981-16-0733-2

Copyright-Jahr: 2021
DOI: https://doi.org/10.1007/978-981-16-0733-2_27

Neuer Inhalt

Bildnachweise

Smart-Manufacturing Dashboard Banner/© AdobeStock_583269095, VDI-Icon, Profil Icon, inhalt2, Springer Professional Modul/© Springer Fachmedien Wiesbaden GmbH, Sustainability in Finance_2024/© Robert Kneschke / stock.adobe.com, Search Icon, Banner Hanser, Dirk Wolters/© Netec GmbH, NPL Kreditzweitmarktgesetz/© Good_Stock / Getty Images / iStock, Verbrennungsmotor/© OkFoto.it / stock.adobe.com, Zeitschrift Wissensmanagement Cover, PatentFit-Logo/© Springer Fachmedien Wiesbaden GmbH, Sustainibility Finance/© Robert Kneschke / stock.adobe.com / Springer Fachmedien Wiesbaden GmbH, Zukunftswerkstatt Sales Excellence 2024/© AndreyPopov / Getty Images / iStock, 2023_Antrieb/© supervisuell

Springer Professional

Abstract

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Springer Professional "Wirtschaft"

Neuer Inhalt

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.