Skip to main content
Top

2020 | OriginalPaper | Chapter

Automatic Classification of Carnatic Music Instruments Using MFCC and LPC

Authors : Surendra Shetty, Sarika Hegde

Published in: Data Management, Analytics and Innovation

Publisher: Springer Singapore

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

With a large collection of digital music in recent days, the challenge is to organize and access the music efficiently. Research in the field of Music Information Retrieval (MIR) focuses on these challenges. In this paper, we develop a system which automatically identifies the instrument in a given Carnatic music on ten different types of instruments. We extract the well-known features namely, MFCC and LPC, and analyze the capability of these features in distinguishing different instruments. Then, we apply, the classification techniques like, Artificial Neural Network, Support Vector Machine, and Bayesian classifiers on those features. We compare the performances of those algorithms along with different features for Carnatic music instruments identification.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Aucouturier, J.J., Pachet, F.: Improving timbre similarity: how high’s the sky? J. Negat. Results Speech Audio Sci. 1(1), 1–13 (2004) Aucouturier, J.J., Pachet, F.: Improving timbre similarity: how high’s the sky? J. Negat. Results Speech Audio Sci. 1(1), 1–13 (2004)
2.
go back to reference Banerjee, A., Ghosh, A., Palit, S., Ballester, M.A.F.: A novel approach to string instrument recognition. In: International Conference on Image and Signal Processing, pp. 165–175. Springer, Cham (2018) Banerjee, A., Ghosh, A., Palit, S., Ballester, M.A.F.: A novel approach to string instrument recognition. In: International Conference on Image and Signal Processing, pp. 165–175. Springer, Cham (2018)
3.
go back to reference Benetos, E., Kotti, M., Kotropoulos, C.: Large scale musical instrument identification. In: Proceedings SMC’07, 4th Sound and Music Computing Conference, Greece (2007) Benetos, E., Kotti, M., Kotropoulos, C.: Large scale musical instrument identification. In: Proceedings SMC’07, 4th Sound and Music Computing Conference, Greece (2007)
4.
go back to reference Chetry, N., Sandier, M.: Linear predictive model for musical instrument detection. In: IEEE Conference on Acoustics, Speech and Signal Processing, vol. 5 (2006) Chetry, N., Sandier, M.: Linear predictive model for musical instrument detection. In: IEEE Conference on Acoustics, Speech and Signal Processing, vol. 5 (2006)
5.
go back to reference Diment, A., Heittola, T., Virtanen, T.: Semi-supervised learning for musical instrument recognition. In: 2013 Proceedings of the 21st European Signal Processing Conference (EUSIPCO), pp. 1–5. IEEE (2013) Diment, A., Heittola, T., Virtanen, T.: Semi-supervised learning for musical instrument recognition. In: 2013 Proceedings of the 21st European Signal Processing Conference (EUSIPCO), pp. 1–5. IEEE (2013)
6.
go back to reference Duda, R.O., Hart, P.E., Stork, D.G.: Pattern Classification. Wiley Publications, Hoboken (2006)MATH Duda, R.O., Hart, P.E., Stork, D.G.: Pattern Classification. Wiley Publications, Hoboken (2006)MATH
7.
go back to reference Giannoulis, D., Klapuri, A.: Musical instrument recognition in polyphonic audio using missing feature approach. IEEE Trans. Audio Speech Lang. Process. 21(9), 1805–1817 (2013)CrossRef Giannoulis, D., Klapuri, A.: Musical instrument recognition in polyphonic audio using missing feature approach. IEEE Trans. Audio Speech Lang. Process. 21(9), 1805–1817 (2013)CrossRef
8.
go back to reference Giannoulis, D., Benetos, E., Klapuri, A., Plumbley, M.D.: Improving instrument recognition in polyphonic music through system integration. In: 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 5222–5226. IEEE (2014) Giannoulis, D., Benetos, E., Klapuri, A., Plumbley, M.D.: Improving instrument recognition in polyphonic music through system integration. In: 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 5222–5226. IEEE (2014)
9.
go back to reference Gold, B., Morgan, N.: Speech and Audio Signal Processing—Processing and Perception of Speech and Music. Wiley India Pvt. Ltd., ISBN: 81–265-0822-1 (2006) Gold, B., Morgan, N.: Speech and Audio Signal Processing—Processing and Perception of Speech and Music. Wiley India Pvt. Ltd., ISBN: 81–265-0822-1 (2006)
10.
go back to reference He, X., Zhou, X.: Audio classification by hybrid support vector machine/hidden Markov model. UK World J. Model. Simul. 1(1), 56–59 (2005)MathSciNet He, X., Zhou, X.: Audio classification by hybrid support vector machine/hidden Markov model. UK World J. Model. Simul. 1(1), 56–59 (2005)MathSciNet
11.
go back to reference Hg, R., Sreenivas, T.V.: Multi-instrument detection in polyphonic music using Gaussian Mixture based factorial HMM. In: 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 191–195. IEEE (2015) Hg, R., Sreenivas, T.V.: Multi-instrument detection in polyphonic music using Gaussian Mixture based factorial HMM. In: 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 191–195. IEEE (2015)
12.
go back to reference Kim, Y.K., Brian, Y.: Singer identification in popular music recordings using voice coding features. In: Proceedings of ISMIR (2002) Kim, Y.K., Brian, Y.: Singer identification in popular music recordings using voice coding features. In: Proceedings of ISMIR (2002)
13.
go back to reference Martin, K.D., Kim, Y.E.: 2pMU9: musical instrument identification: a pattern-recognition approach. In: 136th Meeting of the ASA (1998) Martin, K.D., Kim, Y.E.: 2pMU9: musical instrument identification: a pattern-recognition approach. In: 136th Meeting of the ASA (1998)
14.
go back to reference Martinez, W.L., Martinez, A.R.: Computational Statistics Handbook with Matlab. Chapman & Hall/CRC Publications, ISBN: 1-58488-566-1 (2008) Martinez, W.L., Martinez, A.R.: Computational Statistics Handbook with Matlab. Chapman & Hall/CRC Publications, ISBN: 1-58488-566-1 (2008)
15.
go back to reference Mesaros, A., Astola, J.: The mel-frequency cepstral coefficients in the context of singer identification. In: Proceedings of the International Conference on Music Information Retrieval (2005) Mesaros, A., Astola, J.: The mel-frequency cepstral coefficients in the context of singer identification. In: Proceedings of the International Conference on Music Information Retrieval (2005)
16.
go back to reference Mukherjee, H., Obaidullah, S.M., Phadikar, S., Roy, K.: MISNA—a musical instrument segregation system from noisy audio with LPCC-S features and extreme learning. Multimed. Tools Appl. 77, 27997–28022 (2018)CrossRef Mukherjee, H., Obaidullah, S.M., Phadikar, S., Roy, K.: MISNA—a musical instrument segregation system from noisy audio with LPCC-S features and extreme learning. Multimed. Tools Appl. 77, 27997–28022 (2018)CrossRef
17.
go back to reference Murthy, Y.V., Koolagudi, S.G.: Content-based music information retrieval (CB-MIR) and its applications toward the music industry: a review. ACM Comput. Surv. (CSUR) 51(3), 45 (2018)CrossRef Murthy, Y.V., Koolagudi, S.G.: Content-based music information retrieval (CB-MIR) and its applications toward the music industry: a review. ACM Comput. Surv. (CSUR) 51(3), 45 (2018)CrossRef
18.
go back to reference Peltonen, V., Tuomi, J., Klapuri, A, Huopaniemi, J., Sorsa, T.: Computational Auditory Scene Recognition. In: IEEE International Conference on Acoustics Speech and Signal Processing, vol. 2 (2002) Peltonen, V., Tuomi, J., Klapuri, A, Huopaniemi, J., Sorsa, T.: Computational Auditory Scene Recognition. In: IEEE International Conference on Acoustics Speech and Signal Processing, vol. 2 (2002)
19.
go back to reference Rabiner, L., Jaung, B.: Fundamentals of Speech Recognition. Prentice Hall, ISBN:10:013051572 (1993) Rabiner, L., Jaung, B.: Fundamentals of Speech Recognition. Prentice Hall, ISBN:10:013051572 (1993)
20.
go back to reference Slaney, M.: Auditory toolbox: a MATLAB Toolbox for auditory modeling work. Technical Report 1998-010, Interval Research Corporation, Palo Alto, CA, USA, 1998, Version 2 (1998) Slaney, M.: Auditory toolbox: a MATLAB Toolbox for auditory modeling work. Technical Report 1998-010, Interval Research Corporation, Palo Alto, CA, USA, 1998, Version 2 (1998)
21.
go back to reference Sturm, B.L., Morvidone, M., Daudet, L.: Musical instrument identification using multiscale mel-frequency cepstral coefficients. In: 18th European Signal Processing Conference (EUSIPCO-2010) (2010) Sturm, B.L., Morvidone, M., Daudet, L.: Musical instrument identification using multiscale mel-frequency cepstral coefficients. In: 18th European Signal Processing Conference (EUSIPCO-2010) (2010)
22.
go back to reference Tan, P., Steinbach, M., Kumar, V.: Introduction to Data Mining. Pearson Addison Wesley, ISBN: 978-81-317-1472-0 (2006) Tan, P., Steinbach, M., Kumar, V.: Introduction to Data Mining. Pearson Addison Wesley, ISBN: 978-81-317-1472-0 (2006)
23.
go back to reference Theodoridis, S., Koutroumbas, K.: Pattern Recognition, 4th edn. Academic Press, London (2009)MATH Theodoridis, S., Koutroumbas, K.: Pattern Recognition, 4th edn. Academic Press, London (2009)MATH
24.
go back to reference Uhlich, S., Giron, F., Mitsufuji, Y.: Deep neural network based instrument extraction from music. In: ICASSP, pp. 2135–2139 (2015) Uhlich, S., Giron, F., Mitsufuji, Y.: Deep neural network based instrument extraction from music. In: ICASSP, pp. 2135–2139 (2015)
25.
go back to reference Zhang, T.: System and method for automatic singer identification. In: Proceedings of the IEEE Conference on Multimedia and Expo, vol. 1, pp. 33–36 (2003) Zhang, T.: System and method for automatic singer identification. In: Proceedings of the IEEE Conference on Multimedia and Expo, vol. 1, pp. 33–36 (2003)
26.
go back to reference Zlatintsi, A., Maragos, P.: Multiscale fractal analysis of musical instrument signals with application to recognition. IEEE Trans. Audio Speech Lang. Process. 21(4), 737–748 (2013)CrossRef Zlatintsi, A., Maragos, P.: Multiscale fractal analysis of musical instrument signals with application to recognition. IEEE Trans. Audio Speech Lang. Process. 21(4), 737–748 (2013)CrossRef
Metadata
Title
Automatic Classification of Carnatic Music Instruments Using MFCC and LPC
Authors
Surendra Shetty
Sarika Hegde
Copyright Year
2020
Publisher
Springer Singapore
DOI
https://doi.org/10.1007/978-981-32-9949-8_32