Skip to main content

2017 | OriginalPaper | Buchkapitel

Generalisation Performance of Western Instrument Recognition Models in Polyphonic Mixtures with Ethnic Samples

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Instrument recognition in polyphonic audio recordings is a very complex task. Most research studies until now were focussed on the recognition of Western instruments in Western classical and popular music, but also an increasing number of recent works addressed the classification of ethnic/world recordings. However, such studies are typically restricted to one kind of music and do not measure the bias of “Western” effect, i.e., the danger of overfitting towards Western music when the classification models are optimised only for such tracks. In this paper, we analyse the performance of several instrument classification models which are trained and optimised on polyphonic mixtures of Western instruments, but independently validated on mixtures created with randomly added ethnic samples. The conducted experiments include evolutionary multi-objective feature selection from a large set of audio signal descriptors and the estimation of individual feature relevance.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
4
In this study, the reference point is (1,1): a theoretical solution which uses all features and leads to the classification error \(e=1\).
 
5
For all applied tests in this paper, we use a standard value of 5% for the significance level.
 
6
The statistical observations are shortened for simplicity reasons and should be interpreted with certain restrictions. Obviously, they hold only for tested instruments, mixtures, features, feature processing, and feature selection method.
 
Literatur
1.
Zurück zum Zitat Abdoli, S.: Iranian traditional music dastgah classification. In: Proceedings of the 12th International Society for Music Information Retrieval Conference (ISMIR), pp. 275–280 (2011) Abdoli, S.: Iranian traditional music dastgah classification. In: Proceedings of the 12th International Society for Music Information Retrieval Conference (ISMIR), pp. 275–280 (2011)
2.
Zurück zum Zitat Agarwal, P., Karnick, H., Raj, B.: A comparative study of Indian and western music forms. In: Proceedings of the 14th International Society for Music Information Retrieval Conference (ISMIR), pp. 29–34 (2013) Agarwal, P., Karnick, H., Raj, B.: A comparative study of Indian and western music forms. In: Proceedings of the 14th International Society for Music Information Retrieval Conference (ISMIR), pp. 29–34 (2013)
3.
Zurück zum Zitat Benetos, E., Holzapfel, A.: Automatic transcription of Turkish makam music. In: Proceedings of the 14th International Society for Music Information Retrieval Conference (ISMIR), pp. 355–360 (2013) Benetos, E., Holzapfel, A.: Automatic transcription of Turkish makam music. In: Proceedings of the 14th International Society for Music Information Retrieval Conference (ISMIR), pp. 355–360 (2013)
4.
Zurück zum Zitat Brown, J.C., Houix, O., Mcadams, S.: Feature dependence in the automatic identification of musical woodwind instruments. J. Acoust. Soc. Am. 109(3), 1064–1072 (2001)CrossRef Brown, J.C., Houix, O., Mcadams, S.: Feature dependence in the automatic identification of musical woodwind instruments. J. Acoust. Soc. Am. 109(3), 1064–1072 (2001)CrossRef
5.
Zurück zum Zitat Ding, C.H.Q., Peng, H.: Minimum redundancy feature selection from microarray gene expression data. J. Bioinform. Comput. Biol. 3(2), 185–205 (2005)CrossRef Ding, C.H.Q., Peng, H.: Minimum redundancy feature selection from microarray gene expression data. J. Bioinform. Comput. Biol. 3(2), 185–205 (2005)CrossRef
6.
Zurück zum Zitat Eerola, T.: Are the emotions expressed in music genre-specific? An audio-based evaluation of datasets spanning classical, film, pop and mixed genres. J. New Music Res. 40(3), 349–366 (2011)CrossRef Eerola, T.: Are the emotions expressed in music genre-specific? An audio-based evaluation of datasets spanning classical, film, pop and mixed genres. J. New Music Res. 40(3), 349–366 (2011)CrossRef
7.
Zurück zum Zitat Eerola, T., Ferrer, R.: Instrument library (MUMS) revised. Music Percept. 25(3), 253–255 (2008)CrossRef Eerola, T., Ferrer, R.: Instrument library (MUMS) revised. Music Percept. 25(3), 253–255 (2008)CrossRef
8.
Zurück zum Zitat Eggink, J., Brown, G.J.: Instrument recognition in accompanied sonatas and concertos. In: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 217–220 (2004) Eggink, J., Brown, G.J.: Instrument recognition in accompanied sonatas and concertos. In: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 217–220 (2004)
9.
Zurück zum Zitat Emmerich, M., Beume, N., Naujoks, B.: An EMO algorithm using the hypervolume measure as selection criterion. In: Coello Coello, C.A., Hernández Aguirre, A., Zitzler, E. (eds.) EMO 2005. LNCS, vol. 3410, pp. 62–76. Springer, Heidelberg (2005). doi:10.1007/978-3-540-31880-4_5 CrossRef Emmerich, M., Beume, N., Naujoks, B.: An EMO algorithm using the hypervolume measure as selection criterion. In: Coello Coello, C.A., Hernández Aguirre, A., Zitzler, E. (eds.) EMO 2005. LNCS, vol. 3410, pp. 62–76. Springer, Heidelberg (2005). doi:10.​1007/​978-3-540-31880-4_​5 CrossRef
10.
Zurück zum Zitat Eronen, A.J., Klapuri, A.: Musical instrument recognition using cepstral coefficients and temporal features. In: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 753–756 (2000) Eronen, A.J., Klapuri, A.: Musical instrument recognition using cepstral coefficients and temporal features. In: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 753–756 (2000)
11.
Zurück zum Zitat Essid, S., Richard, G., David, B.: Instrument recognition in polyphonic music based on automatic taxonomies. IEEE Trans. Audio Speech Lang. Process. 14(1), 68–80 (2006)CrossRef Essid, S., Richard, G., David, B.: Instrument recognition in polyphonic music based on automatic taxonomies. IEEE Trans. Audio Speech Lang. Process. 14(1), 68–80 (2006)CrossRef
12.
Zurück zum Zitat Fuhrmann, F.: Automatic musical instrument recognition from polyphonic music audio signals. Ph.D. thesis, Universitat Pompeu Fabra (2012) Fuhrmann, F.: Automatic musical instrument recognition from polyphonic music audio signals. Ph.D. thesis, Universitat Pompeu Fabra (2012)
13.
Zurück zum Zitat Gaikwad, S., Chitre, A.V., Dandawate, Y.H.: Classification of Indian classical instruments using spectral and principal component analysis based cepstrum features. In: Proceedings of the 2014 International Conference on Electronic Systems, Signal Processing and Computing Technologies (ICESC), pp. 276–279 (2014) Gaikwad, S., Chitre, A.V., Dandawate, Y.H.: Classification of Indian classical instruments using spectral and principal component analysis based cepstrum features. In: Proceedings of the 2014 International Conference on Electronic Systems, Signal Processing and Computing Technologies (ICESC), pp. 276–279 (2014)
14.
Zurück zum Zitat Goto, M., Hashiguchi, H., Nishimura, T., Oka, R.: RWC music database: Music genre database and musical instrument sound database. In: Proceedings of the 4th International Conference on Music Information Retrieval (ISMIR), pp. 229–230 (2003) Goto, M., Hashiguchi, H., Nishimura, T., Oka, R.: RWC music database: Music genre database and musical instrument sound database. In: Proceedings of the 4th International Conference on Music Information Retrieval (ISMIR), pp. 229–230 (2003)
15.
Zurück zum Zitat Gunasekaran, S., Revathy, K.: Fractal dimension analysis of audio signals for Indian musical instrument recognition. In: Proceedings of the International Conference on Audio, Language and Image Processing (ICALIP), pp. 257–261 (2008) Gunasekaran, S., Revathy, K.: Fractal dimension analysis of audio signals for Indian musical instrument recognition. In: Proceedings of the International Conference on Audio, Language and Image Processing (ICALIP), pp. 257–261 (2008)
16.
Zurück zum Zitat Guyon, I., Nikravesh, M., Gunn, S., Zadeh, L.A.: Feature Extraction: Foundations and Applications. Springer, Heidelberg (2006)CrossRefMATH Guyon, I., Nikravesh, M., Gunn, S., Zadeh, L.A.: Feature Extraction: Foundations and Applications. Springer, Heidelberg (2006)CrossRefMATH
17.
Zurück zum Zitat Hastie, T., Tibshirani, R., Friedman, J.: The Elements of Statistical Learning. Springer, New York (2009)CrossRefMATH Hastie, T., Tibshirani, R., Friedman, J.: The Elements of Statistical Learning. Springer, New York (2009)CrossRefMATH
18.
Zurück zum Zitat Heittola, T., Klapuri, A., Virtanen, T.: Musical instrument recognition in polyphonic audio using source-filter model for sound separation. In: Proceedings of the 10th International Society for Music Information Retrieval Conference (ISMIR), pp. 327–332 (2009) Heittola, T., Klapuri, A., Virtanen, T.: Musical instrument recognition in polyphonic audio using source-filter model for sound separation. In: Proceedings of the 10th International Society for Music Information Retrieval Conference (ISMIR), pp. 327–332 (2009)
19.
Zurück zum Zitat Koduri, G.K., Miron, M., Serrà, J., Serra, X.: Computational approaches for the understanding of melody in carnatic music. In: Proceedings of the 12th International Society for Music Information Retrieval Conference (ISMIR), pp. 263–268 (2011) Koduri, G.K., Miron, M., Serrà, J., Serra, X.: Computational approaches for the understanding of melody in carnatic music. In: Proceedings of the 12th International Society for Music Information Retrieval Conference (ISMIR), pp. 263–268 (2011)
20.
Zurück zum Zitat Lartillot, O., Toiviainen, P.: MIR in Matlab (II): A toolbox for musical feature extraction from audio. In: Proceedings of the 8th International Conference on Music Information Retrieval (ISMIR), pp. 127–130 (2007) Lartillot, O., Toiviainen, P.: MIR in Matlab (II): A toolbox for musical feature extraction from audio. In: Proceedings of the 8th International Conference on Music Information Retrieval (ISMIR), pp. 127–130 (2007)
21.
Zurück zum Zitat Lashari, S.A., Ibrahim, R., Senan, N.: Soft set theory for automatic classification of traditional pakistani musical instruments sounds. In: Proceedings of the International Conference on Computer Information Science (ICCIS), pp. 94–99 (2012) Lashari, S.A., Ibrahim, R., Senan, N.: Soft set theory for automatic classification of traditional pakistani musical instruments sounds. In: Proceedings of the International Conference on Computer Information Science (ICCIS), pp. 94–99 (2012)
22.
Zurück zum Zitat Lidy, T., Silla Jr., C.N., Cornelis, O., Gouyon, F., Rauber, A., Kaestner, C.A.A., Koerich, A.L.: On the suitability of state-of-the-art music information retrieval methods for analyzing, categorizing and accessing non-Western and ethnic music collections. Signal Process. 90(4), 1032–1048 (2010)CrossRefMATH Lidy, T., Silla Jr., C.N., Cornelis, O., Gouyon, F., Rauber, A., Kaestner, C.A.A., Koerich, A.L.: On the suitability of state-of-the-art music information retrieval methods for analyzing, categorizing and accessing non-Western and ethnic music collections. Signal Process. 90(4), 1032–1048 (2010)CrossRefMATH
23.
Zurück zum Zitat Livshin, A., Rodet, X.: The significance of the non-harmonic “noise” versis the harmonic series for musical instrument recognition. In: Proceedings of the 7th International Conference on Music Information Retrieval (ISMIR), pp. 95–100 (2006) Livshin, A., Rodet, X.: The significance of the non-harmonic “noise” versis the harmonic series for musical instrument recognition. In: Proceedings of the 7th International Conference on Music Information Retrieval (ISMIR), pp. 95–100 (2006)
24.
Zurück zum Zitat McEnnis, D., McKay, C., Fujinaga, I.: jAudio: Additions and improvements. In: Proceedings of the 7th International Conference on Music Information Retrieval (ISMIR), pp. 385–386 (2006) McEnnis, D., McKay, C., Fujinaga, I.: jAudio: Additions and improvements. In: Proceedings of the 7th International Conference on Music Information Retrieval (ISMIR), pp. 385–386 (2006)
25.
Zurück zum Zitat Mierswa, I., Morik, K.: Automatic feature extraction for classifying audio data. Mach. Learn. J. 58(2–3), 127–149 (2005)CrossRefMATH Mierswa, I., Morik, K.: Automatic feature extraction for classifying audio data. Mach. Learn. J. 58(2–3), 127–149 (2005)CrossRefMATH
26.
Zurück zum Zitat Müller, M.: Information Retrieval for Music and Motion. Springer, Heidelberg (2007)CrossRef Müller, M.: Information Retrieval for Music and Motion. Springer, Heidelberg (2007)CrossRef
27.
Zurück zum Zitat Müller, M., Ewert, S.: Chroma toolbox: MATLAB implementations for extracting variants of chroma-based audio features. In: Proceedings of the 12th International Society for Music Information Retrieval Conference (ISMIR), pp. 215–220 (2011) Müller, M., Ewert, S.: Chroma toolbox: MATLAB implementations for extracting variants of chroma-based audio features. In: Proceedings of the 12th International Society for Music Information Retrieval Conference (ISMIR), pp. 215–220 (2011)
28.
Zurück zum Zitat Newton, M., Smith, L.: A neurally inspired musical instrument classification system based upon the sound onset. J. Acoust. Soc. Am. 131(6), 4785–4798 (2012)CrossRef Newton, M., Smith, L.: A neurally inspired musical instrument classification system based upon the sound onset. J. Acoust. Soc. Am. 131(6), 4785–4798 (2012)CrossRef
29.
Zurück zum Zitat Sandrock, T.: Multi-label feature selection with application to musical instrument recognition. Ph.D. thesis, Stellenbosch University (2013) Sandrock, T.: Multi-label feature selection with application to musical instrument recognition. Ph.D. thesis, Stellenbosch University (2013)
30.
Zurück zum Zitat Srinivasamurthy, A., Holzapfel, A., Serra, X.: In search of automatic rhythm analysis methods for Turkish and Indian art music. J. New Music Res. 43, 94–114 (2014)CrossRef Srinivasamurthy, A., Holzapfel, A., Serra, X.: In search of automatic rhythm analysis methods for Turkish and Indian art music. J. New Music Res. 43, 94–114 (2014)CrossRef
31.
Zurück zum Zitat Sturm, B.: Evaluating music emotion recognition: Lessons from music genre recognition? In: Proceedings of the IEEE International Conference on Multimedia and Expo (ICME), pp. 1–6 (2013) Sturm, B.: Evaluating music emotion recognition: Lessons from music genre recognition? In: Proceedings of the IEEE International Conference on Multimedia and Expo (ICME), pp. 1–6 (2013)
32.
Zurück zum Zitat Tzanetakis, G., Cook, P.: Musical genre classification of audio signals. IEEE Trans. Speech Audio Process. 10(5), 293–302 (2002)CrossRef Tzanetakis, G., Cook, P.: Musical genre classification of audio signals. IEEE Trans. Speech Audio Process. 10(5), 293–302 (2002)CrossRef
33.
Zurück zum Zitat Vatolkin, I., Preuß, M., Rudolph, G., Eichhoff, M., Weihs, C.: Multi-objective evolutionary feature selection for instrument recognition in polyphonic audio mixtures. Soft Comput. Fusion Found. Methodologies Appl. 16(12), 2027–2047 (2012) Vatolkin, I., Preuß, M., Rudolph, G., Eichhoff, M., Weihs, C.: Multi-objective evolutionary feature selection for instrument recognition in polyphonic audio mixtures. Soft Comput. Fusion Found. Methodologies Appl. 16(12), 2027–2047 (2012)
34.
Zurück zum Zitat Vatolkin, I., Rudolph, G., Weihs, C.: Evaluation of album effect for feature selection in music genre recognition. In: Proceedings of the 16th International Society for Music Information Retrieval Conference (ISMIR), pp. 169–175 (2015) Vatolkin, I., Rudolph, G., Weihs, C.: Evaluation of album effect for feature selection in music genre recognition. In: Proceedings of the 16th International Society for Music Information Retrieval Conference (ISMIR), pp. 169–175 (2015)
35.
Zurück zum Zitat Zitzler, E., Thiele, L.: Multiobjective optimization using evolutionary algorithms - A comparative case study. In: Proceedings of the 5th International Conference on Parallel Problem Solving from Nature (PPSN), pp. 292–304 (1998) Zitzler, E., Thiele, L.: Multiobjective optimization using evolutionary algorithms - A comparative case study. In: Proceedings of the 5th International Conference on Parallel Problem Solving from Nature (PPSN), pp. 292–304 (1998)
Metadaten
Titel
Generalisation Performance of Western Instrument Recognition Models in Polyphonic Mixtures with Ethnic Samples
verfasst von
Igor Vatolkin
Copyright-Jahr
2017
DOI
https://doi.org/10.1007/978-3-319-55750-2_21

Premium Partner