Skip to main content

2013 | OriginalPaper | Buchkapitel

Better Than MFCC Audio Classification Features

verfasst von : Ruben Gonzalez

Erschienen in: The Era of Interactive Media

Verlag: Springer New York

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Mel-Frequency Ceptral Coeffienents (MFCCs) are generally the features of choice for both audio classification and content-based retrieval due to their proven performance. This paper presents alternate feature sets that not only consistently outperform MFCC features but are simpler to calculate.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat José Anibal Arias, Julien Pinquier and Régine André-Obrecht, “Evaluation Of Classification Techniques For Audio Indexing,” Proceedings of 13th European Signal Processing Conference, September 4–8, 2005. EUSIPCO'2005, Antalya, Turkey. José Anibal Arias, Julien Pinquier and Régine André-Obrecht, “Evaluation Of Classification Techniques For Audio Indexing,” Proceedings of 13th European Signal Processing Conference, September 4–8, 2005. EUSIPCO'2005, Antalya, Turkey.
2.
Zurück zum Zitat S. Chu, S. Narayanan, C.-C. Jay Kuo, and Maja J. Mataric. “Where am i? scene recognition for mobile robots using audio features”. In Proc. of ICME, Toronto, Canada, July 2006 S. Chu, S. Narayanan, C.-C. Jay Kuo, and Maja J. Mataric. “Where am i? scene recognition for mobile robots using audio features”. In Proc. of ICME, Toronto, Canada, July 2006
3.
Zurück zum Zitat Lefèvre F., “A Confidence Measure based on the K-nn Probability Estimator”, International Conference on Spoken Language Processing, Beijing, 2000 Lefèvre F., “A Confidence Measure based on the K-nn Probability Estimator”, International Conference on Spoken Language Processing, Beijing, 2000
4.
Zurück zum Zitat Kim, H-G., Moreau, N., Sikora., “Audio Classification Based on MPEG-7 Spectral Basis Representations” IEEE Trans. On Circuits And Systems For Video Technology,Vol. 14,No. 5, May 2004. Kim, H-G., Moreau, N., Sikora., “Audio Classification Based on MPEG-7 Spectral Basis Representations” IEEE Trans. On Circuits And Systems For Video Technology,Vol. 14,No. 5, May 2004.
5.
Zurück zum Zitat M.F. McKinney, J. Breebaart. “Features for audio and music classification.” In Proc. of the Intern. Conf. on Music Information Retrieval (ISMIR 2004), pp. 151–158, Plymouth MA, 2004. M.F. McKinney, J. Breebaart. “Features for audio and music classification.” In Proc. of the Intern. Conf. on Music Information Retrieval (ISMIR 2004), pp. 151–158, Plymouth MA, 2004.
6.
Zurück zum Zitat Peltonen, V., Tuomi, J., Klapuri, A., Huopaniemi, J., Sorsa, T., “Computational auditory scene recognition”, Proceeding of. International Conference on Acoustics, Speech, and Signal Processing, 2002. (ICASSP ‘02). May 13–17, 2002, Orlando, FL, USA, vol.2, pp:1941–1944. Peltonen, V., Tuomi, J., Klapuri, A., Huopaniemi, J., Sorsa, T., “Computational auditory scene recognition”, Proceeding of. International Conference on Acoustics, Speech, and Signal Processing, 2002. (ICASSP ‘02). May 13–17, 2002, Orlando, FL, USA, vol.2, pp:1941–1944.
7.
Zurück zum Zitat Mingchun Liu and Chunru Wan. 2001. “Feature selection for automatic classification of musical instrument sounds.” In Proceedings of the 1st ACM/IEEE-CS joint conference on Digital libraries (JCDL ‘01). ACM, New York, NY, USA, 247–248. Mingchun Liu and Chunru Wan. 2001. “Feature selection for automatic classification of musical instrument sounds.” In Proceedings of the 1st ACM/IEEE-CS joint conference on Digital libraries (JCDL ‘01). ACM, New York, NY, USA, 247–248.
8.
Zurück zum Zitat D. J. Hermes, “Measurement of pitch by subharmonic summation” J. Acoust. Soc. Am. Volume 83, Issue 1, pp. 257–264 (January 1988)CrossRef D. J. Hermes, “Measurement of pitch by subharmonic summation” J. Acoust. Soc. Am. Volume 83, Issue 1, pp. 257–264 (January 1988)CrossRef
9.
Zurück zum Zitat Bregman, Albert S., Auditory Scene Analysis: The Perceptual Organization of Sound. Cambridge, Massachusetts: The MIT Press, 1990 (hardcover)/1994 (paperback). Bregman, Albert S., Auditory Scene Analysis: The Perceptual Organization of Sound. Cambridge, Massachusetts: The MIT Press, 1990 (hardcover)/1994 (paperback).
10.
Zurück zum Zitat D. Stewart, “Australian Frog Calls - Subtropical East”, [Audio Recording] D. Stewart, “Australian Frog Calls - Subtropical East”, [Audio Recording]
11.
Zurück zum Zitat McGill University Master Samples, 1993 “Vol 1: Classical Sounds” [Audio Recording] McGill University Master Samples, 1993 “Vol 1: Classical Sounds” [Audio Recording]
Metadaten
Titel
Better Than MFCC Audio Classification Features
verfasst von
Ruben Gonzalez
Copyright-Jahr
2013
Verlag
Springer New York
DOI
https://doi.org/10.1007/978-1-4614-3501-3_24

Neuer Inhalt