Skip to main content
Top
Published in: International Journal of Machine Learning and Cybernetics 3/2011

01-09-2011 | Original Article

Online music tracking with global alignment

Authors: Antonio Camarena-Ibarrola, Edgar Chávez

Published in: International Journal of Machine Learning and Cybernetics | Issue 3/2011

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

To make audio monitoring, the state of the art in this area makes use of local alignment algorithms between the objective audio and musical interpretation.The inductive hypothesis of a local alignment tool is that the alignment is correct to the current position of an error this is drag and accumulate to subsequent errors which do not recover unless elaborate heuristics are used. Our approach uses a local non-alignment scheme based on the audio search the entire purpose of short segments of audio taken from musical performance to get the k nearest audio segments (the proximity is determined using audio tracks based on entropy signs).The current audio segment of the play is paired with the nearest (in time) between the k previously selected audio segments of the target audio.To our knowledge, this is the first algorithm able to start up from an arbitrary point in the audio end, for example, if the musical performance had already begun when the monitoring system just went on.We complemented the overall strategy through a simple heuristic of ignoring the candidates when they are all too far in time with respect to the last position reported by the system.We have tested our method with 62 musical pieces, some of which are pop and classical music mostly.For every song we have two interpretations, we use one as the audio object and the other as the interpretation which will be monitored.We obtained excellent results.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Show more products
Literature
1.
go back to reference Bilmes JA (1998) A gentle tutorial of the EM algorithm and its application to parameter estimation for Gaussian mixture and hidden Markov models. Technical report TR-97-021, Department of Electrical Engineering and Computer Science U.C. Berkeley Bilmes JA (1998) A gentle tutorial of the EM algorithm and its application to parameter estimation for Gaussian mixture and hidden Markov models. Technical report TR-97-021, Department of Electrical Engineering and Computer Science U.C. Berkeley
3.
go back to reference Camarena-Ibarrola A, Chavez E (2006) On musical performances identification, entropy and string matching. In: Fifth Mexican international conference on artificial intelligence 2006 (MICAI2006) Camarena-Ibarrola A, Chavez E (2006) On musical performances identification, entropy and string matching. In: Fifth Mexican international conference on artificial intelligence 2006 (MICAI2006)
4.
go back to reference Camarena-Ibarrola A, Chavez E, Tellez ES (2009) Robust radio broadcast monitoring using a multi-band spectral entropy signature. In: 14th Iberoamerican congress on pattern recognition. Springer, pp 587–594 Camarena-Ibarrola A, Chavez E, Tellez ES (2009) Robust radio broadcast monitoring using a multi-band spectral entropy signature. In: 14th Iberoamerican congress on pattern recognition. Springer, pp 587–594
5.
go back to reference Cano P, Loscos A, Bonada J (1999) Score-performance matching using hmms. In: ICMC99. Audiovisual Institute, Pompeu Fabra University, Spain Cano P, Loscos A, Bonada J (1999) Score-performance matching using hmms. In: ICMC99. Audiovisual Institute, Pompeu Fabra University, Spain
6.
go back to reference Dixon S (2005) Live tracking of musical performances using on-line time warping. In: 8th International conference on digital audio effects (DAFx’05). Austrian Research Institute for Artificial Intelligence, Vienna Dixon S (2005) Live tracking of musical performances using on-line time warping. In: 8th International conference on digital audio effects (DAFx’05). Austrian Research Institute for Artificial Intelligence, Vienna
7.
go back to reference Dixon S, Widmer G (2005) Match: a music alignment tool chest. In: 6th International conference on music information retrieval (ISMIR). Austrian Research Institute for Artificial Intelligence, Vienna Dixon S, Widmer G (2005) Match: a music alignment tool chest. In: 6th International conference on music information retrieval (ISMIR). Austrian Research Institute for Artificial Intelligence, Vienna
8.
go back to reference Edgar Chávez ACI (2010) Real time tracking of musical performances. In: 9th Mexican international conference on artificial intelligence (MICAI’2010), LNCS. Springer, pp 138–148 Edgar Chávez ACI (2010) Real time tracking of musical performances. In: 9th Mexican international conference on artificial intelligence (MICAI’2010), LNCS. Springer, pp 138–148
10.
go back to reference Haitsma J, Kalker T (2002) A highly robust audio fingerprinting system. In: International symposium on music information retrieval ISMIR Haitsma J, Kalker T (2002) A highly robust audio fingerprinting system. In: International symposium on music information retrieval ISMIR
11.
go back to reference Navarro G, Raffinot M (2002) Flexible pattern matching in strings. practical on-line search for texts and biological sequences, vol 17. Cambridge University Press Navarro G, Raffinot M (2002) Flexible pattern matching in strings. practical on-line search for texts and biological sequences, vol 17. Cambridge University Press
12.
go back to reference Orio N, Déchelle F (2001) Score following using spectral analysis and hidden Markov models. In: Proceedings of the ICMC, pp 151–154 Orio N, Déchelle F (2001) Score following using spectral analysis and hidden Markov models. In: Proceedings of the ICMC, pp 151–154
13.
go back to reference Orio N, Lemouton S, Schwarz D (2003) Score following: state of the art and new developments. In: Proceedings of the 2003 conference on new interfaces for musical expression. National University of Singapore, p 41 Orio N, Lemouton S, Schwarz D (2003) Score following: state of the art and new developments. In: Proceedings of the 2003 conference on new interfaces for musical expression. National University of Singapore, p 41
14.
go back to reference Rabiner L, Juang B (2003) An introduction to hidden markov models. ASSP Mag IEEE 3(1):4–16CrossRef Rabiner L, Juang B (2003) An introduction to hidden markov models. ASSP Mag IEEE 3(1):4–16CrossRef
15.
go back to reference Rabiner RL (1989) A tutorial on hidden markov models and selected aplications in speech recognition. Proc IEEE 77(2):257–286 Rabiner RL (1989) A tutorial on hidden markov models and selected aplications in speech recognition. Proc IEEE 77(2):257–286
16.
go back to reference Rabiner RL, Rosenberg AE, Levinson SE (1978) Considerations in dynamic time warping for discrete word recognition. In: IEEE trans on acoustics, speech and signal processing ASSP-26, pp 622–635 Rabiner RL, Rosenberg AE, Levinson SE (1978) Considerations in dynamic time warping for discrete word recognition. In: IEEE trans on acoustics, speech and signal processing ASSP-26, pp 622–635
17.
go back to reference Sakoe H, Chiba S (1978) Dynamic programming algortihm optimization for spoken word recognition. IEEE transactions on acoustics and speech signal processing (ASSP), pp 43–49 Sakoe H, Chiba S (1978) Dynamic programming algortihm optimization for spoken word recognition. IEEE transactions on acoustics and speech signal processing (ASSP), pp 43–49
18.
go back to reference Sethares W, Morris R, Sethares J (2005) Beat tracking of musical performances using low-level audio features. IEEE Trans Speech Audio Process 13(2):275–285CrossRef Sethares W, Morris R, Sethares J (2005) Beat tracking of musical performances using low-level audio features. IEEE Trans Speech Audio Process 13(2):275–285CrossRef
Metadata
Title
Online music tracking with global alignment
Authors
Antonio Camarena-Ibarrola
Edgar Chávez
Publication date
01-09-2011
Publisher
Springer-Verlag
Published in
International Journal of Machine Learning and Cybernetics / Issue 3/2011
Print ISSN: 1868-8071
Electronic ISSN: 1868-808X
DOI
https://doi.org/10.1007/s13042-011-0025-0

Other articles of this Issue 3/2011

International Journal of Machine Learning and Cybernetics 3/2011 Go to the issue

Editorial

Editorial