Skip to main content
Top
Published in: The Journal of Supercomputing 11/2020

01-02-2020

A score identification parallel system based on audio-to-score alignment

Authors: A. J. Muñoz-Montoro, R. Cortina, S. García-Galán, E. F. Combarro, J. Ranilla

Published in: The Journal of Supercomputing | Issue 11/2020

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

This paper presents a parallel system for searching a digital score of classical music in a personal library. The application scenario of the system is for a musician who wants to search for a specific score in its own device by playing an excerpt of a few seconds of the composition. We propose a solution, based on audio-to-score alignment, which allows to identify the correct score in a database of musical pieces in real time. This is a challenging task because we focus on a real-time system targeted for handheld devices characterized by both mobility and low power consumption. Experimental results show that it is possible to achieve real-time execution in the tested scenarios using parallel computing techniques with ARM processors.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literature
3.
go back to reference Arzt A (2016) Flexible and robust music tracking. Ph.D. thesis, Johannes Kepler University Linz Arzt A (2016) Flexible and robust music tracking. Ph.D. thesis, Johannes Kepler University Linz
5.
go back to reference Carabias-Orti J.J, Rodriguez-Serrano F, Vera-Candeas P, Ruiz-Reyes N, Canadas-Quesada F.J (2015) An audio to score alignment framework using spectral factorization and dynamic time warping. In: ISMIR: Proceedings of the International Conference of Music Information Retrieval, pp 742–748 Carabias-Orti J.J, Rodriguez-Serrano F, Vera-Candeas P, Ruiz-Reyes N, Canadas-Quesada F.J (2015) An audio to score alignment framework using spectral factorization and dynamic time warping. In: ISMIR: Proceedings of the International Conference of Music Information Retrieval, pp 742–748
7.
go back to reference Cont A (2006) Realtime audio to score alignment for polyphonic music instruments, using sparse non-negative constraints and hierarchical HMMS. In: Proceedings of 2006 IEEE International Conference on Acoustics Speed and Signal Processing, vol 5. IEEE, pp V–245–V–248. https://doi.org/10.1109/ICASSP.2006.1661258 Cont A (2006) Realtime audio to score alignment for polyphonic music instruments, using sparse non-negative constraints and hierarchical HMMS. In: Proceedings of 2006 IEEE International Conference on Acoustics Speed and Signal Processing, vol 5. IEEE, pp V–245–V–248. https://​doi.​org/​10.​1109/​ICASSP.​2006.​1661258
8.
go back to reference Dixon S (2005) Live tracking of musical performances using on-line time warping. In: DAFx, pp. 1727–1728 Dixon S (2005) Live tracking of musical performances using on-line time warping. In: DAFx, pp. 1727–1728
10.
go back to reference Févotte C, Idier J (2011) Algorithms for nonnegative matrix factorization with the \(\beta\)-divergence. Neural Comput 23(9):2421–2456MathSciNetCrossRef Févotte C, Idier J (2011) Algorithms for nonnegative matrix factorization with the \(\beta\)-divergence. Neural Comput 23(9):2421–2456MathSciNetCrossRef
12.
15.
go back to reference Orio N, Schwarz D (2001) Alignment of monophonic and polyphonic music to a score. In: Proceedings of the International Computer Music Conference, pp 155–158 Orio N, Schwarz D (2001) Alignment of monophonic and polyphonic music to a score. In: Proceedings of the International Computer Music Conference, pp 155–158
16.
go back to reference Otsuka T, Takahashi T, Okuno H.G, Komatani K, Ogata T, Murata K, Nakadai K (2009) Incremental polyphonic audio to score alignment using beat tracking for singer robots. In: 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems. IEEE, pp 2289–2296. https://doi.org/10.1109/IROS.2009.5354637 Otsuka T, Takahashi T, Okuno H.G, Komatani K, Ogata T, Murata K, Nakadai K (2009) Incremental polyphonic audio to score alignment using beat tracking for singer robots. In: 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems. IEEE, pp 2289–2296. https://​doi.​org/​10.​1109/​IROS.​2009.​5354637
17.
go back to reference Raffel C, Ellis DPW (2015) Large-scale content-based matching of midi and audio files. In: Proceedings of the International Society for Music Information Retrieval Conference Raffel C, Ellis DPW (2015) Large-scale content-based matching of midi and audio files. In: Proceedings of the International Society for Music Information Retrieval Conference
18.
go back to reference Raffel C, Ellis DPW (2016) Extracting ground truth information from MIDI files: A Midifesto. In: Proceedings of 17th International Society for Music Information Retrieval Conference, pp 796–802 Raffel C, Ellis DPW (2016) Extracting ground truth information from MIDI files: A Midifesto. In: Proceedings of 17th International Society for Music Information Retrieval Conference, pp 796–802
19.
go back to reference Raphael C (2010) Music plus one and machine learning. In: Proceedings of the 27th International Conference on Machine Learning, pp 21–28 Raphael C (2010) Music plus one and machine learning. In: Proceedings of the 27th International Conference on Machine Learning, pp 21–28
20.
go back to reference Rodríguez-Serrano FJ, Carabias-Orti JJ, Vera-Candeas P, Canadas-Quesada FJ, Ruiz-Reyes N (2014) Monophonic constrained non-negative sparse coding using instrument models for audio separation and transcription of monophonic source-based polyphonic mixtures. Multimed Tools Appl 72(1):925–949. https://doi.org/10.1007/s11042-013-1398-8CrossRef Rodríguez-Serrano FJ, Carabias-Orti JJ, Vera-Candeas P, Canadas-Quesada FJ, Ruiz-Reyes N (2014) Monophonic constrained non-negative sparse coding using instrument models for audio separation and transcription of monophonic source-based polyphonic mixtures. Multimed Tools Appl 72(1):925–949. https://​doi.​org/​10.​1007/​s11042-013-1398-8CrossRef
22.
go back to reference Thickstun J, Harchaoui Z, Kakade S (2017) Learning features of music from scratch. In: ICLR, pp 1–14 Thickstun J, Harchaoui Z, Kakade S (2017) Learning features of music from scratch. In: ICLR, pp 1–14
23.
go back to reference Turetsky R, Ellis D (2003) Ground-truth transcriptions of real music from force-aligned MIDI syntheses. In: Proceedings of the 4th international symposium on music information retrieval, pp 135–141. https://doi.org/10.7916/D8S472CZ Turetsky R, Ellis D (2003) Ground-truth transcriptions of real music from force-aligned MIDI syntheses. In: Proceedings of the 4th international symposium on music information retrieval, pp 135–141. https://​doi.​org/​10.​7916/​D8S472CZ
24.
Metadata
Title
A score identification parallel system based on audio-to-score alignment
Authors
A. J. Muñoz-Montoro
R. Cortina
S. García-Galán
E. F. Combarro
J. Ranilla
Publication date
01-02-2020
Publisher
Springer US
Published in
The Journal of Supercomputing / Issue 11/2020
Print ISSN: 0920-8542
Electronic ISSN: 1573-0484
DOI
https://doi.org/10.1007/s11227-020-03185-2

Other articles of this Issue 11/2020

The Journal of Supercomputing 11/2020 Go to the issue

Premium Partner