Skip to main content
Erschienen in: The Journal of Supercomputing 1/2021

22.04.2020

Parallel multichannel music source separation system

verfasst von: A. J. Muñoz-Montoro, D. Suarez-Dou, J. J. Carabias-Orti, F. J. Canadas-Quesada, J. Ranilla

Erschienen in: The Journal of Supercomputing | Ausgabe 1/2021

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

This paper presents a parallel low-latency multichannel source separation system designed to recover the original signals of the instruments that compound a multichannel music recording. Our approach is suitable for many applications based on interactive live broadcast classical music, where a latency of a few seconds can be assumed by online users. The obtained results show that it is possible to reach real time in the tested scenarios assuming a low-latency and combining multi-core architectures with parallel and high-performance techniques.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literatur
3.
Zurück zum Zitat Anderson E, Bai Z, Bischof C, Blackford S, Demmel J, Dongarra J, Du Croz J, Greenbaum A, Hammarling S, McKenney A, Sorensen D (1999) LAPACK users’ guide, 3rd edn. Society for Industrial and Applied Mathematics, PhiladelphiaCrossRef Anderson E, Bai Z, Bischof C, Blackford S, Demmel J, Dongarra J, Du Croz J, Greenbaum A, Hammarling S, McKenney A, Sorensen D (1999) LAPACK users’ guide, 3rd edn. Society for Industrial and Applied Mathematics, PhiladelphiaCrossRef
5.
Zurück zum Zitat Campbell DR, Palomaki KJ, Brown GJ (2005) A MATLAB simulation of “shoebox” room acoustics for use in teaching and research. Computing and Information Systems Journal 9(3):48 Campbell DR, Palomaki KJ, Brown GJ (2005) A MATLAB simulation of “shoebox” room acoustics for use in teaching and research. Computing and Information Systems Journal 9(3):48
8.
Zurück zum Zitat Carabias-Orti, J. J., Rodriguez-Serrano, F., Vera-Candeas, P., Ruiz-Reyes, N., & Canadas-Quesada, F. J. (2015). An audio to score alignment framework using spectral factorization and dynamic time warping. In ISMIR: Proceedings of the International Conference of Music Information Retrieval (pp. 742–748) Carabias-Orti, J. J., Rodriguez-Serrano, F., Vera-Candeas, P., Ruiz-Reyes, N., & Canadas-Quesada, F. J. (2015). An audio to score alignment framework using spectral factorization and dynamic time warping. In ISMIR: Proceedings of the International Conference of Music Information Retrieval (pp. 742–748)
10.
Zurück zum Zitat Chordia, P., & Rae, A. (2009). Using source separation to improve tempo detection. In: Proceedings of the 10th International Society for Music Information Retrieval Conference, ISMIR 2009 (pp. 183–188) Chordia, P., & Rae, A. (2009). Using source separation to improve tempo detection. In: Proceedings of the 10th International Society for Music Information Retrieval Conference, ISMIR 2009 (pp. 183–188)
11.
Zurück zum Zitat Dagum L, Menon R (1998) Openmp: An industry standard api for shared-memory programming. IEEE Computational Science and Engineering 5(1):46–55CrossRef Dagum L, Menon R (1998) Openmp: An industry standard api for shared-memory programming. IEEE Computational Science and Engineering 5(1):46–55CrossRef
12.
Zurück zum Zitat Dessein, A., Cont, A., & Lemaitre, G. (2010). Real-time polyphonic music transcription with non-negative matrix factorization and beta-divergence. In Proceedings of the 11th International Society for Music Information Retrieval Conference, ISMIR 2010, (pp. 489–494) Dessein, A., Cont, A., & Lemaitre, G. (2010). Real-time polyphonic music transcription with non-negative matrix factorization and beta-divergence. In Proceedings of the 11th International Society for Music Information Retrieval Conference, ISMIR 2010, (pp. 489–494)
14.
Zurück zum Zitat Disch, S., Ertel, C., Faller, C., Herre, J., Hilpert, J., Hoelzer, A., Kroon, P., Linzmeier, K., & Spenger, C. (2004). Spatial audio coding: Next-generation efficient and compatible coding of multi-channel audio. In: Audio engineering society convention 117. Audio engineering society Disch, S., Ertel, C., Faller, C., Herre, J., Hilpert, J., Hoelzer, A., Kroon, P., Linzmeier, K., & Spenger, C. (2004). Spatial audio coding: Next-generation efficient and compatible coding of multi-channel audio. In: Audio engineering society convention 117. Audio engineering society
18.
Zurück zum Zitat Frigo M (1999) A fast fourier transform compiler. ACM SIGPLAN Notices 10(1145/989393):989457 Frigo M (1999) A fast fourier transform compiler. ACM SIGPLAN Notices 10(1145/989393):989457
19.
Zurück zum Zitat Goto, M. (2004). Development of the RWC music database. In Proceedings of the 18th International Congress on Acoustics (ICA 2004) (pp. 553–556) Goto, M. (2004). Development of the RWC music database. In Proceedings of the 18th International Congress on Acoustics (ICA 2004) (pp. 553–556)
20.
Zurück zum Zitat Goto M, Hashiguchi H, Nishimura T, Oka R (2002) RWC music database: Popular, classical and Jazz music databases. Ismir 2:287–288 Goto M, Hashiguchi H, Nishimura T, Oka R (2002) RWC music database: Popular, classical and Jazz music databases. Ismir 2:287–288
21.
Zurück zum Zitat Hennequin, R., David, B., & Badeau, R. (2011). Score informed audio source separation using a parametric model of non-negative spectrogram. In 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), (Vol. 1, pp. 45–48). IEEE. https://doi.org/10.1109/ICASSP.2011.5946324 Hennequin, R., David, B., & Badeau, R. (2011). Score informed audio source separation using a parametric model of non-negative spectrogram. In 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), (Vol. 1, pp. 45–48). IEEE. https://​doi.​org/​10.​1109/​ICASSP.​2011.​5946324
22.
Zurück zum Zitat Huang, P. S., Chen, S. D., Smaragdis, P., & Hasegawa-Johnson, M. (2012). Singing-voice separation from monaural recordings using robust principal component analysis. In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp. 57–60). Huang, P. S., Chen, S. D., Smaragdis, P., & Hasegawa-Johnson, M. (2012). Singing-voice separation from monaural recordings using robust principal component analysis. In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp. 57–60).
25.
Zurück zum Zitat Marxer, R. (2013). Audio source separation for music in low-latency and high-latency scenarios. Marxer, R. (2013). Audio source separation for music in low-latency and high-latency scenarios.
26.
Zurück zum Zitat Miron, M., Carabias, J. J., & Janer, J. (2015). Improving score-informed source separation for classical music through note refinement. In: Proceedings of the International Society for Music Information Retrieval (ISMIR) Conference (pp. 448–454). Miron, M., Carabias, J. J., & Janer, J. (2015). Improving score-informed source separation for classical music through note refinement. In: Proceedings of the International Society for Music Information Retrieval (ISMIR) Conference (pp. 448–454).
31.
Zurück zum Zitat Rodríguez-Serrano FJ, Carabias-Orti JJ, Vera-Candeas P, Canadas-Quesada FJ, Ruiz-Reyes N (2014) Monophonic constrained non-negative sparse coding using instrument models for audio separation and transcription of monophonic source-based polyphonic mixtures. Multimedia Tools and Applications 72(1):925–949. https://doi.org/10.1007/s11042-013-1398-8CrossRef Rodríguez-Serrano FJ, Carabias-Orti JJ, Vera-Candeas P, Canadas-Quesada FJ, Ruiz-Reyes N (2014) Monophonic constrained non-negative sparse coding using instrument models for audio separation and transcription of monophonic source-based polyphonic mixtures. Multimedia Tools and Applications 72(1):925–949. https://​doi.​org/​10.​1007/​s11042-013-1398-8CrossRef
33.
Zurück zum Zitat Turetsky, R., & Ellis, D. (2003). Ground-truth transcriptions of real music from force-aligned MIDI syntheses. In: Proceedings of the 4th International Symposium on Music Information Retrieval (pp. 135–141). https://doi.org/10.7916/D8S472CZ. Turetsky, R., & Ellis, D. (2003). Ground-truth transcriptions of real music from force-aligned MIDI syntheses. In: Proceedings of the 4th International Symposium on Music Information Retrieval (pp. 135–141). https://​doi.​org/​10.​7916/​D8S472CZ.
37.
Zurück zum Zitat Viste, H., & Evangelista, G. (2001). Sound source separation: Preprocessing for hearing aids and structured audio coding. In: COST G-6 Conference on Digital Audio Effects (DAFX-01), (pp. 67–70). Viste, H., & Evangelista, G. (2001). Sound source separation: Preprocessing for hearing aids and structured audio coding. In: COST G-6 Conference on Digital Audio Effects (DAFX-01), (pp. 67–70).
38.
Zurück zum Zitat Woodruff, J., Pardo, B., & Dannenberg, R. (2006). Remixing stereo music with score-informed source separation. In Proceedings of the 7th International Society for Music Information Retrieval Conference (ISMIR). Woodruff, J., Pardo, B., & Dannenberg, R. (2006). Remixing stereo music with score-informed source separation. In Proceedings of the 7th International Society for Music Information Retrieval Conference (ISMIR).
Metadaten
Titel
Parallel multichannel music source separation system
verfasst von
A. J. Muñoz-Montoro
D. Suarez-Dou
J. J. Carabias-Orti
F. J. Canadas-Quesada
J. Ranilla
Publikationsdatum
22.04.2020
Verlag
Springer US
Erschienen in
The Journal of Supercomputing / Ausgabe 1/2021
Print ISSN: 0920-8542
Elektronische ISSN: 1573-0484
DOI
https://doi.org/10.1007/s11227-020-03282-2

Weitere Artikel der Ausgabe 1/2021

The Journal of Supercomputing 1/2021 Zur Ausgabe