Skip to main content
Top

2018 | OriginalPaper | Chapter

45. Enabling Interactive and Interoperable Semantic Music Applications

Authors : Jesús Corral García, Panos Kudumakis, Isabel Barbancho, Lorenzo J. Tardón, Mark Sandler

Published in: Springer Handbook of Systematic Musicology

Publisher: Springer Berlin Heidelberg

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

New interactive music services have emerged, but many of them use proprietary file formats. In order to enable interoperability among these services, the International Organization for Standardization (ISO)/International Electrotechnical Commission (IEC) Moving Picture Experts Group (MPEG) issued a new standard, the so-called MPEG-A: Interactive Music Application Format (IM AF).
The purpose of this chapter is to review the IM AF standard and its features, and also to provide a detailed description of the design and implementation of an IM AF codec and its integration into a popular open source analysis, annotation and visualization audio tool known as Sonic Visualiser. This is followed by a discussion highlighting the benefits of their combined features, such as automatic chords or melody extraction time-aligned with the song's lyrics. Furthermore, this integration provides the semantic music research community with a testbed enabling further development and comparison of new Sonic Visualiser plug-ins, e. g., from singing voice-to-text conversion with automatic lyrics highlighting for karaoke applications, to source separation-based music instrument extraction from a mixed song.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literature
45.1
go back to reference P. Kudumakis: MP3: Something’s gotta change!, Audio! 1(3), 6 (2011) P. Kudumakis: MP3: Something’s gotta change!, Audio! 1(3), 6 (2011)
45.2
go back to reference I. Jang, P. Kudumakis, M. Sandler, K. Kang: The MPEG interactive music application format standard, IEEE Sig. Process. Mag. 28(1), 150–154 (2011)CrossRef I. Jang, P. Kudumakis, M. Sandler, K. Kang: The MPEG interactive music application format standard, IEEE Sig. Process. Mag. 28(1), 150–154 (2011)CrossRef
45.6
go back to reference ISO/IEC 23000-12:2010 – Information technology – Multimedia application format (MPEG-A) – Part 12: Interactive music application format ISO/IEC 23000-12:2010 – Information technology – Multimedia application format (MPEG-A) – Part 12: Interactive music application format
45.7
go back to reference ISO/IEC 23000-12:2010/Amd.2:2012 – Information technology – Multimedia application format (MPEG-A) – Part 12: Interactive music application format, AMENDMENT 2: Compact representation of dynamic volume change and audio equalization ISO/IEC 23000-12:2010/Amd.2:2012 – Information technology – Multimedia application format (MPEG-A) – Part 12: Interactive music application format, AMENDMENT 2: Compact representation of dynamic volume change and audio equalization
45.8
go back to reference J.C. Garcia, C. Taglialatela, P. Kudumakis, L.J. Tardon, I. Barbancho, M. Sandler: Interactive music applications by MPEG-A support in Sonic Visualiser. In: AES 53rd Int. Conf. Semant. Audio, London (2014) J.C. Garcia, C. Taglialatela, P. Kudumakis, L.J. Tardon, I. Barbancho, M. Sandler: Interactive music applications by MPEG-A support in Sonic Visualiser. In: AES 53rd Int. Conf. Semant. Audio, London (2014)
45.9
go back to reference C. Cannam, C. Landone, M. Sandler: Sonic Visualiser: An open source application for viewing, analysing, and annotating music audio files. In: Proc. ACM Multimedia Int. Conf. (2010) C. Cannam, C. Landone, M. Sandler: Sonic Visualiser: An open source application for viewing, analysing, and annotating music audio files. In: Proc. ACM Multimedia Int. Conf. (2010)
45.10
go back to reference M. Mauch, S. Dixon: Approximate note transcription for the improved identification of difficult chords. In: Proc. Int. Symp. Music Inf. Retriev. (2010) pp. 135–140 M. Mauch, S. Dixon: Approximate note transcription for the improved identification of difficult chords. In: Proc. Int. Symp. Music Inf. Retriev. (2010) pp. 135–140
45.11
go back to reference J. Salamon, E. Gómez: Melody extraction from polyphonic music signals using pitch contour characteristics, IEEE Trans. Audio Speech Lang. Proc. 20(6), 1759–1770 (2012)CrossRef J. Salamon, E. Gómez: Melody extraction from polyphonic music signals using pitch contour characteristics, IEEE Trans. Audio Speech Lang. Proc. 20(6), 1759–1770 (2012)CrossRef
45.12
go back to reference ISO/IEC 23003-2:2010 – Information technology – MPEG audio technologies – Part 2: Spatial Audio Object Coding (SAOC) ISO/IEC 23003-2:2010 – Information technology – MPEG audio technologies – Part 2: Spatial Audio Object Coding (SAOC)
45.13
go back to reference ISO/IEC 10918-1:1994 – Information technology – Digital compression and coding of continuous-tone still images (JPEG) ISO/IEC 10918-1:1994 – Information technology – Digital compression and coding of continuous-tone still images (JPEG)
45.14
go back to reference ETS 3GPP TS 26.245-2004 – Transparent end-to-end Packet switched Streaming Service (PSS); Timed text format ETS 3GPP TS 26.245-2004 – Transparent end-to-end Packet switched Streaming Service (PSS); Timed text format
45.15
go back to reference ISO/IEC 15938-5:2003 – Information technology – Multimedia content description interface – Part 5: Multimedia description schemes ISO/IEC 15938-5:2003 – Information technology – Multimedia content description interface – Part 5: Multimedia description schemes
45.16
go back to reference ISO/IEC 14496-12:2008 – Information technology – Coding of audio-visual objects – Part 12: ISO base media file format ISO/IEC 14496-12:2008 – Information technology – Coding of audio-visual objects – Part 12: ISO base media file format
45.17
go back to reference C. Taglialatela: MPEG IM AF encoder: Features development, BSc Thesis (Seconda Università degli Studi di Napoli, Napoli 2013) C. Taglialatela: MPEG IM AF encoder: Features development, BSc Thesis (Seconda Università degli Studi di Napoli, Napoli 2013)
45.19
go back to reference T. Hosoya, M. Suzuki, A. Ito, S. Makino: Lyrics recognition from a singing voice based on finite state automation for music information retrieval. In: Proc. Int. Symp. Music Inf. Retriev. (2005) pp. 532–535 T. Hosoya, M. Suzuki, A. Ito, S. Makino: Lyrics recognition from a singing voice based on finite state automation for music information retrieval. In: Proc. Int. Symp. Music Inf. Retriev. (2005) pp. 532–535
45.21
go back to reference G. Herrero, P. Kudumakis, L.J. Tardon, I. Barbancho, M. Sandler: An HTML5 interactive (MPEG-A IM AF) music player. In: 10th Int. Symp. Comput. Music Multidiscip. Res. (CMMR), Marseille (2013) G. Herrero, P. Kudumakis, L.J. Tardon, I. Barbancho, M. Sandler: An HTML5 interactive (MPEG-A IM AF) music player. In: 10th Int. Symp. Comput. Music Multidiscip. Res. (CMMR), Marseille (2013)
Metadata
Title
Enabling Interactive and Interoperable Semantic Music Applications
Authors
Jesús Corral García
Panos Kudumakis
Isabel Barbancho
Lorenzo J. Tardón
Mark Sandler
Copyright Year
2018
Publisher
Springer Berlin Heidelberg
DOI
https://doi.org/10.1007/978-3-662-55004-5_45

Premium Partners