Skip to main content

2019 | OriginalPaper | Buchkapitel

Music Summary Detection with State Space Embedding and Recurrence Plot

verfasst von : Yongwei Gao, Yichun Shen, Xulong Zhang, Shuai Yu, Wei Li

Erschienen in: Proceedings of the 6th Conference on Sound and Music Technology (CSMT)

Verlag: Springer Singapore

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Automatic music summary detection is a task that identifies the most representative part of a song, facilitating users to retrieve the desired songs. In this paper, we propose a novel method based on state space embedding and recurrence plot. Firstly, an extended audio feature with state space embedding is extracted to construct a similarity matrix. Compared with the raw audio features, this extended feature is more robust against noise. Then recurrence plot based on global strategy is adopted to detect similar segment pairs within a song. Finally, we proposed to extract the most repeated part as a summary by selecting and merging the stripes containing the lowest distance in the similarity matrix under the constraints of slope and duration. Experimental results show that the performance of the proposed algorithm is more powerful than the other two competitive baseline methods.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Gao S, Li H (2015) Popular song summarization using chorus section detection from audio signal. In: Proceedings of the 17th international workshop on multimedia signal processing (MMSP), pp 1–6. IEEE, Xiamen, China Gao S, Li H (2015) Popular song summarization using chorus section detection from audio signal. In: Proceedings of the 17th international workshop on multimedia signal processing (MMSP), pp 1–6. IEEE, Xiamen, China
2.
Zurück zum Zitat Maddage NC, Xu C, Kankanhalli MS et al (2004) Content-based music structure analysis with applications to music semantics understanding. In: Proceedings of the 12th ACM international conference on multimedia (MM), pp 112–119. ACM, New York, USA Maddage NC, Xu C, Kankanhalli MS et al (2004) Content-based music structure analysis with applications to music semantics understanding. In: Proceedings of the 12th ACM international conference on multimedia (MM), pp 112–119. ACM, New York, USA
3.
Zurück zum Zitat Matthew C, Jonathan F (2002) Automatic music summarization via similarity analysis. In: Proceedings of the 3rd international society for music information retrieval (ISMIR), pp 122–127. Paris, France Matthew C, Jonathan F (2002) Automatic music summarization via similarity analysis. In: Proceedings of the 3rd international society for music information retrieval (ISMIR), pp 122–127. Paris, France
4.
Zurück zum Zitat Bartsch MA, Wakefield GH (2005) Audio thumbnailing of popular music using chroma-based representations. IEEE Trans Multimedia (MM) 7(1):96–104CrossRef Bartsch MA, Wakefield GH (2005) Audio thumbnailing of popular music using chroma-based representations. IEEE Trans Multimedia (MM) 7(1):96–104CrossRef
5.
Zurück zum Zitat Lu L, Zhang HJ (2003) Automated extraction of music snippets. In: Proceedings of the 11th ACM international conference on multimedia (MM), pp 140–147. ACM, CA, USA Lu L, Zhang HJ (2003) Automated extraction of music snippets. In: Proceedings of the 11th ACM international conference on multimedia (MM), pp 140–147. ACM, CA, USA
6.
Zurück zum Zitat Chai W (2006) Semantic segmentation and summarization of music: methods based on tonality and recurrent structure. IEEE Signal Process Mag 23(2):124–132MathSciNetCrossRef Chai W (2006) Semantic segmentation and summarization of music: methods based on tonality and recurrent structure. IEEE Signal Process Mag 23(2):124–132MathSciNetCrossRef
7.
Zurück zum Zitat Nieto O, Humphrey EJ, Bello JP (2012) Compressing music recordings into audio summaries. In: Proceedings of 13th international society for music information retrieval (ISMIR), pp 313–318, Porto, Portugal (2012) Nieto O, Humphrey EJ, Bello JP (2012) Compressing music recordings into audio summaries. In: Proceedings of 13th international society for music information retrieval (ISMIR), pp 313–318, Porto, Portugal (2012)
8.
Zurück zum Zitat Xu C, Maddage MC, Shao X (2005) Automatic music classification and summarization. IEEE Trans Speech Audio Process (TASLP) 13(3):441–450CrossRef Xu C, Maddage MC, Shao X (2005) Automatic music classification and summarization. IEEE Trans Speech Audio Process (TASLP) 13(3):441–450CrossRef
9.
Zurück zum Zitat Xu C, Zhu Y, Tian Q (20025) Automatic music summarization based on temporal, spectral and cepstral features. In: Proceedings of international conference on multimedia and expo, pp 117–120, Lausanne, Switzerland Xu C, Zhu Y, Tian Q (20025) Automatic music summarization based on temporal, spectral and cepstral features. In: Proceedings of international conference on multimedia and expo, pp 117–120, Lausanne, Switzerland
10.
Zurück zum Zitat Zlatintsi A, Maragos P, Potamianos A (2012) A saliency-based approach to audio event detection and summarization. In: Proceedings of the 20th European signal processing conference (EUSIPCO), pp 1294–1298, Bucharest, Romania Zlatintsi A, Maragos P, Potamianos A (2012) A saliency-based approach to audio event detection and summarization. In: Proceedings of the 20th European signal processing conference (EUSIPCO), pp 1294–1298, Bucharest, Romania
11.
Zurück zum Zitat Logan B, Chu S (2000) Music summarization using key phrases. In: Proceedings of the IEEE international conference on acoustics, speech, and signal processing (ICASSP), pp 749–752. Istanbul, Turkey Logan B, Chu S (2000) Music summarization using key phrases. In: Proceedings of the IEEE international conference on acoustics, speech, and signal processing (ICASSP), pp 749–752. Istanbul, Turkey
12.
Zurück zum Zitat Müller M, Ewert S (2010) Towards timbre-invariant audio features for harmony-based music. IEEE Trans Audio Speech Lang Process (TASLP) 18(3):649–662CrossRef Müller M, Ewert S (2010) Towards timbre-invariant audio features for harmony-based music. IEEE Trans Audio Speech Lang Process (TASLP) 18(3):649–662CrossRef
13.
Zurück zum Zitat Müller M, Ewert S (2011) Chroma Toolbox: MATLAB implementations for extracting variants of chroma-based audio features. In: Proceedings of the 12th international conference on music information retrieval (ISMIR), pp 215–220, Miami, Florida Müller M, Ewert S (2011) Chroma Toolbox: MATLAB implementations for extracting variants of chroma-based audio features. In: Proceedings of the 12th international conference on music information retrieval (ISMIR), pp 215–220, Miami, Florida
14.
Zurück zum Zitat Kantz H, Schreiber T (2004) Nonlinear time series analysis. Cambridge University Press, Cambridge, United Kingdom Kantz H, Schreiber T (2004) Nonlinear time series analysis. Cambridge University Press, Cambridge, United Kingdom
15.
Zurück zum Zitat Bello JP (2011) Measuring structural similarity in music. IEEE Trans Audio Speech Lang Process (TASLP) 19(7):2013–2025CrossRef Bello JP (2011) Measuring structural similarity in music. IEEE Trans Audio Speech Lang Process (TASLP) 19(7):2013–2025CrossRef
16.
Zurück zum Zitat Serrà J, Serra X, Andrzejak RG (2009) Cross recurrence quantification for cover song identification. New J Phys 11(9):093017CrossRef Serrà J, Serra X, Andrzejak RG (2009) Cross recurrence quantification for cover song identification. New J Phys 11(9):093017CrossRef
17.
Zurück zum Zitat Cho T, Bello JP (2011) A feature smoothing method for chord recognition using recurrence plots. In: Proceedings of the 12th international society for music information retrieval (ISMIR), pp 651–656, Miami, Florida Cho T, Bello JP (2011) A feature smoothing method for chord recognition using recurrence plots. In: Proceedings of the 12th international society for music information retrieval (ISMIR), pp 651–656, Miami, Florida
18.
Zurück zum Zitat Bertin-Mahieux T, Ellis DPW (2011) Large-scale cover song recognition using hashed chroma landmarks. In: Proceedings of IEEE workshop on applications of signal processing to audio and acoustics (WASPAA), pp 117–120, New York, USA Bertin-Mahieux T, Ellis DPW (2011) Large-scale cover song recognition using hashed chroma landmarks. In: Proceedings of IEEE workshop on applications of signal processing to audio and acoustics (WASPAA), pp 117–120, New York, USA
19.
Zurück zum Zitat Egorov A, Linetsky G (2008) Cover song identification with IF-F0 pitch class profiles. MIREX extended abstract Egorov A, Linetsky G (2008) Cover song identification with IF-F0 pitch class profiles. MIREX extended abstract
20.
Zurück zum Zitat Matthew C, Jonathan F (2003) Summarizing popular music via structural similarity analysis. In: Proceedings of IEEE workshop on applications of signal processing to audio and acoustics (WASPAA), pp 1159–1170, New York, USA (2003) Matthew C, Jonathan F (2003) Summarizing popular music via structural similarity analysis. In: Proceedings of IEEE workshop on applications of signal processing to audio and acoustics (WASPAA), pp 1159–1170, New York, USA (2003)
Metadaten
Titel
Music Summary Detection with State Space Embedding and Recurrence Plot
verfasst von
Yongwei Gao
Yichun Shen
Xulong Zhang
Shuai Yu
Wei Li
Copyright-Jahr
2019
Verlag
Springer Singapore
DOI
https://doi.org/10.1007/978-981-13-8707-4_4

Neuer Inhalt