Skip to main content
Top

2015 | OriginalPaper | Chapter

6. Application of SHAZAM-Based Audio Fingerprinting for Multilingual Indian Song Retrieval

Authors : S. Sri Ranjani, V. Abdulkareem, K. Karthik, P. K. Bora

Published in: Advances in Communication and Computing

Publisher: Springer India

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Extracting film songs from a multilingual database based on a query clip is a challenging task. The challenge stems from the subtle variations in pitch and rhythm, which accompany the change in the singer’s voice, style, and orchestration, change in language and even a change in gender. The fingerprinting algorithm must be designed to capture the base tune in the composition and not the adaptations (or variations which include lyrical modifications and changes in the singer’s voice). The SHAZAM system was developed for capturing cover audio pieces from millions of Western songs stored in the database, with the objective of tapping into the melodic construct of the song (devoid of other forms of embellishments). When applied to the Indian database the system was found less effective, due to subtle changes in both rhythm and melody mainly due to the semiclassical nature of Indian film songs. The retrieval accuracy was found to be 85 %. Potential reasons for the failure of this SHAZAM system have been discussed with examples.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Cano, P., Batlle, E., Kalker, T., Haitsma, J.: A review of audio fingerprinting. J. VLSI Signal Process. Syst. 41, 271–284 (2005)CrossRef Cano, P., Batlle, E., Kalker, T., Haitsma, J.: A review of audio fingerprinting. J. VLSI Signal Process. Syst. 41, 271–284 (2005)CrossRef
2.
go back to reference South film gets bigger. Article, Times of India, 31 May 2011 South film gets bigger. Article, Times of India, 31 May 2011
3.
go back to reference Serra, J., Gomez, E., Herrera, P.: Audio cover song identification and similarity: background, approaches, evaluation, and beyond. Adv. Music Inf. Retr. 274, 307–332 (2010) Serra, J., Gomez, E., Herrera, P.: Audio cover song identification and similarity: background, approaches, evaluation, and beyond. Adv. Music Inf. Retr. 274, 307–332 (2010)
4.
go back to reference Ellis, D.P.W., Poliner, G.E.: Identifying ‘cover songs’ with chroma features and dynamic programming beat tracking. In: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), vol. 4, pp. 1429–1432 (2007) Ellis, D.P.W., Poliner, G.E.: Identifying ‘cover songs’ with chroma features and dynamic programming beat tracking. In: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), vol. 4, pp. 1429–1432 (2007)
5.
go back to reference Kim, S., Narayanan, S.: Dynamic chroma feature vectors with applications to cover song identification. In: IEEE Workshop on Multimedia Signal Processing (MMSP), pp. 984–987 (2008) Kim, S., Narayanan, S.: Dynamic chroma feature vectors with applications to cover song identification. In: IEEE Workshop on Multimedia Signal Processing (MMSP), pp. 984–987 (2008)
6.
go back to reference Tsai, W., Yu, H., Wang, H.: Using the similarity of main melodies to identify cover versions of popular songs for music document retrieval. J. Inf. Sci. Eng. 24, 1669–1687 (2008) Tsai, W., Yu, H., Wang, H.: Using the similarity of main melodies to identify cover versions of popular songs for music document retrieval. J. Inf. Sci. Eng. 24, 1669–1687 (2008)
7.
go back to reference Pollastri, E.: Melody-retrieval based on pitch-tracking and string-matching methods. In: Proceedings of the XIIth Colloquium on Musical Informatics (1999) Pollastri, E.: Melody-retrieval based on pitch-tracking and string-matching methods. In: Proceedings of the XIIth Colloquium on Musical Informatics (1999)
8.
go back to reference Zhu, Y., Kankanhalli, M., Tian, Q.: Similarity matching of continuous melody contours for humming querying of melody databases. In: IEEE Workshop on Multimedia Signal Processing, pp. 249–252 (2002) Zhu, Y., Kankanhalli, M., Tian, Q.: Similarity matching of continuous melody contours for humming querying of melody databases. In: IEEE Workshop on Multimedia Signal Processing, pp. 249–252 (2002)
9.
go back to reference Lijffijt, J., Papapetrou, P., Hollmén, J., Athitsos, V.: Benchmarking dynamic time warping for music retrieval. In: Proceedings of the 3rd International Conference on Pervasive Technologies Related to Assistive Environments, PETRA’10 (2010) Lijffijt, J., Papapetrou, P., Hollmén, J., Athitsos, V.: Benchmarking dynamic time warping for music retrieval. In: Proceedings of the 3rd International Conference on Pervasive Technologies Related to Assistive Environments, PETRA’10 (2010)
10.
go back to reference Kumar, P.P., Rao, P., Roy, S.D.: Note onset detection in natural humming. In: Proceedings of the International Conference on Computational Intelligence and Multimedia Applications, ICCIMA’07, pp. 176–180 (2007) Kumar, P.P., Rao, P., Roy, S.D.: Note onset detection in natural humming. In: Proceedings of the International Conference on Computational Intelligence and Multimedia Applications, ICCIMA’07, pp. 176–180 (2007)
11.
go back to reference Thoshkahna, B., Ramakrishnan, K.R.: An onset detection algorithm for query by humming (QBH) applications using psychoacoustic knowledge. In: Proceedings of 17th European Signal Processing Conference (EUSIPCO), EURASIPCO’09 (2009) Thoshkahna, B., Ramakrishnan, K.R.: An onset detection algorithm for query by humming (QBH) applications using psychoacoustic knowledge. In: Proceedings of 17th European Signal Processing Conference (EUSIPCO), EURASIPCO’09 (2009)
12.
go back to reference Kumar, N., Tsiartas, A., Narayanan, S.: Features for comparing tune similarity of songs across different languages. In: International Workshop on Multimedia Signal Processing, pp. 331–336 (2012) Kumar, N., Tsiartas, A., Narayanan, S.: Features for comparing tune similarity of songs across different languages. In: International Workshop on Multimedia Signal Processing, pp. 331–336 (2012)
13.
go back to reference Wang, A.L.: An industrial-strength audio search algorithm. In: Proceedings of International Conference on Music Information Retrieval, Baltimore, Maryland (2003) Wang, A.L.: An industrial-strength audio search algorithm. In: Proceedings of International Conference on Music Information Retrieval, Baltimore, Maryland (2003)
14.
go back to reference Ellis, D.: Robust landmark-based audio fingerprinting (2007) Ellis, D.: Robust landmark-based audio fingerprinting (2007)
15.
go back to reference Wang, A.: The Shazam music recognition service. Commun. ACM 49, 44–48 (2006)CrossRef Wang, A.: The Shazam music recognition service. Commun. ACM 49, 44–48 (2006)CrossRef
Metadata
Title
Application of SHAZAM-Based Audio Fingerprinting for Multilingual Indian Song Retrieval
Authors
S. Sri Ranjani
V. Abdulkareem
K. Karthik
P. K. Bora
Copyright Year
2015
Publisher
Springer India
DOI
https://doi.org/10.1007/978-81-322-2464-8_6