Skip to main content
Erschienen in: International Journal of Multimedia Information Retrieval 1/2015

01.03.2015 | Regular Paper

A novel framework for CBCD using integrated color and acoustic features

verfasst von: R. Roopalakshmi

Erschienen in: International Journal of Multimedia Information Retrieval | Ausgabe 1/2015

Einloggen

Aktivieren Sie unsere intelligente Suche um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Most studies in content-based video copy detection (CBCD) concentrate on visual signatures, while only very few efforts are made to exploit audio features. The audio data, if present, is an essential source of a video; hence, the integration of visual-acoustic fingerprints significantly improves the copy detection performance. Based on this aspect, we propose a new framework, which jointly employs color-based visual features and audio fingerprints for detecting the duplicate videos. The proposed framework incorporates three stages: First, a novel visual fingerprint based on spatio-temporal dominant color features is generated; Second, mel-frequency cepstral coefficients are extracted and compactly represented as acoustic signatures; Third, the resultant multimodal signatures are jointly used for the CBCD task, by employing combination rule and weighting strategies. The results of experiments on TRECVID 2008 and 2009 datasets, demonstrate the improved efficiency of the proposed framework compared to the reference methods against a wide range of video transformations.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
2.
Zurück zum Zitat Wei S, Zhao Y, Zhu C, Xu C, Zhu Z (2011) Frame fusion for video copy detection. IEEE Trans Circuits Syst Video Technol 21(1):15–28CrossRef Wei S, Zhao Y, Zhu C, Xu C, Zhu Z (2011) Frame fusion for video copy detection. IEEE Trans Circuits Syst Video Technol 21(1):15–28CrossRef
3.
Zurück zum Zitat Küçüktunç O, Baştan M, Güdükbay U, Ulusoy O (2010) Video copy detection using multiple visual cues and MPEG-7 descriptors. Elsevier J Vis Commun Image Represent 21:838–849CrossRef Küçüktunç O, Baştan M, Güdükbay U, Ulusoy O (2010) Video copy detection using multiple visual cues and MPEG-7 descriptors. Elsevier J Vis Commun Image Represent 21:838–849CrossRef
4.
Zurück zum Zitat Chiu CY, Wang HM, Chen CS (2010) Fast min-hashing indexing and robust spatio-temporal matching for detecting video copies. ACM Trans Multimed Comput Commun Appl 6(2):1–23CrossRef Chiu CY, Wang HM, Chen CS (2010) Fast min-hashing indexing and robust spatio-temporal matching for detecting video copies. ACM Trans Multimed Comput Commun Appl 6(2):1–23CrossRef
5.
Zurück zum Zitat Sarkar A, Singh V, Ghosh P, Manjunath BS, Singh A (2010) Efficient and robust detection of duplicate videos in a large database. IEEE Trans Circuits Syst Video Technol 20(6):870–885CrossRef Sarkar A, Singh V, Ghosh P, Manjunath BS, Singh A (2010) Efficient and robust detection of duplicate videos in a large database. IEEE Trans Circuits Syst Video Technol 20(6):870–885CrossRef
6.
Zurück zum Zitat Roopalakshmi R, Reddy GRM (2013) A novel spatio-temporal registration framework for video copy localization based on multimodal features. Elsevier Signal Process J 93(8):2339–2351CrossRef Roopalakshmi R, Reddy GRM (2013) A novel spatio-temporal registration framework for video copy localization based on multimodal features. Elsevier Signal Process J 93(8):2339–2351CrossRef
7.
Zurück zum Zitat Roopalakshmi R, Reddy GRM (2013) A framework for estimating geometric distortions in video copies based on visual-audio fingerprints. Springer Signal Image Video Process (SIViP) J, 7(1). doi:10.1007/s11760-013-0424-7.2013 Roopalakshmi R, Reddy GRM (2013) A framework for estimating geometric distortions in video copies based on visual-audio fingerprints. Springer Signal Image Video Process (SIViP) J, 7(1). doi:10.​1007/​s11760-013-0424-7.​2013
8.
Zurück zum Zitat Chiu CY (2010) Time-series linear search for video copies based on compact signature manipulation and containment relation modeling. IEEE Trans Circuits Syst Video Technol 20(11):1603–1613CrossRef Chiu CY (2010) Time-series linear search for video copies based on compact signature manipulation and containment relation modeling. IEEE Trans Circuits Syst Video Technol 20(11):1603–1613CrossRef
9.
Zurück zum Zitat Hua XS, Chen X, Zhang HJ (2004) Robust video signature based on ordinal measure. In: Proceedings of IEEE international conference on image processing (ICIP), vol 1685–688 Hua XS, Chen X, Zhang HJ (2004) Robust video signature based on ordinal measure. In: Proceedings of IEEE international conference on image processing (ICIP), vol 1685–688
10.
Zurück zum Zitat Lowe GD (2004) Distinctive image features from scale-invariant key points. Int J Comput Vis 60:91–110CrossRef Lowe GD (2004) Distinctive image features from scale-invariant key points. Int J Comput Vis 60:91–110CrossRef
11.
Zurück zum Zitat Bay H, Tuytelaars T, Gool LV (2008) SURF: speeded up robust features. Comput Vis Image Understand 110:346–359CrossRef Bay H, Tuytelaars T, Gool LV (2008) SURF: speeded up robust features. Comput Vis Image Understand 110:346–359CrossRef
12.
Zurück zum Zitat Chiu CY, Chen CS, Chien LF (2008) A framework for handling spatiotemporal variations in video copy detection. IEEE Trans Circuits Syst Video Technol 18:412–417CrossRef Chiu CY, Chen CS, Chien LF (2008) A framework for handling spatiotemporal variations in video copy detection. IEEE Trans Circuits Syst Video Technol 18:412–417CrossRef
13.
Zurück zum Zitat Itoh Y, Erokuumae M, Kojima K, Ishigame M, Tanaka K (2010) Time-space acoustical feature for fast video copy detection. In: Proceeding of IEEE international workshop on multimedia signal processing, pp 487–492 Itoh Y, Erokuumae M, Kojima K, Ishigame M, Tanaka K (2010) Time-space acoustical feature for fast video copy detection. In: Proceeding of IEEE international workshop on multimedia signal processing, pp 487–492
14.
Zurück zum Zitat Anguera X, Obrador P, Oliver N (2009) Multimodal video copy detection applied to social media. In: Proceedings of ACM international conference-WSM’09, pp 57–64 Anguera X, Obrador P, Oliver N (2009) Multimodal video copy detection applied to social media. In: Proceedings of ACM international conference-WSM’09, pp 57–64
15.
Zurück zum Zitat Saracoğlu A, Esen E, Ateş TK, Acar BO, Zubari U, Ozan EC, özalp E, Alatan AA, Çiloglu T (2009) Content based copy detection with coarse audio-visual fingerprints, 2009-seventh international workshop on content-based multimedia indexing (cbmi), pp 213–218 Saracoğlu A, Esen E, Ateş TK, Acar BO, Zubari U, Ozan EC, özalp E, Alatan AA, Çiloglu T (2009) Content based copy detection with coarse audio-visual fingerprints, 2009-seventh international workshop on content-based multimedia indexing (cbmi), pp 213–218
16.
Zurück zum Zitat Manjunath BS, Salembier P, Sikora T (2002) Introduction to MPEG-7-multimedia content description interface. Wiley, Newyork Manjunath BS, Salembier P, Sikora T (2002) Introduction to MPEG-7-multimedia content description interface. Wiley, Newyork
17.
Zurück zum Zitat Park TH (2010) Introduction to digital signal processing-computer musically speaking. World scientific Press, SingaporeMATH Park TH (2010) Introduction to digital signal processing-computer musically speaking. World scientific Press, SingaporeMATH
18.
Zurück zum Zitat Deng Y, Manjunath BS, Kenney C, Moore MS, Shin H (2001) An efficient color representation for image retrieval. IEEE Trans Image Process 10:140–147CrossRefMATH Deng Y, Manjunath BS, Kenney C, Moore MS, Shin H (2001) An efficient color representation for image retrieval. IEEE Trans Image Process 10:140–147CrossRefMATH
20.
Zurück zum Zitat Yang NC, Chang WH, Kuo CM, Li TH (2008) A fast MPEG-7 dominant color extraction with new similarity measure for image retrieval. Elsevier J Vis Commun Image Represent 19:92–105CrossRef Yang NC, Chang WH, Kuo CM, Li TH (2008) A fast MPEG-7 dominant color extraction with new similarity measure for image retrieval. Elsevier J Vis Commun Image Represent 19:92–105CrossRef
21.
Zurück zum Zitat Kashiwagi T, Oe S (2007) Introduction of frequency image and applications. In SICE annual conference-07, pp 584–591 Kashiwagi T, Oe S (2007) Introduction of frequency image and applications. In SICE annual conference-07, pp 584–591
22.
Zurück zum Zitat Roytman E, Gotsman C (1995) Dynamic color quantization of video sequences. IEEE Trans Vis Comput Graph 1(3):274–286CrossRef Roytman E, Gotsman C (1995) Dynamic color quantization of video sequences. IEEE Trans Vis Comput Graph 1(3):274–286CrossRef
23.
Zurück zum Zitat Roopalakshmi R, Ram Mohana Reddy G (2011) Efficient video copy detection using simple and effective extraction of color features. In Springer CCIS, vol 193, Part IV, pp 473–480. doi:10.1007/978-3-642-22726-4_49 Roopalakshmi R, Ram Mohana Reddy G (2011) Efficient video copy detection using simple and effective extraction of color features. In Springer CCIS, vol 193, Part IV, pp 473–480. doi:10.​1007/​978-3-642-22726-4_​49
24.
Zurück zum Zitat Boreczky JS, Wilcox LD (1998) A hidden Markov model framework for video segmentation using audio and image features. In: Proceedings of international conference on acoustics, speech, and signal processing (ICASSP-98), vol 6, pp 3741–3744 Boreczky JS, Wilcox LD (1998) A hidden Markov model framework for video segmentation using audio and image features. In: Proceedings of international conference on acoustics, speech, and signal processing (ICASSP-98), vol 6, pp 3741–3744
25.
Zurück zum Zitat Wang Y, Liu Z, Huang JC (2000) Multimedia content analysis using both audio and visual cues. IEEE Signal Process Mag 17(6):12–36 Wang Y, Liu Z, Huang JC (2000) Multimedia content analysis using both audio and visual cues. IEEE Signal Process Mag 17(6):12–36
27.
Zurück zum Zitat Chen N, Xiao HD, Wan W (2011) Audio hash function based on non-negative matrix factorization of mel-frequency cepstral coefficients. IET Inf Secur 5(1):19–25CrossRef Chen N, Xiao HD, Wan W (2011) Audio hash function based on non-negative matrix factorization of mel-frequency cepstral coefficients. IET Inf Secur 5(1):19–25CrossRef
28.
Zurück zum Zitat \(\ddot{O}\)zer H, Sankur B, Memom N, Anarim E (2005) Perceptual audio hashing functions. EURASIP J Appl Signal Process, 12, pp 1780–1793 \(\ddot{O}\)zer H, Sankur B, Memom N, Anarim E (2005) Perceptual audio hashing functions. EURASIP J Appl Signal Process, 12, pp 1780–1793
31.
Zurück zum Zitat Kim C, Vasudev B (2005) Spatiotemporal sequence matching for efficient video copy detection. IEEE Trans Circuits Syst Video Technol 15:127–132CrossRef Kim C, Vasudev B (2005) Spatiotemporal sequence matching for efficient video copy detection. IEEE Trans Circuits Syst Video Technol 15:127–132CrossRef
Metadaten
Titel
A novel framework for CBCD using integrated color and acoustic features
verfasst von
R. Roopalakshmi
Publikationsdatum
01.03.2015
Verlag
Springer London
Erschienen in
International Journal of Multimedia Information Retrieval / Ausgabe 1/2015
Print ISSN: 2192-6611
Elektronische ISSN: 2192-662X
DOI
https://doi.org/10.1007/s13735-014-0062-z

Weitere Artikel der Ausgabe 1/2015

International Journal of Multimedia Information Retrieval 1/2015 Zur Ausgabe