Skip to main content
Top
Published in: The Journal of Supercomputing 5/2016

01-05-2016

Simulated smart phone recordings for audio identification

Authors: Shingchern D. You, Yu-Chu Lin

Published in: The Journal of Supercomputing | Issue 5/2016

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

This paper studies the use of simulated recordings to perform audio identification experiments. In contrast to use actual recordings in the experiments, we use the measured room impulse response to generate simulated recordings. Doing so greatly reduces the burden of manually recording audio items for experiments. By comparing the correlations between actual and simulated recordings, we conclude that this approach is highly possible. The audio identification experiments are conducted based on the moving picture expert group audio signature descriptors to represent the simulated recordings. We also add environmental noises, provided by European Telecommunications Standards Institute, to the simulated recordings to study the performance degradation. Finally, we study if performing filtering in the descriptor domain can improve the accuracy. The experimental results show that filtering in the frequency direction yields higher accuracy for signal to noise ratio of 10 dB items.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Cho H, Choi M (2014) Personal mobile album/diary application development. J Converg 5(1):32–37 Cho H, Choi M (2014) Personal mobile album/diary application development. J Converg 5(1):32–37
2.
go back to reference Oh J-S, Park C-U, Lee S-B (2014) NFC-based mobile payment service adoption and diffusion. J Converg 5(2):8–14 Oh J-S, Park C-U, Lee S-B (2014) NFC-based mobile payment service adoption and diffusion. J Converg 5(2):8–14
3.
go back to reference Feese S, Burscher MJ, Jonas K, Troster G (2014) Sensing spatial and temporal coordination in teams using the smartphone. Hum-Centric Comput Inf Sci 4(15):1–18 Feese S, Burscher MJ, Jonas K, Troster G (2014) Sensing spatial and temporal coordination in teams using the smartphone. Hum-Centric Comput Inf Sci 4(15):1–18
6.
go back to reference Stan G-B, Embrechts J-J, Archambeau D (2002) Comparison of different impulse response measurement techniques. J Audio Eng Soc 50(4):249–262 Stan G-B, Embrechts J-J, Archambeau D (2002) Comparison of different impulse response measurement techniques. J Audio Eng Soc 50(4):249–262
7.
go back to reference ETSI, Speech and Multimedia Transmission Quality (STQ) (2012) Speech quality performance in the presence of background noise; part 1: background noise simulation technique and background noise database. ETSI ES202 396-1, pp 45–47 ETSI, Speech and Multimedia Transmission Quality (STQ) (2012) Speech quality performance in the presence of background noise; part 1: background noise simulation technique and background noise database. ETSI ES202 396-1, pp 45–47
9.
go back to reference Wang AL-C (2003) An industrial-strength audio search algorithm. In: Proc. of international conference on music information retrieval (ISMIR), Baltimore, pp 7–13 Wang AL-C (2003) An industrial-strength audio search algorithm. In: Proc. of international conference on music information retrieval (ISMIR), Baltimore, pp 7–13
10.
go back to reference ISO/IEC (2002) Information technology—multimedia content description interface-part 4: audio. IS 15938-4 ISO/IEC (2002) Information technology—multimedia content description interface-part 4: audio. IS 15938-4
11.
go back to reference Cano P, Battle E, Kalker T, Haitsma J (2005) A review of audio fingerprinting. J VLSI Signal Process 41(3):271–284 Cano P, Battle E, Kalker T, Haitsma J (2005) A review of audio fingerprinting. J VLSI Signal Process 41(3):271–284
12.
go back to reference Haitsma J, Kalker T (2002) A highly robust audio fingerprinting system. In: Proc. int’l. conf. on music information retrieval. IRCAM, France, pp 107–115 Haitsma J, Kalker T (2002) A highly robust audio fingerprinting system. In: Proc. int’l. conf. on music information retrieval. IRCAM, France, pp 107–115
13.
go back to reference Baluja S, Covell M (2007) Audio fingerprinting: combining computer vision and data stream processing. In: Proceedings of IEEE intl conf on acoustics, speech and signal processing. IEEE Press, Piscataway, pp II-213–II-216 Baluja S, Covell M (2007) Audio fingerprinting: combining computer vision and data stream processing. In: Proceedings of IEEE intl conf on acoustics, speech and signal processing. IEEE Press, Piscataway, pp II-213–II-216
14.
go back to reference Burges CJC, Platt JC, Jana S (2003) Distortion discriminant analysis for audio fingerprinting. IEEE Trans Speech Audio Process 11(3):165–174 Burges CJC, Platt JC, Jana S (2003) Distortion discriminant analysis for audio fingerprinting. IEEE Trans Speech Audio Process 11(3):165–174
15.
go back to reference Ramona M, Peeters G (2013) Audioprint: an efficient audio fingerprint system based on a novel cost-less synchronization scheme. In: Proceedings of the international conference on acoustics, speech and signal processing (ICASSP’13), pp 818–822 Ramona M, Peeters G (2013) Audioprint: an efficient audio fingerprint system based on a novel cost-less synchronization scheme. In: Proceedings of the international conference on acoustics, speech and signal processing (ICASSP’13), pp 818–822
16.
go back to reference You SD, Pu Y-H (2015) Using paired distances of signal peaks in stereo channels as fingerprints for copy identification. ACM Trans Multimed Comput Commun Appl 12(1):1–22, Art No 1 You SD, Pu Y-H (2015) Using paired distances of signal peaks in stereo channels as fingerprints for copy identification. ACM Trans Multimed Comput Commun Appl 12(1):1–22, Art No 1
18.
go back to reference You SD, Chen W-H (2015) Comparative study of methods for reducing dimensionality of MPEG-7 audio signature descriptors. Multimed Tools Appl 74(10):3579–3598CrossRef You SD, Chen W-H (2015) Comparative study of methods for reducing dimensionality of MPEG-7 audio signature descriptors. Multimed Tools Appl 74(10):3579–3598CrossRef
19.
go back to reference Park M, Kim H-R, Yang SH (2006) Frequency-temporal filtering for a robust audio fingerprinting scheme in real-noise environments. ETRI J 28(4):509–512 Park M, Kim H-R, Yang SH (2006) Frequency-temporal filtering for a robust audio fingerprinting scheme in real-noise environments. ETRI J 28(4):509–512
20.
go back to reference Storn R (1996) Echo cancellation techniques for multimedia applications: a survey. International Computer Science Institute-Publications-TR, Berkeley, USA Storn R (1996) Echo cancellation techniques for multimedia applications: a survey. International Computer Science Institute-Publications-TR, Berkeley, USA
Metadata
Title
Simulated smart phone recordings for audio identification
Authors
Shingchern D. You
Yu-Chu Lin
Publication date
01-05-2016
Publisher
Springer US
Published in
The Journal of Supercomputing / Issue 5/2016
Print ISSN: 0920-8542
Electronic ISSN: 1573-0484
DOI
https://doi.org/10.1007/s11227-015-1533-6

Other articles of this Issue 5/2016

The Journal of Supercomputing 5/2016 Go to the issue

Premium Partner