nach oben

Acoustical Physics

Erschienen in:

01.08.2023 | ACOUSTIC SIGNALS PROCESSING. COMPUTER SIMULATION

Distant Speech Detection

verfasst von: V. N. Sorokin

Erschienen in: Acoustical Physics | Ausgabe 4/2023

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

The article studies the amplitude and phase responses of speech signals recorded at different distances from the speaker by various types of microphones in free space and in a closed room. The ratios of the average energy of the amplitude spectrum in different frequency ranges and the average slope of the linear phase component show differences for a syllable recorded near a microphone and the same syllable recorded distantly and again reproduced near the microphone. The greatest difference is observed in the average-energy ratios in the frequency ranges of 0–1 and 1–8, as well as 3–4 and 4–6 kHz. The slope of the linear component is calculated in the 4–8 kHz range. The degree of differentiation depends on the vowel sound.

Vorheriger Artikel An Information-Statistical Approach to Analyzing Acoustic Emission Signals

Nächster Artikel Ultrasonic Identification of Polycrystalline Metal Materials based on Linear Prediction Analysis

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Z. Wu, N. Evans, T. Kinnunen, J. Yamagishi, F. Alegre, and H. Li, Speech Commun. 66, 130 (2015).CrossRef

T. Kinnunen, M. Sahidullah, H. Delgado, M. Todisco, N. Evans, J. Yamagishi, and K. A. Lee, in Proc. InterSpeech 2017 (Stockholm, 2017).

M. Sahidullah, H. Delgado, M. Todisco, T. Kinnunen, N. Evans, J. Yamagishi, and K. A. Lee, in Handbook of Biometric Anti-Spoofing (Springer, Cham, 2019), p. 321.

K. A. Lee, O. Sadjadi, H. Li, and D. Reynolds, Comput. Speech Lang. 61, 101058 (2020).CrossRef

M. R. Kamble, H. B. Sailor, H. A. Patil, and H. Li, APSIPA Trans. Signal Inf. Process. 9 (1), e2 (2020). https://doi.org/10.1017/ATSIP.2019.21CrossRef

Y. W. Lau, M. Wagner, and D. Tran, in Proc. IEEE Int. Symp. on Intelligent Multimedia, Video and Speech (Hong Kong, 2004), p. 145.

J. P. Campbell, Proc. IEEE 85, 1437 (1997).CrossRef

A. Khodabakhsh, A. Mohammadi, and C. Demiroglu, Comput. Speech Lang. 42, 20 (2017).CrossRef

B. Sisman, J. Yamagishi, S. King, and H. Li, IEEE/ACM Trans. Audio, Speech Lang. Proc. 29, 132 (2021).

10.

J. Lindberg and M. Blomberg, in Proc. European Conf. on Speech Communication and Technology (Eurospeech) (Budapest, 1999), p. 1211.

11.

J. Villalba and E. Lleida, in Proc. IEEE Int. Carnahan Conf. on Security Technology (ICCST) (Barcelona, 2011). https://doi.org/10.1109/CCST.2011.6095943

12.

Z. F. Wang, G. Wei, and Q. H. He, in Proc. IEEE Int. Conf. Machine Learning and Cybernetics (ICMLC) (Singapore, 2011), p. 1708.

13.

J. Galka, M. Grzywacz, and R. Samborski, Speech Commun. 67, 143 (2015).CrossRef

14.

A. J. Kolarik, B. C. J. Moore, P. Zahori, S. Cirstea, and S. Pardhan, Atten., Percept. Psychophys. 2 (78), 373 (2016).CrossRef

15.

E. Skudrzyk, The Foundations of Acoustics (Springer-Verlag, Wien 1971; Inostrannaya literatura, Moscow, 1959), Vol. 2.

16.

N. Kopco and B. G. Shinn-Cunningham, J. Acoust. Soc. Am. 130 (3), 1530 (2011).ADSCrossRef

17.

L. Prud’homme and M. Lavandier, J. Acoust. Soc. Am. 148 (3), 614 (2020).CrossRef

18.

E. Georganti, T. May, S. V. D. Par, A. Harma, and J. Mourjopoulos, IEEE Trans. Audio Speech Lang. Process. 19, 1949 (2011). https://doi.org/10.1109/TASL.2011.2104953CrossRef

19.

I. Spiousas, P. E. Etchemendy, M. C. Eguia, E. R. Calcagno, E. Abregú, and R. O. Vergara, Front. Psychol. 8, 969 (2017).CrossRef

20.

P. D. Coleman, J. Acoust. Soc. Am. 34, 345 (1962).ADSCrossRef

21.

V. N. Sorokin and A. I. Tsyplikhin, Inf. Protsessy 10 (2), 87 (2010).

22.

M. Witkowski, S. Kacprzak, P. Zelasko, K. Kowalczyk, and J. Gałka, in Proc. InterSpeech 2017 (Stockholm, 2017), p. 27.

23.

M. R. Kamble, H. Tak, and H. A. Patil, Speech Commun. 125, 114 (2020).CrossRef

24.

M. R. Kamble and H. A. Patil, Comput. Speech Lang. 65, 101140 (2021).CrossRef

25.

H. Teager, IEEE Trans. Acoust. Speech Signal Proc. 28 (5), 599 (1980).CrossRef

26.

W. Shang and M. Stevenson, Comput. Speech Lang. 65, 101133 (2021).CrossRef

27.

Z. Oo, L. Wang, K. Phapatanaburi, M. Liu, S. Nakagawa, M. Iwahashi, and J. Dang, EURASIP J. Audio, Speech, Music, Art. No. 8 (2019).

28.

M. Liu, L. Wang, J. Danga, K. A. Lee, and S. Nakagawa, Comput. Speech Lang. 66, 101161 (2021).CrossRef

29.

V. N. Sorokin and A. S. Leonov, Acoust. Phys. 68 (2), 187 (2022).ADSCrossRef

30.

J. L. Flanagan, Speech Analysis Synthesis and Perception (Springer-Verlag, Berlin, Heidelberg, New York, 1965; Svyaz’, Moscow, 1968).

31.

P. M. Morse, Vibration and Sound (McGraw-Hill, 1948; Gos. izd. tekhniko-tekhnich. lit., Moscow-Leningrad, 1949).

Titel: Distant Speech Detection
verfasst von: V. N. Sorokin
Publikationsdatum: 01.08.2023
Verlag: Pleiades Publishing
Erschienen in: Acoustical Physics / Ausgabe 4/2023
Print ISSN: 1063-7710
Elektronische ISSN: 1562-6865
DOI: https://doi.org/10.1134/S1063771023600250

Premium Partner

Marktübersichten

Die im Laufe eines Jahres in der „adhäsion“ veröffentlichten Marktübersichten helfen Anwendern verschiedenster Branchen, sich einen gezielten Überblick über Lieferantenangebote zu verschaffen.

Zur Marktübersicht

Springer Professional

Abstract

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Weitere Artikel der Ausgabe 4/2023

Imaging of Multiple Cracks in Thin Plates using Circular Sensor Array Based on Lamb Waves

Mode-Matching Analysis for Sound Propagation in a Cylindrical Duct with a Partial Lining

Investigation of Ultra-Wide Acoustic Bandgap of Nano Phononic Crystals in GHZ Frequency Ranges with Delay Line Configuration

An Information-Statistical Approach to Analyzing Acoustic Emission Signals

Acoustic Studies of the Melting and Crystallization of Eutectic Gallium–Silver Alloys in Porous Glasses

Natural Oscillations of an Elastic Half-Strip with a Different Arrangement of Fixation Areas of Its Edges

Premium Partner

Marktübersichten