nach oben

International Journal of Speech Technology

Erschienen in:

02.06.2021

RETRACTED ARTICLE: Detecting adversarial attacks on audio-visual speech recognition using deep learning method

verfasst von: Rabie A. Ramadan

Erschienen in: International Journal of Speech Technology | Ausgabe 3/2022

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

Deep learning techniques have made significant progress in various machine learning-based tasks in different fields. Deep learning patterns are primarily prone to Adverse attacks. However, the exploration of adversarial detection methods for the audio and Video (AV) streaming dataset is minimal. This research proposes an effective malicious detection process with the temporal connection among distinct AV streams using the Deep Convolutional Neural Network (DCNN) method. The proposed process significantly detects the adversarial attacks based on two audio-visual recognition models, namely Lip-Reading in the Wild(LRW) and Geospatial Repository and Data (GRiD) Management models, which are trained in correspondence to the Lip reading data sets. Experimental results have indicated that the proposed strategy is a powerful method to identify the adversarial attacks compared to Supervised Kernel Machines, Combined Neural Network, and Band Feature Selection methods. The precision, recall, accuracy, and F1-score of the proposed system are observed as 88.10%, 89.30%, 95.60%, and 0.96, respectively, far better than the existing systems.

Vorheriger Artikel RETRACTED ARTICLE: Drought Prediction and Analysis of Water level based on satellite images Using Deep Convolutional Neural Network

Nächster Artikel High speed low area decimation filter for hearing aid application

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Bai, S., Li, Y., Zhou, Y., Li, Q., & Torr, H. S. P. (2019). Metric attack and defense for person re-identification. https://arxiv.org/abs/1901.10650.

Boloor, A., He, X., Gill, C. D., Vorobeychik, Y., & Zhang, X. (2019). Simple physical adversarial examples against end-to-end autonomous driving models. In Proceedings of the 15th IEEE international conference on embedded software and systems (ICESS), Las Vegas, NV, USA, 2–3 June 2019, pp. 1–7.

Carlini, N., & Wagner, D. A. (2017). Towards evaluating the robustness of neural networks. In 2017 IEEE symposium on security and privacy (SP), pp. 39–57.

Carlini, N., & Wagner, D. A. (2017). Towards evaluating the robustness of neural networks. In Proceedings of the IEEE symposium on security and privacy, San Jose, CA, USA, 22–26 May 2017, pp. 39–57.

Carlini, N., & Wagner, D. (2018). Audio adversarial examples: Targeted attacks on speech-to-text. In IEEE security and privacy workshops (SPW), 2018, pp. 1–17, https://doi.org/10.1109/SPW.2018.00009.

Carlini, N., Mishra, P., Vaidya, T., Zhang, Y., Sherr, M., Shields, C., Wagner, D., & Zhou, W. (2016). Hidden voice commands. In: USENIX security symposium, 2016, pp. 513–530.

Chakraborty, T., Jajodia, S., Katz, J., Picariello, A., Sperli, G., Subrahmanian, V. S. (2019). FORGE: a fake online repository generation engine for cyber deception. IEEE transactions on dependent and secure computing. 2019, pp. 1–16.

Chen, P.-Y., Sharma, Y., Zhang, H., Yi, J., Hsieh, C.-J. (2017). Ead: Elastic-net attacks to deep neural networks via adversarial examples. AAAI Conference on Artificial Intelligence (AAAI-18), New Orleans, Louisiana, USA.

Chung, J. S., & Zisserman, A. (2016). Lip reading in the wild. In ACCV, 2016.

Cooke, M., Barker, J., Cunningham, S., & Shao, X. (2006). An audio-visual corpus for speech perception and automatic speech recognition. The Journal of the Acoustical Society of America,20(5), 2006.

Hannun, A., Case, C., Casper, J., Catanzaro, B., Diamos, G., Elsen, E., Prenger, R., Satheesh, S., Sengupta, S., & Coates, A. (2014). Deep speech: Scaling up end-to-end speech recognition, arXiv preprint https://arxiv.org/1412.5567.

Kurakin, A., Goodfellow, I. J., & Bengio, S. (2017). Adversarial examples in the physical world. In Proceedings of the 5th international conference on learning representations (ICLR), Toulon, France, 24–26 April 2017.

Kwon, H., Yoon, H., Park, K.-W. (2019). Poster: Detecting audio adversarial example through audio modification. In Proceedings of the 2019 ACM SIGSAC conference on computer and communications security, CCS’19, ACM, 2019, pp. 2521–2523.

Liu, Y., Chen, X., Liu, C., & Song, D. X. (2017). Delving into transferable adversarial examples and black-box attacks. CoRR, vol. abs/1611.02770.

Logan, B. (2000). Mel frequency cepstral coefficients for music modeling. In ISMIR, vol. 270, pp. 1–11.

Moosavi-Dezfooli, S., Fawzi, A., Frossard, P. (2016). DeepFool: A simple and accurate method to fool deep neural networks. In Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), Las Vegas, NV, USA, 27–30 June 2016, pp. 2574–2582.

Papernot, N., McDaniel, P. D., Jha, S., Fredrikson, M., Celik, Z. B., & Swami, A. (2016). The limitations of deep learning in adversarial settings. In Proceedings of the IEEE European symposium on security and privacy (EuroS&P), Saarbrücken, Germany, 21–24 March 2016, pp. 372–387.

Ren, K., Zheng, T., Qin, Z., & Liu, X. (2020). Adversarial attacks and defenses in deep learning. Engineering, Volume 6, Issue 3, March 2020, pp. 346–360.

Samizade, S., Tan, Z.-H., Shen, C., & Guan, X. (2019). Adversarial example detection by classification for deep speech recognition. ICASSP 2020-2020 IEEE international conference on acoustics, speech and signal processing (ICASSP).

Schmidhuber, J. (2015). Deep learning in neural networks: An overview. Journal of Neural Networks,61, 85–117.CrossRef

Szegedy, C., Zaremba, W., Sutskever, I., Bruna, J., Erhan, D., Goodfellow, I., & Fergus, R. (2014a). Intriguing properties of neural networks. In International conference on learning representations, pp. 22–29.

Szegedy, C., Zaremba, W., Sutskever, I., Bruna, J., Erhan, D., Goodfellow, I. J., & Fergus, R. (2014b). Intriguing properties of neural networks. CoRR, vol. abs/1312.6199.

Szegedy, C., Zaremba, W., Sutskever, I., Bruna, J., Erhan, D., Goodfellow, I. J., & Fergus, R. (2014c). Intriguing properties of neural networks. In Proceedings of the 2nd international conference on learning representations (ICLR), Banff, AB, Canada, 14–16 April 2014.

Vaidya, T., Zhang, Y., Sherr, M., & Shields, C. (2015). Cocaine noodles: Exploiting the gap between human and machine speech recognition. WOOT,15, 10–11.

Zhang, G., Yan, C., Ji, X., Zhang, T., Zhang, T., & Xu, W. (2017). Dolphinattack: Inaudible voice commands. In Proceedings of the 2017 ACM SIGSAC conference on computer and communications security, ACM, 2017, pp. 103–117.

Zheng, Z., Zheng, L., Hu, Z., & Yang, Y. (2018). Open set adversarial examples. https://arxiv.org/abs/1809.02681.

Titel: RETRACTED ARTICLE: Detecting adversarial attacks on audio-visual speech recognition using deep learning method
verfasst von: Rabie A. Ramadan
Publikationsdatum: 02.06.2021
Verlag: Springer US
Erschienen in: International Journal of Speech Technology / Ausgabe 3/2022
Print ISSN: 1381-2416
Elektronische ISSN: 1572-8110
DOI: https://doi.org/10.1007/s10772-021-09859-3

Neuer Inhalt

Bildnachweise

VDI-Icon, Profil Icon, inhalt2, Springer Professional Modul/© Springer Fachmedien Wiesbaden GmbH, Internationaler Motorenkongress/© [M] ATZlive | Chisnikov / Fotolia.com, Search Icon, Banner Hanser, Benny Hahn/© ZEP GmbH, Customer Experience/© © oatawa / Getty Images / iStock, Erdgasmotor 1.5 TGI evo von Volkswagen/© Volkswagen AG, Zeitschrift Wissensmanagement Cover, PatentFit-Logo/© Springer Fachmedien Wiesbaden GmbH, 2023_Antrieb/© supervisuell, ATZ-Webinar: Prototypenfreie Entwicklung durch Offline- und Driver-in-the-Loop-HiL-Tests /© (c) VI-grade, chassis.tech plus 2023/© [M] ATZlive / TÜV SÜD PRODUCT SERVICE GMBH

Springer Professional

Abstract

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Springer Professional "Wirtschaft"

Weitere Artikel der Ausgabe 3/2022

Performance enhancement of text-independent speaker recognition in noisy and reverberation conditions using Radon transform with deep learning

A blind audio watermarking based on singular value decomposition and quantization

Discrete cosine transform-based data hiding for speech bandwidth extension

Design of MAC unit for digital filters in signal processing and communication

A low power reconfigurable ADC for bioimpedance monitroing system

High speed low area decimation filter for hearing aid application

Neuer Inhalt

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.