nach oben

International Journal of Speech Technology

Erschienen in:

02.11.2016

Voice recognition package for ERTU’s cloud

Erschienen in: International Journal of Speech Technology | Ausgabe 1/2017

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

This paper discusses the application of voice recognition for Egyptian Radio and Television Union’s cloud. It is used as secure access to the cloud by authorized group (AuthGs). The voice of each member from AuthGs is watermarked using singular value decomposition and then encrypted by Chaotic map. The results are transmitted through channel under several conditions and received at the receiver side, and then the recognition rate is calculated for various extraction of watermarking after decryption method. It is done to find out the suitable technique for AuthGs to access the cloud and insure the security and privacy of the cloud. Many tests are performed to compare between the voice before and after process to grantee the high robustness of the signal from illegal eavesdrops or any abuse behaviour. The feature extraction is performed using artificial neural network to store it in a database to compare with.

Vorheriger Artikel Speech based automatic personality perception using spectral features

Nächster Artikel Subjective speech quality measurement repeatability: comparison of laboratory test results

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Abd El-Fattah, M. A., Dessouky, M. I., Diab, S. M., & Abd El-Samie, F. E. (2008). Speech enhancement using an adaptive wiener filtering approach. Progress in Electromagnetics Research M, 4, 167–184.CrossRef

Abd El-samie, F. E. (2009). An efficient singular value decomposition algorithm for digital audio watermarking. International Journal of Speech Technology, 12(l), 27–45.CrossRef

AI-Nuaimy, W., El-Bendary, M. A. M., Shafik, A., Shawki, F., Abou El-azm, A. E., E1-Fishawy, N. A., Elhalafawy, S. M., Diab, S. M., Sallam, B. M., & Abd El-Samie, F. E. (2011). An SVD audio watermarking approach using chaotic encrypted images. Digital Signal Processing, 21(6), 764–779.

Anderws, H. C., & Hunt, B. R. (1977). Digital image restoration. Englewood Cliffs, NJ: Prentice-Hall.

Boll, S. F. (1979). Suppression of acoustic noise in speech using spectral subtraction. In IEEE transactions on ASSP (Vo1. 27(2), pp. 113–120).

Campbell, J. P. (1997). Speaker recognition: A tutorial. Proceedings of the IEEE, 85(9), 1437–1462.CrossRef

Childers, D. G., & Skinner, D. P. (1977). The cepstrum: A guide to processing. Proceedings of IEEE, 65(10), 1428–1443.CrossRef

Elashry, I. F., Farag Allah, O. S., Abbas, A. M., El-Rabaie, S., & Abd El-Samie, F. E. (2009). Homomorphic image encryption. Journal of Electronic Imaging, 18(3), 033002.CrossRef

El-Khamy, S. E., Hadhoud, M. M., Dessouky, M. I., Salatn, B. M., & Abd E1-Samie, F. E. (2004). Optimization of image interpolation as an inverse problem using the LMMSE algorithm. In Proceedings of the IEEE MELECON (pp. 247–250). Croatia.

Erkuguk, S., Krishnan, S., & Glu, M. Z. (2006). A robust audio watermark representation based on linear chirps. IEEE Transactions on Multimedia, 8(5), 925–936.CrossRef

Evans, N. W. D., & Mason, J. S. D. (2006). An assessment on the fundamental limitations of spectral subtraction. In IEEE international conference on Acoustic, speech and signal processing (pp. 1–1).

Jain, A. K. (1978). Fast inversion of banded toeplitz matrices by circular decomposition. In IEEE transaction on acoustics, speech and signal processing (Vol. ASSP-26, No. 2, pp. 121–126).

Khairwa, A., Abhishek, K., Prakash, S., & Pratap T. (2012). A comprehensive study of various biometric identification techniques. In 2012 third international conference on computing communication & networking technologies (ICCCNT) (pp. 1–6). 26–28 July 2012.

Kim, H. S., & Lee, H. K. (2003). Invariant image watermark using Zernike moments. IEEE Transactions on Circuits and Systems for Video Technology, 13(8), 766–775.CrossRef

Krishnamoorthy, P., & Mahadeva, S. R. (2006). Enhancement of noisy speech by spectral subtraction and residual modification. In Annual India conference (pp. 1–5).

Lu, Z. M., Xu, D. G., & Sun, S. H. (2005). Multipurpose image watermarking algorithm based on multistage vector quantization. IEEE Transactions on Image Processing, 14(6), 822–831.CrossRef

Macq, B., Dittmann, J., & Delp, E. J. (2004). Benchmarking of image watermarking algorithms for digital rights management. Proceedings of the IEEE, 92(6), 971–984.CrossRef

Meng, J., Zhang, J., & Zhao, H. (2012). Overview of the speech recognition technology. In 2012 fourth international conference on computational and information sciences (pp. 199–202). 17–19 August 2012.

Naeem, E. A., AbdElnaby, M. M., & Hadhoud, M. M. (2009). Chaotic image encryption in transform domains. IEEE 2009 (pp. 71–76).

Sotelo, E. E. A., Nakamura, T., Nagai, T., & Hernandez, E. E. (2012). Who said that? The crossmodal matching identity for inferring unfamiliar faces from voices. In 2012 eighth international conference on signal image technology and internet based systems (pp. 97–104). 25–29 November 2012.

Tsujino, K., Nakashima, Y., Iizuka, S., & Isoda, Y. (2013). Speech recognition and spoken language understanding for mobile personal assistants: A case study of “Shabette Concier”’. In 2013 IEEE 14th international conference on mobile data management (pp. 225–228). 3–6 June 2013.

Wang, X., Qi, W., & Niu, P. (2007). A new adaptive digital audio watermarking based on support vector regression. IEEE Transactions on Audio, Speech and Language Processing, 15(8), 2270–2277.CrossRef

Titel: Voice recognition package for ERTU’s cloud
Publikationsdatum: 02.11.2016
Erschienen in: International Journal of Speech Technology / Ausgabe 1/2017
Print ISSN: 1381-2416
Elektronische ISSN: 1572-8110
DOI: https://doi.org/10.1007/s10772-016-9387-8

Neuer Inhalt

Bildnachweise

VDI-Icon, Profil Icon, inhalt2, Springer Professional Modul/© Springer Fachmedien Wiesbaden GmbH, Nachhaltigkeitsaward Key Visual/© Cometis AG/Global ESG Monitor | Daniel Rupp | Generiert mit KI, Search Icon, Banner Hanser, Frank Urbansky/© Peter Eichler / Leipzig, CO2-Fußabdruck/© Jenny Sturm / stock.adobe.com, Interview Entropie Bild 1/© Bernhard Weßling, Zeitschrift Wissensmanagement Cover, PatentFit-Logo/© Springer Fachmedien Wiesbaden GmbH, Sustainibility Finance/© Robert Kneschke / stock.adobe.com / Springer Fachmedien Wiesbaden GmbH, Zukunftswerkstatt Sales Excellence 2024/© AndreyPopov / Getty Images / iStock, 2023_Antrieb/© supervisuell

Springer Professional

Abstract

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Springer Professional "Wirtschaft"

Weitere Artikel der Ausgabe 1/2017

Speech enhancement based on stationary bionic wavelet transform and maximum a posterior estimator of magnitude-squared spectrum

Text-independent speaker identification based on selection of the most similar feature vectors

Single channel noise reduction system in low SNR

Modification of energy spectra, epoch parameters and prosody for emotion conversion in speech

Glottal opening instants detection using zero frequency resonator

Single-channel speech separation using empirical mode decomposition and multi pitch information with estimation of number of speakers

Neuer Inhalt

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.