Skip to main content
Erschienen in: International Journal of Speech Technology 1/2017

02.11.2016

Voice recognition package for ERTU’s cloud

Erschienen in: International Journal of Speech Technology | Ausgabe 1/2017

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

This paper discusses the application of voice recognition for Egyptian Radio and Television Union’s cloud. It is used as secure access to the cloud by authorized group (AuthGs). The voice of each member from AuthGs is watermarked using singular value decomposition and then encrypted by Chaotic map. The results are transmitted through channel under several conditions and received at the receiver side, and then the recognition rate is calculated for various extraction of watermarking after decryption method. It is done to find out the suitable technique for AuthGs to access the cloud and insure the security and privacy of the cloud. Many tests are performed to compare between the voice before and after process to grantee the high robustness of the signal from illegal eavesdrops or any abuse behaviour. The feature extraction is performed using artificial neural network to store it in a database to compare with.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Abd El-Fattah, M. A., Dessouky, M. I., Diab, S. M., & Abd El-Samie, F. E. (2008). Speech enhancement using an adaptive wiener filtering approach. Progress in Electromagnetics Research M, 4, 167–184.CrossRef Abd El-Fattah, M. A., Dessouky, M. I., Diab, S. M., & Abd El-Samie, F. E. (2008). Speech enhancement using an adaptive wiener filtering approach. Progress in Electromagnetics Research M, 4, 167–184.CrossRef
Zurück zum Zitat Abd El-samie, F. E. (2009). An efficient singular value decomposition algorithm for digital audio watermarking. International Journal of Speech Technology, 12(l), 27–45.CrossRef Abd El-samie, F. E. (2009). An efficient singular value decomposition algorithm for digital audio watermarking. International Journal of Speech Technology, 12(l), 27–45.CrossRef
Zurück zum Zitat AI-Nuaimy, W., El-Bendary, M. A. M., Shafik, A., Shawki, F., Abou El-azm, A. E., E1-Fishawy, N. A., Elhalafawy, S. M., Diab, S. M., Sallam, B. M., & Abd El-Samie, F. E. (2011). An SVD audio watermarking approach using chaotic encrypted images. Digital Signal Processing, 21(6), 764–779. AI-Nuaimy, W., El-Bendary, M. A. M., Shafik, A., Shawki, F., Abou El-azm, A. E., E1-Fishawy, N. A., Elhalafawy, S. M., Diab, S. M., Sallam, B. M., & Abd El-Samie, F. E. (2011). An SVD audio watermarking approach using chaotic encrypted images. Digital Signal Processing, 21(6), 764–779.
Zurück zum Zitat Anderws, H. C., & Hunt, B. R. (1977). Digital image restoration. Englewood Cliffs, NJ: Prentice-Hall. Anderws, H. C., & Hunt, B. R. (1977). Digital image restoration. Englewood Cliffs, NJ: Prentice-Hall.
Zurück zum Zitat Boll, S. F. (1979). Suppression of acoustic noise in speech using spectral subtraction. In IEEE transactions on ASSP (Vo1. 27(2), pp. 113–120). Boll, S. F. (1979). Suppression of acoustic noise in speech using spectral subtraction. In IEEE transactions on ASSP (Vo1. 27(2), pp. 113–120).
Zurück zum Zitat Campbell, J. P. (1997). Speaker recognition: A tutorial. Proceedings of the IEEE, 85(9), 1437–1462.CrossRef Campbell, J. P. (1997). Speaker recognition: A tutorial. Proceedings of the IEEE, 85(9), 1437–1462.CrossRef
Zurück zum Zitat Childers, D. G., & Skinner, D. P. (1977). The cepstrum: A guide to processing. Proceedings of IEEE, 65(10), 1428–1443.CrossRef Childers, D. G., & Skinner, D. P. (1977). The cepstrum: A guide to processing. Proceedings of IEEE, 65(10), 1428–1443.CrossRef
Zurück zum Zitat Elashry, I. F., Farag Allah, O. S., Abbas, A. M., El-Rabaie, S., & Abd El-Samie, F. E. (2009). Homomorphic image encryption. Journal of Electronic Imaging, 18(3), 033002.CrossRef Elashry, I. F., Farag Allah, O. S., Abbas, A. M., El-Rabaie, S., & Abd El-Samie, F. E. (2009). Homomorphic image encryption. Journal of Electronic Imaging, 18(3), 033002.CrossRef
Zurück zum Zitat El-Khamy, S. E., Hadhoud, M. M., Dessouky, M. I., Salatn, B. M., & Abd E1-Samie, F. E. (2004). Optimization of image interpolation as an inverse problem using the LMMSE algorithm. In Proceedings of the IEEE MELECON (pp. 247–250). Croatia. El-Khamy, S. E., Hadhoud, M. M., Dessouky, M. I., Salatn, B. M., & Abd E1-Samie, F. E. (2004). Optimization of image interpolation as an inverse problem using the LMMSE algorithm. In Proceedings of the IEEE MELECON (pp. 247–250). Croatia.
Zurück zum Zitat Erkuguk, S., Krishnan, S., & Glu, M. Z. (2006). A robust audio watermark representation based on linear chirps. IEEE Transactions on Multimedia, 8(5), 925–936.CrossRef Erkuguk, S., Krishnan, S., & Glu, M. Z. (2006). A robust audio watermark representation based on linear chirps. IEEE Transactions on Multimedia, 8(5), 925–936.CrossRef
Zurück zum Zitat Evans, N. W. D., & Mason, J. S. D. (2006). An assessment on the fundamental limitations of spectral subtraction. In IEEE international conference on Acoustic, speech and signal processing (pp. 1–1). Evans, N. W. D., & Mason, J. S. D. (2006). An assessment on the fundamental limitations of spectral subtraction. In IEEE international conference on Acoustic, speech and signal processing (pp. 1–1).
Zurück zum Zitat Jain, A. K. (1978). Fast inversion of banded toeplitz matrices by circular decomposition. In IEEE transaction on acoustics, speech and signal processing (Vol. ASSP-26, No. 2, pp. 121–126). Jain, A. K. (1978). Fast inversion of banded toeplitz matrices by circular decomposition. In IEEE transaction on acoustics, speech and signal processing (Vol. ASSP-26, No. 2, pp. 121–126).
Zurück zum Zitat Khairwa, A., Abhishek, K., Prakash, S., & Pratap T. (2012). A comprehensive study of various biometric identification techniques. In 2012 third international conference on computing communication & networking technologies (ICCCNT) (pp. 1–6). 26–28 July 2012. Khairwa, A., Abhishek, K., Prakash, S., & Pratap T. (2012). A comprehensive study of various biometric identification techniques. In 2012 third international conference on computing communication & networking technologies (ICCCNT) (pp. 1–6). 26–28 July 2012.
Zurück zum Zitat Kim, H. S., & Lee, H. K. (2003). Invariant image watermark using Zernike moments. IEEE Transactions on Circuits and Systems for Video Technology, 13(8), 766–775.CrossRef Kim, H. S., & Lee, H. K. (2003). Invariant image watermark using Zernike moments. IEEE Transactions on Circuits and Systems for Video Technology, 13(8), 766–775.CrossRef
Zurück zum Zitat Krishnamoorthy, P., & Mahadeva, S. R. (2006). Enhancement of noisy speech by spectral subtraction and residual modification. In Annual India conference (pp. 1–5). Krishnamoorthy, P., & Mahadeva, S. R. (2006). Enhancement of noisy speech by spectral subtraction and residual modification. In Annual India conference (pp. 1–5).
Zurück zum Zitat Lu, Z. M., Xu, D. G., & Sun, S. H. (2005). Multipurpose image watermarking algorithm based on multistage vector quantization. IEEE Transactions on Image Processing, 14(6), 822–831.CrossRef Lu, Z. M., Xu, D. G., & Sun, S. H. (2005). Multipurpose image watermarking algorithm based on multistage vector quantization. IEEE Transactions on Image Processing, 14(6), 822–831.CrossRef
Zurück zum Zitat Macq, B., Dittmann, J., & Delp, E. J. (2004). Benchmarking of image watermarking algorithms for digital rights management. Proceedings of the IEEE, 92(6), 971–984.CrossRef Macq, B., Dittmann, J., & Delp, E. J. (2004). Benchmarking of image watermarking algorithms for digital rights management. Proceedings of the IEEE, 92(6), 971–984.CrossRef
Zurück zum Zitat Meng, J., Zhang, J., & Zhao, H. (2012). Overview of the speech recognition technology. In 2012 fourth international conference on computational and information sciences (pp. 199–202). 17–19 August 2012. Meng, J., Zhang, J., & Zhao, H. (2012). Overview of the speech recognition technology. In 2012 fourth international conference on computational and information sciences (pp. 199–202). 17–19 August 2012.
Zurück zum Zitat Naeem, E. A., AbdElnaby, M. M., & Hadhoud, M. M. (2009). Chaotic image encryption in transform domains. IEEE 2009 (pp. 71–76). Naeem, E. A., AbdElnaby, M. M., & Hadhoud, M. M. (2009). Chaotic image encryption in transform domains. IEEE 2009 (pp. 71–76).
Zurück zum Zitat Sotelo, E. E. A., Nakamura, T., Nagai, T., & Hernandez, E. E. (2012). Who said that? The crossmodal matching identity for inferring unfamiliar faces from voices. In 2012 eighth international conference on signal image technology and internet based systems (pp. 97–104). 25–29 November 2012. Sotelo, E. E. A., Nakamura, T., Nagai, T., & Hernandez, E. E. (2012). Who said that? The crossmodal matching identity for inferring unfamiliar faces from voices. In 2012 eighth international conference on signal image technology and internet based systems (pp. 97–104). 25–29 November 2012.
Zurück zum Zitat Tsujino, K., Nakashima, Y., Iizuka, S., & Isoda, Y. (2013). Speech recognition and spoken language understanding for mobile personal assistants: A case study of “Shabette Concier”’. In 2013 IEEE 14th international conference on mobile data management (pp. 225–228). 3–6 June 2013. Tsujino, K., Nakashima, Y., Iizuka, S., & Isoda, Y. (2013). Speech recognition and spoken language understanding for mobile personal assistants: A case study of “Shabette Concier”’. In 2013 IEEE 14th international conference on mobile data management (pp. 225–228). 3–6 June 2013.
Zurück zum Zitat Wang, X., Qi, W., & Niu, P. (2007). A new adaptive digital audio watermarking based on support vector regression. IEEE Transactions on Audio, Speech and Language Processing, 15(8), 2270–2277.CrossRef Wang, X., Qi, W., & Niu, P. (2007). A new adaptive digital audio watermarking based on support vector regression. IEEE Transactions on Audio, Speech and Language Processing, 15(8), 2270–2277.CrossRef
Metadaten
Titel
Voice recognition package for ERTU’s cloud
Publikationsdatum
02.11.2016
Erschienen in
International Journal of Speech Technology / Ausgabe 1/2017
Print ISSN: 1381-2416
Elektronische ISSN: 1572-8110
DOI
https://doi.org/10.1007/s10772-016-9387-8

Weitere Artikel der Ausgabe 1/2017

International Journal of Speech Technology 1/2017 Zur Ausgabe

Neuer Inhalt