nach oben

International Journal of Speech Technology

Erschienen in:

18.10.2021

Application of big data language recognition technology and GPU parallel computing in English teaching visualization system

verfasst von: Long Shi

Erschienen in: International Journal of Speech Technology | Ausgabe 3/2022

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

With the development of multimedia technology and network technology applications, it is possible to implement online teaching systems in schools. This article aims to realize the design of online English teaching system based on interactive speech recognition system. The teaching system uses the characteristics of English course learning to develop an online English teaching design system based on interactive speech recognition technology. This article uses a deep learning model in the process of establishing the model, using mathematical methods, and fitting a nonlinear function while performing linear calculations. In order to improve the recognition accuracy, the DNN-HMM model is used, which greatly improves the recognition rate of the online English teaching system. The need for labeling speech frames is more urgent in traditional training acoustic models, and many problems have arisen as a result. The step of labeling speech requires strong professionalism and a large workload, and there is no way to adapt to the processing of massive data. The system established in this paper no longer requires a lot of work for annotation, but combined with the CTC layer, the cyclic neural network plays an important role in the process of processing speech sequence signals, and the relationship between data can be used in the process to build LSTM-The CTC model makes data processing faster, and finally makes the English teaching system work more fluently and improves work efficiency, which is of great significance. In the training, this paper discovered the characteristics of LSTM network training such as large amount of calculation. Extract the command from the voice message, and then proceed further. Through the establishment of this system, it meets the needs of English teaching in most schools, enables students to communicate with teachers online, can better achieve learning goals and obtain good benefits.

Vorheriger Artikel Behavior analysis in Arabic social media

Nächster Artikel Performance enhancement of text-independent speaker recognition in noisy and reverberation conditions using Radon transform with deep learning

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Alian, M., & Awajan, A. (2018). Semantic similarity approaches—Review. In 2018 international arab conference on information technology (ACIT2018), Werdanye, Lebanon (pp. 1–6).

Alian, M., & Awajan, A. (2020). Evaluating Factors affecting sentences similarity and paraphrasing identification using K-means clustering. In The 35th International Business Information Management Association (35th IBIMA) (pp. 952–959).

Bsoul, Q. W., & Mohd, M. (2011). Effect of ISRI stemming on similarity measure for arabic document clustering. In Asia information retrieval symposium (pp. 584–593).

Cakır, E., Parascandolo, G., Heittola, T., Huttunen, H., & Virtanen, T. (2017). Convolutional recurrent neural networks for polyphonic sound event detection. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 25(6), 1291–1303.CrossRef

Drugman, T., Alku, P., Alwan, A., & Yegnanarayana, B. (2014). Glottal source processing: From analysis to applications. Computer Speech and Language, 28(5), 1117–1138.CrossRef

Drugman, T., Thomas, M. R. P., Gudnason, J., Naylor, P. A., & Dutoit, T. (2012). Detection of glottal closure instants from speech signals: A quantitative review. IEEE Transactions on Audio Speech and Language Processing, 20(3), 994–1001.CrossRef

El Ouahabi, S., Atounti, M., & Bellouki, M. (2016). Building HMM independent isolated speech recognizer system for Amazigh Language, Europe and MENA. In: Cooperation Advances in Information and Communication Technologies. Volume 520 of the series Advances in Intelligent Systems and Computing, pp 299–307. https://doi.org/10.1007/978-3-319-46568-5-31.

Fernando, S., & Stevenson, M. (2008). A semantic similarity approach to paraphrase detection. In The 11th annual research colloquium of the UK special interest group for computational linguistics.

Froud, H., & Lachkar, A. (2013). Agglomerative hierarchical clustering techniques for arabic documents. In Advances in computational science, engineering and information technology. Advances in intelligent systems and computing (Vol. 225). Heidelberg: Springer.

Grave, E., Bojanowski, P., Gupta, P., Joulin, A., & Mikolov, T. (2018). Learning word vectors for 157 languages. In Proceedings of the international conference on language resources and evaluation (LREC 2018).

Hussein, M., Alsammak, A., & Elshishtawy, T. (2016) In The 10th international conference on informatics and systems (pp. 61–67).

Jaradat, M., Al-Ayyoub, Z., Jararweh, M., & Al-Smadi, Y. (2017). Paraphrase identification and semantic text similarity analysis in Arabic news tweets using lexical, syntactic, and semantic features. Information Processing and Management, 53(3), 640–652.CrossRef

Klavans, J., Eskin, E., & Hatzivassiloglou, V. (1999). Detecting text similarity over short passages: Exploring linguistic feature combinations via machine learning. In SIGDAT conference: empirical methods in NLP and very large corpora (pp. 204–212).

Koutrouvelis, A. I., Kafentzis, G. P., Gaubitch, N. D., & Heusdens, R. (2016). A fast method for high-resolution voiced/unvoiced detection and glottal closure/opening instant estimation of speech. IEEE Transactions on Audio, Speech, and Language Processing, 24(2), 316–328.CrossRef

Lintean, C. M., & Rus, V. (2012). Measuring semantic similarity in short texts through greedy pairing and word semantics. In The twenty-fifth international florida artificial intelligence research society conference (pp. 244–249).

Lydia, E. L., Govindaswamy, P., Lakshmanaprabu, S., & Ramya, D. (2018). Document clustering based on text mining K-means algorithm using euclidean distance similarity. Journal of Advanced Research in Dynamical and Control Systems, 10(2), 208–214.

Mohsen, G., Al-Ayyoub, M., Hmeidi, I., & Al-Aiad, A. (2018). On the automatic construction of an arabic thesaurus. In 9th international conference on information and communication systems (ICICS).

Naeem, S., & Wumaier, A. (2018). Study and implementing K-mean clustering algorithm on English text and techniques to find the optimal value of K. International Journal of Computer Applications, 182(31), 975–8887.CrossRef

Oshea, F. A., Bandar, J. D., Crockett, Z., & Almarsoomi, K. (2013). AWSS: An algorithm for measuring arabic word semantic similarity. In 2013 IEEE international conference on systems, man, and cybernetics (pp. 504–509).

Pokhariya, J. S., & Mathur, S. (2014). Sanskrit speech recognition using Hidden Markov Model Toolkit. International Journal of Engineering Research & Technology (IJERT), 3(10), 93–98.

Rahaman, I., & Hosein, P. (2017). Exploiting Gaussian word embeddings for document clustering. In Future technologies conference (FTC) (pp. 1015–1018).

Rao, K. S., Prasanna, S. R. M., & Yegnanarayana, B. (2007). Determination of instants of significant excitation in speech using Hilbert envelope and group delay function. IEEE Signal Processing Letters, 14(10), 762–765.CrossRef

Scharstein, D. (1994, October). Matching images by comparing their gradient fields. In Proceedings of 12th International Conference on Pattern Recognition (Vol. 1, pp. 572–575). IEEE.

Soliman, H. R. H., Grida, M., & Hassan, M. (2019). Arabic text custering based on K-means algorithm with semantic word embedding. Journal of Theoretical and Applied Information Technology, 97(21), 2497–2509.

Srivastava, S., & Govilkar, S. (2017). A survey on paraphrase detection techniques for Indian regional languages. International Journal of Computer Applications, 163(9), 0975–8887.CrossRef

Thomas, M. R. P., Gudnason, J., Naylor, P. A., Geiser, B., & Vary, P. (2010). Voice source estimation for artificial bandwidth extension of telephone speech. In Proceedings of the IEEE international conference on acoustics speech and signal processing (ICASSP) (pp. 4794–4797).

Varga, A., & Steeneken, H. J. M. (1993). Assessment for automatic speech recognition II: NOISEX-92: A database and an experiment to study the effect of additive noise on speech recognition systems. Speech Communication, 12(3), 247–251.CrossRef

Titel: Application of big data language recognition technology and GPU parallel computing in English teaching visualization system
verfasst von: Long Shi
Publikationsdatum: 18.10.2021
Verlag: Springer US
Erschienen in: International Journal of Speech Technology / Ausgabe 3/2022
Print ISSN: 1381-2416
Elektronische ISSN: 1572-8110
DOI: https://doi.org/10.1007/s10772-021-09904-1

Neuer Inhalt

Bildnachweise

VDI-Icon, Profil Icon, inhalt2, Springer Professional Modul/© Springer Fachmedien Wiesbaden GmbH, Internationaler Motorenkongress/© [M] ATZlive | Chisnikov / Fotolia.com, Search Icon, Banner Hanser, Customer Experience/© © oatawa / Getty Images / iStock, Erdgasmotor 1.5 TGI evo von Volkswagen/© Volkswagen AG, Thorsten Mücke/© Alexandra Bachran, Zeitschrift Wissensmanagement Cover, PatentFit-Logo/© Springer Fachmedien Wiesbaden GmbH, 2023_Antrieb/© supervisuell, ATZ-Webinar: Prototypenfreie Entwicklung durch Offline- und Driver-in-the-Loop-HiL-Tests /© (c) VI-grade, chassis.tech plus 2023/© [M] ATZlive / TÜV SÜD PRODUCT SERVICE GMBH

Springer Professional

Abstract

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Springer Professional "Wirtschaft"

Weitere Artikel der Ausgabe 3/2022

Salient object detection based on adaptive recalibration technique through deep network

Performance enhancement of text-independent speaker recognition in noisy and reverberation conditions using Radon transform with deep learning

A blind audio watermarking based on singular value decomposition and quantization

Detecting adversarial attacks on audio-visual speech recognition using deep learning method

Nonlinear acoustic noise cancellation based automatic speech recognition system (NANC-ASR) with convolutional neural networks

Machine learning techniques for speech emotion recognition using paralinguistic acoustic features

Neuer Inhalt

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.