Skip to main content
Erschienen in: International Journal of Speech Technology 3/2022

18.10.2021

Application of big data language recognition technology and GPU parallel computing in English teaching visualization system

verfasst von: Long Shi

Erschienen in: International Journal of Speech Technology | Ausgabe 3/2022

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

With the development of multimedia technology and network technology applications, it is possible to implement online teaching systems in schools. This article aims to realize the design of online English teaching system based on interactive speech recognition system. The teaching system uses the characteristics of English course learning to develop an online English teaching design system based on interactive speech recognition technology. This article uses a deep learning model in the process of establishing the model, using mathematical methods, and fitting a nonlinear function while performing linear calculations. In order to improve the recognition accuracy, the DNN-HMM model is used, which greatly improves the recognition rate of the online English teaching system. The need for labeling speech frames is more urgent in traditional training acoustic models, and many problems have arisen as a result. The step of labeling speech requires strong professionalism and a large workload, and there is no way to adapt to the processing of massive data. The system established in this paper no longer requires a lot of work for annotation, but combined with the CTC layer, the cyclic neural network plays an important role in the process of processing speech sequence signals, and the relationship between data can be used in the process to build LSTM-The CTC model makes data processing faster, and finally makes the English teaching system work more fluently and improves work efficiency, which is of great significance. In the training, this paper discovered the characteristics of LSTM network training such as large amount of calculation. Extract the command from the voice message, and then proceed further. Through the establishment of this system, it meets the needs of English teaching in most schools, enables students to communicate with teachers online, can better achieve learning goals and obtain good benefits.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Alian, M., & Awajan, A. (2018). Semantic similarity approaches—Review. In 2018 international arab conference on information technology (ACIT2018), Werdanye, Lebanon (pp. 1–6). Alian, M., & Awajan, A. (2018). Semantic similarity approaches—Review. In 2018 international arab conference on information technology (ACIT2018), Werdanye, Lebanon (pp. 1–6).
Zurück zum Zitat Alian, M., & Awajan, A. (2020). Evaluating Factors affecting sentences similarity and paraphrasing identification using K-means clustering. In The 35th International Business Information Management Association (35th IBIMA) (pp. 952–959). Alian, M., & Awajan, A. (2020). Evaluating Factors affecting sentences similarity and paraphrasing identification using K-means clustering. In The 35th International Business Information Management Association (35th IBIMA) (pp. 952–959).
Zurück zum Zitat Bsoul, Q. W., & Mohd, M. (2011). Effect of ISRI stemming on similarity measure for arabic document clustering. In Asia information retrieval symposium (pp. 584–593). Bsoul, Q. W., & Mohd, M. (2011). Effect of ISRI stemming on similarity measure for arabic document clustering. In Asia information retrieval symposium (pp. 584–593).
Zurück zum Zitat Cakır, E., Parascandolo, G., Heittola, T., Huttunen, H., & Virtanen, T. (2017). Convolutional recurrent neural networks for polyphonic sound event detection. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 25(6), 1291–1303.CrossRef Cakır, E., Parascandolo, G., Heittola, T., Huttunen, H., & Virtanen, T. (2017). Convolutional recurrent neural networks for polyphonic sound event detection. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 25(6), 1291–1303.CrossRef
Zurück zum Zitat Drugman, T., Alku, P., Alwan, A., & Yegnanarayana, B. (2014). Glottal source processing: From analysis to applications. Computer Speech and Language, 28(5), 1117–1138.CrossRef Drugman, T., Alku, P., Alwan, A., & Yegnanarayana, B. (2014). Glottal source processing: From analysis to applications. Computer Speech and Language, 28(5), 1117–1138.CrossRef
Zurück zum Zitat Drugman, T., Thomas, M. R. P., Gudnason, J., Naylor, P. A., & Dutoit, T. (2012). Detection of glottal closure instants from speech signals: A quantitative review. IEEE Transactions on Audio Speech and Language Processing, 20(3), 994–1001.CrossRef Drugman, T., Thomas, M. R. P., Gudnason, J., Naylor, P. A., & Dutoit, T. (2012). Detection of glottal closure instants from speech signals: A quantitative review. IEEE Transactions on Audio Speech and Language Processing, 20(3), 994–1001.CrossRef
Zurück zum Zitat El Ouahabi, S., Atounti, M., & Bellouki, M. (2016). Building HMM independent isolated speech recognizer system for Amazigh Language, Europe and MENA. In: Cooperation Advances in Information and Communication Technologies. Volume 520 of the series Advances in Intelligent Systems and Computing, pp 299–307. https://doi.org/10.1007/978-3-319-46568-5-31. El Ouahabi, S., Atounti, M., & Bellouki, M. (2016). Building HMM independent isolated speech recognizer system for Amazigh Language, Europe and MENA. In: Cooperation Advances in Information and Communication Technologies. Volume 520 of the series Advances in Intelligent Systems and Computing, pp 299–307. https://​doi.​org/​10.​1007/​978-3-319-46568-5-31.
Zurück zum Zitat Fernando, S., & Stevenson, M. (2008). A semantic similarity approach to paraphrase detection. In The 11th annual research colloquium of the UK special interest group for computational linguistics. Fernando, S., & Stevenson, M. (2008). A semantic similarity approach to paraphrase detection. In The 11th annual research colloquium of the UK special interest group for computational linguistics.
Zurück zum Zitat Froud, H., & Lachkar, A. (2013). Agglomerative hierarchical clustering techniques for arabic documents. In Advances in computational science, engineering and information technology. Advances in intelligent systems and computing (Vol. 225). Heidelberg: Springer. Froud, H., & Lachkar, A. (2013). Agglomerative hierarchical clustering techniques for arabic documents. In Advances in computational science, engineering and information technology. Advances in intelligent systems and computing (Vol. 225). Heidelberg: Springer.
Zurück zum Zitat Grave, E., Bojanowski, P., Gupta, P., Joulin, A., & Mikolov, T. (2018). Learning word vectors for 157 languages. In Proceedings of the international conference on language resources and evaluation (LREC 2018). Grave, E., Bojanowski, P., Gupta, P., Joulin, A., & Mikolov, T. (2018). Learning word vectors for 157 languages. In Proceedings of the international conference on language resources and evaluation (LREC 2018).
Zurück zum Zitat Hussein, M., Alsammak, A., & Elshishtawy, T. (2016) In The 10th international conference on informatics and systems (pp. 61–67). Hussein, M., Alsammak, A., & Elshishtawy, T. (2016) In The 10th international conference on informatics and systems (pp. 61–67).
Zurück zum Zitat Jaradat, M., Al-Ayyoub, Z., Jararweh, M., & Al-Smadi, Y. (2017). Paraphrase identification and semantic text similarity analysis in Arabic news tweets using lexical, syntactic, and semantic features. Information Processing and Management, 53(3), 640–652.CrossRef Jaradat, M., Al-Ayyoub, Z., Jararweh, M., & Al-Smadi, Y. (2017). Paraphrase identification and semantic text similarity analysis in Arabic news tweets using lexical, syntactic, and semantic features. Information Processing and Management, 53(3), 640–652.CrossRef
Zurück zum Zitat Klavans, J., Eskin, E., & Hatzivassiloglou, V. (1999). Detecting text similarity over short passages: Exploring linguistic feature combinations via machine learning. In SIGDAT conference: empirical methods in NLP and very large corpora (pp. 204–212). Klavans, J., Eskin, E., & Hatzivassiloglou, V. (1999). Detecting text similarity over short passages: Exploring linguistic feature combinations via machine learning. In SIGDAT conference: empirical methods in NLP and very large corpora (pp. 204–212).
Zurück zum Zitat Koutrouvelis, A. I., Kafentzis, G. P., Gaubitch, N. D., & Heusdens, R. (2016). A fast method for high-resolution voiced/unvoiced detection and glottal closure/opening instant estimation of speech. IEEE Transactions on Audio, Speech, and Language Processing, 24(2), 316–328.CrossRef Koutrouvelis, A. I., Kafentzis, G. P., Gaubitch, N. D., & Heusdens, R. (2016). A fast method for high-resolution voiced/unvoiced detection and glottal closure/opening instant estimation of speech. IEEE Transactions on Audio, Speech, and Language Processing, 24(2), 316–328.CrossRef
Zurück zum Zitat Lintean, C. M., & Rus, V. (2012). Measuring semantic similarity in short texts through greedy pairing and word semantics. In The twenty-fifth international florida artificial intelligence research society conference (pp. 244–249). Lintean, C. M., & Rus, V. (2012). Measuring semantic similarity in short texts through greedy pairing and word semantics. In The twenty-fifth international florida artificial intelligence research society conference (pp. 244–249).
Zurück zum Zitat Lydia, E. L., Govindaswamy, P., Lakshmanaprabu, S., & Ramya, D. (2018). Document clustering based on text mining K-means algorithm using euclidean distance similarity. Journal of Advanced Research in Dynamical and Control Systems, 10(2), 208–214. Lydia, E. L., Govindaswamy, P., Lakshmanaprabu, S., & Ramya, D. (2018). Document clustering based on text mining K-means algorithm using euclidean distance similarity. Journal of Advanced Research in Dynamical and Control Systems, 10(2), 208–214.
Zurück zum Zitat Mohsen, G., Al-Ayyoub, M., Hmeidi, I., & Al-Aiad, A. (2018). On the automatic construction of an arabic thesaurus. In 9th international conference on information and communication systems (ICICS). Mohsen, G., Al-Ayyoub, M., Hmeidi, I., & Al-Aiad, A. (2018). On the automatic construction of an arabic thesaurus. In 9th international conference on information and communication systems (ICICS).
Zurück zum Zitat Naeem, S., & Wumaier, A. (2018). Study and implementing K-mean clustering algorithm on English text and techniques to find the optimal value of K. International Journal of Computer Applications, 182(31), 975–8887.CrossRef Naeem, S., & Wumaier, A. (2018). Study and implementing K-mean clustering algorithm on English text and techniques to find the optimal value of K. International Journal of Computer Applications, 182(31), 975–8887.CrossRef
Zurück zum Zitat Oshea, F. A., Bandar, J. D., Crockett, Z., & Almarsoomi, K. (2013). AWSS: An algorithm for measuring arabic word semantic similarity. In 2013 IEEE international conference on systems, man, and cybernetics (pp. 504–509). Oshea, F. A., Bandar, J. D., Crockett, Z., & Almarsoomi, K. (2013). AWSS: An algorithm for measuring arabic word semantic similarity. In 2013 IEEE international conference on systems, man, and cybernetics (pp. 504–509).
Zurück zum Zitat Pokhariya, J. S., & Mathur, S. (2014). Sanskrit speech recognition using Hidden Markov Model Toolkit. International Journal of Engineering Research & Technology (IJERT), 3(10), 93–98. Pokhariya, J. S., & Mathur, S. (2014). Sanskrit speech recognition using Hidden Markov Model Toolkit. International Journal of Engineering Research & Technology (IJERT), 3(10), 93–98.
Zurück zum Zitat Rahaman, I., & Hosein, P. (2017). Exploiting Gaussian word embeddings for document clustering. In Future technologies conference (FTC) (pp. 1015–1018). Rahaman, I., & Hosein, P. (2017). Exploiting Gaussian word embeddings for document clustering. In Future technologies conference (FTC) (pp. 1015–1018).
Zurück zum Zitat Rao, K. S., Prasanna, S. R. M., & Yegnanarayana, B. (2007). Determination of instants of significant excitation in speech using Hilbert envelope and group delay function. IEEE Signal Processing Letters, 14(10), 762–765.CrossRef Rao, K. S., Prasanna, S. R. M., & Yegnanarayana, B. (2007). Determination of instants of significant excitation in speech using Hilbert envelope and group delay function. IEEE Signal Processing Letters, 14(10), 762–765.CrossRef
Zurück zum Zitat Scharstein, D. (1994, October). Matching images by comparing their gradient fields. In Proceedings of 12th International Conference on Pattern Recognition (Vol. 1, pp. 572–575). IEEE. Scharstein, D. (1994, October). Matching images by comparing their gradient fields. In Proceedings of 12th International Conference on Pattern Recognition (Vol. 1, pp. 572–575). IEEE.
Zurück zum Zitat Soliman, H. R. H., Grida, M., & Hassan, M. (2019). Arabic text custering based on K-means algorithm with semantic word embedding. Journal of Theoretical and Applied Information Technology, 97(21), 2497–2509. Soliman, H. R. H., Grida, M., & Hassan, M. (2019). Arabic text custering based on K-means algorithm with semantic word embedding. Journal of Theoretical and Applied Information Technology, 97(21), 2497–2509.
Zurück zum Zitat Srivastava, S., & Govilkar, S. (2017). A survey on paraphrase detection techniques for Indian regional languages. International Journal of Computer Applications, 163(9), 0975–8887.CrossRef Srivastava, S., & Govilkar, S. (2017). A survey on paraphrase detection techniques for Indian regional languages. International Journal of Computer Applications, 163(9), 0975–8887.CrossRef
Zurück zum Zitat Thomas, M. R. P., Gudnason, J., Naylor, P. A., Geiser, B., & Vary, P. (2010). Voice source estimation for artificial bandwidth extension of telephone speech. In Proceedings of the IEEE international conference on acoustics speech and signal processing (ICASSP) (pp. 4794–4797). Thomas, M. R. P., Gudnason, J., Naylor, P. A., Geiser, B., & Vary, P. (2010). Voice source estimation for artificial bandwidth extension of telephone speech. In Proceedings of the IEEE international conference on acoustics speech and signal processing (ICASSP) (pp. 4794–4797).
Zurück zum Zitat Varga, A., & Steeneken, H. J. M. (1993). Assessment for automatic speech recognition II: NOISEX-92: A database and an experiment to study the effect of additive noise on speech recognition systems. Speech Communication, 12(3), 247–251.CrossRef Varga, A., & Steeneken, H. J. M. (1993). Assessment for automatic speech recognition II: NOISEX-92: A database and an experiment to study the effect of additive noise on speech recognition systems. Speech Communication, 12(3), 247–251.CrossRef
Metadaten
Titel
Application of big data language recognition technology and GPU parallel computing in English teaching visualization system
verfasst von
Long Shi
Publikationsdatum
18.10.2021
Verlag
Springer US
Erschienen in
International Journal of Speech Technology / Ausgabe 3/2022
Print ISSN: 1381-2416
Elektronische ISSN: 1572-8110
DOI
https://doi.org/10.1007/s10772-021-09904-1

Weitere Artikel der Ausgabe 3/2022

International Journal of Speech Technology 3/2022 Zur Ausgabe

Neuer Inhalt