Skip to main content

2020 | OriginalPaper | Buchkapitel

Neural Network Processing of Natural Russian Language for Building Intelligent Dialogue Systems

verfasst von : Danila Parygin, Nikolay Matyushin, Anton Finogeev, Natalia Sadovnikova, Tatyana Petrova, Ekaterina Fadeeva

Erschienen in: Electronic Governance and Open Society: Challenges in Eurasia

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Currently chatbots, dialogue systems and intelligent assistants increasingly found in an equipment of everyday life, used in technical support of commercial organizations and in entertainment services. Systems for the English language have good groundwork. However, the process of “recognition” of a natural language associated with a number of difficulties caused by the need to have a significant initial database of dialogues, explore various architectures of neural networks, solve problems of the perception and morphology of the Russian language. In this regard, the purpose of this study is the development of a neural network model for natural Russian language processing, capable of becoming an open platform for the development of specialized dialogue systems. For this, design and training of dialog models of neural networks based on modifications of the Transformer architecture are proposed. Own parsers for extracting and post-processing dialogues in natural Russian from the Otvet@mail.Ru portal and public chat rooms in the Telegram messenger for training neural networks were developed. The data set, prepared with their help and now publicly available on the Internet, contains more than 22.5 million question-answer pairs in natural Russian language. The prepared data set in various configurations applied when training a number of neural network models designed by modifying the Sequence2Sequence, Transformer and text2text architectures. The final version of developed neural network model generates answers to any user message up to 200 characters and is integrated into a dialogue system implemented using the client-server architecture for user interaction with the chat bot.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Parygin, D.: Implementation of exoactive management model for urbanized area: real-time monitoring and proactive planning. In: SMART-2019, Proceedings of the 8th International Conference on System Modeling and Advancement in Research Trends, pp. 310–316. IEEE (2020) Parygin, D.: Implementation of exoactive management model for urbanized area: real-time monitoring and proactive planning. In: SMART-2019, Proceedings of the 8th International Conference on System Modeling and Advancement in Research Trends, pp. 310–316. IEEE (2020)
8.
Zurück zum Zitat Nikolenko, S., Kudrin, A., Arkhangelskaya, E.: Glubokoye obucheniye, pogruzheniye v mir neyronnykh setey [Deep learning, immersion in the world of neural networks], St. Petersburg (2018). (in Russian) Nikolenko, S., Kudrin, A., Arkhangelskaya, E.: Glubokoye obucheniye, pogruzheniye v mir neyronnykh setey [Deep learning, immersion in the world of neural networks], St. Petersburg (2018). (in Russian)
11.
Zurück zum Zitat Popel, M., Bojar, O.: Training tips for the transformer model. Prague Bull. Math. Linguist. 110, 43–70 (2018)CrossRef Popel, M., Bojar, O.: Training tips for the transformer model. Prague Bull. Math. Linguist. 110, 43–70 (2018)CrossRef
12.
Zurück zum Zitat Donchenko, D., Sadovnikova, N., Parygin, D.: Prediction of Road Accidents’ Severity on Russian Roads Using Machine Learning Techniques. In: Radionov, Andrey A., Kravchenko, Oleg A., Guzeev, Victor I., Rozhdestvenskiy, Yurij V. (eds.) ICIE 2019. LNME, pp. 1493–1501. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-22063-1_157CrossRef Donchenko, D., Sadovnikova, N., Parygin, D.: Prediction of Road Accidents’ Severity on Russian Roads Using Machine Learning Techniques. In: Radionov, Andrey A., Kravchenko, Oleg A., Guzeev, Victor I., Rozhdestvenskiy, Yurij V. (eds.) ICIE 2019. LNME, pp. 1493–1501. Springer, Cham (2020). https://​doi.​org/​10.​1007/​978-3-030-22063-1_​157CrossRef
13.
Zurück zum Zitat Liu, C., Lowe, R., Serban, I.V., Noseworthy, M., Charlin, L., Pineau, J.: How NOT To Evaluate Your Dialogue System. In: Proceedings of 2016 Conference on Empirical Methods in Natural Language Processing, Austin, 1–5 November 2016, pp. 2122–2132 (2016) Liu, C., Lowe, R., Serban, I.V., Noseworthy, M., Charlin, L., Pineau, J.: How NOT To Evaluate Your Dialogue System. In: Proceedings of 2016 Conference on Empirical Methods in Natural Language Processing, Austin, 1–5 November 2016, pp. 2122–2132 (2016)
14.
Zurück zum Zitat Boiko, D., Parygin, D., Savina, O., Golubev, A., Zelenskiy, I., Mityagin, S.: Approaches to Analysis of Factors Affecting the Residential Real Estate Bid Prices in Case of Open Data Use. In: Chugunov, A., Khodachek, I., Misnikov, Y., Trutnev, D. (eds.) EGOSE 2019. CCIS, vol. 1135, pp. 360–375. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-39296-3_27CrossRef Boiko, D., Parygin, D., Savina, O., Golubev, A., Zelenskiy, I., Mityagin, S.: Approaches to Analysis of Factors Affecting the Residential Real Estate Bid Prices in Case of Open Data Use. In: Chugunov, A., Khodachek, I., Misnikov, Y., Trutnev, D. (eds.) EGOSE 2019. CCIS, vol. 1135, pp. 360–375. Springer, Cham (2020). https://​doi.​org/​10.​1007/​978-3-030-39296-3_​27CrossRef
15.
Zurück zum Zitat Golubev, A., Sadovnikova, N., Parygin, D., Glinyanova, I., Finogeev, A., Shcherbakov, M.: Woody plants area estimation using ordinary satellite images and deep learning. In: Alexandrov, D.A., Boukhanovsky, A.V., Chugunov, A.V., Kabanov, Y., Koltsova, O. (eds.) DTGS 2018. CCIS, vol. 858, pp. 302–313. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-02843-5_24CrossRef Golubev, A., Sadovnikova, N., Parygin, D., Glinyanova, I., Finogeev, A., Shcherbakov, M.: Woody plants area estimation using ordinary satellite images and deep learning. In: Alexandrov, D.A., Boukhanovsky, A.V., Chugunov, A.V., Kabanov, Y., Koltsova, O. (eds.) DTGS 2018. CCIS, vol. 858, pp. 302–313. Springer, Cham (2018). https://​doi.​org/​10.​1007/​978-3-030-02843-5_​24CrossRef
16.
Zurück zum Zitat Velichko, A.N., Budkov, V.Y., Karpov, A.A.: Analytical survey of computational paralinguistic systems for automatic recognition of deception in human speech. Informatsionno-upravliaiushchie sistemy, no. 5, pp. 30–41 (2017) Velichko, A.N., Budkov, V.Y., Karpov, A.A.: Analytical survey of computational paralinguistic systems for automatic recognition of deception in human speech. Informatsionno-upravliaiushchie sistemy, no. 5, pp. 30–41 (2017)
19.
Zurück zum Zitat Donchenko, D., Ovchar, N., Sadovnikova, N., Parygin, D., Shabalina, O., Ather, D.: Analysis of comments of users of social networks to assess the level of social tension. Procedia Comput. Sci. 119, 359–367 (2017)CrossRef Donchenko, D., Ovchar, N., Sadovnikova, N., Parygin, D., Shabalina, O., Ather, D.: Analysis of comments of users of social networks to assess the level of social tension. Procedia Comput. Sci. 119, 359–367 (2017)CrossRef
Metadaten
Titel
Neural Network Processing of Natural Russian Language for Building Intelligent Dialogue Systems
verfasst von
Danila Parygin
Nikolay Matyushin
Anton Finogeev
Natalia Sadovnikova
Tatyana Petrova
Ekaterina Fadeeva
Copyright-Jahr
2020
DOI
https://doi.org/10.1007/978-3-030-67238-6_17

Premium Partner