Skip to main content
Erschienen in: Automatic Documentation and Mathematical Linguistics 4/2020

01.07.2020 | TEXT PROCESSING AUTOMATION

Automatic Detection and Classification of Information Events in Media Texts

verfasst von: Al-dr A. Khoroshilov, R. R. Musabaev, Ya. D. Kozlovskaya, Yu. A. Nikitin, A. A. Khoroshilov

Erschienen in: Automatic Documentation and Mathematical Linguistics | Ausgabe 4/2020

Einloggen, um Zugang zu erhalten

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The solution to the problem of automatically identifying and classifying information events in media texts is described based on the model of phraseological conceptual analysis of texts. The proposed solution is based on the use of previously developed methods for formalizing the semantic structure of sentences, as well as methods and algorithms for identifying fragments of media texts that describe information events. The developed algorithm implements the rules of C. Fillmore’s case grammar, which are based on the procedures of semantic–syntactic and conceptual analysis of texts.
Fußnoten
1
The term event (information event) will be understood as mass media message descriptions of socially significant phenomena, incidents, facts of social activity of a global or regional scale, as well as facts and events of social conglomerations or facts of personal life of famous public figures, etc.
 
2
The term information line is understood as the main topic of a message, which forces the target audience to discuss it. As a rule, an information line reflects important facts of the content of an event.
 
3
Dictionary of Unified Formalized Representations of Concept Names (UFRCN) [2].
 
4
The structure of the sentence in the form of the main words of phrases and their relationships.
 
Literatur
1.
Zurück zum Zitat Bogatyrev, M.Yu., Extracting facts from natural language texts using conceptual graph models, Izv. Tul. Gos. Univ., Tekh. Nauki, 2016, no. 7, part 1, pp. 198–207. Bogatyrev, M.Yu., Extracting facts from natural language texts using conceptual graph models, Izv. Tul. Gos. Univ., Tekh. Nauki, 2016, no. 7, part 1, pp. 198–207.
2.
Zurück zum Zitat Vinogradov, A.N., Vlasova, N.A., Kurshev, E.P., and Podobryaev, A.V., Modern technologies of natural language processing in strategic management problems, in Tekhnologicheskaya perspektiva v ramkakh evraziiskogo prostranstva: Novye rynki i tochki ekonomicheskogo rosta (Technological Perspective within the Eurasian Space: New Markets and Points of Economic Growth), St. Petersburg: Tsentr Nauchno-Inf. Tekhnol. Asterion, 2018. Vinogradov, A.N., Vlasova, N.A., Kurshev, E.P., and Podobryaev, A.V., Modern technologies of natural language processing in strategic management problems, in Tekhnologicheskaya perspektiva v ramkakh evraziiskogo prostranstva: Novye rynki i tochki ekonomicheskogo rosta (Technological Perspective within the Eurasian Space: New Markets and Points of Economic Growth), St. Petersburg: Tsentr Nauchno-Inf. Tekhnol. Asterion, 2018.
3.
Zurück zum Zitat Ermakov, A.E., Automatic extraction of facts from dossier texts: The experience of establishing anaphoric connections, in Komp’yuternaya lingvistika i intellektual’nye tekhnologii: Trudy mezhdunarodnoi konferentsii “Dialog'2007" (Computational Linguistics and Intellectual Technologies: Proceedings of the International Conference "Dialogue'2007"), Moscow: Nauka, 2007. Ermakov, A.E., Automatic extraction of facts from dossier texts: The experience of establishing anaphoric connections, in Komp’yuternaya lingvistika i intellektual’nye tekhnologii: Trudy mezhdunarodnoi konferentsii “Dialog'2007" (Computational Linguistics and Intellectual Technologies: Proceedings of the International Conference "Dialogue'2007"), Moscow: Nauka, 2007.
4.
Zurück zum Zitat Khoroshilov, Al-dr.A., Nikitin, Yu.V., Khoroshilov, Al-ei.A., and Budsko, V.I., Automatic creation of a formalized representation of the semantic content of unstructured text messages of mass media and social networks, Sist. Vys. Dostupnosti, 2014, vol. 10, no.3. Khoroshilov, Al-dr.A., Nikitin, Yu.V., Khoroshilov, Al-ei.A., and Budsko, V.I., Automatic creation of a formalized representation of the semantic content of unstructured text messages of mass media and social networks, Sist. Vys. Dostupnosti, 2014, vol. 10, no.3.
5.
Zurück zum Zitat Helbig, H., Knowledge Representation and the Semantics of Natural Language, Berlin: Springer, 2006.MATH Helbig, H., Knowledge Representation and the Semantics of Natural Language, Berlin: Springer, 2006.MATH
6.
Zurück zum Zitat Kan, A.V., Revina, V.D., Rusnak, V.I., Khoroshilov, Al-dr.A., and Khoroshilov, A.A., Automatic formation of a syntactic language model for machine translation and information retrieval tasks, Nauchno-Tekh. Inf., Ser. 2, 2018, no. 12, pp. 25–41. Kan, A.V., Revina, V.D., Rusnak, V.I., Khoroshilov, Al-dr.A., and Khoroshilov, A.A., Automatic formation of a syntactic language model for machine translation and information retrieval tasks, Nauchno-Tekh. Inf., Ser. 2, 2018, no. 12, pp. 25–41.
7.
Zurück zum Zitat Fillmore, C.J., The case for case, 1967 Texas Symposium on Linguistic Universals, Columbus, OH: The Ohio State University, 1968. Fillmore, C.J., The case for case, 1967 Texas Symposium on Linguistic Universals, Columbus, OH: The Ohio State University, 1968.
8.
Zurück zum Zitat Ablov, I.V., Kozichev, V.N., Shirmanov, A.V., Khoroshilov, Al-dr.A., and Khoroshilov, Al-ei.A., The tools of a machine grammar of the Russian language (based on G.G. Belonogov), Autom. Doc. Math. Linguist., 2018, vol. 52, pp. 142–156.CrossRef Ablov, I.V., Kozichev, V.N., Shirmanov, A.V., Khoroshilov, Al-dr.A., and Khoroshilov, Al-ei.A., The tools of a machine grammar of the Russian language (based on G.G. Belonogov), Autom. Doc. Math. Linguist., 2018, vol. 52, pp. 142–156.CrossRef
9.
Zurück zum Zitat Kalinin, Yu.P., Khoroshilov, Al-dr.A., and Khoroshilov, Al-ei.A., Modern technologies of automated processing of textual information, Sist. Vys. Dostupnosti, 2015, vol. 11, no. 2, pp. 67–79. Kalinin, Yu.P., Khoroshilov, Al-dr.A., and Khoroshilov, Al-ei.A., Modern technologies of automated processing of textual information, Sist. Vys. Dostupnosti, 2015, vol. 11, no. 2, pp. 67–79.
10.
Zurück zum Zitat Zakharov, V.N., Musabaev, R.R., Krasovitskii, A.M., Kozlovskaya, Ya.D., Khoroshilov, Al-dr.A., and Khoroshilov, Al-ei.A., A method for clustering news media reports based on their conceptual analysis, Inf. Ee Primen., 2019, vol. 29, no. 3, pp. 52–65. Zakharov, V.N., Musabaev, R.R., Krasovitskii, A.M., Kozlovskaya, Ya.D., Khoroshilov, Al-dr.A., and Khoroshilov, Al-ei.A., A method for clustering news media reports based on their conceptual analysis, Inf. Ee Primen., 2019, vol. 29, no. 3, pp. 52–65.
11.
Zurück zum Zitat Aivazyan, S.A., Bukhshtaber, V.M., Enyukov, I.S., and Meshalkin, L.D., Prikladnaya statistika: Klassifikatsiya i snizhenie razmernosti (Applied Statistics: Classification and Dimensionality Reduction), Moscow: Finansy Stat., 1989. Aivazyan, S.A., Bukhshtaber, V.M., Enyukov, I.S., and Meshalkin, L.D., Prikladnaya statistika: Klassifikatsiya i snizhenie razmernosti (Applied Statistics: Classification and Dimensionality Reduction), Moscow: Finansy Stat., 1989.
12.
Zurück zum Zitat Alon, N., Spencer, J.H., and Erdős, P., The Probabilistic Method, Wiley, 1992. Alon, N., Spencer, J.H., and Erdős, P., The Probabilistic Method, Wiley, 1992.
13.
Zurück zum Zitat Gol'dberg, I., Neirosetevye metody v obrabotke estestvennogo yazyka (Neural Network Methods in Natural Language Processing), Moscow: DMK, 2019. Gol'dberg, I., Neirosetevye metody v obrabotke estestvennogo yazyka (Neural Network Methods in Natural Language Processing), Moscow: DMK, 2019.
14.
Zurück zum Zitat Osinga, D., Glubokoe obuchenie. Gotovye resheniya (Deep Learning. Ready-Made Solutions), St. Petersburg: Dialektika, 2019. Osinga, D., Glubokoe obuchenie. Gotovye resheniya (Deep Learning. Ready-Made Solutions), St. Petersburg: Dialektika, 2019.
Metadaten
Titel
Automatic Detection and Classification of Information Events in Media Texts
verfasst von
Al-dr A. Khoroshilov
R. R. Musabaev
Ya. D. Kozlovskaya
Yu. A. Nikitin
A. A. Khoroshilov
Publikationsdatum
01.07.2020
Verlag
Pleiades Publishing
Erschienen in
Automatic Documentation and Mathematical Linguistics / Ausgabe 4/2020
Print ISSN: 0005-1055
Elektronische ISSN: 1934-8371
DOI
https://doi.org/10.3103/S0005105520040032

Weitere Artikel der Ausgabe 4/2020

Automatic Documentation and Mathematical Linguistics 4/2020 Zur Ausgabe