Skip to main content
Erschienen in: Optical Memory and Neural Networks 2/2023

01.12.2023

Application of the Variational Principle to Create a Measurable Assessment of the Relevance of Objects Included in Training Databases

verfasst von: V. A. Antonets, M. A. Antonets

Erschienen in: Optical Memory and Neural Networks | Sonderheft 2/2023

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

We consider the problem of obtaining a measurable assessment of the quality of empirical training data selected by experts. This problem can be solved in those cases where the data can be displayed in the form of histograms. This class includes any diagrams of frequency of occurrence of linguistic objects in samples, for example, lemmas in a text. It also includes discretized temporal signals from different branches of science, technology, and medicine. The proposed method, as well as other known methods, is based on the use of weight functions. With its help, the weight of each histogram is defined as the sum over all its columns of the products of column height by the value of weight function for the corresponding column. However, in contrast to the well-known approaches, the weight function in the proposed approach is not found empirically, but on the basis of the following variation principle. The weight function is considered optimal if the weight of the lightest histogram found with its help is greater than or equal to the weight of the lightest histogram determined by any other weight function. The application of the developed approach to the task of thematic classification of ad texts on electronic trading floors showed that for the selected topics approximately 90% of the lemmas (words) encountered in the training corpus had the weight equal to zero, and almost all words with nonzero weight were semantically related to the topic.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Wu, H.C., Luk, R.W.P., Wong, K. F., and Kwok, K.L., Interpreting TF-IDF term weights as making relevance decisions, ACM Trans. Inf. Syst., 2008, vol. 26, no. 3, pp. 1–37.CrossRef Wu, H.C., Luk, R.W.P., Wong, K. F., and Kwok, K.L., Interpreting TF-IDF term weights as making relevance decisions, ACM Trans. Inf. Syst., 2008, vol. 26, no. 3, pp. 1–37.CrossRef
2.
Zurück zum Zitat Economics and Mathematics Academic Dictionary. clck.ru/34oL5r. Accessed June 23, 2023. Economics and Mathematics Academic Dictionary. clck.ru/34oL5r. Accessed June 23, 2023.
3.
Zurück zum Zitat Ferguson, T.S., Game theory. http://www.gametheory.net/books/online.html. Accessed August 8, 2023. Ferguson, T.S., Game theory. http://​www.​gametheory.​net/​books/​online.​html.​ Accessed August 8, 2023.
4.
Zurück zum Zitat Antonets, M.A. and Kogan, G.P., The variational principle for weights characterizing the relevance. https://arxiv.org/ftp/arxiv/papers/1609/1609.01533.pdf. Accessed August 8, 2023. Antonets, M.A. and Kogan, G.P., The variational principle for weights characterizing the relevance. https://​arxiv.​org/​ftp/​arxiv/​papers/​1609/​1609.​01533.​pdf.​ Accessed August 8, 2023.
5.
Zurück zum Zitat Antonets, M.A., Estimation of samples relevance by their histograms, https://arXiv:1701.08383v3. Antonets, M.A., Estimation of samples relevance by their histograms, https://​arXiv:1701.08383v3.
7.
Zurück zum Zitat Association of Electronic Trading Platforms. https://clck.ru/34WL8R. Accessed August 8, 2023. Association of Electronic Trading Platforms. https://​clck.​ru/​34WL8R.​ Accessed August 8, 2023.
8.
Zurück zum Zitat Arrows, R.J., Nurwitz, L., and Uzawa, H., Studies in Linear and Nonlintar programming, Stanford: Stanford Univ. Press, 1958. Arrows, R.J., Nurwitz, L., and Uzawa, H., Studies in Linear and Nonlintar programming, Stanford: Stanford Univ. Press, 1958.
10.
Zurück zum Zitat Databases of the Russian Society of Holter Monitoring and Non-Invasive Electrophysiology. http://rohmine.org/baza-dannykh-rokhmine/. Accessed August 8, 2023. Databases of the Russian Society of Holter Monitoring and Non-Invasive Electrophysiology. http://​rohmine.​org/​baza-dannykh-rokhmine/​.​ Accessed August 8, 2023.
11.
Zurück zum Zitat Skvortsov, D.V., Stabilometric Study, Moscow: Maska, 2010. Skvortsov, D.V., Stabilometric Study, Moscow: Maska, 2010.
12.
Zurück zum Zitat Buy, Min Ziep, and Taratukhin, E.O., Possibilities of the technique of heart rate variability. https://scardio.ru/content/education/articles/Buiy_6_rkj_2011.pdf. Accessed August 8, 2023. Buy, Min Ziep, and Taratukhin, E.O., Possibilities of the technique of heart rate variability. https://​scardio.​ru/​content/​education/​articles/​Buiy_​6_​rkj_​2011.​pdf.​ Accessed August 8, 2023.
13.
Zurück zum Zitat http://ma-tec.ru/, last accessed 2023/08/08. http://ma-tec.ru/, last accessed 2023/08/08.
14.
Zurück zum Zitat Kahneman, D. and Tversky, A., Prospect theory: an analysis of decision under risk, Econometrica, 1979, vol. 47, pp. 263–291.MathSciNetCrossRefMATH Kahneman, D. and Tversky, A., Prospect theory: an analysis of decision under risk, Econometrica, 1979, vol. 47, pp. 263–291.MathSciNetCrossRefMATH
Metadaten
Titel
Application of the Variational Principle to Create a Measurable Assessment of the Relevance of Objects Included in Training Databases
verfasst von
V. A. Antonets
M. A. Antonets
Publikationsdatum
01.12.2023
Verlag
Pleiades Publishing
Erschienen in
Optical Memory and Neural Networks / Ausgabe Sonderheft 2/2023
Print ISSN: 1060-992X
Elektronische ISSN: 1934-7898
DOI
https://doi.org/10.3103/S1060992X23060024

Weitere Artikel der Sonderheft 2/2023

Optical Memory and Neural Networks 2/2023 Zur Ausgabe

Premium Partner