Skip to main content

2018 | OriginalPaper | Buchkapitel

Unconventional Usage of Entropy in the Field of Web Usage Data Preprocessing and Machine Translation Evaluation

verfasst von : Michal Munk, Ľubomír Benko

Erschienen in: Applied Physics, System Science and Computers

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

This paper focuses on an unconventional usage of entropy. On one side it deals with preprocessing phase, especially the session identification using the Reference Length method. Entropy, in this case, offers an alternative to determining the ratio of auxiliary pages that is important for this method. With the approach introduced in this paper, the need of a sitemap becomes void. On the other hand, the paper looks at entropy in the case of reliability analysis of Machine Translation metrics. In this case, entropy offers also an alternative mean to validate the metrics.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Shannon, C.E.: A mathematical theory of communication. ACM SIGMOBILE Mob. Comput. Commun. Rev. 5, 3 (2001)CrossRef Shannon, C.E.: A mathematical theory of communication. ACM SIGMOBILE Mob. Comput. Commun. Rev. 5, 3 (2001)CrossRef
2.
Zurück zum Zitat Clausius, R.: On the Motive Power of Heat, and on the Laws which Can Be Deduced from It for the Theory of Heat. Dover (1960) Clausius, R.: On the Motive Power of Heat, and on the Laws which Can Be Deduced from It for the Theory of Heat. Dover (1960)
3.
Zurück zum Zitat Holzinger, A., Hörtenhuber, M., Mayer, C., Bachler, M., Wassertheurer, S., Pinho, A.J., Koslicki, D.: On entropy-based data mining. In: Holzinger, A., Jurisica, I. (eds.) Interactive Knowledge Discovery and Data Mining in Biomedical Informatics: State-of-the-Art and Future Challenges, pp. 209–226. Springer, Berlin (2014)CrossRef Holzinger, A., Hörtenhuber, M., Mayer, C., Bachler, M., Wassertheurer, S., Pinho, A.J., Koslicki, D.: On entropy-based data mining. In: Holzinger, A., Jurisica, I. (eds.) Interactive Knowledge Discovery and Data Mining in Biomedical Informatics: State-of-the-Art and Future Challenges, pp. 209–226. Springer, Berlin (2014)CrossRef
4.
Zurück zum Zitat Lima, C.F.L., de Assis, F.M., de Souza, C.P.: A Comparative Study of Use of Shannon, Rényi and Tsallis Entropy for Attribute Selecting in Network Intrusion Detection (2012) Lima, C.F.L., de Assis, F.M., de Souza, C.P.: A Comparative Study of Use of Shannon, Rényi and Tsallis Entropy for Attribute Selecting in Network Intrusion Detection (2012)
6.
Zurück zum Zitat Munk, M., Benko, Ľ., Gangur, M., Turčáni, M.: Influence of ratio of auxiliary pages on the pre-processing phase of Web Usage Mining. E+M Ekon. a Manag. 3, 144–159 (2015) Munk, M., Benko, Ľ., Gangur, M., Turčáni, M.: Influence of ratio of auxiliary pages on the pre-processing phase of Web Usage Mining. E+M Ekon. a Manag. 3, 144–159 (2015)
7.
Zurück zum Zitat Jin, X., Zhou, Y., Mobasher, B.: A maximum entropy web recommendation system. In: Proceeding of the eleventh ACM SIGKDD International Conference on Knowledge Discovery in Data Mining—KDD’05, p. 612. ACM Press, New York (2005) Jin, X., Zhou, Y., Mobasher, B.: A maximum entropy web recommendation system. In: Proceeding of the eleventh ACM SIGKDD International Conference on Knowledge Discovery in Data Mining—KDD’05, p. 612. ACM Press, New York (2005)
8.
Zurück zum Zitat Wang, H., Wang, L., Yi, L.: Maximum entropy framework used in text classification. In: 2010 IEEE International Conference on Intelligent Computing and Intelligent Systems, pp. 828–833. IEEE (2010) Wang, H., Wang, L., Yi, L.: Maximum entropy framework used in text classification. In: 2010 IEEE International Conference on Intelligent Computing and Intelligent Systems, pp. 828–833. IEEE (2010)
9.
Zurück zum Zitat Benko, Ľ., Reichel, J., Munk, M.: Analysis of student behavior in virtual learning environment depending on student assessments. In: ICETA 2015: 13th International Conference on Emerging eLearning Technologies and Applications, Stary Smokovec, November 26–27, 2015. pp. 33–38. IEEE, Stary Smokovec, Danvers (2015) Benko, Ľ., Reichel, J., Munk, M.: Analysis of student behavior in virtual learning environment depending on student assessments. In: ICETA 2015: 13th International Conference on Emerging eLearning Technologies and Applications, Stary Smokovec, November 26–27, 2015. pp. 33–38. IEEE, Stary Smokovec, Danvers (2015)
10.
Zurück zum Zitat Eetemadi, S., Lewis, W., Toutanova, K., Radha, H.: Survey of data-selection methods in statistical machine translation. Mach. Transl. 29, 189–223 (2015)CrossRef Eetemadi, S., Lewis, W., Toutanova, K., Radha, H.: Survey of data-selection methods in statistical machine translation. Mach. Transl. 29, 189–223 (2015)CrossRef
11.
Zurück zum Zitat Tomeh, N., Allauzen, A., Yvon, F.: Maximum-entropy word alignment and posterior-based phrase extraction for machine translation. Mach. Trans. 28, 19–56 (2014)CrossRef Tomeh, N., Allauzen, A., Yvon, F.: Maximum-entropy word alignment and posterior-based phrase extraction for machine translation. Mach. Trans. 28, 19–56 (2014)CrossRef
12.
Zurück zum Zitat Kapusta, J., Munk, M., Drlík, M.: Analysis of differences between expected and observed probability of accesses to web pages. In: Hwang, D., Jung, J., and Nguyen, N.-T. (eds.) Computational Collective Intelligence. Technologies and Applications SE-68, pp. 673–683. Springer International Publishing (2014) Kapusta, J., Munk, M., Drlík, M.: Analysis of differences between expected and observed probability of accesses to web pages. In: Hwang, D., Jung, J., and Nguyen, N.-T. (eds.) Computational Collective Intelligence. Technologies and Applications SE-68, pp. 673–683. Springer International Publishing (2014)
Metadaten
Titel
Unconventional Usage of Entropy in the Field of Web Usage Data Preprocessing and Machine Translation Evaluation
verfasst von
Michal Munk
Ľubomír Benko
Copyright-Jahr
2018
DOI
https://doi.org/10.1007/978-3-319-53934-8_34

Neuer Inhalt