Skip to main content

2017 | OriginalPaper | Buchkapitel

Markov Chain for Author Writing Style Profile Construction

verfasst von : Pavels Osipovs, Andrejs Rinkevičs, Galina Kuleshova, Arkady Borisov

Erschienen in: Recent Advances in Soft Computing

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In this paper, the latest results of research in the area of author’s personal style profile construction are reviewed. The main goal is to explore the ability to use Markov chain graph, educated based on original author texts to store specifics of his personal writing features. Having such personal profile enables text comparison for authorship confirmation. The ability to do it will be in demand in lot of different areas, for example, authorship detection of scientific articles, or artistic literature texts. This paper describes the main idea offered, the proposed algorithm for two graphs similarity level calculation, the structure of the experimental system created and results of the experiments conducted.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Osipov, P.A., Borisov, A.N.: Abnormal action detection based on Markov models. Autom. Control Comput. Sci. 45(2), 94–105 (2011)CrossRef Osipov, P.A., Borisov, A.N.: Abnormal action detection based on Markov models. Autom. Control Comput. Sci. 45(2), 94–105 (2011)CrossRef
2.
Zurück zum Zitat Elayidom, M.S., Jose, C., et al.: Text classification for authorship attribution analysis. Adv. Comput. Int. J. (ACIJ) 4(5), 1–10 (2013)CrossRef Elayidom, M.S., Jose, C., et al.: Text classification for authorship attribution analysis. Adv. Comput. Int. J. (ACIJ) 4(5), 1–10 (2013)CrossRef
3.
Zurück zum Zitat Homem, N., Carvalho, J.P.: Authorship identification and author fuzzy fingerprints. In: 2011 Annual Meeting of the North American Fuzzy Information Processing Society (NAFIPS), pp. 1–6. IEEE (2011). 978-1-61284-968-3/11/2011 Homem, N., Carvalho, J.P.: Authorship identification and author fuzzy fingerprints. In: 2011 Annual Meeting of the North American Fuzzy Information Processing Society (NAFIPS), pp. 1–6. IEEE (2011). 978-1-61284-968-3/11/2011
4.
Zurück zum Zitat Metwally, A., Agrawal, D., Abbadi, A.: Efficient computation of frequent and top-k elements in data streams, University of California, Santa Barbara, USA, Technical report 2005-23, September 2005 Metwally, A., Agrawal, D., Abbadi, A.: Efficient computation of frequent and top-k elements in data streams, University of California, Santa Barbara, USA, Technical report 2005-23, September 2005
5.
Zurück zum Zitat Dabagh, R.M.: Authorship attribution and statistical text analysis. Metodološki zvezki 4(2), 149–163 (2007) Dabagh, R.M.: Authorship attribution and statistical text analysis. Metodološki zvezki 4(2), 149–163 (2007)
6.
Zurück zum Zitat Zheng, R., Qin, Y., Huang, Z., Chen, H.: Authorship analysis in cybercrime investigation. In: Chen, H., Miranda, R., Zeng, D.D., Demchak, C., Schroeder, J., Madhusudan, T. (eds.) ISI 2003. LNCS, vol. 2665, pp. 59–73. Springer, Heidelberg (2003). doi:10.1007/3-540-44853-5_5 CrossRef Zheng, R., Qin, Y., Huang, Z., Chen, H.: Authorship analysis in cybercrime investigation. In: Chen, H., Miranda, R., Zeng, D.D., Demchak, C., Schroeder, J., Madhusudan, T. (eds.) ISI 2003. LNCS, vol. 2665, pp. 59–73. Springer, Heidelberg (2003). doi:10.​1007/​3-540-44853-5_​5 CrossRef
7.
Zurück zum Zitat Bennett, P.N., Dumais, S.T., Horvitz, E.: The combination of text classifiers using reliability indicators. Inf. Retrieval 8(1), 67–100 (2005)CrossRef Bennett, P.N., Dumais, S.T., Horvitz, E.: The combination of text classifiers using reliability indicators. Inf. Retrieval 8(1), 67–100 (2005)CrossRef
8.
Zurück zum Zitat Sanderson, C., Guenter, S.: On authorship attribution via Markov chains and sequence kernels. Presented at 18th International Conference on Pattern Recognition (ICPR 2006), Hong Kong, China, 20–24 August 2006 Sanderson, C., Guenter, S.: On authorship attribution via Markov chains and sequence kernels. Presented at 18th International Conference on Pattern Recognition (ICPR 2006), Hong Kong, China, 20–24 August 2006
9.
Zurück zum Zitat Stamatatos, E., Daelemans, W., et al.: Overview of the author identification task at PAN 2014. Presented at CLEF Conference, PAN part, Sheffield, UK, 15–18 September 2014 Stamatatos, E., Daelemans, W., et al.: Overview of the author identification task at PAN 2014. Presented at CLEF Conference, PAN part, Sheffield, UK, 15–18 September 2014
10.
Zurück zum Zitat Langtangen, H.P.: A Primer on Scientific Programming with Python. Texts in Computational Science and Engineering, vol. 6, 4th edn., XXXI, 872 p. Springer, Heidelberg (2014). ISBN: 978-3-642-54959-5 Langtangen, H.P.: A Primer on Scientific Programming with Python. Texts in Computational Science and Engineering, vol. 6, 4th edn., XXXI, 872 p. Springer, Heidelberg (2014). ISBN: 978-3-642-54959-5
11.
Zurück zum Zitat Johansson, J.R., Nation, P.D., Nori, F.: QuTiP: an open-source Python framework for the dynamics of open quantum systems. Comput. Phys. Commun. 183(8), 1760–1772 (2012)CrossRef Johansson, J.R., Nation, P.D., Nori, F.: QuTiP: an open-source Python framework for the dynamics of open quantum systems. Comput. Phys. Commun. 183(8), 1760–1772 (2012)CrossRef
12.
Zurück zum Zitat Smith, R.: Distinct word length frequencies: distributions and symbol entropies. Glottometrics 23, 7–22 (2012). ISBN: 978-3-942303-17-0 Smith, R.: Distinct word length frequencies: distributions and symbol entropies. Glottometrics 23, 7–22 (2012). ISBN: 978-3-942303-17-0
Metadaten
Titel
Markov Chain for Author Writing Style Profile Construction
verfasst von
Pavels Osipovs
Andrejs Rinkevičs
Galina Kuleshova
Arkady Borisov
Copyright-Jahr
2017
DOI
https://doi.org/10.1007/978-3-319-58088-3_14