Skip to main content

2020 | OriginalPaper | Buchkapitel

Authorship Attribution by Functional Discriminant Analysis

verfasst von : Chahrazed Kettaf, Abderrahmane Yousfate

Erschienen in: Mathematical Aspects of Computer and Information Sciences

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Recognizing the author of a given text is a very difficult task that relies on several complicated and correlated criterias. For this purpose, several classification methods are used (neuronal network, discriminant analysis, SVM...). But a good representation of the text that keeps the maximum of the stylistic information is very important and has a considerable influence on the result. In this paper, we will tackle the problem of the authorship attribution for very long texts using the discriminant analysis extended to the functional case after presenting the texts as elements of a separable Hilbert space.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Preda, C.: L’approche PLS pour l’analyse de données fonctionnelles. Bull. Soc. Sci. Méd. 2, 171–185 (2006) Preda, C.: L’approche PLS pour l’analyse de données fonctionnelles. Bull. Soc. Sci. Méd. 2, 171–185 (2006)
2.
Zurück zum Zitat Zheng, R., Li, J., Chen, H., Huang, Z.: A framework for authorship identification of online messages: writing style features and classification techniques. J. Am. Soc. Inf. Sci. Technol. 57, 378–393 (2006)CrossRef Zheng, R., Li, J., Chen, H., Huang, Z.: A framework for authorship identification of online messages: writing style features and classification techniques. J. Am. Soc. Inf. Sci. Technol. 57, 378–393 (2006)CrossRef
3.
Zurück zum Zitat Kjell, B.: Discrimination of authorship using visualization. Inf. Process. Manage. 30, 141–150 (1994)CrossRef Kjell, B.: Discrimination of authorship using visualization. Inf. Process. Manage. 30, 141–150 (1994)CrossRef
4.
5.
Zurück zum Zitat Grieve, J.: Quantitative authorship attribution: an evaluation of techniques. Lit. Linguist. Comput. 22, 251–270 (2007)CrossRef Grieve, J.: Quantitative authorship attribution: an evaluation of techniques. Lit. Linguist. Comput. 22, 251–270 (2007)CrossRef
6.
Zurück zum Zitat Holmes, D.I.: Authorship attribution. Comput. Humanit. 28, 87–106 (1994)CrossRef Holmes, D.I.: Authorship attribution. Comput. Humanit. 28, 87–106 (1994)CrossRef
7.
Zurück zum Zitat Stamatatos, E., Fakotakis, N., Kokkinakis, G.: Automatic text categorization in terms of genre and author. Comput. Linguist. 26, 471–495 (2000)CrossRef Stamatatos, E., Fakotakis, N., Kokkinakis, G.: Automatic text categorization in terms of genre and author. Comput. Linguist. 26, 471–495 (2000)CrossRef
8.
Zurück zum Zitat Burrows, J.F.: Word patterns and story shapes: the statistical analysis of narrative style. Lit. Linguist. Comput. 2, 61–70 (1987)CrossRef Burrows, J.F.: Word patterns and story shapes: the statistical analysis of narrative style. Lit. Linguist. Comput. 2, 61–70 (1987)CrossRef
9.
Zurück zum Zitat Argamon, S., Levitan, S.: Measuring the usefulness of function words for authorship attribution. In: Proceedings of the Joint Conference of the Association for Computers and the Humanities and the Association for Literary and Linguistic Computing (2005) Argamon, S., Levitan, S.: Measuring the usefulness of function words for authorship attribution. In: Proceedings of the Joint Conference of the Association for Computers and the Humanities and the Association for Literary and Linguistic Computing (2005)
10.
Zurück zum Zitat de Vel, O., Anderson, A., Corney, M., Mohay, G.: Mining e-mail content for author identification forensics. ACM SIGMOD Rec. 30, 55–64 (2001) CrossRef de Vel, O., Anderson, A., Corney, M., Mohay, G.: Mining e-mail content for author identification forensics. ACM SIGMOD Rec. 30, 55–64 (2001) CrossRef
11.
Zurück zum Zitat Forsyth, R., Holmes, D.: Feature-finding for text classification. Lit. Linguist. Comput. 11, 163–174 (1996)CrossRef Forsyth, R., Holmes, D.: Feature-finding for text classification. Lit. Linguist. Comput. 11, 163–174 (1996)CrossRef
12.
Zurück zum Zitat Baayen, R., van Halteren, H., Tweedie, F.: Outside the cave of shadows: using syntactic annotation to enhance authorship attribution. Lit. Linguist. Comput. 11, 121–132 (1996)CrossRef Baayen, R., van Halteren, H., Tweedie, F.: Outside the cave of shadows: using syntactic annotation to enhance authorship attribution. Lit. Linguist. Comput. 11, 121–132 (1996)CrossRef
13.
Zurück zum Zitat Gamon, M.: Linguistic correlates of style: authorship classification with deep linguistic analysis features. In: Proceedings of the 20th International Conference on Computational Linguistics (2004) Gamon, M.: Linguistic correlates of style: authorship classification with deep linguistic analysis features. In: Proceedings of the 20th International Conference on Computational Linguistics (2004)
14.
Zurück zum Zitat McCarthy, P.M., Lewis, G.A., Dufty, D.F., McNamara, D.S.: Analyzing writing styles with Coh-Metrix. In: Proceedings of the Florida Artificial Intelligence Research Society International Conference (2006) McCarthy, P.M., Lewis, G.A., Dufty, D.F., McNamara, D.S.: Analyzing writing styles with Coh-Metrix. In: Proceedings of the Florida Artificial Intelligence Research Society International Conference (2006)
15.
Zurück zum Zitat Fellbaum, C.: WordNet: An Electronic Lexical Database. MIT Press, Cambridge (1998)CrossRef Fellbaum, C.: WordNet: An Electronic Lexical Database. MIT Press, Cambridge (1998)CrossRef
16.
Zurück zum Zitat Forman, G.: An extensive empirical study of feature selection metrics for text classification. J. Mach. Learn. Res. 3, 1289–1305 (2003)MATH Forman, G.: An extensive empirical study of feature selection metrics for text classification. J. Mach. Learn. Res. 3, 1289–1305 (2003)MATH
17.
Zurück zum Zitat Argamon, S., Whitelaw, C., Chase, P., Hota, S.R., Garg, N., Levitan, S.: Stylistic text classification using functional lexical features. J. Am. Soc. Inform. Sci. Technol. 58, 802–822 (2007)CrossRef Argamon, S., Whitelaw, C., Chase, P., Hota, S.R., Garg, N., Levitan, S.: Stylistic text classification using functional lexical features. J. Am. Soc. Inform. Sci. Technol. 58, 802–822 (2007)CrossRef
18.
Zurück zum Zitat Posadas-Durán, J.P., Gómez-Adorno, H., Sidorov, G.: Application of the distributed document representation in the authorship attribution task for small corpora. Soft Comput. 21, 627–639 (2017). American Society for Information Science and TechnologyCrossRef Posadas-Durán, J.P., Gómez-Adorno, H., Sidorov, G.: Application of the distributed document representation in the authorship attribution task for small corpora. Soft Comput. 21, 627–639 (2017). American Society for Information Science and TechnologyCrossRef
19.
Zurück zum Zitat Schmidt, H.: Probabilistic part-of-speech tagging using decision trees (1994) Schmidt, H.: Probabilistic part-of-speech tagging using decision trees (1994)
Metadaten
Titel
Authorship Attribution by Functional Discriminant Analysis
verfasst von
Chahrazed Kettaf
Abderrahmane Yousfate
Copyright-Jahr
2020
DOI
https://doi.org/10.1007/978-3-030-43120-4_34