Skip to main content
Top
Published in: Automatic Documentation and Mathematical Linguistics 5/2020

01-09-2020 | TEXT PROCESSING AUTOMATION

A Methodology of Using a Concordancer and Table Processor for Authorship Attribution

Author: V. A. Yatsko

Published in: Automatic Documentation and Mathematical Linguistics | Issue 5/2020

Login to get access

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

The paper proposes an original methodology of authorship attribution based on the deviations from Zipf distribution and statistical data obtained with the help of a concordance program and computations performed in a table processor. The methodology involves finding distances between input texts and a reference text basing on deviations of stop-words frequencies. The results that have been achieved prove that the proposed methodology allows performing efficient authorship attribution and that it can be used in the educational process to develop student skills and competencies pertaining to natural language processing.
Footnotes
1
On the approval of the federal state educational standard of higher education in the direction of preparation 03.03.02 Linguistics (bachelor’s level): order of the Ministry of Education and Science of Russia dated 07.08.2014 N 940. - URL: http://​ fgosvo.​ru/​uploadfiles/​fgosvob/​450302_​Lingvistika.​pdf (date of the application: 25.06.2020).
 
Literature
1.
go back to reference Francis, W.N. and Kucera, H., Computational Analysis of Present Day American English, Providence, RI: Brown Univ. Press, 1967. Francis, W.N. and Kucera, H., Computational Analysis of Present Day American English, Providence, RI: Brown Univ. Press, 1967.
4.
go back to reference Yatsko, V.A., Automatic text classification method based on Zipf’s law, Autom. Doc. Math. Linguist., 2015, vol. 49, pp. 83–88.CrossRef Yatsko, V.A., Automatic text classification method based on Zipf’s law, Autom. Doc. Math. Linguist., 2015, vol. 49, pp. 83–88.CrossRef
13.
go back to reference Yatsko, V., Zonal text processing, Digital Scholarship Humanit., 2016, vol. 31, no. 4, pp. 773–781.CrossRef Yatsko, V., Zonal text processing, Digital Scholarship Humanit., 2016, vol. 31, no. 4, pp. 773–781.CrossRef
Metadata
Title
A Methodology of Using a Concordancer and Table Processor for Authorship Attribution
Author
V. A. Yatsko
Publication date
01-09-2020
Publisher
Pleiades Publishing
Published in
Automatic Documentation and Mathematical Linguistics / Issue 5/2020
Print ISSN: 0005-1055
Electronic ISSN: 1934-8371
DOI
https://doi.org/10.3103/S0005105520050088

Other articles of this Issue 5/2020

Automatic Documentation and Mathematical Linguistics 5/2020 Go to the issue

Premium Partner