Skip to main content
Erschienen in: Evolutionary Intelligence 5/2023

08.11.2022 | Special Issue

Evolutionary intelligence driven style recognition of English novels based on text analysis

verfasst von: Yue Hu

Erschienen in: Evolutionary Intelligence | Ausgabe 5/2023

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Novel style recognition is of great reference significance for analyzing the quality and readability of works. Writing stylistics is the analysis of an author's writing style through statistical methods. In this paper, our main purpose is to identify the style of English novels based on text analysis methods. We use the method of information entropy calculation which is an evolutionary intelligent method to identify the style of English novels, so as to provide reference for better grasping the content of literary works. We focus on how to process text data and mine text intrinsic features to better identify English novel style. To better solve the problems, we design following strategies in our proposed method. First, we need to collect and process English novel samples. Then tokenize the text and consider the object content that needs to be analyzed. The next step is to count the number of words and calculate the information entropy. Finally, the data is processed and analyzed to get a conclusion. The experimental results prove that the method proposed in this paper has good evaluation performance.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Holmes DI (1985) The analysis of literary style—a review. J Royal Stat Soc: Series A (General) 148(4):328–341CrossRef Holmes DI (1985) The analysis of literary style—a review. J Royal Stat Soc: Series A (General) 148(4):328–341CrossRef
3.
Zurück zum Zitat Swan A, Carr L (2008) Institutions, their repositories and the web. Ser Rev 34(1):31–35CrossRef Swan A, Carr L (2008) Institutions, their repositories and the web. Ser Rev 34(1):31–35CrossRef
4.
Zurück zum Zitat Chang CH, Kayed M, Girgis MR et al (2006) A survey of web information extraction systems. IEEE Trans Knowl Data Eng 18(10):1411–1428CrossRef Chang CH, Kayed M, Girgis MR et al (2006) A survey of web information extraction systems. IEEE Trans Knowl Data Eng 18(10):1411–1428CrossRef
5.
Zurück zum Zitat Khosmood F, Levinson R A (2008) Automatic natural language style classification and transformation, In: BCS-IRSG workshop on corpus profiling. 1–11. Khosmood F, Levinson R A (2008) Automatic natural language style classification and transformation, In: BCS-IRSG workshop on corpus profiling. 1–11.
6.
Zurück zum Zitat Vijayakumar T, Vinothkanna R (2020) Capsule network on font style classification. J Artif Intell 2(02):64–76 Vijayakumar T, Vinothkanna R (2020) Capsule network on font style classification. J Artif Intell 2(02):64–76
7.
Zurück zum Zitat Bharath V, Rani N S (2017) A font style classification system for English OCR. In: 2017 international conference on intelligent computing and control (I2C2). IEEE, 2017: 1–5. Bharath V, Rani N S (2017) A font style classification system for English OCR. In: 2017 international conference on intelligent computing and control (I2C2). IEEE, 2017: 1–5.
9.
Zurück zum Zitat Xu S (2018) Bayesian Naïve Bayes classifiers to text classification. J Inf Sci 44(1):48–59CrossRef Xu S (2018) Bayesian Naïve Bayes classifiers to text classification. J Inf Sci 44(1):48–59CrossRef
10.
Zurück zum Zitat Kolluri J, Razia S (2020) Text classification using Naïve Bayes classifier. Mater Today: Proc. Kolluri J, Razia S (2020) Text classification using Naïve Bayes classifier. Mater Today: Proc.
11.
Zurück zum Zitat Noble WS (2006) What is a support vector machine? Nat Biotechnol 24(12):1565–1567CrossRef Noble WS (2006) What is a support vector machine? Nat Biotechnol 24(12):1565–1567CrossRef
12.
Zurück zum Zitat Suthaharan S (2016) Support vector machine–machine learning models and algorithms for big data classification. Springer, Boston, pp 207–235MATH Suthaharan S (2016) Support vector machine–machine learning models and algorithms for big data classification. Springer, Boston, pp 207–235MATH
13.
Zurück zum Zitat Kramer O (2013) K-nearest neighbors//Dimensionality reduction with unsupervised nearest neighbors. Springer, Berlin, pp 13–23MATH Kramer O (2013) K-nearest neighbors//Dimensionality reduction with unsupervised nearest neighbors. Springer, Berlin, pp 13–23MATH
14.
Zurück zum Zitat Keller JM, Gray MR, Givens JA (1985) A fuzzy k-nearest neighbor algorithm. IEEE Trans Syst Man Cybern 4:580–585CrossRef Keller JM, Gray MR, Givens JA (1985) A fuzzy k-nearest neighbor algorithm. IEEE Trans Syst Man Cybern 4:580–585CrossRef
15.
Zurück zum Zitat Joachims T (1996) A probabilistic analysis of the rocchio algorithm with TFIDF for text categorization. Carnegie-mellon univ pittsburgh pa dept of computer science. Joachims T (1996) A probabilistic analysis of the rocchio algorithm with TFIDF for text categorization. Carnegie-mellon univ pittsburgh pa dept of computer science.
16.
Zurück zum Zitat Chowdhary K R (2020) Natural language processing. Fund Artif Intell, 603–649. Chowdhary K R (2020) Natural language processing. Fund Artif Intell, 603–649.
17.
Zurück zum Zitat Nadkarni PM, Ohno-Machado L, Chapman WW (2011) Natural language processing: an introduction. J Am Med Inform Assoc 18(5):544–551CrossRef Nadkarni PM, Ohno-Machado L, Chapman WW (2011) Natural language processing: an introduction. J Am Med Inform Assoc 18(5):544–551CrossRef
18.
Zurück zum Zitat Loper E, Bird S (2002) Nltk: the natural language toolkit. arXiv preprint cs/0205028. Loper E, Bird S (2002) Nltk: the natural language toolkit. arXiv preprint cs/0205028.
19.
Zurück zum Zitat van Rossum G (1995) Python reference manual. Department of Computer Science [CS], (R 9525). van Rossum G (1995) Python reference manual. Department of Computer Science [CS], (R 9525).
20.
Zurück zum Zitat Csiszár I, Shields PC (2004) Information theory and statistics: a tutorial. Found Trends® Commun Inf Theory 1(4):417–528CrossRefMATH Csiszár I, Shields PC (2004) Information theory and statistics: a tutorial. Found Trends® Commun Inf Theory 1(4):417–528CrossRefMATH
22.
Zurück zum Zitat Brillouin L (2013) Science and information theory. Courier Corporation Brillouin L (2013) Science and information theory. Courier Corporation
Metadaten
Titel
Evolutionary intelligence driven style recognition of English novels based on text analysis
verfasst von
Yue Hu
Publikationsdatum
08.11.2022
Verlag
Springer Berlin Heidelberg
Erschienen in
Evolutionary Intelligence / Ausgabe 5/2023
Print ISSN: 1864-5909
Elektronische ISSN: 1864-5917
DOI
https://doi.org/10.1007/s12065-022-00790-3

Weitere Artikel der Ausgabe 5/2023

Evolutionary Intelligence 5/2023 Zur Ausgabe

Premium Partner