Skip to main content
Top

2019 | OriginalPaper | Chapter

Vector Space Model of Text Classification Based on Inertia Contribution of Document

Authors : Demba Kandé, Fodé Camara, Reine Marie Marone, Samba Ndiaye

Published in: Emerging Technologies for Developing Countries

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

The use of textual data has increased exponentially in recent years due to the networking infrastructure such as Facebook, Twitter, Wikipedia, Blogs, and so one. Analysis of this massive textual data can help to automatically categorize and label new content. Before classification process, term weighting scheme is the crucial step for representing the documents in a way suitable for classification algorithms. In this paper, we are conducting a survey on the term weighting schemes and we propose an efficient term weighting scheme that provide a better classification accuracy than those obtening with the famous TF-IDF, the recent IF-IGM and the others term weighting schemes in the literature.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
2.
go back to reference Lan, M., Tan, C.L., Jian, S., Yue, L.: Supervised and traditional term weighting methods for automatic text categorization. IEEE Trans. Pattern Anal. Mach. Intell. 31(4), 721–735 (2009)CrossRef Lan, M., Tan, C.L., Jian, S., Yue, L.: Supervised and traditional term weighting methods for automatic text categorization. IEEE Trans. Pattern Anal. Mach. Intell. 31(4), 721–735 (2009)CrossRef
4.
go back to reference Debole, F., Sebastiani, F.: Supervised term weighting for automated text categorization. In: Proceedings of the 2003 ACM Symposium on Applied Computing, pp. 784–788 (2003) Debole, F., Sebastiani, F.: Supervised term weighting for automated text categorization. In: Proceedings of the 2003 ACM Symposium on Applied Computing, pp. 784–788 (2003)
8.
go back to reference Yoshida, T.M.M.K.K.: Term weighting method based on information gain ratio for summarizing documents retrieved by IR systems. J. Nat. Lang. Process. 9(4), 3–32 (2001) Yoshida, T.M.M.K.K.: Term weighting method based on information gain ratio for summarizing documents retrieved by IR systems. J. Nat. Lang. Process. 9(4), 3–32 (2001)
9.
go back to reference Chen, K., Zhang, Z., Long, J., Zhang, H.: Turning from TF-IDF to TF-IGM for term weighting in text classification. J. Expert Syst. Appl. 66(C), 245–260 (2016)CrossRef Chen, K., Zhang, Z., Long, J., Zhang, H.: Turning from TF-IDF to TF-IGM for term weighting in text classification. J. Expert Syst. Appl. 66(C), 245–260 (2016)CrossRef
10.
go back to reference Cormack, G.V., Gómez Hidalgo, J.M., Puertas Sánz, E.: Spam filtering for short messages. In: Proceedings of the Sixteenth ACM Conference on Conference on information and Knowledge Management, Lisbon, Portugal, 06–10 November 2007, CIKM 2007, pp. 313–320. ACM, New York (2007). http://doi.acm.org/10.1145/1321440.1321486 Cormack, G.V., Gómez Hidalgo, J.M., Puertas Sánz, E.: Spam filtering for short messages. In: Proceedings of the Sixteenth ACM Conference on Conference on information and Knowledge Management, Lisbon, Portugal, 06–10 November 2007, CIKM 2007, pp. 313–320. ACM, New York (2007). http://​doi.​acm.​org/​10.​1145/​1321440.​1321486
Metadata
Title
Vector Space Model of Text Classification Based on Inertia Contribution of Document
Authors
Demba Kandé
Fodé Camara
Reine Marie Marone
Samba Ndiaye
Copyright Year
2019
DOI
https://doi.org/10.1007/978-3-030-05198-3_14

Premium Partner