Skip to main content

2019 | OriginalPaper | Buchkapitel

Comparison of Support Vector Machines With and Without Latent Semantic Analysis for Document Classification

verfasst von : Vaibhav Khatavkar, Parag Kulkarni

Erschienen in: Data Management, Analytics and Innovation

Verlag: Springer Singapore

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Document Classification is a key technique in Information Retrieval. Various techniques have been developed for document classification. Every technique aims for higher accuracy and greater speed. Its performance depends on various parameters like algorithms, size, and type of dataset used. Support Vector Machine (SVM) is a prominent technique used for classifying large datasets. This paper attempts to study the effect of Latent Semantic Analysis (LSA) on SVM. LSA is used for dimensionality reduction. The performance of SVM is studied on reduced dataset generated by LSA.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
2.
Zurück zum Zitat Xu, J., & Croft, W. B. (2017). Query expansion using local and global document analysis. In ACM SIGIR Forum (Vol. 51, No. 2), July 2017. Xu, J., & Croft, W. B. (2017). Query expansion using local and global document analysis. In ACM SIGIR Forum (Vol. 51, No. 2), July 2017.
3.
Zurück zum Zitat Khatavkar, V., & Kulkarni, P. (2016, December). Context vector machine for information retrieval. International Conference on Communication and Signal Processing. Khatavkar, V., & Kulkarni, P. (2016, December). Context vector machine for information retrieval. International Conference on Communication and Signal Processing.
4.
Zurück zum Zitat Khatavkar, V., & Kulkarni, P. (2017). Document context identification using latent semantic analysis. Presented in 3rd International Conference on Computing, Communication, Control and Automation, August 17–18, 2017, Pune, MS, India. (To be published on IEEE). Khatavkar, V., & Kulkarni, P. (2017). Document context identification using latent semantic analysis. Presented in 3rd International Conference on Computing, Communication, Control and Automation, August 17–18, 2017, Pune, MS, India. (To be published on IEEE).
5.
Zurück zum Zitat Sheikh, I., Illina, I., Fohr, D., & Linar, G. (2016). Document level semantic context for retrieving OOV proper names. ICASSP. Sheikh, I., Illina, I., Fohr, D., & Linar, G. (2016). Document level semantic context for retrieving OOV proper names. ICASSP.
6.
Zurück zum Zitat Wissner-Gross, A. (2016). Datasets over algorithms, Edge.com. Retrieved January 8, 2016. Wissner-Gross, A. (2016). Datasets over algorithms, Edge.com. Retrieved January 8, 2016.
12.
Zurück zum Zitat Onal Suzek, T. (2017). Using latent semantic analysis for automated keyword extraction from large document corpora. Turkish Journal of Electrical Engineering & Computer Sciences. Onal Suzek, T. (2017). Using latent semantic analysis for automated keyword extraction from large document corpora. Turkish Journal of Electrical Engineering & Computer Sciences.
13.
Zurück zum Zitat Rajkumar, K., & Karthik, K. (2017). Contextual plagiarism detection using latent semantic analysis. International Research Journal of Advanced Engineering and Science, 2(1), 214–217. Rajkumar, K., & Karthik, K. (2017). Contextual plagiarism detection using latent semantic analysis. International Research Journal of Advanced Engineering and Science, 2(1), 214–217.
14.
Zurück zum Zitat Marcolin, C. B., & Luiz Becker, J. (2016). Exploring latent semantic analysis in a big data (base). In Twenty-second Americas Conference on Information Systems, San Diego. Marcolin, C. B., & Luiz Becker, J. (2016). Exploring latent semantic analysis in a big data (base). In Twenty-second Americas Conference on Information Systems, San Diego.
15.
Zurück zum Zitat Hofmann, T. (2017). Probabilistic latent semantic indexing. In ACM SIGIR Forum (Vol. 51, No. 2), July 2017. Hofmann, T. (2017). Probabilistic latent semantic indexing. In ACM SIGIR Forum (Vol. 51, No. 2), July 2017.
17.
Zurück zum Zitat Dumais, S. T. (2005). Latent semantic analysis. Annual Review of Information Science and Technology. Dumais, S. T. (2005). Latent semantic analysis. Annual Review of Information Science and Technology.
20.
Zurück zum Zitat Fatima, S., & Srinivasu, B. Dr. (2017, February). Text document categorization using support vector machine. International Research Journal of Engineering and Technology (IRJET). Fatima, S., & Srinivasu, B. Dr. (2017, February). Text document categorization using support vector machine. International Research Journal of Engineering and Technology (IRJET).
21.
Zurück zum Zitat Cortes, C., & Vapnik, V. (2003). Support-vector networks. Machine Learning, 20(3), 273–297.MATH Cortes, C., & Vapnik, V. (2003). Support-vector networks. Machine Learning, 20(3), 273–297.MATH
22.
Zurück zum Zitat Joachims, T. (1991). Text categorization with support vector machines: Learning with many relevant features. University at Dortmund Informatik LS8, Baroper Str. 30144221 Dortmund, Germany. Joachims, T. (1991). Text categorization with support vector machines: Learning with many relevant features. University at Dortmund Informatik LS8, Baroper Str. 30144221 Dortmund, Germany.
23.
Zurück zum Zitat Abdiansah, A., & Wardoyo, R. (2015). Time complexity analysis of support vector machines (SVM) in LibSVM. International Journal of Computer Applications. Abdiansah, A., & Wardoyo, R. (2015). Time complexity analysis of support vector machines (SVM) in LibSVM. International Journal of Computer Applications.
Metadaten
Titel
Comparison of Support Vector Machines With and Without Latent Semantic Analysis for Document Classification
verfasst von
Vaibhav Khatavkar
Parag Kulkarni
Copyright-Jahr
2019
Verlag
Springer Singapore
DOI
https://doi.org/10.1007/978-981-13-1402-5_20

Neuer Inhalt