Skip to main content

2021 | OriginalPaper | Buchkapitel

Categorizing Documents by Support Vector Machine Trained Using Self-Organizing Maps Clustering Approach

verfasst von : Vishal Patil, Yogesh Jadhav, Ajay Sirsat

Erschienen in: Techno-Societal 2020

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

This paper mainly emphasis on the use of machine learning algorithms such as self-organizing maps (SOM) and support vector machines (SVM) for classifying text documents. We have to classify documents effectively and accurately to different classes based on their content. We tested classification of self-organizing map on Reuters R-8 data set and compared the results to three other popular machine learning algorithms: k-means clustering, k nearest neighbor searching, and Naive Bayes classifier. Self-organizing map yielded the highest accuracies as an unsupervised method. Furthermore, the accuracy of self-organizing maps was improved when used together with support vector machines.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Saarikoski J, Laurikkala J, Jrvelin K, Juhola M (2011) Self-organising maps in document classification: a comparison with six machine learning methods. Adaptive and Natural Computing Algorithms, pp 260–269 Saarikoski J, Laurikkala J, Jrvelin K, Juhola M (2011) Self-organising maps in document classification: a comparison with six machine learning methods. Adaptive and Natural Computing Algorithms, pp 260–269
2.
Zurück zum Zitat Haykin S (1999) In neural networks: a comprehensive foundation, Second Edition. Prentice Hall, Upper Saddle River Haykin S (1999) In neural networks: a comprehensive foundation, Second Edition. Prentice Hall, Upper Saddle River
3.
Zurück zum Zitat Mary Amala Bai V, Manimegalai D (2010) An analysis of document clustering algorithms. In: International conference on communication control and computing technologies, Ramanathapuram, pp 402–406 Mary Amala Bai V, Manimegalai D (2010) An analysis of document clustering algorithms. In: International conference on communication control and computing technologies, Ramanathapuram, pp 402–406
4.
Zurück zum Zitat Ko Y, Seo J (2000) Automatic text categorization by unsupervised learning. In: Proceedings of the 18th international conference on computational linguistics, (COLING2000), pp 453459 Ko Y, Seo J (2000) Automatic text categorization by unsupervised learning. In: Proceedings of the 18th international conference on computational linguistics, (COLING2000), pp 453459
5.
Zurück zum Zitat Indu M, Kavitha KV (2009) Review on text summarization evaluation methods. In: International conference on research advances in integrated navigation systems (RAINS), Bangalore, 2016, pp 1–4 Indu M, Kavitha KV (2009) Review on text summarization evaluation methods. In: International conference on research advances in integrated navigation systems (RAINS), Bangalore, 2016, pp 1–4
6.
Zurück zum Zitat Rui W, Liu J, Jia Y (2016) Unsupervised feature selection for text classification via word embedding. In: ICBDA Rui W, Liu J, Jia Y (2016) Unsupervised feature selection for text classification via word embedding. In: ICBDA
7.
Zurück zum Zitat Shafiabady N, Lee LH, Rajkumar R, Kallimani VP, Akram NA, Isa D (2016) Using unsupervised clustering approach to train the support vector machine for text classification. Neurocomputing 211:4–10 Shafiabady N, Lee LH, Rajkumar R, Kallimani VP, Akram NA, Isa D (2016) Using unsupervised clustering approach to train the support vector machine for text classification. Neurocomputing 211:4–10
8.
Zurück zum Zitat Brucher H, Knowlmayer G, Mittermayer MA (2002) Document classification methods for organizing explicit knowledge. In: Proceedings of the 3rd European conference on organizational knowledge, learning and capabilities, (ECOKLC02), Institute of Information Systems, University of Bern, Engehaldenstrasse Bern, Switzerland, Athens, Greece, pp 124–126 Brucher H, Knowlmayer G, Mittermayer MA (2002) Document classification methods for organizing explicit knowledge. In: Proceedings of the 3rd European conference on organizational knowledge, learning and capabilities, (ECOKLC02), Institute of Information Systems, University of Bern, Engehaldenstrasse Bern, Switzerland, Athens, Greece, pp 124–126
9.
Zurück zum Zitat Lobo VJAS (2009) Application of self-organizing maps to the maritime environment. In: Popovich VV, Claramunt C, Schrenk M, Korolenko KV (eds) Information fusion and geographic information systems. Lecture Notes in Geoinformation and Cartography. Springer, Berlin, Heidelberg Lobo VJAS (2009) Application of self-organizing maps to the maritime environment. In: Popovich VV, Claramunt C, Schrenk M, Korolenko KV (eds) Information fusion and geographic information systems. Lecture Notes in Geoinformation and Cartography. Springer, Berlin, Heidelberg
10.
Zurück zum Zitat Sigogne A, Constant M (2009) Real-time unsupervised classification of web documents. In: IMCSIT, pp 281–286. IEEE Sigogne A, Constant M (2009) Real-time unsupervised classification of web documents. In: IMCSIT, pp 281–286. IEEE
11.
Zurück zum Zitat Li TS, Huang CL (2009) Defect spatial pattern recognition using a hybrid SOMSVM approach in semiconductor manufacturing. Expert Syst Appl 36(1):374–385 Li TS, Huang CL (2009) Defect spatial pattern recognition using a hybrid SOMSVM approach in semiconductor manufacturing. Expert Syst Appl 36(1):374–385
12.
Zurück zum Zitat Wu W, Liu X, Xu M, Peng J, Setiono R (2004) A hybrid SOM-SVM method for analyzing zebra fish gene expression. In: Proceedings of the 17th international conference on pattern recognition (ICPR04) vol 2, pp 323–326 Wu W, Liu X, Xu M, Peng J, Setiono R (2004) A hybrid SOM-SVM method for analyzing zebra fish gene expression. In: Proceedings of the 17th international conference on pattern recognition (ICPR04) vol 2, pp 323–326
Metadaten
Titel
Categorizing Documents by Support Vector Machine Trained Using Self-Organizing Maps Clustering Approach
verfasst von
Vishal Patil
Yogesh Jadhav
Ajay Sirsat
Copyright-Jahr
2021
DOI
https://doi.org/10.1007/978-3-030-69921-5_2

    Marktübersichten

    Die im Laufe eines Jahres in der „adhäsion“ veröffentlichten Marktübersichten helfen Anwendern verschiedenster Branchen, sich einen gezielten Überblick über Lieferantenangebote zu verschaffen.