Skip to main content
Top

2018 | OriginalPaper | Chapter

Text Classification Algorithm Based on SLAS-C

Authors : Zhichao Yin, Jun Xiang, Chunyong Yin, Jin Wang

Published in: Advances in Computer Science and Ubiquitous Computing

Publisher: Springer Singapore

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Nowadays, mobile marketing is becoming increasingly important both strategically and economically because of the mobile devices. Short text is becoming a popular text form which can be seen in many fields such as network news, QQ messages, comments in BBS and so forth. Besides, our mobile devices also contain a lot of data of short text. To extract useful information from the short text more efficiently, this paper proposes SLAS (semi-supervised learning method and SVM classifier) and CART (classification and regression tree) to improve the traditional methods, which can classify massive short texts to mining the useful information from the short texts. The experiment also shows a better result than before, which has a more than 10% increase, including precision rate, recall rate and F1 value, besides, the running time is reduced by half than the KNN algorithm.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Deng, N., Tian, Y.: New method in data mining—support vector machine (SVM), vol. 16, no. 2, pp. 113–126. Science Press (2004) Deng, N., Tian, Y.: New method in data mining—support vector machine (SVM), vol. 16, no. 2, pp. 113–126. Science Press (2004)
2.
go back to reference Breiman, L., Friedman, J.H., Olshen, R.A., et al.: Classification and regression tree. Wadsworth Int. Group 37(15), 237–251 (1984) Breiman, L., Friedman, J.H., Olshen, R.A., et al.: Classification and regression tree. Wadsworth Int. Group 37(15), 237–251 (1984)
3.
go back to reference Cover, T.M., Hart, P.E.: Nearest neighbor pattern classification. IEEE Trans. Inf. Theor. 13(1), 21–27 (1967)CrossRef Cover, T.M., Hart, P.E.: Nearest neighbor pattern classification. IEEE Trans. Inf. Theor. 13(1), 21–27 (1967)CrossRef
4.
go back to reference Yang, Y., Liu, X.: A re-examination of text categorization methods. In: Proceedings of the 22nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 42–49 (1999) Yang, Y., Liu, X.: A re-examination of text categorization methods. In: Proceedings of the 22nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 42–49 (1999)
5.
go back to reference Fan, Y., Liu, H.: Research on Chinese short text classification based on Wikipedia. Mod. Libr. Inf. Technol. 24(3), 47–52 (2012)MathSciNet Fan, Y., Liu, H.: Research on Chinese short text classification based on Wikipedia. Mod. Libr. Inf. Technol. 24(3), 47–52 (2012)MathSciNet
6.
go back to reference Banerjee, S., Ramanathan, K., Gupta, A.: Clustering short texts using Wikipedia. In: Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 787–788 (2007) Banerjee, S., Ramanathan, K., Gupta, A.: Clustering short texts using Wikipedia. In: Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 787–788 (2007)
7.
go back to reference Lin, X., Zhang, M., Bao, X.: Short text classification method based on concept network. Comput. Eng. 36(21), 4–10 (2010) Lin, X., Zhang, M., Bao, X.: Short text classification method based on concept network. Comput. Eng. 36(21), 4–10 (2010)
8.
go back to reference Li, X., Pang, J., et al.: Deep neural network for short-text sentiment classification. In: International Conference on Database Systems for Advanced Applications, pp. 168–175 (2016) Li, X., Pang, J., et al.: Deep neural network for short-text sentiment classification. In: International Conference on Database Systems for Advanced Applications, pp. 168–175 (2016)
9.
go back to reference Francisco, P., Julián-Iranzo, P., et al.: Classifying unlabeled short texts using a fuzzy declarative approach. Lang. Resour. Eval. 47(1), 151–178 (2013)CrossRef Francisco, P., Julián-Iranzo, P., et al.: Classifying unlabeled short texts using a fuzzy declarative approach. Lang. Resour. Eval. 47(1), 151–178 (2013)CrossRef
Metadata
Title
Text Classification Algorithm Based on SLAS-C
Authors
Zhichao Yin
Jun Xiang
Chunyong Yin
Jin Wang
Copyright Year
2018
Publisher
Springer Singapore
DOI
https://doi.org/10.1007/978-981-10-7605-3_63