Skip to main content

2018 | OriginalPaper | Buchkapitel

Text Classification Algorithm Based on SLAS-C

verfasst von : Zhichao Yin, Jun Xiang, Chunyong Yin, Jin Wang

Erschienen in: Advances in Computer Science and Ubiquitous Computing

Verlag: Springer Singapore

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Nowadays, mobile marketing is becoming increasingly important both strategically and economically because of the mobile devices. Short text is becoming a popular text form which can be seen in many fields such as network news, QQ messages, comments in BBS and so forth. Besides, our mobile devices also contain a lot of data of short text. To extract useful information from the short text more efficiently, this paper proposes SLAS (semi-supervised learning method and SVM classifier) and CART (classification and regression tree) to improve the traditional methods, which can classify massive short texts to mining the useful information from the short texts. The experiment also shows a better result than before, which has a more than 10% increase, including precision rate, recall rate and F1 value, besides, the running time is reduced by half than the KNN algorithm.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Deng, N., Tian, Y.: New method in data mining—support vector machine (SVM), vol. 16, no. 2, pp. 113–126. Science Press (2004) Deng, N., Tian, Y.: New method in data mining—support vector machine (SVM), vol. 16, no. 2, pp. 113–126. Science Press (2004)
2.
Zurück zum Zitat Breiman, L., Friedman, J.H., Olshen, R.A., et al.: Classification and regression tree. Wadsworth Int. Group 37(15), 237–251 (1984) Breiman, L., Friedman, J.H., Olshen, R.A., et al.: Classification and regression tree. Wadsworth Int. Group 37(15), 237–251 (1984)
3.
Zurück zum Zitat Cover, T.M., Hart, P.E.: Nearest neighbor pattern classification. IEEE Trans. Inf. Theor. 13(1), 21–27 (1967)CrossRef Cover, T.M., Hart, P.E.: Nearest neighbor pattern classification. IEEE Trans. Inf. Theor. 13(1), 21–27 (1967)CrossRef
4.
Zurück zum Zitat Yang, Y., Liu, X.: A re-examination of text categorization methods. In: Proceedings of the 22nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 42–49 (1999) Yang, Y., Liu, X.: A re-examination of text categorization methods. In: Proceedings of the 22nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 42–49 (1999)
5.
Zurück zum Zitat Fan, Y., Liu, H.: Research on Chinese short text classification based on Wikipedia. Mod. Libr. Inf. Technol. 24(3), 47–52 (2012)MathSciNet Fan, Y., Liu, H.: Research on Chinese short text classification based on Wikipedia. Mod. Libr. Inf. Technol. 24(3), 47–52 (2012)MathSciNet
6.
Zurück zum Zitat Banerjee, S., Ramanathan, K., Gupta, A.: Clustering short texts using Wikipedia. In: Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 787–788 (2007) Banerjee, S., Ramanathan, K., Gupta, A.: Clustering short texts using Wikipedia. In: Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 787–788 (2007)
7.
Zurück zum Zitat Lin, X., Zhang, M., Bao, X.: Short text classification method based on concept network. Comput. Eng. 36(21), 4–10 (2010) Lin, X., Zhang, M., Bao, X.: Short text classification method based on concept network. Comput. Eng. 36(21), 4–10 (2010)
8.
Zurück zum Zitat Li, X., Pang, J., et al.: Deep neural network for short-text sentiment classification. In: International Conference on Database Systems for Advanced Applications, pp. 168–175 (2016) Li, X., Pang, J., et al.: Deep neural network for short-text sentiment classification. In: International Conference on Database Systems for Advanced Applications, pp. 168–175 (2016)
9.
Zurück zum Zitat Francisco, P., Julián-Iranzo, P., et al.: Classifying unlabeled short texts using a fuzzy declarative approach. Lang. Resour. Eval. 47(1), 151–178 (2013)CrossRef Francisco, P., Julián-Iranzo, P., et al.: Classifying unlabeled short texts using a fuzzy declarative approach. Lang. Resour. Eval. 47(1), 151–178 (2013)CrossRef
Metadaten
Titel
Text Classification Algorithm Based on SLAS-C
verfasst von
Zhichao Yin
Jun Xiang
Chunyong Yin
Jin Wang
Copyright-Jahr
2018
Verlag
Springer Singapore
DOI
https://doi.org/10.1007/978-981-10-7605-3_63

Neuer Inhalt