Skip to main content
Top

2018 | OriginalPaper | Chapter

Identification of Sentiment Labels Based on Self-training

Authors : Zhaowei Qu, Chunye Wu, Xiaoru Wang, Yanjiao Zhao

Published in: Data Mining and Big Data

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Traditional methods for sentiment classification based on supervised learning require a large amount of labeled data for training. However, It is hard to obtain enough labeled data because it can be too expensive compared with unlabeled data. In this paper, we propose an identification of sentiment labels based on self-training (ISLS) method that can make full use of the large number of labeled data. We extract sentiment expressions based on sentiment seeds by self-training, learn sentiment words on unlabeled data and annotate unlabeled data. The sentiment expressions include processing and extracting for the negative meaning of the text. The ISLS method avoids the subjective problems of manual annotation. Experiments validate the effectiveness of the proposed ISLS method.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Pang, B., Lee, L.: Opinion mining and sentiment analysis. Found. Trends Inf. Retrieval 2(1–2), 1–135 (2008)CrossRef Pang, B., Lee, L.: Opinion mining and sentiment analysis. Found. Trends Inf. Retrieval 2(1–2), 1–135 (2008)CrossRef
2.
go back to reference Rosenberg, C., Hebert, M., Schneiderman, H.: Semi-supervised self-training of object detection models. In: IEEE Workshops on Application of Computer Vision, pp. 29–36. IEEE Computer Society (2005) Rosenberg, C., Hebert, M., Schneiderman, H.: Semi-supervised self-training of object detection models. In: IEEE Workshops on Application of Computer Vision, pp. 29–36. IEEE Computer Society (2005)
3.
go back to reference Blum, A., Chawla, S.: Learning from labeled and unlabeled data using graph mincuts. In: Proceedings of the 18th International Conference on Machine Learning (2001) Blum, A., Chawla, S.: Learning from labeled and unlabeled data using graph mincuts. In: Proceedings of the 18th International Conference on Machine Learning (2001)
4.
go back to reference Zhu, X., Ghahramani, Z.: Learning from labeled and unlabeled data with label propagation. School Comput. Sci., Carnegie Mellon Univ., Pittsburgh, PA, Technical Report, CMU-CALD-02-107 (2002) Zhu, X., Ghahramani, Z.: Learning from labeled and unlabeled data with label propagation. School Comput. Sci., Carnegie Mellon Univ., Pittsburgh, PA, Technical Report, CMU-CALD-02-107 (2002)
5.
go back to reference Hassan, A., Abu-Jbara, A., Jha, R., Radev, D.: Identifying the semantic orientation of foreign words. In: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: shortpapers (ACL-2011) (2011) Hassan, A., Abu-Jbara, A., Jha, R., Radev, D.: Identifying the semantic orientation of foreign words. In: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: shortpapers (ACL-2011) (2011)
6.
go back to reference Deng, Z.H., Yu, H., Yang, Y.: Identifying sentiment words using an optimization model with L 1, regularization. In: Thirtieth AAAI Conference on Artificial Intelligence, pp. 115–121. AAAI Press (2016) Deng, Z.H., Yu, H., Yang, Y.: Identifying sentiment words using an optimization model with L 1, regularization. In: Thirtieth AAAI Conference on Artificial Intelligence, pp. 115–121. AAAI Press (2016)
7.
go back to reference Pang, B., Lee, L., Vaithyanathan, S.: Thumbs up?: sentiment classification using machine learning techniques. In: Proceedings of EMNLP, Philadelphia, PA, pp. 79–86 (2002) Pang, B., Lee, L., Vaithyanathan, S.: Thumbs up?: sentiment classification using machine learning techniques. In: Proceedings of EMNLP, Philadelphia, PA, pp. 79–86 (2002)
8.
go back to reference Zhu, X., Ghahramani, Z., Mit, T.J.: Semi-supervised learning with graphs. In: International Joint Conference on Natural Language Processing, pp. 2465–2472 (2005) Zhu, X., Ghahramani, Z., Mit, T.J.: Semi-supervised learning with graphs. In: International Joint Conference on Natural Language Processing, pp. 2465–2472 (2005)
9.
go back to reference Maas, A.L., Daly, R.E., Pham, P.T., et al.: Learning word vectors for sentiment analysis. In: Meeting of the Association for Computational Linguistics: Human Language Technologies, pp. 142–150. Association for Computational Linguistics (2011) Maas, A.L., Daly, R.E., Pham, P.T., et al.: Learning word vectors for sentiment analysis. In: Meeting of the Association for Computational Linguistics: Human Language Technologies, pp. 142–150. Association for Computational Linguistics (2011)
10.
go back to reference Hatzivassiloglou, V., McKeown, K.: Predicting the semantic orientation of adjectives. In: Proceedings of the Eighth Conference on European Chapter of the Association for Computational Linguistics, pp. 174–181 (1997) Hatzivassiloglou, V., McKeown, K.: Predicting the semantic orientation of adjectives. In: Proceedings of the Eighth Conference on European Chapter of the Association for Computational Linguistics, pp. 174–181 (1997)
11.
go back to reference Turney, P., Littman, M.: Measuring praise and criticism: inference of semantic orientation from association (2003)CrossRef Turney, P., Littman, M.: Measuring praise and criticism: inference of semantic orientation from association (2003)CrossRef
12.
go back to reference Kaji, N., Kitsuregawa, M.: Building lexicon for sentiment analysis from massive collection of html documents. In: Proceedings of the Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, pp. 1075–1083 (2007) Kaji, N., Kitsuregawa, M.: Building lexicon for sentiment analysis from massive collection of html documents. In: Proceedings of the Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, pp. 1075–1083 (2007)
13.
go back to reference Qiu, G., Liu, B., Bu, J., Chen, C.: Expanding domain sentiment lexicon through double propagation. In: Proceedings of the 21st International Jiont Conference on Artifical intelligence, pp. 1199–1204 (2009) Qiu, G., Liu, B., Bu, J., Chen, C.: Expanding domain sentiment lexicon through double propagation. In: Proceedings of the 21st International Jiont Conference on Artifical intelligence, pp. 1199–1204 (2009)
14.
go back to reference Chen, L., Wang, W., Nagarajan, M., et al.: Extracting diverse sentiment expressions with target-dependent polarity from twitter. In: International AAAI Conference on Weblogs and Social Media (2012) Chen, L., Wang, W., Nagarajan, M., et al.: Extracting diverse sentiment expressions with target-dependent polarity from twitter. In: International AAAI Conference on Weblogs and Social Media (2012)
15.
go back to reference Zhu, X., Kiritchenko, S., Mohammad, S.: NRC-Canada-2014: recent improvements in the sentiment analysis of tweets. In: International Workshop on Semantic Evaluation, pp. 443–447 (2014) Zhu, X., Kiritchenko, S., Mohammad, S.: NRC-Canada-2014: recent improvements in the sentiment analysis of tweets. In: International Workshop on Semantic Evaluation, pp. 443–447 (2014)
16.
go back to reference Mikolov, T., Chen, K., Corrado, G., et al.: Efficient estimation of word representations in vector space. Comput. Sci. (2013) Mikolov, T., Chen, K., Corrado, G., et al.: Efficient estimation of word representations in vector space. Comput. Sci. (2013)
17.
go back to reference Mikolov, T., Sutskever, I., Chen, K., et al.: Distributed representations of words and phrases and their compositionality. In: International Conference on Neural Information Processing Systems, pp. 3111–3119. Curran Associates Inc. (2013) Mikolov, T., Sutskever, I., Chen, K., et al.: Distributed representations of words and phrases and their compositionality. In: International Conference on Neural Information Processing Systems, pp. 3111–3119. Curran Associates Inc. (2013)
18.
go back to reference Turney, P., Littman, M.: Measuring praise and CRIT-ICISM: inference of semantic orientation from association (2003) Turney, P., Littman, M.: Measuring praise and CRIT-ICISM: inference of semantic orientation from association (2003)
19.
go back to reference Bo, P., Lee, L.: Seeing stars: exploiting class relationships for sentiment categorization with respect to rating scales. In: Meeting on Association for Computational Linguistics, pp. 115–124. Association for Computational Linguistics (2005) Bo, P., Lee, L.: Seeing stars: exploiting class relationships for sentiment categorization with respect to rating scales. In: Meeting on Association for Computational Linguistics, pp. 115–124. Association for Computational Linguistics (2005)
Metadata
Title
Identification of Sentiment Labels Based on Self-training
Authors
Zhaowei Qu
Chunye Wu
Xiaoru Wang
Yanjiao Zhao
Copyright Year
2018
DOI
https://doi.org/10.1007/978-3-319-93803-5_38

Premium Partner