Skip to main content

2018 | OriginalPaper | Buchkapitel

Identification of Sentiment Labels Based on Self-training

verfasst von : Zhaowei Qu, Chunye Wu, Xiaoru Wang, Yanjiao Zhao

Erschienen in: Data Mining and Big Data

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Traditional methods for sentiment classification based on supervised learning require a large amount of labeled data for training. However, It is hard to obtain enough labeled data because it can be too expensive compared with unlabeled data. In this paper, we propose an identification of sentiment labels based on self-training (ISLS) method that can make full use of the large number of labeled data. We extract sentiment expressions based on sentiment seeds by self-training, learn sentiment words on unlabeled data and annotate unlabeled data. The sentiment expressions include processing and extracting for the negative meaning of the text. The ISLS method avoids the subjective problems of manual annotation. Experiments validate the effectiveness of the proposed ISLS method.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Pang, B., Lee, L.: Opinion mining and sentiment analysis. Found. Trends Inf. Retrieval 2(1–2), 1–135 (2008)CrossRef Pang, B., Lee, L.: Opinion mining and sentiment analysis. Found. Trends Inf. Retrieval 2(1–2), 1–135 (2008)CrossRef
2.
Zurück zum Zitat Rosenberg, C., Hebert, M., Schneiderman, H.: Semi-supervised self-training of object detection models. In: IEEE Workshops on Application of Computer Vision, pp. 29–36. IEEE Computer Society (2005) Rosenberg, C., Hebert, M., Schneiderman, H.: Semi-supervised self-training of object detection models. In: IEEE Workshops on Application of Computer Vision, pp. 29–36. IEEE Computer Society (2005)
3.
Zurück zum Zitat Blum, A., Chawla, S.: Learning from labeled and unlabeled data using graph mincuts. In: Proceedings of the 18th International Conference on Machine Learning (2001) Blum, A., Chawla, S.: Learning from labeled and unlabeled data using graph mincuts. In: Proceedings of the 18th International Conference on Machine Learning (2001)
4.
Zurück zum Zitat Zhu, X., Ghahramani, Z.: Learning from labeled and unlabeled data with label propagation. School Comput. Sci., Carnegie Mellon Univ., Pittsburgh, PA, Technical Report, CMU-CALD-02-107 (2002) Zhu, X., Ghahramani, Z.: Learning from labeled and unlabeled data with label propagation. School Comput. Sci., Carnegie Mellon Univ., Pittsburgh, PA, Technical Report, CMU-CALD-02-107 (2002)
5.
Zurück zum Zitat Hassan, A., Abu-Jbara, A., Jha, R., Radev, D.: Identifying the semantic orientation of foreign words. In: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: shortpapers (ACL-2011) (2011) Hassan, A., Abu-Jbara, A., Jha, R., Radev, D.: Identifying the semantic orientation of foreign words. In: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: shortpapers (ACL-2011) (2011)
6.
Zurück zum Zitat Deng, Z.H., Yu, H., Yang, Y.: Identifying sentiment words using an optimization model with L 1, regularization. In: Thirtieth AAAI Conference on Artificial Intelligence, pp. 115–121. AAAI Press (2016) Deng, Z.H., Yu, H., Yang, Y.: Identifying sentiment words using an optimization model with L 1, regularization. In: Thirtieth AAAI Conference on Artificial Intelligence, pp. 115–121. AAAI Press (2016)
7.
Zurück zum Zitat Pang, B., Lee, L., Vaithyanathan, S.: Thumbs up?: sentiment classification using machine learning techniques. In: Proceedings of EMNLP, Philadelphia, PA, pp. 79–86 (2002) Pang, B., Lee, L., Vaithyanathan, S.: Thumbs up?: sentiment classification using machine learning techniques. In: Proceedings of EMNLP, Philadelphia, PA, pp. 79–86 (2002)
8.
Zurück zum Zitat Zhu, X., Ghahramani, Z., Mit, T.J.: Semi-supervised learning with graphs. In: International Joint Conference on Natural Language Processing, pp. 2465–2472 (2005) Zhu, X., Ghahramani, Z., Mit, T.J.: Semi-supervised learning with graphs. In: International Joint Conference on Natural Language Processing, pp. 2465–2472 (2005)
9.
Zurück zum Zitat Maas, A.L., Daly, R.E., Pham, P.T., et al.: Learning word vectors for sentiment analysis. In: Meeting of the Association for Computational Linguistics: Human Language Technologies, pp. 142–150. Association for Computational Linguistics (2011) Maas, A.L., Daly, R.E., Pham, P.T., et al.: Learning word vectors for sentiment analysis. In: Meeting of the Association for Computational Linguistics: Human Language Technologies, pp. 142–150. Association for Computational Linguistics (2011)
10.
Zurück zum Zitat Hatzivassiloglou, V., McKeown, K.: Predicting the semantic orientation of adjectives. In: Proceedings of the Eighth Conference on European Chapter of the Association for Computational Linguistics, pp. 174–181 (1997) Hatzivassiloglou, V., McKeown, K.: Predicting the semantic orientation of adjectives. In: Proceedings of the Eighth Conference on European Chapter of the Association for Computational Linguistics, pp. 174–181 (1997)
11.
Zurück zum Zitat Turney, P., Littman, M.: Measuring praise and criticism: inference of semantic orientation from association (2003)CrossRef Turney, P., Littman, M.: Measuring praise and criticism: inference of semantic orientation from association (2003)CrossRef
12.
Zurück zum Zitat Kaji, N., Kitsuregawa, M.: Building lexicon for sentiment analysis from massive collection of html documents. In: Proceedings of the Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, pp. 1075–1083 (2007) Kaji, N., Kitsuregawa, M.: Building lexicon for sentiment analysis from massive collection of html documents. In: Proceedings of the Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, pp. 1075–1083 (2007)
13.
Zurück zum Zitat Qiu, G., Liu, B., Bu, J., Chen, C.: Expanding domain sentiment lexicon through double propagation. In: Proceedings of the 21st International Jiont Conference on Artifical intelligence, pp. 1199–1204 (2009) Qiu, G., Liu, B., Bu, J., Chen, C.: Expanding domain sentiment lexicon through double propagation. In: Proceedings of the 21st International Jiont Conference on Artifical intelligence, pp. 1199–1204 (2009)
14.
Zurück zum Zitat Chen, L., Wang, W., Nagarajan, M., et al.: Extracting diverse sentiment expressions with target-dependent polarity from twitter. In: International AAAI Conference on Weblogs and Social Media (2012) Chen, L., Wang, W., Nagarajan, M., et al.: Extracting diverse sentiment expressions with target-dependent polarity from twitter. In: International AAAI Conference on Weblogs and Social Media (2012)
15.
Zurück zum Zitat Zhu, X., Kiritchenko, S., Mohammad, S.: NRC-Canada-2014: recent improvements in the sentiment analysis of tweets. In: International Workshop on Semantic Evaluation, pp. 443–447 (2014) Zhu, X., Kiritchenko, S., Mohammad, S.: NRC-Canada-2014: recent improvements in the sentiment analysis of tweets. In: International Workshop on Semantic Evaluation, pp. 443–447 (2014)
16.
Zurück zum Zitat Mikolov, T., Chen, K., Corrado, G., et al.: Efficient estimation of word representations in vector space. Comput. Sci. (2013) Mikolov, T., Chen, K., Corrado, G., et al.: Efficient estimation of word representations in vector space. Comput. Sci. (2013)
17.
Zurück zum Zitat Mikolov, T., Sutskever, I., Chen, K., et al.: Distributed representations of words and phrases and their compositionality. In: International Conference on Neural Information Processing Systems, pp. 3111–3119. Curran Associates Inc. (2013) Mikolov, T., Sutskever, I., Chen, K., et al.: Distributed representations of words and phrases and their compositionality. In: International Conference on Neural Information Processing Systems, pp. 3111–3119. Curran Associates Inc. (2013)
18.
Zurück zum Zitat Turney, P., Littman, M.: Measuring praise and CRIT-ICISM: inference of semantic orientation from association (2003) Turney, P., Littman, M.: Measuring praise and CRIT-ICISM: inference of semantic orientation from association (2003)
19.
Zurück zum Zitat Bo, P., Lee, L.: Seeing stars: exploiting class relationships for sentiment categorization with respect to rating scales. In: Meeting on Association for Computational Linguistics, pp. 115–124. Association for Computational Linguistics (2005) Bo, P., Lee, L.: Seeing stars: exploiting class relationships for sentiment categorization with respect to rating scales. In: Meeting on Association for Computational Linguistics, pp. 115–124. Association for Computational Linguistics (2005)
Metadaten
Titel
Identification of Sentiment Labels Based on Self-training
verfasst von
Zhaowei Qu
Chunye Wu
Xiaoru Wang
Yanjiao Zhao
Copyright-Jahr
2018
DOI
https://doi.org/10.1007/978-3-319-93803-5_38

Premium Partner