Skip to main content
Top

2015 | OriginalPaper | Chapter

Incorporating Sample Filtering into Subject-Based Ensemble Model for Cross-Domain Sentiment Classification

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Recently, cross-domain sentiment classification is becoming popular owing to its potential applications, such as marketing et al. It seeks to generalize a model, which is trained on a source domain and using it to label samples in the target domain. However, the source and target distributions differ substantially in many cases. To address this issue, we propose a comprehensive model, which takes sample filtering and labeling adaptation into account simultaneously, named joint Sample Filtering with Subject-based Ensemble Model (SF-SE). Firstly, a sentence level Latent Dirichlet Allocation (LDA) model, which incorporates topic and sentiment together (SS-LDA) is introduced. Under this model, a high-quality training dataset is constructed in an unsupervised way. Secondly, inspired by the distribution variance of domain-independent and domain-specific features related to the subject of a sentence, we introduce a Subject-based Ensemble model to efficiently improve the classification performance. Experimental results show that the proposed model is effective for cross-domain sentiment classification.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
go back to reference Pang, B., Lee, L., Vaithyanathan, S.: Thumbs up?: Sentiment classification using machine learning techniques. In: Proceedings of the ACL-2002 Conference on Empirical Methods in Natural Language Processing, pp. 79–86 (2002) Pang, B., Lee, L., Vaithyanathan, S.: Thumbs up?: Sentiment classification using machine learning techniques. In: Proceedings of the ACL-2002 Conference on Empirical Methods in Natural Language Processing, pp. 79–86 (2002)
go back to reference Pang, B., Lee, L.: Opinion mining and sentiment analysis. J. Found. Trends Inf. Retrieval 2, 1–135 (2008)CrossRef Pang, B., Lee, L.: Opinion mining and sentiment analysis. J. Found. Trends Inf. Retrieval 2, 1–135 (2008)CrossRef
go back to reference Liu, Y., Huang, X., An, A., Yu, X.: ARSA: a sentiment-aware model for predicting sales performance using blogs. In: Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 607–614 (2007) Liu, Y., Huang, X., An, A., Yu, X.: ARSA: a sentiment-aware model for predicting sales performance using blogs. In: Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 607–614 (2007)
go back to reference Yu, X., Liu, Y., Huang, X., An, A.: Mining online reviews for predicting sales performance: a case study in the movie domain. IEEE Trans. J. Knowl. Data Eng. 24, 720–734 (2012)CrossRef Yu, X., Liu, Y., Huang, X., An, A.: Mining online reviews for predicting sales performance: a case study in the movie domain. IEEE Trans. J. Knowl. Data Eng. 24, 720–734 (2012)CrossRef
go back to reference Pan, S.J., Yang, Q.: A survey on transfer learning. IEEE Trans. J. Knowl. Data Eng. 22, 1345–1359 (2010)CrossRef Pan, S.J., Yang, Q.: A survey on transfer learning. IEEE Trans. J. Knowl. Data Eng. 22, 1345–1359 (2010)CrossRef
go back to reference Blitzer, J., McDonald, R., Pereira, F.: Domain adaptation with structural correspondence learning. In: Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing, pp. 120–128 (2006) Blitzer, J., McDonald, R., Pereira, F.: Domain adaptation with structural correspondence learning. In: Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing, pp. 120–128 (2006)
go back to reference Blitzer J., Dredze M., Pereira, F.: Biographies, bollywood, boom-boxes and blenders: domain adaptation for sentiment classification. In: Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics, vol. 7, pp. 440–447 (2007) Blitzer J., Dredze M., Pereira, F.: Biographies, bollywood, boom-boxes and blenders: domain adaptation for sentiment classification. In: Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics, vol. 7, pp. 440–447 (2007)
go back to reference Pan, S. J., Ni, X., Sun, J., Yang, Q., Chen, Z.: Cross-domain sentiment classification via spectral feature alignment. In: Proceedings of the 19th International Conference on World Wide Web, pp. 751–760 (2010) Pan, S. J., Ni, X., Sun, J., Yang, Q., Chen, Z.: Cross-domain sentiment classification via spectral feature alignment. In: Proceedings of the 19th International Conference on World Wide Web, pp. 751–760 (2010)
go back to reference He, Y., Lin, C., Alani, H.: Automatically extracting polarity-bearing topics for cross-domain sentiment classification. In: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, vol. 1, pp. 123–131 (2011) He, Y., Lin, C., Alani, H.: Automatically extracting polarity-bearing topics for cross-domain sentiment classification. In: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, vol. 1, pp. 123–131 (2011)
go back to reference Duan, L., Xu, D., Tsang, I.: Learning with augmented features for heterogeneous domain adaptation. J. arXiv preprint (2012). arXiv:1206.4660 Duan, L., Xu, D., Tsang, I.: Learning with augmented features for heterogeneous domain adaptation. J. arXiv preprint (2012). arXiv:​1206.​4660
go back to reference Jiang, J., Zhai, C.: Instance weighting for domain adaptation in NLP. In: Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics, vol. 7, pp. 264–271 (2007) Jiang, J., Zhai, C.: Instance weighting for domain adaptation in NLP. In: Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics, vol. 7, pp. 264–271 (2007)
go back to reference Xia, R., Zong, C.: A POS-based ensemble model for cross-domain sentiment classification. In: Proceedings of the 5th International Joint Conference on Natural Language Processing, pp. 614–622. Citeseer (2011) Xia, R., Zong, C.: A POS-based ensemble model for cross-domain sentiment classification. In: Proceedings of the 5th International Joint Conference on Natural Language Processing, pp. 614–622. Citeseer (2011)
go back to reference Samdani, R., Yih, W.: Domain adaptation with ensemble of feature groups. In: Proceedings-International Joint Conference on Artificial Intelligence, vol. 22, p. 1458 (2011) Samdani, R., Yih, W.: Domain adaptation with ensemble of feature groups. In: Proceedings-International Joint Conference on Artificial Intelligence, vol. 22, p. 1458 (2011)
go back to reference Gao, J., Fan, W., Jiang, J., Han, J.: Knowledge transfer via multiple model local structure mapping. In: Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 283–291 (2008) Gao, J., Fan, W., Jiang, J., Han, J.: Knowledge transfer via multiple model local structure mapping. In: Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 283–291 (2008)
go back to reference Yoshida, Y., Hirao, T., Iwata, T., Nagata, M., Matsumoto, Y.: Twenty-Fifth AAAI Conference on Artificial Intelligence (2011) Yoshida, Y., Hirao, T., Iwata, T., Nagata, M., Matsumoto, Y.: Twenty-Fifth AAAI Conference on Artificial Intelligence (2011)
go back to reference Lin, C., He, Y.: Joint sentiment/topic model for sentiment analysis. In: Proceedings of the 18th ACM Conference on Information and Knowledge Management, pp. 375–384 (2009) Lin, C., He, Y.: Joint sentiment/topic model for sentiment analysis. In: Proceedings of the 18th ACM Conference on Information and Knowledge Management, pp. 375–384 (2009)
go back to reference Xia, R., Zong, C., Hu, X., Cambria, E.: Feature ensemble plus sample selection: domain adaptation for sentiment classification. J. Intell. Syst. 28, 10–18 (2013)CrossRef Xia, R., Zong, C., Hu, X., Cambria, E.: Feature ensemble plus sample selection: domain adaptation for sentiment classification. J. Intell. Syst. 28, 10–18 (2013)CrossRef
go back to reference Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent dirichlet allocation. J. Mach. Learn. Res. 3, 993–1022 (2003)MATH Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent dirichlet allocation. J. Mach. Learn. Res. 3, 993–1022 (2003)MATH
go back to reference Lu, B., Ott, M., Cardie, C., Tsou, B.K.: Multi-aspect sentiment analysis with topic models. In: IEEE 11th International Conference on Data Mining Workshops (ICDMW), pp. 81–88 (2011) Lu, B., Ott, M., Cardie, C., Tsou, B.K.: Multi-aspect sentiment analysis with topic models. In: IEEE 11th International Conference on Data Mining Workshops (ICDMW), pp. 81–88 (2011)
go back to reference Fumera, G., Roli, F.: A theoretical and experimental analysis of linear combiners for multiple classifier systems. IEEE Trans. J. Pattern Analy. Mach. Intell. 27, 942–956 (2005)CrossRef Fumera, G., Roli, F.: A theoretical and experimental analysis of linear combiners for multiple classifier systems. IEEE Trans. J. Pattern Analy. Mach. Intell. 27, 942–956 (2005)CrossRef
go back to reference Juang, B.H., Katagiri, S.: Discriminative learning for minimum error classification. IEEE Trans. J. Signal Process. 40, 3043–3054 (1992)CrossRef Juang, B.H., Katagiri, S.: Discriminative learning for minimum error classification. IEEE Trans. J. Signal Process. 40, 3043–3054 (1992)CrossRef
go back to reference Pang, B., Lee, L.: A sentimental education: sentiment analysis using subjectivity summarization based on minimum cuts. In: Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics, p. 271 (2004) Pang, B., Lee, L.: A sentimental education: sentiment analysis using subjectivity summarization based on minimum cuts. In: Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics, p. 271 (2004)
Metadata
Title
Incorporating Sample Filtering into Subject-Based Ensemble Model for Cross-Domain Sentiment Classification
Authors
Liang Yang
Shaowu Zhang
Hongfei Lin
Xianhui Wei
Copyright Year
2015
DOI
https://doi.org/10.1007/978-3-319-25816-4_10

Premium Partner