Skip to main content

2020 | OriginalPaper | Buchkapitel

Multi-domain Sentiment Classification on Self-constructed Indonesian Dataset

verfasst von : Nankai Lin, Boyu Chen, Sihui Fu, Xiaotian Lin, Shengyi Jiang

Erschienen in: Natural Language Processing and Chinese Computing

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Domain-dependence limits the application of a well-trained sentiment classifier based on one domain data in other different domains. To solve this problem, multi-domain sentiment classification has received great attention recently. It aims to construct a domain-specific sentiment classifier at once from datasets of multi-domains. However, research on multi-domain sentiment classification mainly focuses on high-resource languages, and there is no research on Indonesian multi-domain sentiment classification. To fill the gap, we constructed an Indonesian multi-domain dataset, including 489,000 reviews from four domains with three sentiment polarities (positive, neutral, and negative), and proposed an integrated model for Indonesian multi-domain sentiment classification. This model is consisted of lemmatization layer, domain-general module, domain-specific module, and domain classifier module. Based on the Indonesian multi-domain dataset, the model was evaluated and compared with baseline methods commonly used in the sentiment analysis of high-resource languages. The effectiveness of some essential components in the model was also verified. The model achieved an average weighted F1 over four domains with 87.24%, outperforming the baseline methods and demonstrating its effectiveness.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Naradhipa, A.R., Purwarianti, A.: Sentiment classification for Indonesian message in social media. In: Proceedings - International Conference on Cloud Computing and Social Networking 2012: Cloud Computing and Social Networking for Smart and Productive Society (ICCCSN 2012) (2012). https://doi.org/10.1109/ICCCSN.2012.6215730 Naradhipa, A.R., Purwarianti, A.: Sentiment classification for Indonesian message in social media. In: Proceedings - International Conference on Cloud Computing and Social Networking 2012: Cloud Computing and Social Networking for Smart and Productive Society (ICCCSN 2012) (2012). https://​doi.​org/​10.​1109/​ICCCSN.​2012.​6215730
2.
Zurück zum Zitat Wicaksono, A.F., Vania, C., Distiawan, B.T., Adriani, M.: Automatically building a corpus for sentiment analysis on Indonesian tweets. In: Proceedings of the 28th Pacific Asia Conference on Language, Information and Computation (PACLIC 2014) (2014) Wicaksono, A.F., Vania, C., Distiawan, B.T., Adriani, M.: Automatically building a corpus for sentiment analysis on Indonesian tweets. In: Proceedings of the 28th Pacific Asia Conference on Language, Information and Computation (PACLIC 2014) (2014)
4.
Zurück zum Zitat Ilmania, A., Cahyawijaya, S., Purwarianti, A., Abdurahman: Aspect detection and sentiment classification using deep neural network for Indonesian aspect-based sentiment analysis. In: Presented at the (2018) Ilmania, A., Cahyawijaya, S., Purwarianti, A., Abdurahman: Aspect detection and sentiment classification using deep neural network for Indonesian aspect-based sentiment analysis. In: Presented at the (2018)
5.
Zurück zum Zitat Li, S., Zong, C.: Multi-domain sentiment classification. In: Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics on Human Language Technologies: Short Papers, pp. 257–260. Association for Computational Linguistics, Stroudsburg, PA, USA (2008) Li, S., Zong, C.: Multi-domain sentiment classification. In: Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics on Human Language Technologies: Short Papers, pp. 257–260. Association for Computational Linguistics, Stroudsburg, PA, USA (2008)
7.
Zurück zum Zitat Yang, F., Mukherjee, A., Zhang, Y.: Leveraging multiple domains for sentiment classification. In: COLING 2016 - 26th International Conference on Computational Linguistics, Proceedings of COLING 2016: Technical Papers (2016) Yang, F., Mukherjee, A., Zhang, Y.: Leveraging multiple domains for sentiment classification. In: COLING 2016 - 26th International Conference on Computational Linguistics, Proceedings of COLING 2016: Technical Papers (2016)
8.
Zurück zum Zitat Liu, P., Qiu, X., Xuanjing, H.: Recurrent neural network for text classification with multi-task learning. In: IJCAI International Joint Conference on Artificial Intelligence (2016) Liu, P., Qiu, X., Xuanjing, H.: Recurrent neural network for text classification with multi-task learning. In: IJCAI International Joint Conference on Artificial Intelligence (2016)
11.
Zurück zum Zitat Vania, C., Ibrahim, M., Adriani, M.: Sentiment Lexicon generation for an under-resourced language. Int. J. Comput. Linguistics Appl 5, 59–72 (2014) Vania, C., Ibrahim, M., Adriani, M.: Sentiment Lexicon generation for an under-resourced language. Int. J. Comput. Linguistics Appl 5, 59–72 (2014)
17.
Zurück zum Zitat Vaswani, A., et al.: Attention is all you need. In: Advances in Neural Information Processing Systems (2017) Vaswani, A., et al.: Attention is all you need. In: Advances in Neural Information Processing Systems (2017)
18.
Zurück zum Zitat Bahdanau, D., Cho, K.H., Bengio, Y.: Neural machine translation by jointly learning to align and translate. In: 3rd International Conference on Learning Representations (ICLR 2015) - Conference Track Proceedings (2015) Bahdanau, D., Cho, K.H., Bengio, Y.: Neural machine translation by jointly learning to align and translate. In: 3rd International Conference on Learning Representations (ICLR 2015) - Conference Track Proceedings (2015)
19.
Zurück zum Zitat Weston, J., Chopra, S., Bordes, A.: Memory networks. In: 3rd International Conference on Learning Representations (ICLR 2015) - Conference Track Proceedings (2015) Weston, J., Chopra, S., Bordes, A.: Memory networks. In: 3rd International Conference on Learning Representations (ICLR 2015) - Conference Track Proceedings (2015)
20.
Zurück zum Zitat Sukhbaatar, S., Szlam, A., Weston, J., Fergus, R.: End-to-end memory networks. In: Advances in Neural Information Processing Systems (2015) Sukhbaatar, S., Szlam, A., Weston, J., Fergus, R.: End-to-end memory networks. In: Advances in Neural Information Processing Systems (2015)
21.
Zurück zum Zitat Kumar, A., et al.: Ask me anything: dynamic memory networks for natural language processing. In: 33rd International Conference on Machine Learning (ICML 2016) (2016) Kumar, A., et al.: Ask me anything: dynamic memory networks for natural language processing. In: 33rd International Conference on Machine Learning (ICML 2016) (2016)
22.
Zurück zum Zitat Blitzer, J., Dredze, M., Pereira, F.: Biographies, bollywood, boom-boxes and blenders: domain adaptation for sentiment classification. In: ACL 2007 - Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics (2007) Blitzer, J., Dredze, M., Pereira, F.: Biographies, bollywood, boom-boxes and blenders: domain adaptation for sentiment classification. In: ACL 2007 - Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics (2007)
Metadaten
Titel
Multi-domain Sentiment Classification on Self-constructed Indonesian Dataset
verfasst von
Nankai Lin
Boyu Chen
Sihui Fu
Xiaotian Lin
Shengyi Jiang
Copyright-Jahr
2020
DOI
https://doi.org/10.1007/978-3-030-60450-9_62

Premium Partner