Skip to main content
Top

2020 | OriginalPaper | Chapter

Multi-domain Sentiment Classification on Self-constructed Indonesian Dataset

Authors : Nankai Lin, Boyu Chen, Sihui Fu, Xiaotian Lin, Shengyi Jiang

Published in: Natural Language Processing and Chinese Computing

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Domain-dependence limits the application of a well-trained sentiment classifier based on one domain data in other different domains. To solve this problem, multi-domain sentiment classification has received great attention recently. It aims to construct a domain-specific sentiment classifier at once from datasets of multi-domains. However, research on multi-domain sentiment classification mainly focuses on high-resource languages, and there is no research on Indonesian multi-domain sentiment classification. To fill the gap, we constructed an Indonesian multi-domain dataset, including 489,000 reviews from four domains with three sentiment polarities (positive, neutral, and negative), and proposed an integrated model for Indonesian multi-domain sentiment classification. This model is consisted of lemmatization layer, domain-general module, domain-specific module, and domain classifier module. Based on the Indonesian multi-domain dataset, the model was evaluated and compared with baseline methods commonly used in the sentiment analysis of high-resource languages. The effectiveness of some essential components in the model was also verified. The model achieved an average weighted F1 over four domains with 87.24%, outperforming the baseline methods and demonstrating its effectiveness.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Naradhipa, A.R., Purwarianti, A.: Sentiment classification for Indonesian message in social media. In: Proceedings - International Conference on Cloud Computing and Social Networking 2012: Cloud Computing and Social Networking for Smart and Productive Society (ICCCSN 2012) (2012). https://doi.org/10.1109/ICCCSN.2012.6215730 Naradhipa, A.R., Purwarianti, A.: Sentiment classification for Indonesian message in social media. In: Proceedings - International Conference on Cloud Computing and Social Networking 2012: Cloud Computing and Social Networking for Smart and Productive Society (ICCCSN 2012) (2012). https://​doi.​org/​10.​1109/​ICCCSN.​2012.​6215730
2.
go back to reference Wicaksono, A.F., Vania, C., Distiawan, B.T., Adriani, M.: Automatically building a corpus for sentiment analysis on Indonesian tweets. In: Proceedings of the 28th Pacific Asia Conference on Language, Information and Computation (PACLIC 2014) (2014) Wicaksono, A.F., Vania, C., Distiawan, B.T., Adriani, M.: Automatically building a corpus for sentiment analysis on Indonesian tweets. In: Proceedings of the 28th Pacific Asia Conference on Language, Information and Computation (PACLIC 2014) (2014)
4.
go back to reference Ilmania, A., Cahyawijaya, S., Purwarianti, A., Abdurahman: Aspect detection and sentiment classification using deep neural network for Indonesian aspect-based sentiment analysis. In: Presented at the (2018) Ilmania, A., Cahyawijaya, S., Purwarianti, A., Abdurahman: Aspect detection and sentiment classification using deep neural network for Indonesian aspect-based sentiment analysis. In: Presented at the (2018)
5.
go back to reference Li, S., Zong, C.: Multi-domain sentiment classification. In: Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics on Human Language Technologies: Short Papers, pp. 257–260. Association for Computational Linguistics, Stroudsburg, PA, USA (2008) Li, S., Zong, C.: Multi-domain sentiment classification. In: Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics on Human Language Technologies: Short Papers, pp. 257–260. Association for Computational Linguistics, Stroudsburg, PA, USA (2008)
7.
go back to reference Yang, F., Mukherjee, A., Zhang, Y.: Leveraging multiple domains for sentiment classification. In: COLING 2016 - 26th International Conference on Computational Linguistics, Proceedings of COLING 2016: Technical Papers (2016) Yang, F., Mukherjee, A., Zhang, Y.: Leveraging multiple domains for sentiment classification. In: COLING 2016 - 26th International Conference on Computational Linguistics, Proceedings of COLING 2016: Technical Papers (2016)
8.
go back to reference Liu, P., Qiu, X., Xuanjing, H.: Recurrent neural network for text classification with multi-task learning. In: IJCAI International Joint Conference on Artificial Intelligence (2016) Liu, P., Qiu, X., Xuanjing, H.: Recurrent neural network for text classification with multi-task learning. In: IJCAI International Joint Conference on Artificial Intelligence (2016)
11.
go back to reference Vania, C., Ibrahim, M., Adriani, M.: Sentiment Lexicon generation for an under-resourced language. Int. J. Comput. Linguistics Appl 5, 59–72 (2014) Vania, C., Ibrahim, M., Adriani, M.: Sentiment Lexicon generation for an under-resourced language. Int. J. Comput. Linguistics Appl 5, 59–72 (2014)
17.
go back to reference Vaswani, A., et al.: Attention is all you need. In: Advances in Neural Information Processing Systems (2017) Vaswani, A., et al.: Attention is all you need. In: Advances in Neural Information Processing Systems (2017)
18.
go back to reference Bahdanau, D., Cho, K.H., Bengio, Y.: Neural machine translation by jointly learning to align and translate. In: 3rd International Conference on Learning Representations (ICLR 2015) - Conference Track Proceedings (2015) Bahdanau, D., Cho, K.H., Bengio, Y.: Neural machine translation by jointly learning to align and translate. In: 3rd International Conference on Learning Representations (ICLR 2015) - Conference Track Proceedings (2015)
19.
go back to reference Weston, J., Chopra, S., Bordes, A.: Memory networks. In: 3rd International Conference on Learning Representations (ICLR 2015) - Conference Track Proceedings (2015) Weston, J., Chopra, S., Bordes, A.: Memory networks. In: 3rd International Conference on Learning Representations (ICLR 2015) - Conference Track Proceedings (2015)
20.
go back to reference Sukhbaatar, S., Szlam, A., Weston, J., Fergus, R.: End-to-end memory networks. In: Advances in Neural Information Processing Systems (2015) Sukhbaatar, S., Szlam, A., Weston, J., Fergus, R.: End-to-end memory networks. In: Advances in Neural Information Processing Systems (2015)
21.
go back to reference Kumar, A., et al.: Ask me anything: dynamic memory networks for natural language processing. In: 33rd International Conference on Machine Learning (ICML 2016) (2016) Kumar, A., et al.: Ask me anything: dynamic memory networks for natural language processing. In: 33rd International Conference on Machine Learning (ICML 2016) (2016)
22.
go back to reference Blitzer, J., Dredze, M., Pereira, F.: Biographies, bollywood, boom-boxes and blenders: domain adaptation for sentiment classification. In: ACL 2007 - Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics (2007) Blitzer, J., Dredze, M., Pereira, F.: Biographies, bollywood, boom-boxes and blenders: domain adaptation for sentiment classification. In: ACL 2007 - Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics (2007)
Metadata
Title
Multi-domain Sentiment Classification on Self-constructed Indonesian Dataset
Authors
Nankai Lin
Boyu Chen
Sihui Fu
Xiaotian Lin
Shengyi Jiang
Copyright Year
2020
DOI
https://doi.org/10.1007/978-3-030-60450-9_62

Premium Partner