Skip to main content
Erschienen in: Social Network Analysis and Mining 1/2019

01.12.2019 | Original Article

Deep learning approaches for Arabic sentiment analysis

verfasst von: Ammar Mohammed, Rania Kora

Erschienen in: Social Network Analysis and Mining | Ausgabe 1/2019

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Social media are considered an excellent source of information and can provide opinions, thoughts and insights toward various important topics. Sentiment analysis becomes a hot topic in research due to its importance in making decisions based on opinions derived from analyzing the user’s contents on social media. Although the Arabic language is one of the widely spoken languages used for content sharing across the social media, the sentiment analysis on Arabic contents is limited due to several challenges including the morphological structures of the language, the varieties of dialects and the lack of the appropriate corpora. Hence, the rapid increase in research in Arabic sentiment analysis is grown slowly in contrast to other languages such as English. The contribution of this paper is twofold: First, we introduce a corpus of forty thousand labeled Arabic tweets spanning several topics. Second, we present three deep learning models, namely CNN, LSTM and RCNN, for Arabic sentiment analysis. With the help of word embedding, we validate the performance of the three models on the proposed corpus. The experimental results indicate that LSTM with an average accuracy of 81.31% outperforms CNN and RCNN. Also, applying data augmentation on the corpus increases LSTM accuracy by 8.3%.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Abdul-Mageed M, Diab MT (2012) Awatif: a multi-genre corpus for modern standard arabic subjectivity and sentiment analysis. In: LREC, vol. 515. Citeseer, pp 3907–3914 Abdul-Mageed M, Diab MT (2012) Awatif: a multi-genre corpus for modern standard arabic subjectivity and sentiment analysis. In: LREC, vol. 515. Citeseer, pp 3907–3914
Zurück zum Zitat Abdul-Mageed M, Diab MT, Korayem M (2011) Subjectivity and sentiment analysis of modern standard arabic. In: Proceedings of the 49th annual meeting of the association for computational linguistics: human language technologies: short papers, vol 2. Association for Computational Linguistics, pp 587–591 Abdul-Mageed M, Diab MT, Korayem M (2011) Subjectivity and sentiment analysis of modern standard arabic. In: Proceedings of the 49th annual meeting of the association for computational linguistics: human language technologies: short papers, vol 2. Association for Computational Linguistics, pp 587–591
Zurück zum Zitat Ahmed S, Pasquier M, Qadah G (2013) Key issues in conducting sentiment analysis on Arabic social media text. In: 2013 9th international conference on innovations in information technology (IIT). IEEE, pp 72–77 Ahmed S, Pasquier M, Qadah G (2013) Key issues in conducting sentiment analysis on Arabic social media text. In: 2013 9th international conference on innovations in information technology (IIT). IEEE, pp 72–77
Zurück zum Zitat Alayba AM, Palade V, England M, Iqbal R (2017) Arabic language sentiment analysis on health services. In: 2017 1st International workshop on Arabic script analysis and recognition (ASAR). IEEE, pp 114–118 Alayba AM, Palade V, England M, Iqbal R (2017) Arabic language sentiment analysis on health services. In: 2017 1st International workshop on Arabic script analysis and recognition (ASAR). IEEE, pp 114–118
Zurück zum Zitat Alayba AM, Palade V, England M, Iqbal R (2018) A combined cnn and lstm model for arabic sentiment analysis. In: International cross-domain conference for machine learning and knowledge extraction. Springer, pp 179–191 Alayba AM, Palade V, England M, Iqbal R (2018) A combined cnn and lstm model for arabic sentiment analysis. In: International cross-domain conference for machine learning and knowledge extraction. Springer, pp 179–191
Zurück zum Zitat Albraheem L, Al-Khalifa HS (2012) Exploring the problems of sentiment analysis in informal arabic. In: Proceedings of the 14th international conference on information integration and web-based applications & services. ACM, pp 415–418 Albraheem L, Al-Khalifa HS (2012) Exploring the problems of sentiment analysis in informal arabic. In: Proceedings of the 14th international conference on information integration and web-based applications & services. ACM, pp 415–418
Zurück zum Zitat Aldayel HK, Azmi AM (2016) Arabic tweets sentiment analysis-a hybrid scheme. J Inf Sci 42(6):782–797CrossRef Aldayel HK, Azmi AM (2016) Arabic tweets sentiment analysis-a hybrid scheme. J Inf Sci 42(6):782–797CrossRef
Zurück zum Zitat Alomari KM, ElSherif HM, Shaalan K (2017) Arabic tweets sentimental analysis using machine learning. In: International conference on industrial, engineering and other applications of applied intelligent systems. Springer, pp 602–610 Alomari KM, ElSherif HM, Shaalan K (2017) Arabic tweets sentimental analysis using machine learning. In: International conference on industrial, engineering and other applications of applied intelligent systems. Springer, pp 602–610
Zurück zum Zitat AlOtaibi S, Khan MB (2017) Sentiment analysis challenges of informal arabic. Int J Adv Comput Sci Appl 8(2):278–284 AlOtaibi S, Khan MB (2017) Sentiment analysis challenges of informal arabic. Int J Adv Comput Sci Appl 8(2):278–284
Zurück zum Zitat Alshuaibi ASI, Mohd Shamsudin F, Alshuaibi MSI (2015) Internet misuse at work in jordan: challenges and implications. In: Proceedings of the 3rd convention of the world association of business schools (WAiBS), pp 68–78 Alshuaibi ASI, Mohd Shamsudin F, Alshuaibi MSI (2015) Internet misuse at work in jordan: challenges and implications. In: Proceedings of the 3rd convention of the world association of business schools (WAiBS), pp 68–78
Zurück zum Zitat Altaher A (2017) Hybrid approach for sentiment analysis of arabic tweets based on deep learning model and features weighting. Int J Adv Appl Sci 4(8):43–49CrossRef Altaher A (2017) Hybrid approach for sentiment analysis of arabic tweets based on deep learning model and features weighting. Int J Adv Appl Sci 4(8):43–49CrossRef
Zurück zum Zitat Baly R, El-Khoury G, Moukalled R, Aoun R, Hajj H, Shaban KB, El-Hajj W (2017) Comparative evaluation of sentiment analysis methods across arabic dialects. Proc Comput Sci 117:266–273CrossRef Baly R, El-Khoury G, Moukalled R, Aoun R, Hajj H, Shaban KB, El-Hajj W (2017) Comparative evaluation of sentiment analysis methods across arabic dialects. Proc Comput Sci 117:266–273CrossRef
Zurück zum Zitat Chen Y, Yuan J, You Q, Luo J (2018) Twitter sentiment analysis via bi-sense emoji embedding and attention-based lstm. In: 2018 ACM multimedia conference on multimedia conference. ACM, pp 117–125 Chen Y, Yuan J, You Q, Luo J (2018) Twitter sentiment analysis via bi-sense emoji embedding and attention-based lstm. In: 2018 ACM multimedia conference on multimedia conference. ACM, pp 117–125
Zurück zum Zitat Collobert R, Weston J (2008) A unified architecture for natural language processing: Deep neural networks with multitask learning. In: Proceedings of the 25th international conference on Machine learning. ACM, pp 160–167 Collobert R, Weston J (2008) A unified architecture for natural language processing: Deep neural networks with multitask learning. In: Proceedings of the 25th international conference on Machine learning. ACM, pp 160–167
Zurück zum Zitat Dahou A, Xiong S, Zhou J, Haddoud MH, Duan P (2016) Word embeddings and convolutional neural network for arabic sentiment classification. In: Proceedings of coling 2016, the 26th international conference on computational linguistics: technical papers, pp 2418–2427 Dahou A, Xiong S, Zhou J, Haddoud MH, Duan P (2016) Word embeddings and convolutional neural network for arabic sentiment classification. In: Proceedings of coling 2016, the 26th international conference on computational linguistics: technical papers, pp 2418–2427
Zurück zum Zitat Duwairi RM, Marji R, Sha’ban N, Rushaidat S (2014) Sentiment analysis in Arabic tweets. In: 2014 5th international conference on information and communication systems (ICICS). IEEE, pp 1–6 Duwairi RM, Marji R, Sha’ban N, Rushaidat S (2014) Sentiment analysis in Arabic tweets. In: 2014 5th international conference on information and communication systems (ICICS). IEEE, pp 1–6
Zurück zum Zitat Eigen D, Rolfe J, Fergus R, LeCun Y (2013) Understanding deep architectures using a recursive convolutional network. arXiv preprint arXiv:1312.1847 Eigen D, Rolfe J, Fergus R, LeCun Y (2013) Understanding deep architectures using a recursive convolutional network. arXiv preprint arXiv:​1312.​1847
Zurück zum Zitat El-Beltagy SR, Ali A (2013) Open issues in the sentiment analysis of arabic social media: a case study. In: 2013 9th international conference on innovations in information technology (IIT). IEEE, pp 215–220 El-Beltagy SR, Ali A (2013) Open issues in the sentiment analysis of arabic social media: a case study. In: 2013 9th international conference on innovations in information technology (IIT). IEEE, pp 215–220
Zurück zum Zitat Farghaly A, Shaalan K (2009) Arabic natural language processing: challenges and solutions. ACM Trans Asian Lang Inf Process (TALIP) 8(4):14 Farghaly A, Shaalan K (2009) Arabic natural language processing: challenges and solutions. ACM Trans Asian Lang Inf Process (TALIP) 8(4):14
Zurück zum Zitat Gantz J, Reinsel D (2011) The 2011 digital universe study: extracting value from chaos. Sponsored by EMC corporation, IDC Gantz J, Reinsel D (2011) The 2011 digital universe study: extracting value from chaos. Sponsored by EMC corporation, IDC
Zurück zum Zitat Girshick R, Donahue J, Darrell T, Malik J (2014) Rich feature hierarchies for accurate object detection and semantic segmentation. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 580–587 Girshick R, Donahue J, Darrell T, Malik J (2014) Rich feature hierarchies for accurate object detection and semantic segmentation. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 580–587
Zurück zum Zitat Graves A, Fernández S, Gomez F, Schmidhuber J (2006) Connectionist temporal classification: labelling unsegmented sequence data with recurrent neural networks. In: Proceedings of the 23rd international conference on Machine learning. ACM, pp 369–376 Graves A, Fernández S, Gomez F, Schmidhuber J (2006) Connectionist temporal classification: labelling unsegmented sequence data with recurrent neural networks. In: Proceedings of the 23rd international conference on Machine learning. ACM, pp 369–376
Zurück zum Zitat Han J, Moraga C (1995) The influence of the sigmoid function parameters on the speed of backpropagation learning. In: International workshop on artificial neural networks. Springer, pp 195–201 Han J, Moraga C (1995) The influence of the sigmoid function parameters on the speed of backpropagation learning. In: International workshop on artificial neural networks. Springer, pp 195–201
Zurück zum Zitat Hassan A, Amin MR, Al Azad AK, Mohammed N (2016) Sentiment analysis on bangla and romanized bangla text using deep recurrent models. In: 2016 international workshop on computational intelligence (IWCI). IEEE, pp 51–56 Hassan A, Amin MR, Al Azad AK, Mohammed N (2016) Sentiment analysis on bangla and romanized bangla text using deep recurrent models. In: 2016 international workshop on computational intelligence (IWCI). IEEE, pp 51–56
Zurück zum Zitat Heikal M, Torki M, El-Makky N (2018) Sentiment analysis of arabic tweets using deep learning. Proc Comput Sci 142:114–122CrossRef Heikal M, Torki M, El-Makky N (2018) Sentiment analysis of arabic tweets using deep learning. Proc Comput Sci 142:114–122CrossRef
Zurück zum Zitat Huang G, Liu Z, Van Der Maaten L, Weinberger KQ (2017) Densely connected convolutional networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 4700–4708 Huang G, Liu Z, Van Der Maaten L, Weinberger KQ (2017) Densely connected convolutional networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 4700–4708
Zurück zum Zitat Kobayashi S (2018) Contextual augmentation: data augmentation by words with paradigmatic relations. arXiv preprint arXiv:1805.06201 Kobayashi S (2018) Contextual augmentation: data augmentation by words with paradigmatic relations. arXiv preprint arXiv:​1805.​06201
Zurück zum Zitat Lai S, Xu L, Liu K, Zhao J (2015) Recurrent convolutional neural networks for text classification. In: Twenty-ninth AAAI conference on artificial intelligence Lai S, Xu L, Liu K, Zhao J (2015) Recurrent convolutional neural networks for text classification. In: Twenty-ninth AAAI conference on artificial intelligence
Zurück zum Zitat Mikolov T, Chen K, Corrado G, Dean J (2013) Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781 Mikolov T, Chen K, Corrado G, Dean J (2013) Efficient estimation of word representations in vector space. arXiv preprint arXiv:​1301.​3781
Zurück zum Zitat Mikolov T, Sutskever I, Chen K, Corrado GS, Dean J (2013) Distributed representations of words and phrases and their compositionality. In: Advances in neural information processing systems, pp 3111–3119 Mikolov T, Sutskever I, Chen K, Corrado GS, Dean J (2013) Distributed representations of words and phrases and their compositionality. In: Advances in neural information processing systems, pp 3111–3119
Zurück zum Zitat Mourad A, Darwish K (2013) Subjectivity and sentiment analysis of modern standard arabic and arabic microblogs. In: Proceedings of the 4th workshop on computational approaches to subjectivity, sentiment and social media analysis, pp 55–64 Mourad A, Darwish K (2013) Subjectivity and sentiment analysis of modern standard arabic and arabic microblogs. In: Proceedings of the 4th workshop on computational approaches to subjectivity, sentiment and social media analysis, pp 55–64
Zurück zum Zitat Nabil M, Aly M, Atiya A (2015) Astd: Arabic sentiment tweets dataset. In: Proceedings of the 2015 conference on empirical methods in natural language processing, pp 2515–2519 Nabil M, Aly M, Atiya A (2015) Astd: Arabic sentiment tweets dataset. In: Proceedings of the 2015 conference on empirical methods in natural language processing, pp 2515–2519
Zurück zum Zitat Pennington J, Socher R, Manning C (2014) Glove: global vectors for word representation. In: Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP), pp 1532–1543 Pennington J, Socher R, Manning C (2014) Glove: global vectors for word representation. In: Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP), pp 1532–1543
Zurück zum Zitat Pontiki M, Galanis D, Papageorgiou H, Androutsopoulos I, Manandhar S, Mohammad AS, Al-Ayyoub M, Zhao Y, Qin B, De Clercq O, et al. (2016) Semeval-2016 task 5: aspect based sentiment analysis. In: Proceedings of the 10th international workshop on semantic evaluation (SemEval-2016), pp 19–30 Pontiki M, Galanis D, Papageorgiou H, Androutsopoulos I, Manandhar S, Mohammad AS, Al-Ayyoub M, Zhao Y, Qin B, De Clercq O, et al. (2016) Semeval-2016 task 5: aspect based sentiment analysis. In: Proceedings of the 10th international workshop on semantic evaluation (SemEval-2016), pp 19–30
Zurück zum Zitat Rao A, Spasojevic N (2016) Actionable and political text classification using word embeddings and lstm. arXiv preprint arXiv:1607.02501 Rao A, Spasojevic N (2016) Actionable and political text classification using word embeddings and lstm. arXiv preprint arXiv:​1607.​02501
Zurück zum Zitat Ravi K, Ravi V (2015) A survey on opinion mining and sentiment analysis: tasks, approaches and applications. Knowl Based Syst 89:14–46CrossRef Ravi K, Ravi V (2015) A survey on opinion mining and sentiment analysis: tasks, approaches and applications. Knowl Based Syst 89:14–46CrossRef
Zurück zum Zitat Salamah JB, Elkhlifi A (2014) Microblogging opinion mining approach for kuwaiti dialect. In: The International conference on computing technology and information management (ICCTIM). Society of Digital Information and Wireless Communication, p 388 Salamah JB, Elkhlifi A (2014) Microblogging opinion mining approach for kuwaiti dialect. In: The International conference on computing technology and information management (ICCTIM). Society of Digital Information and Wireless Communication, p 388
Zurück zum Zitat Shoukry A, Rafea A (2012) Sentence-level arabic sentiment analysis. In: 2012 international conference on collaboration technologies and systems (CTS). IEEE, pp 546–550 Shoukry A, Rafea A (2012) Sentence-level arabic sentiment analysis. In: 2012 international conference on collaboration technologies and systems (CTS). IEEE, pp 546–550
Zurück zum Zitat Soliman AB, Eissa K, El-Beltagy SR (2017) Aravec: a set of arabic word embedding models for use in arabic nlp. Proc Comput Sci 117:256–265CrossRef Soliman AB, Eissa K, El-Beltagy SR (2017) Aravec: a set of arabic word embedding models for use in arabic nlp. Proc Comput Sci 117:256–265CrossRef
Zurück zum Zitat Thelwall M, Buckley K, Paltoglou G (2011) Sentiment in twitter events. J Am Soc Inf Sci Technol 62(2):406–418CrossRef Thelwall M, Buckley K, Paltoglou G (2011) Sentiment in twitter events. J Am Soc Inf Sci Technol 62(2):406–418CrossRef
Zurück zum Zitat Vilares D, Alonso MA, Gómez-Rodríguez C (2017) Supervised sentiment analysis in multilingual environments. Inf Process Manag 53(3):595–607CrossRef Vilares D, Alonso MA, Gómez-Rodríguez C (2017) Supervised sentiment analysis in multilingual environments. Inf Process Manag 53(3):595–607CrossRef
Zurück zum Zitat Vizcarra G, Mauricio A, Mauricio L (2018) A deep learning approach for sentiment analysis in spanish tweets. In: International conference on artificial neural networks. Springer, pp 622–629 Vizcarra G, Mauricio A, Mauricio L (2018) A deep learning approach for sentiment analysis in spanish tweets. In: International conference on artificial neural networks. Springer, pp 622–629
Zurück zum Zitat Wang Y, Huang M, Zhao L, et al. (2016) Attention-based lstm for aspect-level sentiment classification. In: Proceedings of the 2016 conference on empirical methods in natural language processing, pp 606–615 Wang Y, Huang M, Zhao L, et al. (2016) Attention-based lstm for aspect-level sentiment classification. In: Proceedings of the 2016 conference on empirical methods in natural language processing, pp 606–615
Zurück zum Zitat Zhang L, Wang S, Liu B (2018) Deep learning for sentiment analysis: a survey. Wiley Interdiscip Rev Data Min Knowl Discov 8(4):e1253CrossRef Zhang L, Wang S, Liu B (2018) Deep learning for sentiment analysis: a survey. Wiley Interdiscip Rev Data Min Knowl Discov 8(4):e1253CrossRef
Metadaten
Titel
Deep learning approaches for Arabic sentiment analysis
verfasst von
Ammar Mohammed
Rania Kora
Publikationsdatum
01.12.2019
Verlag
Springer Vienna
Erschienen in
Social Network Analysis and Mining / Ausgabe 1/2019
Print ISSN: 1869-5450
Elektronische ISSN: 1869-5469
DOI
https://doi.org/10.1007/s13278-019-0596-4

Weitere Artikel der Ausgabe 1/2019

Social Network Analysis and Mining 1/2019 Zur Ausgabe

Premium Partner