Skip to main content
Erschienen in: Cognitive Computation 4/2017

03.06.2017

Application of Rough Set-Based Feature Selection for Arabic Sentiment Analysis

verfasst von: Qasem A. Al-Radaideh, Ghufran Y. Al-Qudah

Erschienen in: Cognitive Computation | Ausgabe 4/2017

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Sentiment analysis is considered as one of the recent applications of text categorization that categories the emotions expressed in text as negative, positive, and natural. Rough set theory is a mathematical tool used to analyze uncertainty, incomplete information, and data reduction. Indiscernibility, reduct, and core are essential concepts in rough set theory that can be employed for data classification and knowledge reduction. This paper proposes to use the rough set-based methods for sentiment analysis to classify tweets that are written in the Arabic language. The paper investigates the application of the reduct concept of rough set theory as a feature selection method for sentiment analysis. This paper investigates four reduct computation techniques to generate the set of reducts. For classification purposes, two rule generation algorithms have been studied to build the rough set rule-based classifier. An Arabic data set of 4800 tweets is used in the experiments to validate the use of reduct computation for Arabic sentiment analysis. The results of the experiments showed that using rough set reducts techniques lead to different results and some of them can perform better than non-rough set classifier. The best classification accuracy rate was for rough set classifier using the full attribute weighting reduct generation algorithm which achieved an accuracy of 74%. The primary results indicate that using the rough set theory framework for sentiment analysis is an appealing option where it can enhance the overall accuracy and reduce the number of used terms for classification which in turn will lead to a faster classification process, especially with a large dataset.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Kumari U, Soni D, Sharma A. A cognitive study of sentiment analysis techniques and tools: a survey. International Journal of Computer Science And Technology. 2017;8(1):58–62. Kumari U, Soni D, Sharma A. A cognitive study of sentiment analysis techniques and tools: a survey. International Journal of Computer Science And Technology. 2017;8(1):58–62.
2.
Zurück zum Zitat Vohra M, Teraiya J. A Comparative Study Of Sentiment Analysis Techniques. J Inf Knowl Res Comput Eng. 2013;2:313–7. Vohra M, Teraiya J. A Comparative Study Of Sentiment Analysis Techniques. J Inf Knowl Res Comput Eng. 2013;2:313–7.
3.
Zurück zum Zitat Wang J, Dong A. A comparison of two text representations for sentiment analysis. In Computer Application and System Modeling (ICCASM), 2010 International Conference on IEEE, 2010;(11): 11–35. Wang J, Dong A. A comparison of two text representations for sentiment analysis. In Computer Application and System Modeling (ICCASM), 2010 International Conference on IEEE, 2010;(11): 11–35.
4.
Zurück zum Zitat Varela P L, Martins A F, Aguiar P M, Figueiredo, M A. An Empirical Study of Feature Selection for Sentiment Analysis. Figueiredo Conference on Telecommunications, Castelo Branco, Portugal. 2013. Varela P L, Martins A F, Aguiar P M, Figueiredo, M A. An Empirical Study of Feature Selection for Sentiment Analysis. Figueiredo Conference on Telecommunications, Castelo Branco, Portugal. 2013.
5.
Zurück zum Zitat Jianping F, Zhenzhong K, Baopeng Z, Jun Y, Dan L. iPrivacy: image privacy protection by identifying sensitive objects via deep multi-task learning. IEEE Transactions on Information Forensics and Security. 2016;12(5):1005–16. Jianping F, Zhenzhong K, Baopeng Z, Jun Y, Dan L. iPrivacy: image privacy protection by identifying sensitive objects via deep multi-task learning. IEEE Transactions on Information Forensics and Security. 2016;12(5):1005–16.
6.
Zurück zum Zitat Jun Y., Xiaokang Y., Fei G., Dacheng T.. Deep multimodal distance metric learning using click constraints for image ranking, In: IEEE Transactions on Cybernetics, vol. PP, no.99; 2016. pp.1–11. Jun Y., Xiaokang Y., Fei G., Dacheng T.. Deep multimodal distance metric learning using click constraints for image ranking, In: IEEE Transactions on Cybernetics, vol. PP, no.99; 2016. pp.1–11.
7.
Zurück zum Zitat Jun Y, Yong R, Yuan Y, Tang Dacheng T. High-order distance-based multiview stochastic learning in image classification. IEEE Transactions on Cybernetics. 2014;44(12):2431–42.CrossRef Jun Y, Yong R, Yuan Y, Tang Dacheng T. High-order distance-based multiview stochastic learning in image classification. IEEE Transactions on Cybernetics. 2014;44(12):2431–42.CrossRef
8.
Zurück zum Zitat Chaoqun H, Jun Y, Jian W, Dacheng T, Meng W. Multimodal deep autoencoder for human pose recovery. IEEE Trans Image Process. 2016;24(12):5659–70. Chaoqun H, Jun Y, Jian W, Dacheng T, Meng W. Multimodal deep autoencoder for human pose recovery. IEEE Trans Image Process. 2016;24(12):5659–70.
9.
Zurück zum Zitat Duwairi R, El-Orfali M. A study of the effects of preprocessing strategies on sentiment analysis for Arabic text. J Inf Sci. 2014;40(4):501–13.CrossRef Duwairi R, El-Orfali M. A study of the effects of preprocessing strategies on sentiment analysis for Arabic text. J Inf Sci. 2014;40(4):501–13.CrossRef
10.
Zurück zum Zitat Rahmath, H, Ahmad, T. Sentiment Analysis Techniques - A Comparative Study. IJCEM International Journal of Computational Engineering & Management. 2014;4(17):25–29. Rahmath, H, Ahmad, T. Sentiment Analysis Techniques - A Comparative Study. IJCEM International Journal of Computational Engineering & Management. 2014;4(17):25–29.
11.
Zurück zum Zitat Pawlak Z. Rough sets. Int J of Information and Computer Sciences. 1982;11(5):341–56.CrossRef Pawlak Z. Rough sets. Int J of Information and Computer Sciences. 1982;11(5):341–56.CrossRef
12.
Zurück zum Zitat Pawlak Z. Rough classification. International Journal of Man-Machine Studies. 1984;20(5):469–83.CrossRef Pawlak Z. Rough classification. International Journal of Man-Machine Studies. 1984;20(5):469–83.CrossRef
13.
Zurück zum Zitat Chouchoulas A, Shen Q. A rough set-based approach to text classification. Lectures Notes in Artificial Intelligence. 1999;1711:118–27. Chouchoulas A, Shen Q. A rough set-based approach to text classification. Lectures Notes in Artificial Intelligence. 1999;1711:118–27.
14.
Zurück zum Zitat Abdul-Mageed, M., Kübler, S., and Diab, M.. SAMAR: a system for subjectivity and sentiment analysis of arabic social media. In: Proceedings of the 3rd Workshop on Computational Approaches to Subjectivity and Sentiment Analysis; 2012. pp. 19–28. Abdul-Mageed, M., Kübler, S., and Diab, M.. SAMAR: a system for subjectivity and sentiment analysis of arabic social media. In: Proceedings of the 3rd Workshop on Computational Approaches to Subjectivity and Sentiment Analysis; 2012. pp. 19–28.
15.
Zurück zum Zitat Al-Kabi, M., Abdulla, N., and Al-Ayyoub, M.. An analytical study of arabic sentiments: Maktoob Case Study. In: Proceedings of 8th IEEE International Conference on Internet Technology and Secured Transactions (ICITST); 2013a. pp. 89–94. Al-Kabi, M., Abdulla, N., and Al-Ayyoub, M.. An analytical study of arabic sentiments: Maktoob Case Study. In: Proceedings of 8th IEEE International Conference on Internet Technology and Secured Transactions (ICITST); 2013a. pp. 89–94.
16.
Zurück zum Zitat Al-Kabi, M., Al-Qudah, N., Alsmadi, I., Dabour, M., and Wahsheh, H.. Arabic/English sentiment analysis: an empirical study. In: Proceedings of the 4th International Conference on Information and Communication Systems (ICICS); 2013b. Al-Kabi, M., Al-Qudah, N., Alsmadi, I., Dabour, M., and Wahsheh, H.. Arabic/English sentiment analysis: an empirical study. In: Proceedings of the 4th International Conference on Information and Communication Systems (ICICS); 2013b.
17.
Zurück zum Zitat Abdulla, N. A., Ahmed, N. A., Shehab, M. A., and Al-Ayyoub, M.. Arabic sentiment analysis: lexicon-based and corpus-based. In: Proceedings of IEEE Jordan Conference on Applied Electrical Engineering and Computing Technologies (AEECT); 2013. pp. 1–6. Abdulla, N. A., Ahmed, N. A., Shehab, M. A., and Al-Ayyoub, M.. Arabic sentiment analysis: lexicon-based and corpus-based. In: Proceedings of IEEE Jordan Conference on Applied Electrical Engineering and Computing Technologies (AEECT); 2013. pp. 1–6.
19.
Zurück zum Zitat Shoukry A, Rafea A. Sentence-level Arabic sentiment analysis. In: Proceedings of International Conference on Collaboration Technologies and Systems (CTS). Denver; 2012 pp. 546–550. Shoukry A, Rafea A. Sentence-level Arabic sentiment analysis. In: Proceedings of International Conference on Collaboration Technologies and Systems (CTS). Denver; 2012 pp. 546–550.
20.
Zurück zum Zitat Al-Subaihin AS, Al-Khalifa HS. A system for sentiment analysis of colloquial Arabic using human computation. The Scientific World Journal pp. 2014:1–8. Al-Subaihin AS, Al-Khalifa HS. A system for sentiment analysis of colloquial Arabic using human computation. The Scientific World Journal pp. 2014:1–8.
21.
Zurück zum Zitat Pandarachalil R, Sendhilkumar S, Mahalakshmi G. Twitter sentiment analysis for large-scale data: an unsupervised approach. Cogn Comput. 2015;7(2):254–62.CrossRef Pandarachalil R, Sendhilkumar S, Mahalakshmi G. Twitter sentiment analysis for large-scale data: an unsupervised approach. Cogn Comput. 2015;7(2):254–62.CrossRef
22.
Zurück zum Zitat Recupero D, Presutti V, Consoli S, Gangemi A, Nuzzolese A. Sentilo: frame-based sentiment analysis. Cogn Comput. 2015;7(2):211–25.CrossRef Recupero D, Presutti V, Consoli S, Gangemi A, Nuzzolese A. Sentilo: frame-based sentiment analysis. Cogn Comput. 2015;7(2):211–25.CrossRef
23.
Zurück zum Zitat Bayoudhi A., Hadrich L, and Ghorbel B.. Sentiment classification of Arabic documents: experiments with multi-type features and ensemble algorithms. In: Proceedings of the 29th Pacific Asia Conference on Language, Information and Computation, Shanghai, China; 2015 pp. 196–205. Bayoudhi A., Hadrich L, and Ghorbel B.. Sentiment classification of Arabic documents: experiments with multi-type features and ensemble algorithms. In: Proceedings of the 29th Pacific Asia Conference on Language, Information and Computation, Shanghai, China; 2015 pp. 196–205.
24.
Zurück zum Zitat Bharti S, Vachha B, Pradhan R, Babu K, Jena S. Sarcastic sentiment detection in tweets streamed in real time: a big data approach. Digital Communications and Networks. 2016;2:108–21.CrossRef Bharti S, Vachha B, Pradhan R, Babu K, Jena S. Sarcastic sentiment detection in tweets streamed in real time: a big data approach. Digital Communications and Networks. 2016;2:108–21.CrossRef
25.
Zurück zum Zitat Al-Kabi M, Al-Ayyoub M, Alsmadi I, Wahsheh H. A prototype for a standard Arabic sentiment analysis corpus. The International Arab Journal of Information Technology. 2016;13(1A):163–70. Al-Kabi M, Al-Ayyoub M, Alsmadi I, Wahsheh H. A prototype for a standard Arabic sentiment analysis corpus. The International Arab Journal of Information Technology. 2016;13(1A):163–70.
26.
Zurück zum Zitat Dashtipour K, Poria S, Hussain A, Cambria E, Hawalah A, Gelbukh A, Zhou Q. Multilingual sentiment analysis: state of the art and independent comparison of techniques. Cogn Comput. 2016;8:757–71.CrossRef Dashtipour K, Poria S, Hussain A, Cambria E, Hawalah A, Gelbukh A, Zhou Q. Multilingual sentiment analysis: state of the art and independent comparison of techniques. Cogn Comput. 2016;8:757–71.CrossRef
27.
Zurück zum Zitat Al-Radaideh, Q., Sulaiman, M., Selamat, M., and Ibrahim, H.. An empirical comparison of reduct generation approaches in the context of rough set based classification. In: Proceedings of International Conference on Information Technology and Natural Sciences (ICITNS). 2003. Al-Radaideh, Q., Sulaiman, M., Selamat, M., and Ibrahim, H.. An empirical comparison of reduct generation approaches in the context of rough set based classification. In: Proceedings of International Conference on Information Technology and Natural Sciences (ICITNS). 2003.
28.
Zurück zum Zitat Qablan, T., Al-Radaideh, Q., and Shuqeir, S.. A reduct computation approach based on ant colony optimization. ABHATH AL-YARMOUK; 2012. pp. 29–40. Qablan, T., Al-Radaideh, Q., and Shuqeir, S.. A reduct computation approach based on ant colony optimization. ABHATH AL-YARMOUK; 2012. pp. 29–40.
29.
Zurück zum Zitat Al-Radaideh, Q. A., Sulaiman, M. N., Selamat, M. H., and Ibrahim, H.. Approximate reduct computation by rough sets based attribute weighting. In: Proceedings of the IEEE International Conference on Granular Computing; 2005a (2): 383–386. Al-Radaideh, Q. A., Sulaiman, M. N., Selamat, M. H., and Ibrahim, H.. Approximate reduct computation by rough sets based attribute weighting. In: Proceedings of the IEEE International Conference on Granular Computing; 2005a (2): 383–386.
30.
Zurück zum Zitat Al-Radaideh Q, Sulaiman M, Selamat M, Ibrahim H. Heuristic reduct computation approach by attributes weighting for rough set based classification. J Comput Sci. 2005b:41–7. Al-Radaideh Q, Sulaiman M, Selamat M, Ibrahim H. Heuristic reduct computation approach by attributes weighting for rough set based classification. J Comput Sci. 2005b:41–7.
31.
Zurück zum Zitat Arafat H, Elawady R, Barakat S, Elrashidy N. Different feature selection for sentiment classification. International Journal of Information Science and Intelligent Systems. 2014;3(1):137–50. Arafat H, Elawady R, Barakat S, Elrashidy N. Different feature selection for sentiment classification. International Journal of Information Science and Intelligent Systems. 2014;3(1):137–50.
32.
Zurück zum Zitat Al-Abrat, M., and Al-Radaideh, Q.. A rough set based approach for arabic text categorization. Non-published Master Thesis, Department Of Computer Information Systems, Yarmouk University, Irbid, Jordan; 2013. Al-Abrat, M., and Al-Radaideh, Q.. A rough set based approach for arabic text categorization. Non-published Master Thesis, Department Of Computer Information Systems, Yarmouk University, Irbid, Jordan; 2013.
33.
Zurück zum Zitat Yahia, M.. Arabic text categorization based on rough set classification. In Proceedings of 2011 9th IEEE/ACS International Conference on Computer Systems and Applications (AICCSA); 2011. pp. 293–294. Yahia, M.. Arabic text categorization based on rough set classification. In Proceedings of 2011 9th IEEE/ACS International Conference on Computer Systems and Applications (AICCSA); 2011. pp. 293–294.
34.
Zurück zum Zitat Al-Radaideh, Q., and Twaiq, L.. Rough set theory approaches for Arabic sentiment classification. In: Proceedings of International Conference on Future of Things and Cloud, IEEE Computer Society; 2014. Al-Radaideh, Q., and Twaiq, L.. Rough set theory approaches for Arabic sentiment classification. In: Proceedings of International Conference on Future of Things and Cloud, IEEE Computer Society; 2014.
36.
Zurück zum Zitat Bazan J, Szczuka M. RSES and RSESlib—a collection of tools for rough set computations. In Proc. of RSCTC’ 2000. LNAI. 2005;2005:106–13. Bazan J, Szczuka M. RSES and RSESlib—a collection of tools for rough set computations. In Proc. of RSCTC’ 2000. LNAI. 2005;2005:106–13.
37.
Zurück zum Zitat Wroblewski J.. Finding minimal reducts using genetic algorithms. In: Proceedings of the 2nd Annual Join Conference on Information Sciences; 1995. pp.186–189. Wroblewski J.. Finding minimal reducts using genetic algorithms. In: Proceedings of the 2nd Annual Join Conference on Information Sciences; 1995. pp.186–189.
38.
Zurück zum Zitat Bazan, J. G., Nguyen, H. S., Nguyen, S. H., Synak, P., and Wróblewski, J.. Rough set algorithms in the classification problem. In: Rough set methods and applications; 2000. pp. 49–88. Bazan, J. G., Nguyen, H. S., Nguyen, S. H., Synak, P., and Wróblewski, J.. Rough set algorithms in the classification problem. In: Rough set methods and applications; 2000. pp. 49–88.
39.
Zurück zum Zitat Bazan, J. G.. A comparison of dynamic and non-dynamic rough set methods for extracting laws from decision tables. Rough sets in knowledge discovery. 1998; (1): 321–365. Bazan, J. G.. A comparison of dynamic and non-dynamic rough set methods for extracting laws from decision tables. Rough sets in knowledge discovery. 1998; (1): 321–365.
40.
Zurück zum Zitat Sengupta, S., and Das, A. K.. A study on rough set theory based dynamic reduct for classification system optimization. Int J Artif Intell Appl. 2014; 5:(4). Sengupta, S., and Das, A. K.. A study on rough set theory based dynamic reduct for classification system optimization. Int J Artif Intell Appl. 2014; 5:(4).
41.
Zurück zum Zitat Stefanowski J, Vanderpooten D. Induction of decision rules in classification and discovery-oriented perspectives. Int J Intell Syst. 2001;16(1):13–27.CrossRef Stefanowski J, Vanderpooten D. Induction of decision rules in classification and discovery-oriented perspectives. Int J Intell Syst. 2001;16(1):13–27.CrossRef
42.
Zurück zum Zitat Wroblewski J. Rough Sets and Current Trends in Computing. In: Covering with reducts—a fast algorithm for rule generation. Heidelberg: Springer; 1998. p. 402–7. Wroblewski J. Rough Sets and Current Trends in Computing. In: Covering with reducts—a fast algorithm for rule generation. Heidelberg: Springer; 1998. p. 402–7.
43.
Zurück zum Zitat Witten I, Frank E, Hall M, Pal C. “Data mining: practical machine learning tools and techniques”, USA: Morgan Kaufmann. 4th ed. 2016. Witten I, Frank E, Hall M, Pal C. “Data mining: practical machine learning tools and techniques”, USA: Morgan Kaufmann. 4th ed. 2016.
Metadaten
Titel
Application of Rough Set-Based Feature Selection for Arabic Sentiment Analysis
verfasst von
Qasem A. Al-Radaideh
Ghufran Y. Al-Qudah
Publikationsdatum
03.06.2017
Verlag
Springer US
Erschienen in
Cognitive Computation / Ausgabe 4/2017
Print ISSN: 1866-9956
Elektronische ISSN: 1866-9964
DOI
https://doi.org/10.1007/s12559-017-9477-1

Weitere Artikel der Ausgabe 4/2017

Cognitive Computation 4/2017 Zur Ausgabe