Skip to main content
Erschienen in: Social Network Analysis and Mining 1/2022

01.12.2022 | Original Article

Sport-fanaticism lexicons for sentiment analysis in Arabic social text

verfasst von: Mohammed Alqmase, Husni Al-Muhtaseb

Erschienen in: Social Network Analysis and Mining | Ausgabe 1/2022

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Sport-fanaticism is one of the social problems. Studying this problem in social network sites such as Twitter becomes important where social sites provide a mean for people to communicate and share emotions. Hence, a huge amount of data is posted on social media every day where text mining and sentiment analysis are essential to automatically analyze such data to extract the desired information and knowledge. In this paper, two main contributions are introduced. The first contribution is that we generated twelve large-scale fanatic-lexicons that can help in building fanatic-classification to automatically classify Arabic social text (e.g., tweets) into fanatic-text or non-fanatic text. The generated fanatic-lexicons can help in building anti-fanatic tools and automatically detecting and measuring sport-fanaticism in Arabic social text. As far as we know, the generated fanatic-lexicons are the first large-scale fanatic-lexicons. The generated resources are publicly available for research purpose. The second contribution is that we proposed a new method to automatically generate sentiment lexicons which is called Term Frequency-Inverse Context Frequency (TFICF). The performance of the proposed-TFICF method is analyzed and compared with one of the common methods in this path which is called Pointwise Mutual Information (PMI). Our proposed-TFICF method showed better performance where the highest accuracy of TFICF is 89%, and the highest accuracy of PMI is 82%.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Aldayel HK, Azmi AM (2016) Arabic tweets sentiment analysis–a hybrid scheme. J Inf Sci 42(6):782–797CrossRef Aldayel HK, Azmi AM (2016) Arabic tweets sentiment analysis–a hybrid scheme. J Inf Sci 42(6):782–797CrossRef
Zurück zum Zitat Alshahrani HA, Fong AC (2018) Arabic domain-oriented sentiment lexicon construction using latent Dirichlet Allocation, In: 2018 IEEE International Conference on Electro/Information Technology (EIT) Alshahrani HA, Fong AC (2018) Arabic domain-oriented sentiment lexicon construction using latent Dirichlet Allocation, In: 2018 IEEE International Conference on Electro/Information Technology (EIT)
Zurück zum Zitat Al-Twairesh N, Al-Khalifa H, Al-Salman A (2014) Subjectivity and sentiment analysis of Arabic: trends and challenges, In: IEEE/ACS 11th International Conference on Computer Systems and Applications (AICCSA), Doha, Qatar Al-Twairesh N, Al-Khalifa H, Al-Salman A (2014) Subjectivity and sentiment analysis of Arabic: trends and challenges, In: IEEE/ACS 11th International Conference on Computer Systems and Applications (AICCSA), Doha, Qatar
Zurück zum Zitat Al-Twairesh N, Al-Khalifa H, AlSalman A (2016) Arasenti: large-scale twitter-specific arabic sentiment lexicons, In: Proceedings of the 54th annual meeting of the association for computational linguistics Al-Twairesh N, Al-Khalifa H, AlSalman A (2016) Arasenti: large-scale twitter-specific arabic sentiment lexicons, In: Proceedings of the 54th annual meeting of the association for computational linguistics
Zurück zum Zitat Church KW, Hanks P (1990) Word association norms, mutual information, and lexicography. Comput Linguist 16(1):22–29 Church KW, Hanks P (1990) Word association norms, mutual information, and lexicography. Comput Linguist 16(1):22–29
Zurück zum Zitat Darwish K, Mubarak H (2016) "Farasa: A new fast and accurate Arabic word segmenter," In: Proceedings of the Tenth International Conference on Language Resources and Evaluation Darwish K, Mubarak H (2016) "Farasa: A new fast and accurate Arabic word segmenter," In: Proceedings of the Tenth International Conference on Language Resources and Evaluation
Zurück zum Zitat E--Beltagy SR (2017) WeightedNileULex: a scored Arabic sentiment lexicon for improved sentiment analysis. In: Gayar NE, Suen CY (eds) Language processing, pattern recognition and intelligent systems. Special issue on computational linguistics, speech and image processing for Arabic language. World Scientific Publishing Co E--Beltagy SR (2017) WeightedNileULex: a scored Arabic sentiment lexicon for improved sentiment analysis. In: Gayar NE, Suen CY (eds) Language processing, pattern recognition and intelligent systems. Special issue on computational linguistics, speech and image processing for Arabic language. World Scientific Publishing Co
Zurück zum Zitat El-Beltagy SR (2016) NileULex: a phrase and word level sentiment lexicon for Egyptian and Modern Standard Arabic, In: LREC El-Beltagy SR (2016) NileULex: a phrase and word level sentiment lexicon for Egyptian and Modern Standard Arabic, In: LREC
Zurück zum Zitat El-Beltagy SR, Ali A (2013a) "Open issues in the sentiment analysis of Arabic social media: A case study," In: 2013a 9th International Conference on Innovations in Information Technology (IIT) El-Beltagy SR, Ali A (2013a) "Open issues in the sentiment analysis of Arabic social media: A case study," In: 2013a 9th International Conference on Innovations in Information Technology (IIT)
Zurück zum Zitat El-Beltagy S, Ali A (2013b) “unWeighted opinion mining Lexicon (Egyptian Arabic) El-Beltagy S, Ali A (2013b) “unWeighted opinion mining Lexicon (Egyptian Arabic)
Zurück zum Zitat El-Masri M, Altrabsheh N, Mansour H (2017) Successes and challenges of Arabic sentiment analysis research: a literature review. Soc Netw Anal Min Springer 1(7):54CrossRef El-Masri M, Altrabsheh N, Mansour H (2017) Successes and challenges of Arabic sentiment analysis research: a literature review. Soc Netw Anal Min Springer 1(7):54CrossRef
Zurück zum Zitat ElSahar H, El-Beltagy SR (2014) A fully automated approach for arabic slang lexicon extraction from microblogs," In: International conference on intelligent text processing and computational linguistics ElSahar H, El-Beltagy SR (2014) A fully automated approach for arabic slang lexicon extraction from microblogs," In: International conference on intelligent text processing and computational linguistics
Zurück zum Zitat ElSahar H, El-Beltagy SR (2015) Building large arabic multi-domain resources for sentiment analysis, In: International Conference on Intelligent Text Processing and Computational Linguistics ElSahar H, El-Beltagy SR (2015) Building large arabic multi-domain resources for sentiment analysis, In: International Conference on Intelligent Text Processing and Computational Linguistics
Zurück zum Zitat Esuli A, Sebastiani F (2006) Sentiwordnet: a publicly available lexical resource for opinion mining, In: LREC, Citeseer, pp 417–422 Esuli A, Sebastiani F (2006) Sentiwordnet: a publicly available lexical resource for opinion mining, In: LREC, Citeseer, pp 417–422
Zurück zum Zitat Hu M, Liu B (2014) Mining and summarizing customer reviews, In: Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining Hu M, Liu B (2014) Mining and summarizing customer reviews, In: Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
Zurück zum Zitat Kiritchenko S, Zhu X, Mohammad SM (2014) Sentiment analysis of short informal texts. J Artif Intell Res 50:723–762CrossRef Kiritchenko S, Zhu X, Mohammad SM (2014) Sentiment analysis of short informal texts. J Artif Intell Res 50:723–762CrossRef
Zurück zum Zitat Liu B (2012) Sentiment analysis and opinion mining, Morgan and Claypool Publishers Liu B (2012) Sentiment analysis and opinion mining, Morgan and Claypool Publishers
Zurück zum Zitat Mahyoub FH, Siddiqui MA, Dahab MY (2014) Building an Arabic sentiment lexicon using semi-supervised learning. J King Saud Univ Comput Inf Sci 26(4):417–424 Mahyoub FH, Siddiqui MA, Dahab MY (2014) Building an Arabic sentiment lexicon using semi-supervised learning. J King Saud Univ Comput Inf Sci 26(4):417–424
Zurück zum Zitat Mataoui M, Zelmati O, Boumechache M (2016) A proposed lexicon-based sentiment analysis approach for the vernacular Algerian Arabic. Res Comput Sci 110:55–70CrossRef Mataoui M, Zelmati O, Boumechache M (2016) A proposed lexicon-based sentiment analysis approach for the vernacular Algerian Arabic. Res Comput Sci 110:55–70CrossRef
Zurück zum Zitat Mohammad S, Turney P (2010) Emotions evoked by common words and phrases: Using mechanical turk to create an emotion lexicon, In: Proceedings of the NAACL HLT 2010 workshop on computational approaches to analysis and generation of emotion in text Mohammad S, Turney P (2010) Emotions evoked by common words and phrases: Using mechanical turk to create an emotion lexicon, In: Proceedings of the NAACL HLT 2010 workshop on computational approaches to analysis and generation of emotion in text
Zurück zum Zitat Mohammad SM, Kiritchenko S, Zhu X (2013) Nrc-canada: Building the state-of-the-art in sentiment analysis of tweets, arXiv preprint arXiv:1308.6242 Mohammad SM, Kiritchenko S, Zhu X (2013) Nrc-canada: Building the state-of-the-art in sentiment analysis of tweets, arXiv preprint arXiv:1308.6242
Zurück zum Zitat Mohammad S, Salameh M, Kiritchenko S (2016) Sentiment Lexicons for Arabic Social Media, In: LREC Mohammad S, Salameh M, Kiritchenko S (2016) Sentiment Lexicons for Arabic Social Media, In: LREC
Zurück zum Zitat Mohammad H, Sulaiman MN (2015) A review on evaluation metrics for data classification evaluations. Int J Data Min Knowl Manag Process 5(2):1CrossRef Mohammad H, Sulaiman MN (2015) A review on evaluation metrics for data classification evaluations. Int J Data Min Knowl Manag Process 5(2):1CrossRef
Zurück zum Zitat Mohammad SM, Turney PD (2013) Crowdsourcing a word–emotion association lexicon. Comput Intell 29(3):436–465MathSciNetCrossRef Mohammad SM, Turney PD (2013) Crowdsourcing a word–emotion association lexicon. Comput Intell 29(3):436–465MathSciNetCrossRef
Zurück zum Zitat Nielsen F (2011) A new ANEW: Evaluation of a word list for sentiment analysis in microblogs, arXiv preprint arXiv:1103.2903 Nielsen F (2011) A new ANEW: Evaluation of a word list for sentiment analysis in microblogs, arXiv preprint arXiv:1103.2903
Zurück zum Zitat Niwa Y, Nitta Y (1994) Co-occurrence vectors from corpora vs. distance vectors from dictionaries, In: Proceedings of the 15th conference on Computational linguistics Niwa Y, Nitta Y (1994) Co-occurrence vectors from corpora vs. distance vectors from dictionaries, In: Proceedings of the 15th conference on Computational linguistics
Zurück zum Zitat Pasha A, Al-Badrashiny M, Mohamed MT, El Kholy A, Eskander R, Habash N, Pooleery M, Rambow O, Roth R (2014) Madamira: a fast, comprehensive tool for morphological analysis and disambiguation of arabic," In: LREC Pasha A, Al-Badrashiny M, Mohamed MT, El Kholy A, Eskander R, Habash N, Pooleery M, Rambow O, Roth R (2014) Madamira: a fast, comprehensive tool for morphological analysis and disambiguation of arabic," In: LREC
Zurück zum Zitat QCRI Arabic Language Technologies Tools & Demos (2016) “FARASA” developed by Arabic Language Technologies Group at Qatar Computing Research Institute (QCRI), website accessed (2022) [Online]. https://farasa.qcri.org/. Accessed 13 May 2022 QCRI Arabic Language Technologies Tools & Demos (2016) “FARASA” developed by Arabic Language Technologies Group at Qatar Computing Research Institute (QCRI), website accessed (2022) [Online]. https://​farasa.​qcri.​org/​. Accessed 13 May 2022
Zurück zum Zitat Refaee E, Rieser V (2014) An arabic twitter corpus for subjectivity and sentiment analysis, In: LREC Refaee E, Rieser V (2014) An arabic twitter corpus for subjectivity and sentiment analysis, In: LREC
Zurück zum Zitat Wilson T, Wiebe J, Hoffmann P (2005) Recognizing contextual polarity in phrase-level sentiment analysis, In: Proceedings of Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing Wilson T, Wiebe J, Hoffmann P (2005) Recognizing contextual polarity in phrase-level sentiment analysis, In: Proceedings of Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing
Zurück zum Zitat Youssef M, El-Beltagy SR (2018) MoArLex: an Arabic sentiment lexicon built through automatic lexicon expansion. Procedia Comput Sci 142:94–103CrossRef Youssef M, El-Beltagy SR (2018) MoArLex: an Arabic sentiment lexicon built through automatic lexicon expansion. Procedia Comput Sci 142:94–103CrossRef
Metadaten
Titel
Sport-fanaticism lexicons for sentiment analysis in Arabic social text
verfasst von
Mohammed Alqmase
Husni Al-Muhtaseb
Publikationsdatum
01.12.2022
Verlag
Springer Vienna
Erschienen in
Social Network Analysis and Mining / Ausgabe 1/2022
Print ISSN: 1869-5450
Elektronische ISSN: 1869-5469
DOI
https://doi.org/10.1007/s13278-022-00871-2

Weitere Artikel der Ausgabe 1/2022

Social Network Analysis and Mining 1/2022 Zur Ausgabe

Premium Partner