Skip to main content
Erschienen in:
Buchtitelbild

2020 | OriginalPaper | Buchkapitel

PerSent 2.0: Persian Sentiment Lexicon Enriched with Domain-Specific Words

verfasst von : Kia Dashtipour, Ali Raza, Alexander Gelbukh, Rui Zhang, Erik Cambria, Amir Hussain

Erschienen in: Advances in Brain Inspired Cognitive Systems

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Sentiment analysis is probably the most actively growing area of natural language processing nowadays, which leverages huge amount of user-contributed data on Internet to improve income of businesses and quality of life of consumer. The majority of existent sentiment-analysis systems is focused on English, due to lack of resources and tools for other languages. To fill this gap for Persian language, in our previous work we have compiled the first version of PerSent Persian sentiment lexicon, which was small and included only words and phrases from general domain. In this paper, we present its extension with words from three different domains and evaluate its performance on polarity classification task using various machine learning-based classifiers. We use a multi-domain dataset to evaluate the performance of our new lexicon on various domains. Our results demonstrate usefulness of the new lexicon for analysis of product and movie reviews and especially of political news in Persian language.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Abdulla, N., Mohammed, S., Al-Ayyoub, M., Al-Kabi, M., et al.: Automatic lexicon construction for Arabic sentiment analysis. In: 2014 International Conference on Future Internet of Things and Cloud (FiCloud), pp. 547–552. IEEE (2014) Abdulla, N., Mohammed, S., Al-Ayyoub, M., Al-Kabi, M., et al.: Automatic lexicon construction for Arabic sentiment analysis. In: 2014 International Conference on Future Internet of Things and Cloud (FiCloud), pp. 547–552. IEEE (2014)
2.
Zurück zum Zitat Al-Moslmi, T., Albared, M., Al-Shabi, A., Omar, N., Abdullah, S.: Arabic senti-lexicon: constructing publicly available language resources for Arabic sentiment analysis. J. Inf. Sci. 44(3), 345–362 (2018)CrossRef Al-Moslmi, T., Albared, M., Al-Shabi, A., Omar, N., Abdullah, S.: Arabic senti-lexicon: constructing publicly available language resources for Arabic sentiment analysis. J. Inf. Sci. 44(3), 345–362 (2018)CrossRef
3.
Zurück zum Zitat de Albornoz, J.C., Plaza, L., Gervás, P.: SentiSense: an easily scalable concept-based affective lexicon for sentiment analysis. In: LREC, pp. 3562–3567 (2012) de Albornoz, J.C., Plaza, L., Gervás, P.: SentiSense: an easily scalable concept-based affective lexicon for sentiment analysis. In: LREC, pp. 3562–3567 (2012)
4.
Zurück zum Zitat Baccianella, S., Esuli, A., Sebastiani, F.: SentiWordNet 3.0: an enhanced lexical resource for sentiment analysis and opinion mining. In: LREC, vol. 10, pp. 2200–2204 (2010) Baccianella, S., Esuli, A., Sebastiani, F.: SentiWordNet 3.0: an enhanced lexical resource for sentiment analysis and opinion mining. In: LREC, vol. 10, pp. 2200–2204 (2010)
5.
Zurück zum Zitat Basiri, M.E., Naghsh-Nilchi, A.R., Ghassem-Aghaee, N.: A framework for sentiment analysis in Persian. Open Trans. Inf. Process. 1(3), 1–14 (2014) Basiri, M.E., Naghsh-Nilchi, A.R., Ghassem-Aghaee, N.: A framework for sentiment analysis in Persian. Open Trans. Inf. Process. 1(3), 1–14 (2014)
7.
Zurück zum Zitat Cambria, E., Havasi, C., Hussain, A.: SenticNet 2: a semantic and affective resource for opinion mining and sentiment analysis. In: FLAIRS Conference, pp. 202–207 (2012) Cambria, E., Havasi, C., Hussain, A.: SenticNet 2: a semantic and affective resource for opinion mining and sentiment analysis. In: FLAIRS Conference, pp. 202–207 (2012)
8.
Zurück zum Zitat Cambria, E., Poria, S., Bajpai, R., Schuller, B.: SenticNet 4: a semantic resource for sentiment analysis based on conceptual primitives. In: Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers, pp. 2666–2677 (2016) Cambria, E., Poria, S., Bajpai, R., Schuller, B.: SenticNet 4: a semantic resource for sentiment analysis based on conceptual primitives. In: Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers, pp. 2666–2677 (2016)
9.
Zurück zum Zitat Cambria, E., Poria, S., Hazarika, D., Kwok, K.: SenticNet 5: discovering conceptual primitives for sentiment analysis by means of context embeddings. In: Proceedings of AAAI (2018) Cambria, E., Poria, S., Hazarika, D., Kwok, K.: SenticNet 5: discovering conceptual primitives for sentiment analysis by means of context embeddings. In: Proceedings of AAAI (2018)
10.
Zurück zum Zitat Cambria, E., Speer, R., Havasi, C., Hussain, A.: SenticNet: a publicly available semantic resource for opinion mining. In: AAAI Fall Symposium: Commonsense Knowledge, vol. 10 (2010) Cambria, E., Speer, R., Havasi, C., Hussain, A.: SenticNet: a publicly available semantic resource for opinion mining. In: AAAI Fall Symposium: Commonsense Knowledge, vol. 10 (2010)
11.
Zurück zum Zitat Dashtipour, K., Gogate, M., Adeel, A., Algarafi, A., Howard, N., Hussain, A.: Persian named entity recognition. In: 2017 IEEE 16th International Conference on Cognitive Informatics & Cognitive Computing (ICCI* CC), pp. 79–83. IEEE (2017) Dashtipour, K., Gogate, M., Adeel, A., Algarafi, A., Howard, N., Hussain, A.: Persian named entity recognition. In: 2017 IEEE 16th International Conference on Cognitive Informatics & Cognitive Computing (ICCI* CC), pp. 79–83. IEEE (2017)
12.
Zurück zum Zitat Dashtipour, K., Gogate, M., Adeel, A., Hussain, A., Alqarafi, A., Durrani, T.: A comparative study of Persian sentiment analysis based on different feature combinations. In: Liang, Q., Mu, J., Jia, M., Wang, W., Feng, X., Zhang, B. (eds.) CSPS 2017. LNEE, vol. 463, pp. 2288–2294. Springer, Singapore (2019). https://doi.org/10.1007/978-981-10-6571-2_279CrossRef Dashtipour, K., Gogate, M., Adeel, A., Hussain, A., Alqarafi, A., Durrani, T.: A comparative study of Persian sentiment analysis based on different feature combinations. In: Liang, Q., Mu, J., Jia, M., Wang, W., Feng, X., Zhang, B. (eds.) CSPS 2017. LNEE, vol. 463, pp. 2288–2294. Springer, Singapore (2019). https://​doi.​org/​10.​1007/​978-981-10-6571-2_​279CrossRef
15.
Zurück zum Zitat Dashtipour, K., Hussain, A., Zhou, Q., Gelbukh, A., Hawalah, A.Y.A., Cambria, E.: PerSent: a freely available Persian sentiment lexicon. In: Liu, C.-L., Hussain, A., Luo, B., Tan, K.C., Zeng, Y., Zhang, Z. (eds.) BICS 2016. LNCS (LNAI), vol. 10023, pp. 310–320. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-49685-6_28CrossRef Dashtipour, K., Hussain, A., Zhou, Q., Gelbukh, A., Hawalah, A.Y.A., Cambria, E.: PerSent: a freely available Persian sentiment lexicon. In: Liu, C.-L., Hussain, A., Luo, B., Tan, K.C., Zeng, Y., Zhang, Z. (eds.) BICS 2016. LNCS (LNAI), vol. 10023, pp. 310–320. Springer, Cham (2016). https://​doi.​org/​10.​1007/​978-3-319-49685-6_​28CrossRef
16.
Zurück zum Zitat Dashtipour, K., et al.: Multilingual sentiment analysis: state of the art and independent comparison of techniques. Cogn. Comput. 8(4), 757–771 (2016)CrossRef Dashtipour, K., et al.: Multilingual sentiment analysis: state of the art and independent comparison of techniques. Cogn. Comput. 8(4), 757–771 (2016)CrossRef
17.
Zurück zum Zitat Dodds, P.S., Harris, K.D., Kloumann, I.M., Bliss, C.A., Danforth, C.M.: Temporal patterns of happiness and information in a global social network: hedonometrics and Twitter. PloS One 6(12), e26752 (2011)CrossRef Dodds, P.S., Harris, K.D., Kloumann, I.M., Bliss, C.A., Danforth, C.M.: Temporal patterns of happiness and information in a global social network: hedonometrics and Twitter. PloS One 6(12), e26752 (2011)CrossRef
18.
Zurück zum Zitat Gogate, M., Adeel, A., Hussain, A.: Deep learning driven multimodal fusion for automated deception detection. In: 2017 IEEE Symposium Series on Computational Intelligence (SSCI), pp. 1–6. IEEE (2017) Gogate, M., Adeel, A., Hussain, A.: Deep learning driven multimodal fusion for automated deception detection. In: 2017 IEEE Symposium Series on Computational Intelligence (SSCI), pp. 1–6. IEEE (2017)
19.
Zurück zum Zitat Gogate, M., Adeel, A., Hussain, A.: A novel brain-inspired compression-based optimised multimodal fusion for emotion recognition. In: 2017 IEEE Symposium Series on Computational Intelligence (SSCI), pp. 1–7. IEEE (2017) Gogate, M., Adeel, A., Hussain, A.: A novel brain-inspired compression-based optimised multimodal fusion for emotion recognition. In: 2017 IEEE Symposium Series on Computational Intelligence (SSCI), pp. 1–7. IEEE (2017)
21.
Zurück zum Zitat Ieracitano, C., Mammone, N., Bramanti, A., Hussain, A., Morabito, F.C.: A convolutional neural network approach for classification of dementia stages based on 2D-spectral representation of EEG recordings. Neurocomputing 323, 96–107 (2019)CrossRef Ieracitano, C., Mammone, N., Bramanti, A., Hussain, A., Morabito, F.C.: A convolutional neural network approach for classification of dementia stages based on 2D-spectral representation of EEG recordings. Neurocomputing 323, 96–107 (2019)CrossRef
22.
Zurück zum Zitat Ieracitano, C., Panto, F., Mammone, N., Paviglianiti, A., Frontera, P., Morabito, F.C.: Towards an automatic classification of SEM images of nanomaterial via a deep learning approach. In: Multidisciplinary Approaches to Neural Computing, in press Ieracitano, C., Panto, F., Mammone, N., Paviglianiti, A., Frontera, P., Morabito, F.C.: Towards an automatic classification of SEM images of nanomaterial via a deep learning approach. In: Multidisciplinary Approaches to Neural Computing, in press
23.
Zurück zum Zitat Khallash, M., Hadian, A., Minaei-Bidgoli, B.: An empirical study on the effect of morphological and lexical features in Persian dependency parsing. In: Proceedings of the Fourth Workshop on Statistical Parsing of Morphologically-Rich Languages, pp. 97–107 (2013) Khallash, M., Hadian, A., Minaei-Bidgoli, B.: An empirical study on the effect of morphological and lexical features in Persian dependency parsing. In: Proceedings of the Fourth Workshop on Statistical Parsing of Morphologically-Rich Languages, pp. 97–107 (2013)
25.
Zurück zum Zitat Liu, B., Hu, M., Cheng, J.: Opinion observer: analyzing and comparing opinions on the web. In: Proceedings of the 14th International Conference on World Wide Web, pp. 342–351. ACM (2005) Liu, B., Hu, M., Cheng, J.: Opinion observer: analyzing and comparing opinions on the web. In: Proceedings of the 14th International Conference on World Wide Web, pp. 342–351. ACM (2005)
26.
Zurück zum Zitat Remus, R., Quasthoff, U., Heyer, G.: SentiWS - a publicly available German-language resource for sentiment analysis. In: LREC (2010) Remus, R., Quasthoff, U., Heyer, G.: SentiWS - a publicly available German-language resource for sentiment analysis. In: LREC (2010)
27.
Zurück zum Zitat Sharma, R., Bhattacharyya, P.: A sentiment analyzer for Hindi using Hindi senti lexicon. In: Proceedings of the 11th International Conference on Natural Language Processing, pp. 150–155 (2014) Sharma, R., Bhattacharyya, P.: A sentiment analyzer for Hindi using Hindi senti lexicon. In: Proceedings of the 11th International Conference on Natural Language Processing, pp. 150–155 (2014)
29.
Zurück zum Zitat Yang, C., Lin, K.H.Y., Chen, H.H.: Building emotion lexicon from weblog corpora. In: Proceedings of the 45th Annual Meeting of the ACL on Interactive Poster and Demonstration Sessions, pp. 133–136. Association for Computational Linguistics (2007) Yang, C., Lin, K.H.Y., Chen, H.H.: Building emotion lexicon from weblog corpora. In: Proceedings of the 45th Annual Meeting of the ACL on Interactive Poster and Demonstration Sessions, pp. 133–136. Association for Computational Linguistics (2007)
Metadaten
Titel
PerSent 2.0: Persian Sentiment Lexicon Enriched with Domain-Specific Words
verfasst von
Kia Dashtipour
Ali Raza
Alexander Gelbukh
Rui Zhang
Erik Cambria
Amir Hussain
Copyright-Jahr
2020
DOI
https://doi.org/10.1007/978-3-030-39431-8_48