Skip to main content
Erschienen in: Cognitive Computation 1/2022

18.01.2021

Distant Supervised Construction and Evaluation of a Novel Dataset of Emotion-Tagged Social Media Comments in Spanish

verfasst von: Juan Pablo Tessore, Leonardo Martín Esnaola, Laura Lanzarini, Sandra Baldassarri

Erschienen in: Cognitive Computation | Ausgabe 1/2022

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Tagged language resources are an essential requirement for developing machine-learning text-based classifiers. However, manual tagging is extremely time consuming and the resulting datasets are rather small, containing only a few thousand samples. Basic emotion datasets are particularly difficult to classify manually because categorization is prone to subjectivity, and thus, redundant classification is required to validate the assigned tag. Even though, in recent years, the amount of emotion-tagged text datasets in Spanish has been growing, it cannot be compared with the number, size, and quality of the datasets in English. Quality is a particularly concerning issue, as not many datasets in Spanish included a validation step in the construction process. In this article, a dataset of social media comments in Spanish is compiled, selected, filtered, and presented. A sample of the dataset is reclassified by a group of psychologists and validated using the Fleiss Kappa interrater agreement measure. Error analysis is performed by using the Sentic Computing tool BabelSenticNet. Results indicate that the agreement between the human raters and the automatically acquired tag is moderate, similar to other manually tagged datasets, with the advantages that the presented dataset contains several hundreds of thousands of tagged comments and it does not require extensive manual tagging. The agreement measured between human raters is very similar to the one between human raters and the original tag. Every measure presented is in the moderate agreement zone and, as such, suitable for training classification algorithms in sentiment analysis field.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Anhänge
Nur mit Berechtigung zugänglich
Literatur
1.
Zurück zum Zitat Cambria E. Affective computing and sentiment analysis. IEEE Intell Syst. 2016;31(2):102–7. Cambria E. Affective computing and sentiment analysis. IEEE Intell Syst. 2016;31(2):102–7.
2.
Zurück zum Zitat Picard R. Affective Computing. MIT Press; 1997. Picard R. Affective Computing. MIT Press; 1997.
3.
Zurück zum Zitat Cambria E, Poria S, Gelbukh A, Thelwall M. Sentiment analysis is a big suitcase. IEEE Intell Syst. 2017;32(6):74–80.CrossRef Cambria E, Poria S, Gelbukh A, Thelwall M. Sentiment analysis is a big suitcase. IEEE Intell Syst. 2017;32(6):74–80.CrossRef
4.
Zurück zum Zitat Chaturvedi I, Cambria E, Vilares D. Lyapunov filtering of objectivity for Spanish Sentiment Model. In: 2016 International Joint Conference on Neural Networks (IJCNN). Vancouver, British Columbia, Canada: IEEE; 2016. p. 4474–4481. Chaturvedi I, Cambria E, Vilares D. Lyapunov filtering of objectivity for Spanish Sentiment Model. In: 2016 International Joint Conference on Neural Networks (IJCNN). Vancouver, British Columbia, Canada: IEEE; 2016. p. 4474–4481.
5.
Zurück zum Zitat Majumder N, Poria S, Peng H, Chhaya N, Cambria E, Gelbukh A, et al. Sentiment and Sarcasm Classification With Multitask Learning. IEEE Intell Syst. 2019 May-June 1;34(3):38–43. Majumder N, Poria S, Peng H, Chhaya N, Cambria E, Gelbukh A, et al. Sentiment and Sarcasm Classification With Multitask Learning. IEEE Intell Syst. 2019 May-June 1;34(3):38–43.
6.
Zurück zum Zitat Majumder N, Poria S, Gelbukh A, Cambria E. Deep learning-based document modeling for personality detection from text. IEEE Intell Syst. 2017;32(2):74–9.CrossRef Majumder N, Poria S, Gelbukh A, Cambria E. Deep learning-based document modeling for personality detection from text. IEEE Intell Syst. 2017;32(2):74–9.CrossRef
7.
Zurück zum Zitat Medhat W, Hassan A, Korashy H. Sentiment analysis algorithms and applications: a survey. Ain Shams Eng J. 2014;5(4):1093–113.CrossRef Medhat W, Hassan A, Korashy H. Sentiment analysis algorithms and applications: a survey. Ain Shams Eng J. 2014;5(4):1093–113.CrossRef
8.
Zurück zum Zitat Cambria E, Hussain A, Havasi C, Eckl C. Sentic Computing: Exploitation of Common Sense for the Development of Emotion-Sensitive Systems. In: Esposito A, Campbell N, Vogel C, Hussain A, Nijholt A, editors. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Berlin, Heidelberg: Springer Berlin Heidelberg; 2010. p. 148–156. (Lecture Notes in Computer Science; vol. 5967). Cambria E, Hussain A, Havasi C, Eckl C. Sentic Computing: Exploitation of Common Sense for the Development of Emotion-Sensitive Systems. In: Esposito A, Campbell N, Vogel C, Hussain A, Nijholt A, editors. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Berlin, Heidelberg: Springer Berlin Heidelberg; 2010. p. 148–156. (Lecture Notes in Computer Science; vol. 5967).
9.
Zurück zum Zitat Bi J-W, Liu Y, Fan Z-P, Cambria E. Modelling customer satisfaction from online reviews using ensemble neural network and effect-based Kano model. Int J Prod Res. 2019;57(22):7068–88.CrossRef Bi J-W, Liu Y, Fan Z-P, Cambria E. Modelling customer satisfaction from online reviews using ensemble neural network and effect-based Kano model. Int J Prod Res. 2019;57(22):7068–88.CrossRef
10.
Zurück zum Zitat Chen L, Qi L. Social opinion mining for supporting buyers’ complex decision making: exploratory user study and algorithm comparison. Soc Netw Anal Min. 2011;1(4):301–20.CrossRef Chen L, Qi L. Social opinion mining for supporting buyers’ complex decision making: exploratory user study and algorithm comparison. Soc Netw Anal Min. 2011;1(4):301–20.CrossRef
11.
Zurück zum Zitat Bae Y, Lee H. Sentiment analysis of twitter audiences: measuring the positive or negative influence of popular twitterers. J Am Soc Inf Sci Technol. 2012;63(12):2521–35.CrossRef Bae Y, Lee H. Sentiment analysis of twitter audiences: measuring the positive or negative influence of popular twitterers. J Am Soc Inf Sci Technol. 2012;63(12):2521–35.CrossRef
12.
Zurück zum Zitat Mahata D, Friedrichs J, Hitkul, Shah RR. Phramacovigilance - exploring deep learning techniques for identifying mentions of medication intake from twitter. 2018. arXiv preprint arXiv 1805.06375 Mahata D, Friedrichs J, Hitkul, Shah RR. Phramacovigilance - exploring deep learning techniques for identifying mentions of medication intake from twitter. 2018. arXiv preprint arXiv 1805.06375
13.
Zurück zum Zitat Wang Z, Chong CS, Lan L, Yang Y, Beng S, Ho JC. Tong Fine-grained sentiment analysis of social media with emotion sensing. In, 2016 Future Technologies Conference (FTC) [Internet] San Francisco, California, USA: IEEE 2016;1361-1364 Wang Z, Chong CS, Lan L, Yang Y, Beng S, Ho JC. Tong Fine-grained sentiment analysis of social media with emotion sensing. In, 2016 Future Technologies Conference (FTC) [Internet] San Francisco, California, USA: IEEE 2016;1361-1364
14.
Zurück zum Zitat Munezero M, Montero CS, Sutinen E, Pajunen J. Are they different? affect, feeling, emotion, sentiment, and opinion detection in text. IEEE Trans Affect Comput. 2014 Apr-June 1;5(2):101–111. Munezero M, Montero CS, Sutinen E, Pajunen J. Are they different? affect, feeling, emotion, sentiment, and opinion detection in text. IEEE Trans Affect Comput. 2014 Apr-June 1;5(2):101–111.
15.
Zurück zum Zitat Wang Z, Ho S-B, Cambria E. A review of emotion sensing: categorization models and algorithms. Multimed Tools Appl. 2020;3:1–30. Wang Z, Ho S-B, Cambria E. A review of emotion sensing: categorization models and algorithms. Multimed Tools Appl. 2020;3:1–30.
16.
Zurück zum Zitat Ekman P, Friesen WV. Constants across cultures in the face and emotion. J Pers Soc Psychol. 1971;17(2):124–9.CrossRef Ekman P, Friesen WV. Constants across cultures in the face and emotion. J Pers Soc Psychol. 1971;17(2):124–9.CrossRef
17.
Zurück zum Zitat Susanto Y, Livingstone AG, Ng BC, Cambria E, Cambria E. The hourglass model revisited. IEEE Intell Syst. 2020 Sept-Oct 1;35(5):96–102. Susanto Y, Livingstone AG, Ng BC, Cambria E, Cambria E. The hourglass model revisited. IEEE Intell Syst. 2020 Sept-Oct 1;35(5):96–102.
18.
Zurück zum Zitat Mintz M, Bills S, Snow R, Jurafsky D. Distant supervision for relation extraction without labeled data. In: Su K-Y, Su J, Wiebe J, Haizhou L, editors. Proceedings of the 47th Annual Meeting ofthe ACL and the 4th IJCNLP of the AFNLP. Suntec, Singapore: Association for Computational Linguistics and Asian Federation of Natural Language Processing Associations; 2009. p. 1003–1011. Mintz M, Bills S, Snow R, Jurafsky D. Distant supervision for relation extraction without labeled data. In: Su K-Y, Su J, Wiebe J, Haizhou L, editors. Proceedings of the 47th Annual Meeting ofthe ACL and the 4th IJCNLP of the AFNLP. Suntec, Singapore: Association for Computational Linguistics and Asian Federation of Natural Language Processing Associations; 2009. p. 1003–1011.
19.
Zurück zum Zitat Pool C, Nissim M. Distant supervision for emotion detection using Facebook reactions. 2016. arXiv preprint arXiv 1611.02988 Pool C, Nissim M. Distant supervision for emotion detection using Facebook reactions. 2016. arXiv preprint arXiv 1611.02988
20.
Zurück zum Zitat Kaur W, Balakrishnan V, Rana O, Sinniah A. Liking, sharing, commenting and reacting on Facebook: user behaviors’ impact on sentiment intensity. Telemat Informatics. 2019;39(June):25–36.CrossRef Kaur W, Balakrishnan V, Rana O, Sinniah A. Liking, sharing, commenting and reacting on Facebook: user behaviors’ impact on sentiment intensity. Telemat Informatics. 2019;39(June):25–36.CrossRef
21.
Zurück zum Zitat Tian Y, Galery T, Dulcinati G, Molimpakis E, Sun C. Facebook sentiment: reactions and emojis. In: Proceedings of the Fifth International Workshop on Natural Language Processing for Social Media. Stroudsburg, PA, USA: Association for Computational Linguistics; 2017. p. 11–16. Tian Y, Galery T, Dulcinati G, Molimpakis E, Sun C. Facebook sentiment: reactions and emojis. In: Proceedings of the Fifth International Workshop on Natural Language Processing for Social Media. Stroudsburg, PA, USA: Association for Computational Linguistics; 2017. p. 11–16.
22.
Zurück zum Zitat Balakrishnan V, Govindan V, Arshad NI, Shuib L, Cachia E. Facebook user reactions and emotion: an analysis of their relationships among the online diabetes community. Malaysian J Comput Sci. 2019;Special Issue 3:87–97. Balakrishnan V, Govindan V, Arshad NI, Shuib L, Cachia E. Facebook user reactions and emotion: an analysis of their relationships among the online diabetes community. Malaysian J Comput Sci. 2019;Special Issue 3:87–97.
23.
Zurück zum Zitat Bilal M, Malik N, Bashir N, Marjani M, Hashem IAT, Gani A. Profiling social media campaigns and political influence: the case of pakistani politics. In: 2019 13th International Conference on Mathematics, Actuarial Science, Computer Science and Statistics (MACS). Karachi, Pakistan, Pakistan: IEEE; 2019. p. 1–7. Bilal M, Malik N, Bashir N, Marjani M, Hashem IAT, Gani A. Profiling social media campaigns and political influence: the case of pakistani politics. In: 2019 13th International Conference on Mathematics, Actuarial Science, Computer Science and Statistics (MACS). Karachi, Pakistan, Pakistan: IEEE; 2019. p. 1–7.
24.
Zurück zum Zitat Hoque MT, Islam A, Ahmed E, Mamun KA, Huda MN. Analyzing performance of different machine learning approaches with doc2vec for classifying sentiment of bengali natural language. In: 2019 International Conference on Electrical, Computer and Communication Engineering (ECCE). Cox’s Bazar, Bangladesh: IEEE; 2019. p. 1–5. Hoque MT, Islam A, Ahmed E, Mamun KA, Huda MN. Analyzing performance of different machine learning approaches with doc2vec for classifying sentiment of bengali natural language. In: 2019 International Conference on Electrical, Computer and Communication Engineering (ECCE). Cox’s Bazar, Bangladesh: IEEE; 2019. p. 1–5.
25.
Zurück zum Zitat Raad BT, Philipp B, Patrick H, Christoph M. ASEDS: Towards Automatic Social Emotion Detection System Using Facebook Reactions. In: 2018 IEEE 20th International Conference on High Performance Computing and Communications; IEEE 16th International Conference on Smart City; IEEE 4th International Conference on Data Science and Systems (HPCC/SmartCity/DSS). Exeter, United Kingdom: IEEE; 2018. p. 860–866. Raad BT, Philipp B, Patrick H, Christoph M. ASEDS: Towards Automatic Social Emotion Detection System Using Facebook Reactions. In: 2018 IEEE 20th International Conference on High Performance Computing and Communications; IEEE 16th International Conference on Smart City; IEEE 4th International Conference on Data Science and Systems (HPCC/SmartCity/DSS). Exeter, United Kingdom: IEEE; 2018. p. 860–866.
26.
Zurück zum Zitat Baj-Rogowska A. Sentiment analysis of Facebook posts: The Uber case. In: 2017 Eighth International Conference on Intelligent Computing and Information Systems (ICICIS). Cairo, Egypt: IEEE; 2017. p. 391–395. Baj-Rogowska A. Sentiment analysis of Facebook posts: The Uber case. In: 2017 Eighth International Conference on Intelligent Computing and Information Systems (ICICIS). Cairo, Egypt: IEEE; 2017. p. 391–395.
27.
Zurück zum Zitat Sandoval-Almazan R, Valle-Cruz D. Facebook impact and sentiment analysis on political campaigns. In: Proceedings of the 19th Annual International Conference on Digital Government Research Governance in the Data Age - dgo ’18. New York, New York, USA: ACM Press; 2018. p. 1–7. Sandoval-Almazan R, Valle-Cruz D. Facebook impact and sentiment analysis on political campaigns. In: Proceedings of the 19th Annual International Conference on Digital Government Research Governance in the Data Age - dgo ’18. New York, New York, USA: ACM Press; 2018. p. 1–7.
28.
Zurück zum Zitat Fleiss JL. Measuring nominal scale agreement among many raters. Psychol Bull. 1971;76(5):378–82.CrossRef Fleiss JL. Measuring nominal scale agreement among many raters. Psychol Bull. 1971;76(5):378–82.CrossRef
29.
Zurück zum Zitat Mercado V, Villagra A, Errecalde M. Political alignment identification: a study with documents of Argentinian journalists. J Comput Sci Technol. 2020;20(1):43–52.CrossRef Mercado V, Villagra A, Errecalde M. Political alignment identification: a study with documents of Argentinian journalists. J Comput Sci Technol. 2020;20(1):43–52.CrossRef
30.
Zurück zum Zitat Lo SL, Cambria E, Chiong R, Cornforth D. Multilingual sentiment analysis: from formal to informal and scarce resource languages. Artif Intell Rev. 2017;48(4):499–527.CrossRef Lo SL, Cambria E, Chiong R, Cornforth D. Multilingual sentiment analysis: from formal to informal and scarce resource languages. Artif Intell Rev. 2017;48(4):499–527.CrossRef
31.
Zurück zum Zitat Cambria E, Li Y, Xing FZ, Poria S, Kwok K. SenticNet 6: Ensemble application of symbolic and subsymbolic AI for sentiment analysis. In: Proceedings of the 29th ACM International Conference on Information & Knowledge Management. New York, NY, USA: ACM; 2020. p. 105–114. Cambria E, Li Y, Xing FZ, Poria S, Kwok K. SenticNet 6: Ensemble application of symbolic and subsymbolic AI for sentiment analysis. In: Proceedings of the 29th ACM International Conference on Information & Knowledge Management. New York, NY, USA: ACM; 2020. p. 105–114.
32.
Zurück zum Zitat Vilares D, Peng H, Satapathy R, CambriaE. BabelSenticNet: A commonsense reasoning framework for multilingual sentiment analysis. In, 2018 IEEE Symposium Series on Computational Intelligence (SSCI) Bangalore, India: IEEE 2018 1292 1298 Vilares D, Peng H, Satapathy R, CambriaE. BabelSenticNet: A commonsense reasoning framework for multilingual sentiment analysis. In, 2018 IEEE Symposium Series on Computational Intelligence (SSCI) Bangalore, India: IEEE 2018 1292 1298
33.
Zurück zum Zitat Justo R, Alcaide JM, Torres MI, Walker M. Detection of sarcasm and nastiness: new resources for Spanish language. Cognit Comput. 2018;10(6):1135–51.CrossRef Justo R, Alcaide JM, Torres MI, Walker M. Detection of sarcasm and nastiness: new resources for Spanish language. Cognit Comput. 2018;10(6):1135–51.CrossRef
34.
Zurück zum Zitat Dashtipour K, Poria S, Hussain A, Cambria E, Hawalah AYA, Gelbukh A, et al. Multilingual sentiment analysis: state of the art and independent comparison of techniques. Cognit Comput. 2016;8(4):757–71.CrossRef Dashtipour K, Poria S, Hussain A, Cambria E, Hawalah AYA, Gelbukh A, et al. Multilingual sentiment analysis: state of the art and independent comparison of techniques. Cognit Comput. 2016;8(4):757–71.CrossRef
35.
Zurück zum Zitat Roth B, Barth T, Wiegand M, Klakow D. A survey of noise reduction methods for distant supervision. In: AKBC 2013 - Proceedings of the 2013 Workshop on Automated Knowledge Base Construction, Co-located with CIKM 2013. San Francisco, California: Association for Computing Machinery; 2013. p. 73–77. Roth B, Barth T, Wiegand M, Klakow D. A survey of noise reduction methods for distant supervision. In: AKBC 2013 - Proceedings of the 2013 Workshop on Automated Knowledge Base Construction, Co-located with CIKM 2013. San Francisco, California: Association for Computing Machinery; 2013. p. 73–77.
37.
Zurück zum Zitat Bandhakavi A, Wiratunga N, Massie S, Padmanabhan D. Lexicon generation for emotion detection from text. IEEE Intell Syst. 2017;32(1):102–8.CrossRef Bandhakavi A, Wiratunga N, Massie S, Padmanabhan D. Lexicon generation for emotion detection from text. IEEE Intell Syst. 2017;32(1):102–8.CrossRef
39.
Zurück zum Zitat Suttles J, Ide N. Distant supervision for emotion classification with discrete binary values. In: International Conference on Intelligent Text Processing and Computational Linguistics. Berlin, Heidelberg: Springer; 2013. p. 121–136. Suttles J, Ide N. Distant supervision for emotion classification with discrete binary values. In: International Conference on Intelligent Text Processing and Computational Linguistics. Berlin, Heidelberg: Springer; 2013. p. 121–136.
40.
Zurück zum Zitat Felbo B, Mislove A, Søgaard A, Rahwan I, Lehmann S. Using millions of emoji occurrences to learn any-domain representations for detecting sentiment, emotion and sarcasm. In: Palmer M, Hwa R, Riedel S, editors. Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing. Stroudsburg, PA, USA: Association for Computational Linguistics; 2017. p. 1615–1625. Felbo B, Mislove A, Søgaard A, Rahwan I, Lehmann S. Using millions of emoji occurrences to learn any-domain representations for detecting sentiment, emotion and sarcasm. In: Palmer M, Hwa R, Riedel S, editors. Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing. Stroudsburg, PA, USA: Association for Computational Linguistics; 2017. p. 1615–1625.
41.
Zurück zum Zitat Moctezuma D, Graff M, Miranda-Jiménez S, Tellez ES, Coronado A, Sánchez CN, et al. A Genetic programming approach to sentiment analysis for twitter. In: Villena Román J, García Cumbreras MA, Martínez Cámara E, Díaz Galiano MC, García Vega M, editors. Proceedings of TASS 2017: Workshop on Sentiment Analysis at SEPLN co-located with 33nd SEPLN Conference [Internet]; 2017 Sept 19; CEUR Workshop Proc. Volume 1896, 2017 [cited 2020 May 15]. p. 23–28. Available from: http://ceur-ws.org/Vol-1896/p1_ingeotec_tass2017.pdf Moctezuma D, Graff M, Miranda-Jiménez S, Tellez ES, Coronado A, Sánchez CN, et al. A Genetic programming approach to sentiment analysis for twitter. In: Villena Román J, García Cumbreras MA, Martínez Cámara E, Díaz Galiano MC, García Vega M, editors. Proceedings of TASS 2017: Workshop on Sentiment Analysis at SEPLN co-located with 33nd SEPLN Conference [Internet]; 2017 Sept 19; CEUR Workshop Proc. Volume 1896, 2017 [cited 2020 May 15]. p. 23–28. Available from: http://​ceur-ws.​org/​Vol-1896/​p1_​ingeotec_​tass2017.​pdf
43.
Zurück zum Zitat Sociedad Española del Procesamiento del Lenguaje Natural (SEPLN). Taller de Análisis de sentimientos en Español (TASS) [Internet]. 2020 [cited 15 May 2020] Available from: http://tass.sepln.org Sociedad Española del Procesamiento del Lenguaje Natural (SEPLN). Taller de Análisis de sentimientos en Español (TASS) [Internet]. 2020 [cited 15 May 2020] Available from: http://​tass.​sepln.​org
44.
Zurück zum Zitat Cumbreras MÁG, Gonzalo J, Cámara EM, Unanue RM, Rosso P, Carrillo-de-Albornoz J, et al., editors. Proc Iber Lang Eval Forum (IberLEF 2019) co-located with 35th Conf Spanish Soc Nat Lang Process (SEPLN 2019) [Internet]. CEUR Workshop Proc. Volume 2421, 2019 [cited 2020 May 15]. Available from: http://ceur-ws.org/Vol-2421/ Cumbreras MÁG, Gonzalo J, Cámara EM, Unanue RM, Rosso P, Carrillo-de-Albornoz J, et al., editors. Proc Iber Lang Eval Forum (IberLEF 2019) co-located with 35th Conf Spanish Soc Nat Lang Process (SEPLN 2019) [Internet]. CEUR Workshop Proc. Volume 2421, 2019 [cited 2020 May 15]. Available from: http://​ceur-ws.​org/​Vol-2421/​
46.
Zurück zum Zitat Sahni T, Chandak C, Reddy N, Singh M. Efficient twitter sentiment classification using subjective distant supervision. In: 2017 9th International Conference on Communication Systems and Networks (COMSNETS). Bangalore, India: IEEE; 2017. p. 548–553. Sahni T, Chandak C, Reddy N, Singh M. Efficient twitter sentiment classification using subjective distant supervision. In: 2017 9th International Conference on Communication Systems and Networks (COMSNETS). Bangalore, India: IEEE; 2017. p. 548–553.
47.
Zurück zum Zitat Refaee E, Rieser V. Evaluating distant supervision for subjectivity and sentiment analysis on arabic twitter feeds. In: Proceedings of the EMNLP 2014 Workshop on Arabic Natural Langauge Processing (ANLP). Stroudsburg, PA, USA: Association for Computational Linguistics; 2014. p. 174–179. Refaee E, Rieser V. Evaluating distant supervision for subjectivity and sentiment analysis on arabic twitter feeds. In: Proceedings of the EMNLP 2014 Workshop on Arabic Natural Langauge Processing (ANLP). Stroudsburg, PA, USA: Association for Computational Linguistics; 2014. p. 174–179.
48.
Zurück zum Zitat Deriu J, Lucchi A, De Luca V, Severyn A, Müller S, Cieliebak M, et al. Leveraging large amounts of weakly supervised data for multi-language sentiment classification. In: WWW '17 Companion: Proceedings of the 26th International Conference on World Wide Web Companion. Republic and Canton of Geneva, Switzerland: International World Wide Web Conferences Steering Committee; 2017. p. 1045–1052. Deriu J, Lucchi A, De Luca V, Severyn A, Müller S, Cieliebak M, et al. Leveraging large amounts of weakly supervised data for multi-language sentiment classification. In: WWW '17 Companion: Proceedings of the 26th International Conference on World Wide Web Companion. Republic and Canton of Geneva, Switzerland: International World Wide Web Conferences Steering Committee; 2017. p. 1045–1052.
49.
Zurück zum Zitat Marchetti-Bowick M, Chambers N. Learning for microblogs with distant supervision: political forecasting with twitter. In: Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics. Avignon, France: Association for Computational Linguistics; 2012. p. 603–612. Marchetti-Bowick M, Chambers N. Learning for microblogs with distant supervision: political forecasting with twitter. In: Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics. Avignon, France: Association for Computational Linguistics; 2012. p. 603–612.
50.
Zurück zum Zitat Carletta J. Squibs and discussions: assessing agreement on classification tasks: the kappa statistic. Comput Linguist. 1996;22(2):248–54. Carletta J. Squibs and discussions: assessing agreement on classification tasks: the kappa statistic. Comput Linguist. 1996;22(2):248–54.
51.
Zurück zum Zitat Hearst MA. TextTiling: segmenting text into multi-paragraph subtopic passages. Comput Linguist. 1997;23(1):33–64. Hearst MA. TextTiling: segmenting text into multi-paragraph subtopic passages. Comput Linguist. 1997;23(1):33–64.
52.
Zurück zum Zitat Cohen J. A coefficient of agreement for nominal scales. Educ Psychol Meas. 1960;20(1):37–46.CrossRef Cohen J. A coefficient of agreement for nominal scales. Educ Psychol Meas. 1960;20(1):37–46.CrossRef
53.
Zurück zum Zitat Gambino OJ, Calvo H. Predicting emotional reactions to news articles in social networks. Comput Speech Lang. 2019;58:280–303.CrossRef Gambino OJ, Calvo H. Predicting emotional reactions to news articles in social networks. Comput Speech Lang. 2019;58:280–303.CrossRef
54.
Zurück zum Zitat Chatterjee A, Narahari KN, Joshi M, Agrawal P. SemEval-2019 Task 3: EmoContext Contextual Emotion Detection in Text. In: May J, Shutova E, Herbelot A, Zhu X, Apidianaki M, Mohammad SM, editors. Proceedings of the 13th International Workshop on Semantic Evaluation. Stroudsburg, PA, USA: Association for Computational Linguistics; 2019. p. 39–48. Chatterjee A, Narahari KN, Joshi M, Agrawal P. SemEval-2019 Task 3: EmoContext Contextual Emotion Detection in Text. In: May J, Shutova E, Herbelot A, Zhu X, Apidianaki M, Mohammad SM, editors. Proceedings of the 13th International Workshop on Semantic Evaluation. Stroudsburg, PA, USA: Association for Computational Linguistics; 2019. p. 39–48.
58.
Zurück zum Zitat Bird S, Klein E, Loper E. Natural language processing with python. O'Reilly Media Inc.; 2009. Bird S, Klein E, Loper E. Natural language processing with python. O'Reilly Media Inc.; 2009.
61.
Zurück zum Zitat Craker N, March E. The dark side of Facebook®: The Dark Tetrad, negative social potency, and trolling behaviours. Pers Individ Dif. 2016;102:79–84.CrossRef Craker N, March E. The dark side of Facebook®: The Dark Tetrad, negative social potency, and trolling behaviours. Pers Individ Dif. 2016;102:79–84.CrossRef
63.
Zurück zum Zitat Hsueh P, Melville P, Sindhwani V. Data quality from crowdsourcing: A Study of Annotation Selection Criteria. In: Ringger E, Haertel R, Tomanek K, editors. Proceedings of the NAACL HLT 2009 Workshop on Active Learning for Natural Language Processing - HLT '09. Morristown, NJ, USA: Association for Computational Linguistics; 2009. p. 27–35. Available from: https://www.aclweb.org/anthology/W09-1904.pdf Hsueh P, Melville P, Sindhwani V. Data quality from crowdsourcing: A Study of Annotation Selection Criteria. In: Ringger E, Haertel R, Tomanek K, editors. Proceedings of the NAACL HLT 2009 Workshop on Active Learning for Natural Language Processing - HLT '09. Morristown, NJ, USA: Association for Computational Linguistics; 2009. p. 27–35. Available from: https://​www.​aclweb.​org/​anthology/​W09-1904.​pdf
64.
Zurück zum Zitat Poria S, Cambria E, Gelbukh A, Bisio F, Hussain A. Sentiment data flow analysis by means of dynamic linguistic patterns. IEEE Comput Intell Mag. 2015;10(4):26–36.CrossRef Poria S, Cambria E, Gelbukh A, Bisio F, Hussain A. Sentiment data flow analysis by means of dynamic linguistic patterns. IEEE Comput Intell Mag. 2015;10(4):26–36.CrossRef
65.
Zurück zum Zitat Burdisso SG, Errecalde M, Montes-y-Gómez M. PySS3: A Python package implementing a novel text classifier with visualization tools for Explainable AI. 2019. arXiv preprint arXiv 1912.09322 Burdisso SG, Errecalde M, Montes-y-Gómez M. PySS3: A Python package implementing a novel text classifier with visualization tools for Explainable AI. 2019. arXiv preprint arXiv 1912.09322
66.
Zurück zum Zitat Ferretti E, Errecalde M, Rosso P. Does semantic information help in the text categorization task? J Intell Syst. 2008;17(1–3):91–106. Ferretti E, Errecalde M, Rosso P. Does semantic information help in the text categorization task? J Intell Syst. 2008;17(1–3):91–106.
Metadaten
Titel
Distant Supervised Construction and Evaluation of a Novel Dataset of Emotion-Tagged Social Media Comments in Spanish
verfasst von
Juan Pablo Tessore
Leonardo Martín Esnaola
Laura Lanzarini
Sandra Baldassarri
Publikationsdatum
18.01.2021
Verlag
Springer US
Erschienen in
Cognitive Computation / Ausgabe 1/2022
Print ISSN: 1866-9956
Elektronische ISSN: 1866-9964
DOI
https://doi.org/10.1007/s12559-020-09800-x

Weitere Artikel der Ausgabe 1/2022

Cognitive Computation 1/2022 Zur Ausgabe