Skip to main content

2019 | OriginalPaper | Buchkapitel

Improving Implicit Stance Classification in Tweets Using Word and Sentence Embeddings

verfasst von : Robin Schaefer, Manfred Stede

Erschienen in: KI 2019: Advances in Artificial Intelligence

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Argumentation Mining aims at finding components of arguments, as well as relations between them, in text. One of the largely unsolved problems is implicitness, where the text invites the reader to infer a missing component, such as the claim or a supporting statement. In the work of Wojatzki and Zesch (2016), an interesting implicitness problem is addressed on a Twitter data set. They showed that implicit stances toward a claim can be found with some success using just token and character n-grams. Using the same dataset, we show that results for this task can be improved using word and sentence embeddings, but that not all embedding variants perform alike. Specifically, we compare fastText, GloVe, and Universal Sentence Encoder (USE); and we find that, to our knowledge, USE yields state-of-the-art results for this task.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
2
Note that WZ16 apply the DKPro Core [6] and DKPro TC frameworks [8].
 
3
Note that during punctuation removal #’s are ignored in order to maintain hashtags, which we assume to be meaningful for our task.
 
4
The Snowball Stemmer is implemented using NLTK [13].
 
5
As the USE model has been trained exclusively for 512-dimensional vectors [7], we are unable to create 300-dimensional vectors that would have been more directly comparable to the fastText and GloVe vectors.
 
Literatur
2.
Zurück zum Zitat Boltužić, F., Šnajder, J.: Back up your stance: recognizing arguments in online discussions. In: Proceedings of the First Workshop on Argumentation Mining, Baltimore, Maryland, pp. 49–58. Association for Computational Linguistics, June 2014. https://doi.org/10.3115/v1/W14-2107 Boltužić, F., Šnajder, J.: Back up your stance: recognizing arguments in online discussions. In: Proceedings of the First Workshop on Argumentation Mining, Baltimore, Maryland, pp. 49–58. Association for Computational Linguistics, June 2014. https://​doi.​org/​10.​3115/​v1/​W14-2107
3.
Zurück zum Zitat Bosc, T., Cabrio, E., Villata, S.: DART: a dataset of arguments and their relations on twitter. In: Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016), Portorož, Slovenia, pp. 1258–1263. European Language Resources Association (ELRA), May 2016 Bosc, T., Cabrio, E., Villata, S.: DART: a dataset of arguments and their relations on twitter. In: Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016), Portorož, Slovenia, pp. 1258–1263. European Language Resources Association (ELRA), May 2016
4.
Zurück zum Zitat Bosc, T., Cabrio, E., Villata, S.: Tweeties squabbling: positive and negative results in applying argument mining on social media. In: Proceedings of the 6th International Conference on Computational Models of Argument, Potsdam, Germany, September 2016 Bosc, T., Cabrio, E., Villata, S.: Tweeties squabbling: positive and negative results in applying argument mining on social media. In: Proceedings of the 6th International Conference on Computational Models of Argument, Potsdam, Germany, September 2016
5.
Zurück zum Zitat Cabrio, E., Villata, S.: Combining textual entailment and argumentation theory for supporting online debates interactions. In: Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), Jeju Island, Korea, pp. 208–212. Association for Computational Linguistics, July 2012 Cabrio, E., Villata, S.: Combining textual entailment and argumentation theory for supporting online debates interactions. In: Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), Jeju Island, Korea, pp. 208–212. Association for Computational Linguistics, July 2012
6.
Zurück zum Zitat de Castilho, R.E., Gurevych, I.: A broad-coverage collection of portable NLP components for building shareable analysis pipelines. In: Proceedings of the Workshop on Open Infrastructures and Analysis Frameworks for HLT, Dublin, Ireland, pp. 1–11. Association for Computational Linguistics and Dublin City University, August 2014. https://doi.org/10.3115/v1/W14-5201 de Castilho, R.E., Gurevych, I.: A broad-coverage collection of portable NLP components for building shareable analysis pipelines. In: Proceedings of the Workshop on Open Infrastructures and Analysis Frameworks for HLT, Dublin, Ireland, pp. 1–11. Association for Computational Linguistics and Dublin City University, August 2014. https://​doi.​org/​10.​3115/​v1/​W14-5201
7.
Zurück zum Zitat Chidambaram, M., et al.: Learning cross-lingual sentence representations via a multi-task dual-encoder model. CoRR abs/1810.12836 (2018) Chidambaram, M., et al.: Learning cross-lingual sentence representations via a multi-task dual-encoder model. CoRR abs/1810.12836 (2018)
8.
Zurück zum Zitat Daxenberger, J., Ferschke, O., Gurevych, I., Zesch, T.: DKPro TC: a Java-based framework for supervised learning experiments on textual data. In: Proceedings of 52nd Annual Meeting of the Association for Computational Linguistics: System Demonstrations, Baltimore, Maryland, pp. 61–66. Association for Computational Linguistics, June 2014. https://doi.org/10.3115/v1/P14-5011 Daxenberger, J., Ferschke, O., Gurevych, I., Zesch, T.: DKPro TC: a Java-based framework for supervised learning experiments on textual data. In: Proceedings of 52nd Annual Meeting of the Association for Computational Linguistics: System Demonstrations, Baltimore, Maryland, pp. 61–66. Association for Computational Linguistics, June 2014. https://​doi.​org/​10.​3115/​v1/​P14-5011
9.
Zurück zum Zitat Dusmanu, M., Cabrio, E., Villata, S.: Argument mining on Twitter: arguments, facts and sources. In: Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, Copenhagen, Denmark, pp. 2317–2322. Association for Computational Linguistics, September 2017. https://doi.org/10.18653/v1/D17-1245 Dusmanu, M., Cabrio, E., Villata, S.: Argument mining on Twitter: arguments, facts and sources. In: Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, Copenhagen, Denmark, pp. 2317–2322. Association for Computational Linguistics, September 2017. https://​doi.​org/​10.​18653/​v1/​D17-1245
10.
Zurück zum Zitat Gimpel, K., et al.: Part-of-speech tagging for Twitter: annotation, features, and experiments. In: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, Portland, Oregon, USA, pp. 42–47. Association for Computational Linguistics, June 2011 Gimpel, K., et al.: Part-of-speech tagging for Twitter: annotation, features, and experiments. In: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, Portland, Oregon, USA, pp. 42–47. Association for Computational Linguistics, June 2011
11.
Zurück zum Zitat Grave, E., Bojanowski, P., Gupta, P., Joulin, A., Mikolov, T.: Learning word vectors for 157 languages. In: Proceedings of the International Conference on Language Resources and Evaluation (LREC 2018) (2018) Grave, E., Bojanowski, P., Gupta, P., Joulin, A., Mikolov, T.: Learning word vectors for 157 languages. In: Proceedings of the International Conference on Language Resources and Evaluation (LREC 2018) (2018)
12.
Zurück zum Zitat Grosse, K., González, M.P., Chesñevar, C.I., Maguitman, A.G.: Integrating argumentation and sentiment analysis for mining opinions from twitter. AI Commun. 28(3), 387–401 (2015)MathSciNetCrossRef Grosse, K., González, M.P., Chesñevar, C.I., Maguitman, A.G.: Integrating argumentation and sentiment analysis for mining opinions from twitter. AI Commun. 28(3), 387–401 (2015)MathSciNetCrossRef
13.
Zurück zum Zitat Loper, E., Bird, S.: NLTK: the natural language toolkit. In: Proceedings of the ACL 2002 Workshop on Effective Tools and Methodologies for Teaching Natural Language Processing and Computational Linguistics, ETMTNLP 2002, Stroudsburg, PA, USA, vol. 1, pp. 63–70. Association for Computational Linguistics (2002). https://doi.org/10.3115/1118108.1118117 Loper, E., Bird, S.: NLTK: the natural language toolkit. In: Proceedings of the ACL 2002 Workshop on Effective Tools and Methodologies for Teaching Natural Language Processing and Computational Linguistics, ETMTNLP 2002, Stroudsburg, PA, USA, vol. 1, pp. 63–70. Association for Computational Linguistics (2002). https://​doi.​org/​10.​3115/​1118108.​1118117
14.
Zurück zum Zitat Moens, M.F., Boiy, E., Palau, R.M., Reed, C.: Automatic detection of arguments in legal texts. In: Proceedings of the 11th International Conference on Artificial Intelligence and Law, ICAIL 2007, pp. 225–230. ACM, New York (2007). https://doi.org/10.1145/1276318.1276362 Moens, M.F., Boiy, E., Palau, R.M., Reed, C.: Automatic detection of arguments in legal texts. In: Proceedings of the 11th International Conference on Artificial Intelligence and Law, ICAIL 2007, pp. 225–230. ACM, New York (2007). https://​doi.​org/​10.​1145/​1276318.​1276362
15.
Zurück zum Zitat Mohammad, S., Kiritchenko, S., Sobhani, P., Zhu, X., Cherry, C.: SemEval-2016 task 6: detecting stance in tweets. In: Proceedings of the 10th International Workshop on Semantic Evaluation (SemEval-2016), San Diego, California, pp. 31–41. Association for Computational Linguistics, June 2016. https://doi.org/10.18653/v1/S16-1003 Mohammad, S., Kiritchenko, S., Sobhani, P., Zhu, X., Cherry, C.: SemEval-2016 task 6: detecting stance in tweets. In: Proceedings of the 10th International Workshop on Semantic Evaluation (SemEval-2016), San Diego, California, pp. 31–41. Association for Computational Linguistics, June 2016. https://​doi.​org/​10.​18653/​v1/​S16-1003
16.
Zurück zum Zitat Pedregosa, F., et al.: Scikit-learn: machine learning in Python. J. Mach. Learn. Res. 12, 2825–2830 (2011)MathSciNetMATH Pedregosa, F., et al.: Scikit-learn: machine learning in Python. J. Mach. Learn. Res. 12, 2825–2830 (2011)MathSciNetMATH
17.
Zurück zum Zitat Pennington, J., Socher, R., Manning, C.D.: GloVe: global vectors for word representation. In: Empirical Methods in Natural Language Processing (EMNLP), pp. 1532–1543 (2014) Pennington, J., Socher, R., Manning, C.D.: GloVe: global vectors for word representation. In: Empirical Methods in Natural Language Processing (EMNLP), pp. 1532–1543 (2014)
18.
Zurück zum Zitat Snajder, J.: Social media argumentation mining: the quest for deliberateness in raucousness. CoRR abs/1701.00168 (2017) Snajder, J.: Social media argumentation mining: the quest for deliberateness in raucousness. CoRR abs/1701.00168 (2017)
19.
Zurück zum Zitat Stab, C., Gurevych, I.: Identifying argumentative discourse structures in persuasive essays. In: Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), Doha, Qatar, pp. 46–56. Association for Computational Linguistics, October 2014. https://doi.org/10.3115/v1/D14-1006 Stab, C., Gurevych, I.: Identifying argumentative discourse structures in persuasive essays. In: Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), Doha, Qatar, pp. 46–56. Association for Computational Linguistics, October 2014. https://​doi.​org/​10.​3115/​v1/​D14-1006
20.
Zurück zum Zitat Wojatzki, M., Zesch, T.: Stance-based argument mining - modeling implicit argumentation using stance. In: Proceedings of the KONVENS, pp. 313–322 (2016) Wojatzki, M., Zesch, T.: Stance-based argument mining - modeling implicit argumentation using stance. In: Proceedings of the KONVENS, pp. 313–322 (2016)
Metadaten
Titel
Improving Implicit Stance Classification in Tweets Using Word and Sentence Embeddings
verfasst von
Robin Schaefer
Manfred Stede
Copyright-Jahr
2019
DOI
https://doi.org/10.1007/978-3-030-30179-8_26

Premium Partner