Skip to main content
Erschienen in: Social Network Analysis and Mining 1/2024

01.12.2024 | Original Article

Enhancing stance detection through sequential weighted multi-task learning

verfasst von: Nora Alturayeif, Hamzah Luqman, Moataz Ahmed

Erschienen in: Social Network Analysis and Mining | Ausgabe 1/2024

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The exponential growth of user-generated content on social media platforms, online news outlets, and digital communication has necessitated the development of automated tools for analyzing opinions and attitudes expressed in text. Stance detection, a critical task in Natural Language Processing, aims to identify the underlying perspective or viewpoint of an individual or group toward a specific topic or target. This paper explores the challenges of stance detection, particularly in the context of social media, where brevity, informality, and limited contextual information prevail. While sentiment analysis focuses on explicit sentiment polarity, stance detection classifies the stance or viewpoint of a text toward a target, often of an abstract nature. Motivated by recent achievements in Multi-Task Learning (MTL), this paper addresses the identified gap in the field, advocating further exploration in developing a joint neural architecture that integrates different opinion dimensions. In response, this study introduces two MTL models, Parallel Multi-Task Learning (PMTL) and Sequential Multi-Task Learning (SMTL), which incorporate sentiment analysis and sarcasm detection tasks to enhance stance detection performance. We address the complexities of MTL implementation with Transformer-based architectures and present an accessible architecture for this purpose. This study also proposes and evaluates four task weighting techniques, providing empirical evidence for their effectiveness in MTL models. Through comprehensive evaluations on benchmark datasets in both English and Arabic, we demonstrate that our most proficient model, a multi-target sequential MTL model with hierarchical weighting (SMTL-HW), achieves state-of-the-art results. These contributions underscore the potential of MTL in enhancing stance detection and offer valuable insights into the interaction between sentiment, stance, and sarcasm in text analysis.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Fußnoten
2
For clarity, we use the term “MTL objective" to refer to the final learning objective of a model, while “loss" represents an individual component within this objective function.
 
Literatur
Zurück zum Zitat Aldayel A, Magdy W (2019b) Your stance is exposed! analyzing possible factors forstance detection on social media. Proc ACM on Hum Comput Interact 3:1–20CrossRef Aldayel A, Magdy W (2019b) Your stance is exposed! analyzing possible factors forstance detection on social media. Proc ACM on Hum Comput Interact 3:1–20CrossRef
Zurück zum Zitat Alec R, Jeffrey W, Rewon C, et al (2019) Language models are unsupervised multitask learners. OpenAI Blog 1 Alec R, Jeffrey W, Rewon C, et al (2019) Language models are unsupervised multitask learners. OpenAI Blog 1
Zurück zum Zitat Alturayeif N, Luqman H, Ahmed M (2023) A systematic review of machine learning techniques for stance detection and its applications. Neural Comput Appl 35(7):5113–5144CrossRef Alturayeif N, Luqman H, Ahmed M (2023) A systematic review of machine learning techniques for stance detection and its applications. Neural Comput Appl 35(7):5113–5144CrossRef
Zurück zum Zitat Antoun W, Baly F, Hajj H (2020) Arabert: Transformer-based model for Arabic language understanding. LREC 2020 workshop language resources and evaluation conference Antoun W, Baly F, Hajj H (2020) Arabert: Transformer-based model for Arabic language understanding. LREC 2020 workshop language resources and evaluation conference
Zurück zum Zitat Bahuleyan H, Vechtomova O (2017) Uwaterloo at semeval-2017 task 8: Detecting stance toward rumors with topic independent features. In: Proceedings of the 11th international workshop on semantic evaluations (SemEval-2017), pp 461–464 Bahuleyan H, Vechtomova O (2017) Uwaterloo at semeval-2017 task 8: Detecting stance toward rumors with topic independent features. In: Proceedings of the 11th international workshop on semantic evaluations (SemEval-2017), pp 461–464
Zurück zum Zitat Chai H, Tang S, Cui J, et al (2022) Improving multi-task stance detection with multi-task interaction network. In: Empirical methods in natural language processing, pp 2990–3000 Chai H, Tang S, Cui J, et al (2022) Improving multi-task stance detection with multi-task interaction network. In: Empirical methods in natural language processing, pp 2990–3000
Zurück zum Zitat Chauhan DS, Kumar R, Ekbal A (2019) Attention based shared representation for multi-task stance detection and sentiment analysis. In: Neural information processing: 26th international conference, ICONIP 2019, Sydney, NSW, Australia, December 12–15, 2019, proceedings, part V 26. Springer, pp 661–669 Chauhan DS, Kumar R, Ekbal A (2019) Attention based shared representation for multi-task stance detection and sentiment analysis. In: Neural information processing: 26th international conference, ICONIP 2019, Sydney, NSW, Australia, December 12–15, 2019, proceedings, part V 26. Springer, pp 661–669
Zurück zum Zitat Chen P, Ye K, Cui X (2021) Integrating n-gram features into pre-trained model: a novel ensemble model for multi-target stance detection. In: Springer Science and Business Media, Deutschland GmbH, international conference on artificial neural networks, pp 269–279. https://doi.org/10.1007/978-3-030-86365-4_22 Chen P, Ye K, Cui X (2021) Integrating n-gram features into pre-trained model: a novel ensemble model for multi-target stance detection. In: Springer Science and Business Media, Deutschland GmbH, international conference on artificial neural networks, pp 269–279. https://​doi.​org/​10.​1007/​978-3-030-86365-4_​22
Zurück zum Zitat Clark K, Luong MT, Le QV, et al (2020) Electra: pre-training text encoders as discriminators rather than generators Clark K, Luong MT, Le QV, et al (2020) Electra: pre-training text encoders as discriminators rather than generators
Zurück zum Zitat Devlin J, Chang MW, Lee K, et al (2019) Bert: pre-training of deep bidirectional transformers for language understanding Devlin J, Chang MW, Lee K, et al (2019) Bert: pre-training of deep bidirectional transformers for language understanding
Zurück zum Zitat Dey K, Shrivastava R, Kaushik S (2017) Twitter stance detection-a subjectivity and sentiment polarity inspired two-phase approach. In: IEEE international conference on data mining workshops (ICDMW), pp 365–372. http://www.noslang.com/dictionary Dey K, Shrivastava R, Kaushik S (2017) Twitter stance detection-a subjectivity and sentiment polarity inspired two-phase approach. In: IEEE international conference on data mining workshops (ICDMW), pp 365–372. http://​www.​noslang.​com/​dictionary
Zurück zum Zitat Ebrahimi J, Dou D, Lowd D (2016) A joint sentiment-target-stance model for stance classification in tweets. In: Proceedings of COLING 2016, the 26th international conference on computational linguistics, pp 2656–2665 Ebrahimi J, Dou D, Lowd D (2016) A joint sentiment-target-stance model for stance classification in tweets. In: Proceedings of COLING 2016, the 26th international conference on computational linguistics, pp 2656–2665
Zurück zum Zitat Fang W, Nadeem M, Mohtarami M, et al (2019) Neural multi-task learning for stance prediction. In: Proceedings of the second workshop on fact extraction and verification (FEVER), pp 13–19. https://data.quora.com/ Fang W, Nadeem M, Mohtarami M, et al (2019) Neural multi-task learning for stance prediction. In: Proceedings of the second workshop on fact extraction and verification (FEVER), pp 13–19. https://​data.​quora.​com/​
Zurück zum Zitat Gómez-Suta M, Echeverry-Correa J, Soto-Mejía JA (2023) Stance detection in tweets: a topic modeling approach supporting explainability. Expert Syst Appl 214(119):046 Gómez-Suta M, Echeverry-Correa J, Soto-Mejía JA (2023) Stance detection in tweets: a topic modeling approach supporting explainability. Expert Syst Appl 214(119):046
Zurück zum Zitat Hacohen-Kerner Y, Ido Z, Ya’akobov R (2017) Stance classification of tweets using skip char ngrams. In: Joint European conference on machine learning and knowledge discovery in databases, pp 266–278 Hacohen-Kerner Y, Ido Z, Ya’akobov R (2017) Stance classification of tweets using skip char ngrams. In: Joint European conference on machine learning and knowledge discovery in databases, pp 266–278
Zurück zum Zitat Hanselowski A, Schiller PVSAB, et al (2018) A retrospective analysis of the fake news challenge stance-detection task. In: Proceedings of the 27th international conference on computational linguistics (COLING 2018) Hanselowski A, Schiller PVSAB, et al (2018) A retrospective analysis of the fake news challenge stance-detection task. In: Proceedings of the 27th international conference on computational linguistics (COLING 2018)
Zurück zum Zitat Hardalov M, Arora A, Nakov P, et al (2021) Cross-domain label-adaptive stance detection. In: Proceedings of the 2021 conference on empirical methods in natural language processing, pp 9011–9028 Hardalov M, Arora A, Nakov P, et al (2021) Cross-domain label-adaptive stance detection. In: Proceedings of the 2021 conference on empirical methods in natural language processing, pp 9011–9028
Zurück zum Zitat Hosseinia M, Dragut E, Mukherjee A (2020) Stance prediction for contemporary issues: data and experiments. In: Proceedings of the eighth international workshop on natural language processing for social media. https://doi.org/10.18653/v1/P17 Hosseinia M, Dragut E, Mukherjee A (2020) Stance prediction for contemporary issues: data and experiments. In: Proceedings of the eighth international workshop on natural language processing for social media. https://​doi.​org/​10.​18653/​v1/​P17
Zurück zum Zitat Kendall A, Gal Y, Cipolla R (2018) Multi-task learning using uncertainty to weigh losses for scene geometry and semantics. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 7482–7491 Kendall A, Gal Y, Cipolla R (2018) Multi-task learning using uncertainty to weigh losses for scene geometry and semantics. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 7482–7491
Zurück zum Zitat Kingma DP, Ba JL (2015) Adam: a method for stochastic optimization. In: 3rd International conference on learning representations, ICLR 2015-conference track proceedings Kingma DP, Ba JL (2015) Adam: a method for stochastic optimization. In: 3rd International conference on learning representations, ICLR 2015-conference track proceedings
Zurück zum Zitat Lai M, Cignarella AT, Irazú D, et al (2017) itacos at ibereval2017: detecting stance in catalan and spanish tweets. In: Proceedings of the second workshop on evaluation of human language technologies for Iberian languages (IberEval 2017), pp 185–192 Lai M, Cignarella AT, Irazú D, et al (2017) itacos at ibereval2017: detecting stance in catalan and spanish tweets. In: Proceedings of the second workshop on evaluation of human language technologies for Iberian languages (IberEval 2017), pp 185–192
Zurück zum Zitat Li Y, Caragea C (2019) Multi-task stance detection with sentiment and stance lexicons. In: Proceedings of the 2019 conference on empirical methods in natural language processing and the 9th international joint conference on natural language processing, pp 6299–6305 Li Y, Caragea C (2019) Multi-task stance detection with sentiment and stance lexicons. In: Proceedings of the 2019 conference on empirical methods in natural language processing and the 9th international joint conference on natural language processing, pp 6299–6305
Zurück zum Zitat Li Y, Tian X, Liu T, et al (2015) Multi-task model and feature joint learning, pp 3643–3649 Li Y, Tian X, Liu T, et al (2015) Multi-task model and feature joint learning, pp 3643–3649
Zurück zum Zitat Liu X, He P, Chen W, et al (2019a) Multi-task deep neural networks for natural language understanding. In: Proceedings of the 57th annual meeting of the association for computational linguistics, pp 4487–4496 Liu X, He P, Chen W, et al (2019a) Multi-task deep neural networks for natural language understanding. In: Proceedings of the 57th annual meeting of the association for computational linguistics, pp 4487–4496
Zurück zum Zitat Liu Y, Zhang X, Wegsman D, et al (2022) Politics: pretraining with same-story article comparison for ideology prediction and stance detection Liu Y, Zhang X, Wegsman D, et al (2022) Politics: pretraining with same-story article comparison for ideology prediction and stance detection
Zurück zum Zitat Mahabadi RK, Ruder S, Dehghani M, et al (2021) Parameter-efficient multi-task fine-tuning for transformers via shared hypernetworks. In: Proceedings of the 59th annual meeting of the association for computational linguistics and the 11th international joint conference on natural language processing (volume 1: long papers), pp 565–576 Mahabadi RK, Ruder S, Dehghani M, et al (2021) Parameter-efficient multi-task fine-tuning for transformers via shared hypernetworks. In: Proceedings of the 59th annual meeting of the association for computational linguistics and the 11th international joint conference on natural language processing (volume 1: long papers), pp 565–576
Zurück zum Zitat Mao Y, Wang Z, Liu W, et al (2021) Banditmtl: bandit-based multi-task learning for text classification. In: Proceedings of the 59th annual meeting of the association for computational linguistics and the 11th international joint conference on natural language processing (volume 1: long papers), pp 5506–5516 Mao Y, Wang Z, Liu W, et al (2021) Banditmtl: bandit-based multi-task learning for text classification. In: Proceedings of the 59th annual meeting of the association for computational linguistics and the 11th international joint conference on natural language processing (volume 1: long papers), pp 5506–5516
Zurück zum Zitat Mao Y, Wang Z, Liu W et al (2022) Metaweighting: learning to weight tasks in multi-task learning. Find Assoc Comput Linguist ACL 2022:3436–3448CrossRef Mao Y, Wang Z, Liu W et al (2022) Metaweighting: learning to weight tasks in multi-task learning. Find Assoc Comput Linguist ACL 2022:3436–3448CrossRef
Zurück zum Zitat Mohtarami M, Glass J, Nakov P (2019) Contrastive language adaptation for cross-lingual stance detection. In: 2019 Conference on empirical methods in natural language processing and the 9th international joint conference on natural language processing (EMNLP-IJCNLP), pp 4442–4452. arXiv:1910.02076 Mohtarami M, Glass J, Nakov P (2019) Contrastive language adaptation for cross-lingual stance detection. In: 2019 Conference on empirical methods in natural language processing and the 9th international joint conference on natural language processing (EMNLP-IJCNLP), pp 4442–4452. arXiv:​1910.​02076
Zurück zum Zitat Poddar L, Hsu W, Lee ML, et al (2018) Predicting stances in twitter conversations for detecting veracity of rumors: A neural approach. In: 2018 IEEE 30th international conference on tools with artificial intelligence (ICTAI). IEEE, pp 65–72 Poddar L, Hsu W, Lee ML, et al (2018) Predicting stances in twitter conversations for detecting veracity of rumors: A neural approach. In: 2018 IEEE 30th international conference on tools with artificial intelligence (ICTAI). IEEE, pp 65–72
Zurück zum Zitat Raffel C, Shazeer N, Roberts A et al (2020) Exploring the limits of transfer learning with a unified text-to-text transformer. J Mach Learn Res 21(1):5485–5551MathSciNet Raffel C, Shazeer N, Roberts A et al (2020) Exploring the limits of transfer learning with a unified text-to-text transformer. J Mach Learn Res 21(1):5485–5551MathSciNet
Zurück zum Zitat Ribeiro MT, Singh S, Guestrin C (2016) "why should I trust you?": explaining the predictions of any classifier. In: Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining, San Francisco, CA, USA, August 13–17, 2016, pp 1135–1144 Ribeiro MT, Singh S, Guestrin C (2016) "why should I trust you?": explaining the predictions of any classifier. In: Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining, San Francisco, CA, USA, August 13–17, 2016, pp 1135–1144
Zurück zum Zitat Ruder S, Peters M, Swayamdipta S, et al (2019) Transfer learning in natural language processing tutorial. In: NAACL HLT 2019–2019 conference of the north american chapter of the association for computational linguistics: human language technologies-tutorial abstracts Ruder S, Peters M, Swayamdipta S, et al (2019) Transfer learning in natural language processing tutorial. In: NAACL HLT 2019–2019 conference of the north american chapter of the association for computational linguistics: human language technologies-tutorial abstracts
Zurück zum Zitat Sobhani P, Mohammad SM, Kiritchenko S (2016) Detecting stance in tweets and analyzing its interaction with sentiment. In: Proceedings of the fifth joint conference on lexical and computational semantics (SEM 2016), pp 159–169 Sobhani P, Mohammad SM, Kiritchenko S (2016) Detecting stance in tweets and analyzing its interaction with sentiment. In: Proceedings of the fifth joint conference on lexical and computational semantics (SEM 2016), pp 159–169
Zurück zum Zitat Song W, Song Z, Liu L, et al (2020) Hierarchical multi-task learning for organization evaluation of argumentative student essays. In: IJCAI, pp 3875–3881 Song W, Song Z, Liu L, et al (2020) Hierarchical multi-task learning for organization evaluation of argumentative student essays. In: IJCAI, pp 3875–3881
Zurück zum Zitat Sun L, Li X, Zhang B, et al (2019a) Learning stance classification with recurrent neural capsule network. In: CCF international conference on natural language processing and Chinese computing, pp 277–289 Sun L, Li X, Zhang B, et al (2019a) Learning stance classification with recurrent neural capsule network. In: CCF international conference on natural language processing and Chinese computing, pp 277–289
Zurück zum Zitat Upadhyaya A, Fisichella M, Nejdl W (2023a) A multi-task model for sentiment aided stance detection of climate change tweets. In: Proceedings of the international AAAI conference on web and social media, pp 854–865 Upadhyaya A, Fisichella M, Nejdl W (2023a) A multi-task model for sentiment aided stance detection of climate change tweets. In: Proceedings of the international AAAI conference on web and social media, pp 854–865
Zurück zum Zitat Upadhyaya A, Fisichella M, Nejdl W (2023b) A multi-task model for sentiment aided stance detection of climate change tweets. In: Proceedings of the international AAAI conference on web and social media, pp 854–865 Upadhyaya A, Fisichella M, Nejdl W (2023b) A multi-task model for sentiment aided stance detection of climate change tweets. In: Proceedings of the international AAAI conference on web and social media, pp 854–865
Zurück zum Zitat Vamvas J, Sennrich R (2020) X-stance: a multilingual multi-target dataset for stance detection. In: 5th SwissText and 16th KONVENS joint conference 2020. arXiv:2003.08385 Vamvas J, Sennrich R (2020) X-stance: a multilingual multi-target dataset for stance detection. In: 5th SwissText and 16th KONVENS joint conference 2020. arXiv:​2003.​08385
Zurück zum Zitat Wang H, Wang Y, Song X et al (2023) Quantifying controversy from stance, sentiment, offensiveness and sarcasm: a fine-grained controversy intensity measurement framework on a Chinese dataset. World Wide Web 26(5):3607–3632CrossRef Wang H, Wang Y, Song X et al (2023) Quantifying controversy from stance, sentiment, offensiveness and sarcasm: a fine-grained controversy intensity measurement framework on a Chinese dataset. World Wide Web 26(5):3607–3632CrossRef
Zurück zum Zitat Wei P, Xu N, Mao W (2019) Modeling conversation structure and temporal dynamics for jointly predicting rumor stance and veracity. In: Proceedings of the 2019 conference on empirical methods in natural language processing and the 9th international joint conference on natural language processing (EMNLP-IJCNLP), pp 4787–4798. arXiv:1909.08211 Wei P, Xu N, Mao W (2019) Modeling conversation structure and temporal dynamics for jointly predicting rumor stance and veracity. In: Proceedings of the 2019 conference on empirical methods in natural language processing and the 9th international joint conference on natural language processing (EMNLP-IJCNLP), pp 4787–4798. arXiv:​1909.​08211
Zurück zum Zitat Wu Y, Schuster M, Chen Z, et al (2016) Google’s neural machine translation system: Bridging the gap between human and machine translation. arXiv:1609.08144 Wu Y, Schuster M, Chen Z, et al (2016) Google’s neural machine translation system: Bridging the gap between human and machine translation. arXiv:​1609.​08144
Zurück zum Zitat Yang M, Chen L, Chen X, et al (2019) Knowledge-enhanced hierarchical attention for community question answering with multi-task and adaptive learning. In: IJCAI, pp 5349–5355 Yang M, Chen L, Chen X, et al (2019) Knowledge-enhanced hierarchical attention for community question answering with multi-task and adaptive learning. In: IJCAI, pp 5349–5355
Zurück zum Zitat Ye K, Piao Y, Zhao K, et al (2021) Graph enhanced bert for stance-aware rumor verification on social media. In: International conference on artificial neural networks. Springer, pp 422–435 Ye K, Piao Y, Zhao K, et al (2021) Graph enhanced bert for stance-aware rumor verification on social media. In: International conference on artificial neural networks. Springer, pp 422–435
Zurück zum Zitat Zhang Y, Yang Q (2021) A survey on multi-task learning. In: IEEE transactions on knowledge and data engineering, pp 1–20. arXiv:1707.08114 Zhang Y, Yang Q (2021) A survey on multi-task learning. In: IEEE transactions on knowledge and data engineering, pp 1–20. arXiv:​1707.​08114
Zurück zum Zitat Zhang Y, Ma D, Tiwari P et al (2023) Stance-level sarcasm detection with bert and stance-centered graph attention networks. ACM Trans Internet Technol 23(2):1–21CrossRef Zhang Y, Ma D, Tiwari P et al (2023) Stance-level sarcasm detection with bert and stance-centered graph attention networks. ACM Trans Internet Technol 23(2):1–21CrossRef
Metadaten
Titel
Enhancing stance detection through sequential weighted multi-task learning
verfasst von
Nora Alturayeif
Hamzah Luqman
Moataz Ahmed
Publikationsdatum
01.12.2024
Verlag
Springer Vienna
Erschienen in
Social Network Analysis and Mining / Ausgabe 1/2024
Print ISSN: 1869-5450
Elektronische ISSN: 1869-5469
DOI
https://doi.org/10.1007/s13278-023-01169-7

Weitere Artikel der Ausgabe 1/2024

Social Network Analysis and Mining 1/2024 Zur Ausgabe

Premium Partner