Skip to main content
Erschienen in: Social Network Analysis and Mining 1/2023

01.12.2023 | Original Article

Towards more robust hate speech detection: using social context and user data

verfasst von: Seema Nagar, Ferdous Ahmed Barbhuiya, Kuntal Dey

Erschienen in: Social Network Analysis and Mining | Ausgabe 1/2023

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In this paper, we present a novel approach to detecting hate speech on Twitter. Our method incorporates textual, social context and language features of the author to better capture the nuances of hate speech and improve detection accuracy. We formalize the idea that an individual’s hateful content is influenced by their social circle and propose a framework that combines text content with social context to detect hate speech. Our framework uses a Variational Graph Auto-encoder to jointly learn the unified features of authors using a social network, language features, and profile information. Additionally, to accommodate emerging and future language models, our framework is designed to be flexible and can incorporate any text encoder as a plug-in to obtain the textual features of the content. We evaluate our method on two diverse Twitter datasets and show that it outperforms existing state-of-the-art methods by a significant margin. Our results suggest that considering social context is a promising direction for improving hate speech detection on Twitter.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Arnold A, Liu Y, Abe N (2007) Temporal causal modeling with graphical granger methods. In: Proceedings of the 13th ACM SIGKDD international conference on knowledge discovery and data mining. ACM, KDD-07, pp 66–75 Arnold A, Liu Y, Abe N (2007) Temporal causal modeling with graphical granger methods. In: Proceedings of the 13th ACM SIGKDD international conference on knowledge discovery and data mining. ACM, KDD-07, pp 66–75
Zurück zum Zitat Badjatiya P, Gupta S, Gupta M, et al (2017) Deep learning for hate speech detection in tweets. In: WWW, pp 759–760 Badjatiya P, Gupta S, Gupta M, et al (2017) Deep learning for hate speech detection in tweets. In: WWW, pp 759–760
Zurück zum Zitat Bahadori MT, Liu Y (2013) An examination of practical granger causality inference. In: Proceedings of the 2013 SIAM international conference on data mining, pp 467–475 Bahadori MT, Liu Y (2013) An examination of practical granger causality inference. In: Proceedings of the 2013 SIAM international conference on data mining, pp 467–475
Zurück zum Zitat Burnap P, Williams ML (2016) Us and them: identifying cyber hate on twitter across multiple protected characteristics. EPJ Data Sci 5(1):11CrossRef Burnap P, Williams ML (2016) Us and them: identifying cyber hate on twitter across multiple protected characteristics. EPJ Data Sci 5(1):11CrossRef
Zurück zum Zitat Coussi-Korbel S, Fragaszy DM (1995) On the relation between social dynamics and social learning. Anim Behav 50(6):1441–1453CrossRef Coussi-Korbel S, Fragaszy DM (1995) On the relation between social dynamics and social learning. Anim Behav 50(6):1441–1453CrossRef
Zurück zum Zitat Davidson T, Warmsley D, Macy M, et al (2017) Automated hate speech detection and the problem of offensive language. In: Eleventh international aaai conference on web and social media Davidson T, Warmsley D, Macy M, et al (2017) Automated hate speech detection and the problem of offensive language. In: Eleventh international aaai conference on web and social media
Zurück zum Zitat Djuric N, Zhou J, Morris R, et al (2015) Hate speech detection with comment embeddings. In: WWW, pp 29–30 Djuric N, Zhou J, Morris R, et al (2015) Hate speech detection with comment embeddings. In: WWW, pp 29–30
Zurück zum Zitat Fadaee M, Fathian M, Beigy H, et al (2013) Combining lexical and machine learning methods for persian hate speech detection. In: 2013 2nd international conference on computer and knowledge engineering (ICCKE), pp 169–174 Fadaee M, Fathian M, Beigy H, et al (2013) Combining lexical and machine learning methods for persian hate speech detection. In: 2013 2nd international conference on computer and knowledge engineering (ICCKE), pp 169–174
Zurück zum Zitat Flache A, Mäs M, Feliciani T et al (2017) Models of social influence: towards the next frontiers. J Artif Soc Soc Simul 20:4CrossRef Flache A, Mäs M, Feliciani T et al (2017) Models of social influence: towards the next frontiers. J Artif Soc Soc Simul 20:4CrossRef
Zurück zum Zitat Founta AM, Djouvas C, Chatzakou D, et al (2018) Large scale crowdsourcing and characterization of twitter abusive behavior. In: ICWSM. AAAI Press Founta AM, Djouvas C, Chatzakou D, et al (2018) Large scale crowdsourcing and characterization of twitter abusive behavior. In: ICWSM. AAAI Press
Zurück zum Zitat Founta AM, Chatzakou D, Kourtellis N, et al (2019) A unified deep learning architecture for abuse detection. In: ACM WebSci, pp 105–114 Founta AM, Chatzakou D, Kourtellis N, et al (2019) A unified deep learning architecture for abuse detection. In: ACM WebSci, pp 105–114
Zurück zum Zitat Friedkin NE, Johnsen EC (2011) Social influence network theory: a sociological examination of small group dynamics, vol 33. Cambridge University Press, CambridgeCrossRef Friedkin NE, Johnsen EC (2011) Social influence network theory: a sociological examination of small group dynamics, vol 33. Cambridge University Press, CambridgeCrossRef
Zurück zum Zitat Friedkin NE, Proskurnikov AV, Tempo R et al (2016) Network science on belief system dynamics under logic constraints. Science 354(6310):321–326MathSciNetCrossRef Friedkin NE, Proskurnikov AV, Tempo R et al (2016) Network science on belief system dynamics under logic constraints. Science 354(6310):321–326MathSciNetCrossRef
Zurück zum Zitat Geweke JF (1984) Measures of conditional linear dependence and feedback between time series. J Am Stat Assoc 79(388):907–915MathSciNetCrossRef Geweke JF (1984) Measures of conditional linear dependence and feedback between time series. J Am Stat Assoc 79(388):907–915MathSciNetCrossRef
Zurück zum Zitat Granger CWJ (1969) Investigating causal relations by econometric models and cross-spectral methods. Econometrica 37(3):424–438CrossRef Granger CWJ (1969) Investigating causal relations by econometric models and cross-spectral methods. Econometrica 37(3):424–438CrossRef
Zurück zum Zitat Gröndahl T, Pajola L, Juuti M, et al (2018) All you need is“ love” evading hate speech detection. In: Proceedings of the 11th ACM workshop on artificial intelligence and security, pp 2–12 Gröndahl T, Pajola L, Juuti M, et al (2018) All you need is“ love” evading hate speech detection. In: Proceedings of the 11th ACM workshop on artificial intelligence and security, pp 2–12
Zurück zum Zitat Hosseini H, Kannan S, Zhang B, et al (2017) Deceiving google’s perspective api built for detecting toxic comments. arXiv:1702.08138 Hosseini H, Kannan S, Zhang B, et al (2017) Deceiving google’s perspective api built for detecting toxic comments. arXiv:​1702.​08138
Zurück zum Zitat Isaksen V, Gambäck B (2020) Using transfer-based language models to detect hateful and offensive language online. In: Proceedings of the fourth workshop on online abuse and harms, pp 16–27 Isaksen V, Gambäck B (2020) Using transfer-based language models to detect hateful and offensive language online. In: Proceedings of the fourth workshop on online abuse and harms, pp 16–27
Zurück zum Zitat Jeong C, Jang S, Shin H, et al (2019) A context-aware citation recommendation model with bert and graph convolutional networks. arXiv:1903.06464 Jeong C, Jang S, Shin H, et al (2019) A context-aware citation recommendation model with bert and graph convolutional networks. arXiv:​1903.​06464
Zurück zum Zitat Kuzma R, Cruickshank IJ, Carley KM (2021) Influencing the influencers: evaluating person-to-person influence on social networks using granger causality. In: International conference on complex networks and their applications. Springer, pp 89–99 Kuzma R, Cruickshank IJ, Carley KM (2021) Influencing the influencers: evaluating person-to-person influence on social networks using granger causality. In: International conference on complex networks and their applications. Springer, pp 89–99
Zurück zum Zitat Kwok I, Wang Y (2013) Locate the hate: detecting tweets against blacks. In: Twenty-seventh AAAI conference on artificial intelligence Kwok I, Wang Y (2013) Locate the hate: detecting tweets against blacks. In: Twenty-seventh AAAI conference on artificial intelligence
Zurück zum Zitat Magu R, Joshi K, Luo J (2017) Detecting the hate code on social media. In: Proceedings of the international AAAI conference on web and social media, pp 608–611 Magu R, Joshi K, Luo J (2017) Detecting the hate code on social media. In: Proceedings of the international AAAI conference on web and social media, pp 608–611
Zurück zum Zitat Mishra P, Del Tredici M, Yannakoudakis H, et al (2019) Abusive language detection with graph convolutional networks. arXiv:1904.04073 Mishra P, Del Tredici M, Yannakoudakis H, et al (2019) Abusive language detection with graph convolutional networks. arXiv:​1904.​04073
Zurück zum Zitat Nagar S, Gupta S, Bahushruth C, et al (2021) Hate speech detection on social media using graph convolutional networks. In: International conference on complex networks and their applications. Springer, pp 3–14 Nagar S, Gupta S, Bahushruth C, et al (2021) Hate speech detection on social media using graph convolutional networks. In: International conference on complex networks and their applications. Springer, pp 3–14
Zurück zum Zitat Nedić A, Olshevsky A, Uribe CA (2019) Graph-theoretic analysis of belief system dynamics under logic constraints. Sci Rep 9(1):1–16CrossRef Nedić A, Olshevsky A, Uribe CA (2019) Graph-theoretic analysis of belief system dynamics under logic constraints. Sci Rep 9(1):1–16CrossRef
Zurück zum Zitat Nobata C, Tetreault J, Thomas A, et al (2016) Abusive language detection in online user content. In: Proceedings of the 25th international conference on world wide web, pp 145–153 Nobata C, Tetreault J, Thomas A, et al (2016) Abusive language detection in online user content. In: Proceedings of the 25th international conference on world wide web, pp 145–153
Zurück zum Zitat Pitsilis G, Piperidis S, Papageorgiou H (2018) Hybrid hate speech detection in greek language. In: 2018 8th international conference on information. Intelligence, systems and applications (IISA), pp 1–6 Pitsilis G, Piperidis S, Papageorgiou H (2018) Hybrid hate speech detection in greek language. In: 2018 8th international conference on information. Intelligence, systems and applications (IISA), pp 1–6
Zurück zum Zitat Ribeiro M, Calais P, dos Santos Y, et al (2017) “like sheep among wolves”: characterizing hateful users on twitter. In: MIS2 workshop at WSDM’2018 Ribeiro M, Calais P, dos Santos Y, et al (2017) “like sheep among wolves”: characterizing hateful users on twitter. In: MIS2 workshop at WSDM’2018
Zurück zum Zitat Rodriguez CAA, Chao YL (2019) Automatic detection of hate speech on facebook using sentiment and emotion analysis. In: 2019 International conference on artificial intelligence in information and communication (ICAIIC), pp 1–5 Rodriguez CAA, Chao YL (2019) Automatic detection of hate speech on facebook using sentiment and emotion analysis. In: 2019 International conference on artificial intelligence in information and communication (ICAIIC), pp 1–5
Zurück zum Zitat Sap M, Card D, Gabriel S, et al (2019) The risk of racial bias in hate speech detection. In: Proceedings of the 57th annual meeting of the association for computational linguistics, pp 1668–1678 Sap M, Card D, Gabriel S, et al (2019) The risk of racial bias in hate speech detection. In: Proceedings of the 57th annual meeting of the association for computational linguistics, pp 1668–1678
Zurück zum Zitat Schmidt A, Wiegand M (2017) A survey on hate speech detection using natural language processing. In: Proceedings of the fifth international workshop on natural language processing for social media, pp 1–10 Schmidt A, Wiegand M (2017) A survey on hate speech detection using natural language processing. In: Proceedings of the fifth international workshop on natural language processing for social media, pp 1–10
Zurück zum Zitat Unsvåg EF, Gambäck B (2018) The effects of user features on twitter hate speech detection. In: Proceedings of the 2nd workshop on abusive language online (ALW2), pp 75–85 Unsvåg EF, Gambäck B (2018) The effects of user features on twitter hate speech detection. In: Proceedings of the 2nd workshop on abusive language online (ALW2), pp 75–85
Zurück zum Zitat Valdés-Sosa P, Sanchez JM, Castellanos A et al (2005) Estimating brain functional connectivity with sparse multivariate autoregression. Philos Trans R Soc Lond B Biol Sci 360:969–81CrossRef Valdés-Sosa P, Sanchez JM, Castellanos A et al (2005) Estimating brain functional connectivity with sparse multivariate autoregression. Philos Trans R Soc Lond B Biol Sci 360:969–81CrossRef
Zurück zum Zitat Warner W, Hirschberg J (2012) Detecting hate speech on the world wide web. In: Second workshop on language in social media, association for computational linguistics, pp 19–26 Warner W, Hirschberg J (2012) Detecting hate speech on the world wide web. In: Second workshop on language in social media, association for computational linguistics, pp 19–26
Zurück zum Zitat Waseem Z, Hovy D (2016) Hateful symbols or hateful people? predictive features for hate speech detection on twitter. In: NAACL student research workshop, pp 88–93 Waseem Z, Hovy D (2016) Hateful symbols or hateful people? predictive features for hate speech detection on twitter. In: NAACL student research workshop, pp 88–93
Metadaten
Titel
Towards more robust hate speech detection: using social context and user data
verfasst von
Seema Nagar
Ferdous Ahmed Barbhuiya
Kuntal Dey
Publikationsdatum
01.12.2023
Verlag
Springer Vienna
Erschienen in
Social Network Analysis and Mining / Ausgabe 1/2023
Print ISSN: 1869-5450
Elektronische ISSN: 1869-5469
DOI
https://doi.org/10.1007/s13278-023-01051-6

Weitere Artikel der Ausgabe 1/2023

Social Network Analysis and Mining 1/2023 Zur Ausgabe

Original Article

Social bot metrics

Premium Partner