Skip to main content

2020 | OriginalPaper | Buchkapitel

Detecting Hate Speech Online: A Case of Croatian

verfasst von : Kristina Kocijan, Lucija Košković, Petra Bajac

Erschienen in: Formalizing Natural Languages with NooJ 2019 and Its Natural Language Processing Applications

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

This project proposes a NooJ algorithm with the task to find and categorize various slurs, insults and ultimately, hate speech in Croatian. The results also provide a more detailed insight into inappropriate language in Croatian. We strongly emphasize the ethical considerations of (mis) identifying hate speech and as a result, an unethical and undeserved censorship of inappropriate, but free speech. Thus, we tried to make a clear distinction between insults and hate speech.
The test corpus consists of written online comments and remarks posted on five Croatian Facebook news pages during one week period. Given the differences between the standard Croatian grammar and syntax, and what is actually being used in informal on-line communication, the false negatives present the biggest difficulty since some variations (substandard usages of cases, spelling errors, colloquialisms) are impossible to predict, and therefore, extremely hard to implement into the algorithm.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
1
At this time, we did not deal with detecting irony in the text.
 
3
Rating of Croatian portals - https://​rating.​gemius.​com/​.
 
Literatur
1.
Zurück zum Zitat Barrios, M.M., Estarita, L.M.V., Gil, L.M.: When online commentary turns into violence: the role of Twitter in slander against journalists in Colombia. Confl. Commun. 18(1), 1–16 (2019) Barrios, M.M., Estarita, L.M.V., Gil, L.M.: When online commentary turns into violence: the role of Twitter in slander against journalists in Colombia. Confl. Commun. 18(1), 1–16 (2019)
4.
Zurück zum Zitat Buyse, A.: Words of violence: fear speech, or how violent conflict escalation relates to the freedom of expression. Hum. Rights Q. 36, 779–797 (2014). The Johns Hopkins University PressCrossRef Buyse, A.: Words of violence: fear speech, or how violent conflict escalation relates to the freedom of expression. Hum. Rights Q. 36, 779–797 (2014). The Johns Hopkins University PressCrossRef
5.
Zurück zum Zitat Silva, L.A., Mondal, M., Correa, D., Benevenuto, F., Weber, I.: Analyzing the targets of hate in online social media. In: ICWSM, pp. 687–690 (2016) Silva, L.A., Mondal, M., Correa, D., Benevenuto, F., Weber, I.: Analyzing the targets of hate in online social media. In: ICWSM, pp. 687–690 (2016)
6.
Zurück zum Zitat Gagliardone, I., Gal, D., Alves, T., Martinez, G.: Countering Online Hate Speech. UNESCO, Paris (2015). ISBN 978-92-3-100105-5 Gagliardone, I., Gal, D., Alves, T., Martinez, G.: Countering Online Hate Speech. UNESCO, Paris (2015). ISBN 978-92-3-100105-5
7.
Zurück zum Zitat Pendeš, A., Pekas, M., Juršetić, A., Krajnović, T., Jagnić Nenadić, G., Dojčinović, I., Nikšić, D.: ELSA Croatia. Final report on online hate speech, Legal Research Group on Online Hate Speech, pp. 94–111. The European Law Students’ Association (2014) Pendeš, A., Pekas, M., Juršetić, A., Krajnović, T., Jagnić Nenadić, G., Dojčinović, I., Nikšić, D.: ELSA Croatia. Final report on online hate speech, Legal Research Group on Online Hate Speech, pp. 94–111. The European Law Students’ Association (2014)
10.
Zurück zum Zitat Bansal, H., Nagel, D., Soloveva, A.: HAD-Tübingen at SemEval-2019 Task 6: deep learning analysis of offensive language on Twitter: identification and categorization. In: Proceedings of the 13th International Workshop on Semantic Evaluation, pp. 622–627. Association for Computational Linguistics (2019). https://doi.org/10.18653/v1/s19-2111 Bansal, H., Nagel, D., Soloveva, A.: HAD-Tübingen at SemEval-2019 Task 6: deep learning analysis of offensive language on Twitter: identification and categorization. In: Proceedings of the 13th International Workshop on Semantic Evaluation, pp. 622–627. Association for Computational Linguistics (2019). https://​doi.​org/​10.​18653/​v1/​s19-2111
12.
Zurück zum Zitat Mitrović, J., Birkeneder, B., Granitzer, M.: nlpUP at SemEval-2019 Task 6: a deep neural language model for offensive language detection. In: Proceedings of the 13th International Workshop on Semantic Evaluation, pp. 722–726. Association for Computational Linguistics (2019). https://doi.org/10.18653/v1/s19-2127 Mitrović, J., Birkeneder, B., Granitzer, M.: nlpUP at SemEval-2019 Task 6: a deep neural language model for offensive language detection. In: Proceedings of the 13th International Workshop on Semantic Evaluation, pp. 722–726. Association for Computational Linguistics (2019). https://​doi.​org/​10.​18653/​v1/​s19-2127
14.
Zurück zum Zitat Nobata, C., Tetreault, J., Thomas, A., Mehdad, Y., Chang, Y.: Abusive language detection in online user content. In: Proceedings of the 25th International Conference on World Wide Web, pp. 145–153. International World Wide Web Conferences Steering Committee (2016) Nobata, C., Tetreault, J., Thomas, A., Mehdad, Y., Chang, Y.: Abusive language detection in online user content. In: Proceedings of the 25th International Conference on World Wide Web, pp. 145–153. International World Wide Web Conferences Steering Committee (2016)
15.
Zurück zum Zitat Fišer, D., Erjavec, T., Ljubešić, N.: Legal framework, dataset and annotation schema for socially unacceptable online discourse practices in Slovene. In: Proceedings of the First Workshop on Abusive Language Online, pp. 46–51 (2017) Fišer, D., Erjavec, T., Ljubešić, N.: Legal framework, dataset and annotation schema for socially unacceptable online discourse practices in Slovene. In: Proceedings of the First Workshop on Abusive Language Online, pp. 46–51 (2017)
20.
Zurück zum Zitat Munivrana Vajda, M., Šurina Marton, A.: Gdje prestaju granice slobode izražavanja, a počinje govor mržnje? Analiza hrvatskog zakonodavstva i prakse u svjetlu europskih pravnih standarda. Hrvatski ljetopis za kaznene znanosti i praksu 23(2), 435–467 (2016). https://hrcak.srce.hr/177439 Munivrana Vajda, M., Šurina Marton, A.: Gdje prestaju granice slobode izražavanja, a počinje govor mržnje? Analiza hrvatskog zakonodavstva i prakse u svjetlu europskih pravnih standarda. Hrvatski ljetopis za kaznene znanosti i praksu 23(2), 435–467 (2016). https://​hrcak.​srce.​hr/​177439
21.
Zurück zum Zitat Roksandić Vidlička, S., Mamić, K.: Zlouporaba društvenih mreža u javnom poticanju na nasilje i mržnju i širenju lažnih vijesti: potreba transplantiranja njemačkog Zakona o jačanju provedbe zakona na društvenim mrežama? (Abuse of social networks in public incitement to violence and hatred and in the spreading of false news: the need for the transposition of the German Act on improving law enforcement on social networks?). Hrvatski ljetopis za kaznene znanosti i praksu 25(2), 329–357 (2018). https://hrcak.srce.hr/218951 Roksandić Vidlička, S., Mamić, K.: Zlouporaba društvenih mreža u javnom poticanju na nasilje i mržnju i širenju lažnih vijesti: potreba transplantiranja njemačkog Zakona o jačanju provedbe zakona na društvenim mrežama? (Abuse of social networks in public incitement to violence and hatred and in the spreading of false news: the need for the transposition of the German Act on improving law enforcement on social networks?). Hrvatski ljetopis za kaznene znanosti i praksu 25(2), 329–357 (2018). https://​hrcak.​srce.​hr/​218951
22.
Zurück zum Zitat Kocijan, K., Požega, M.: Building family trees with NooJ in formalising natural languages with NooJ 2014. Selected papers from the NooJ 2014 International Conference, Ed. by J. Monti, M. Silberztein, M. Monteleone, M. Pia di Buono, pp. 198–210. Cambridge Scholars Publishing, Newcastle upon Tyne (2015) Kocijan, K., Požega, M.: Building family trees with NooJ in formalising natural languages with NooJ 2014. Selected papers from the NooJ 2014 International Conference, Ed. by J. Monti, M. Silberztein, M. Monteleone, M. Pia di Buono, pp. 198–210. Cambridge Scholars Publishing, Newcastle upon Tyne (2015)
23.
Zurück zum Zitat Silberztein, M.: Formalizing Natural Languages: The NooJ Approach. Cognitive science series. Wiley, London (2016)CrossRef Silberztein, M.: Formalizing Natural Languages: The NooJ Approach. Cognitive science series. Wiley, London (2016)CrossRef
Metadaten
Titel
Detecting Hate Speech Online: A Case of Croatian
verfasst von
Kristina Kocijan
Lucija Košković
Petra Bajac
Copyright-Jahr
2020
DOI
https://doi.org/10.1007/978-3-030-38833-1_16

Premium Partner