Skip to main content
Erschienen in: Journal of Intelligent Information Systems 2/2023

20.08.2022

Multi-task learning for toxic comment classification and rationale extraction

verfasst von: Kiran Babu Nelatoori, Hima Bindu Kommanti

Erschienen in: Journal of Intelligent Information Systems | Ausgabe 2/2023

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Social media content moderation is the standard practice as on today to promote healthy discussion forums. Toxic span prediction is helpful for explaining the toxic comment classification labels, thus is an important step towards building automated moderation systems. The relation between toxic comment classification and toxic span prediction makes joint learning objective meaningful. We propose a multi-task learning model using ToxicXLMR for bidirectional contextual embeddings of input text for toxic comment classification, and a Bi-LSTM CRF layer for toxic span or rationale identification. To enable multi-task learning in this domain, we have curated a dataset from Jigsaw and Toxic span prediction datasets. The proposed model outperformed the single task models on the curated and toxic span prediction datasets with 4% and 2% improvement for classification and rationale identification, respectively. We investigated the domain adaptation ability of the proposed MTL model on HASOC and OLID datasets that contain the out of domain text from Twitter and found a 3% improvement in the F1 score over single task models.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Caruana, R.A (1993). Multitask connectionist learning. In Proceedings of the 1993 connectionist models summer school. Caruana, R.A (1993). Multitask connectionist learning. In Proceedings of the 1993 connectionist models summer school.
Zurück zum Zitat Chen, Q., Zhuo, Z., & Wang, W. (2019). Bert for joint intent classification and slot filling. arXiv:1902.10909 Chen, Q., Zhuo, Z., & Wang, W. (2019). Bert for joint intent classification and slot filling. arXiv:1902.​10909
Zurück zum Zitat Da San Martino, G., Yu, S., Barrón-Cedeño, A., & et al (2019). Fine-grained analysis of propaganda in news article. In Proceedings of the 2019 conference on empirical methods in natural language processing and the 9th international joint conference on natural language processing (EMNLP-IJCNLP). https://doi.org/10.18653/v1/D19-1565 (pp. 5636–5646). Da San Martino, G., Yu, S., Barrón-Cedeño, A., & et al (2019). Fine-grained analysis of propaganda in news article. In Proceedings of the 2019 conference on empirical methods in natural language processing and the 9th international joint conference on natural language processing (EMNLP-IJCNLP). https://​doi.​org/​10.​18653/​v1/​D19-1565 (pp. 5636–5646).
Zurück zum Zitat Dellerman, D. (2022). Influence of cyberbullying on suicidal behaviors. Ph.D. Thesis, Walden University. Dellerman, D. (2022). Influence of cyberbullying on suicidal behaviors. Ph.D. Thesis, Walden University.
Zurück zum Zitat Devlin, J., Chang, M.-W., Lee, K., & et al (2019). BERT: pre-training of deep bidirectional transformers for language understanding. In Proceedings of the 2019 conference of the North American chapter of the association for computational linguistics: human language technologies, volume 1 (Long and Short Papers). https://doi.org/10.18653/v1/N19-1423 (pp. 4171–4186). Devlin, J., Chang, M.-W., Lee, K., & et al (2019). BERT: pre-training of deep bidirectional transformers for language understanding. In Proceedings of the 2019 conference of the North American chapter of the association for computational linguistics: human language technologies, volume 1 (Long and Short Papers). https://​doi.​org/​10.​18653/​v1/​N19-1423 (pp. 4171–4186).
Zurück zum Zitat Huang, Z., Xu, W., & Yu, K. (2015). Bidirectional lstm-crf models for sequence tagging. arXiv:1508.01991 Huang, Z., Xu, W., & Yu, K. (2015). Bidirectional lstm-crf models for sequence tagging. arXiv:1508.​01991
Zurück zum Zitat Karen, S., Andrea, V., & Andrew, Z. (2014). Deep inside convolutional networks: visualising image classification models and saliency maps. arXiv:1312.6034 Karen, S., Andrea, V., & Andrew, Z. (2014). Deep inside convolutional networks: visualising image classification models and saliency maps. arXiv:1312.​6034
Zurück zum Zitat Liu, P., Li, W., & Zou, L. (2019a). NULI at SemEval-2019 task 6: Transfer learning for offensive language detection using bidirectional transformers. In Proceedings of the 13th international workshop on semantic evaluation. https://doi.org/10.18653/v1/S19-2011 (pp. 87–91). Liu, P., Li, W., & Zou, L. (2019a). NULI at SemEval-2019 task 6: Transfer learning for offensive language detection using bidirectional transformers. In Proceedings of the 13th international workshop on semantic evaluation. https://​doi.​org/​10.​18653/​v1/​S19-2011 (pp. 87–91).
Zurück zum Zitat Liu, Y., Ott, M., Goyal, N., & et al. (2019b). Roberta: A robustly optimized bert pretraining approach. arXiv:1907.11692 Liu, Y., Ott, M., Goyal, N., & et al. (2019b). Roberta: A robustly optimized bert pretraining approach. arXiv:1907.​11692
Zurück zum Zitat Ma, X., & Hovy, E. (2016). End-to-end sequence labeling via bi-directional LSTM-CNNs-CRF. In Proceedings of the 54th annual meeting of the association for computational linguistics (volume 1: long papers). https://doi.org/10.18653/v1/P16-1101(pp. 1064–1074). Ma, X., & Hovy, E. (2016). End-to-end sequence labeling via bi-directional LSTM-CNNs-CRF. In Proceedings of the 54th annual meeting of the association for computational linguistics (volume 1: long papers). https://​doi.​org/​10.​18653/​v1/​P16-1101(pp. 1064–1074).
Zurück zum Zitat McCann, B., Keskar, N.S., Xiong, C., & et al. (2018). The natural language decathlon: Multitask learning as question answering. arXiv:1806.08730 McCann, B., Keskar, N.S., Xiong, C., & et al. (2018). The natural language decathlon: Multitask learning as question answering. arXiv:1806.​08730
Zurück zum Zitat Mozafari, M., Farahbakhsh, R., & Crespi, N. (2019). A BERT-based transfer learning approach for hate speech detection in online social media. In Complex networks 2019: 8th international conference on complex networks and their applications. https://doi.org/10.1007/978-3-030-36687-277 (pp. 928–940). Mozafari, M., Farahbakhsh, R., & Crespi, N. (2019). A BERT-based transfer learning approach for hate speech detection in online social media. In Complex networks 2019: 8th international conference on complex networks and their applications. https://​doi.​org/​10.​1007/​978-3-030-36687-277 (pp. 928–940).
Zurück zum Zitat Pamungkas, EW, & Patti, V. (2019). Cross-domain and cross-lingual abusive language detection: A hybrid approach with deep learning and a multilingual lexicon. In Proceedings of the 57th annual meeting of the association for computational linguistics: student research workshop. https://doi.org/10.18653/v1/P19-2051 (pp. 363–370). Pamungkas, EW, & Patti, V. (2019). Cross-domain and cross-lingual abusive language detection: A hybrid approach with deep learning and a multilingual lexicon. In Proceedings of the 57th annual meeting of the association for computational linguistics: student research workshop. https://​doi.​org/​10.​18653/​v1/​P19-2051 (pp. 363–370).
Zurück zum Zitat Park, JH, & Fung, P. (2017). One-step and two-step classification for abusive language detection on Twitter. In Proceedings of the first workshop on abusive language online. https://doi.org/10.18653/v1/W17-3006 (pp. 41–45). Vancouver: Association for Computational Linguistics. Park, JH, & Fung, P. (2017). One-step and two-step classification for abusive language detection on Twitter. In Proceedings of the first workshop on abusive language online. https://​doi.​org/​10.​18653/​v1/​W17-3006 (pp. 41–45). Vancouver: Association for Computational Linguistics.
Zurück zum Zitat Ramsundar, B., Kearnes, S., Riley, P., & et al. (2015). Massively multitask networks for drug discovery. arXiv:1502.02072 Ramsundar, B., Kearnes, S., Riley, P., & et al. (2015). Massively multitask networks for drug discovery. arXiv:1502.​02072
Zurück zum Zitat Ribeiro, MT, Singh, S., & Guestrin, C. (2016). “Why should i trust you?” Explaining the predictions of any classifier. In Proceedings of the ACM SIGKDD international conference on knowledge discovery and data mining. https://doi.org/10.1145/2939672.2939778 (pp. 1135–1144). Ribeiro, MT, Singh, S., & Guestrin, C. (2016). “Why should i trust you?” Explaining the predictions of any classifier. In Proceedings of the ACM SIGKDD international conference on knowledge discovery and data mining. https://​doi.​org/​10.​1145/​2939672.​2939778 (pp. 1135–1144).
Zurück zum Zitat Ruder, S. (2017). An overview of multi-task learning in deep neural networks. arXiv:1706.05098 Ruder, S. (2017). An overview of multi-task learning in deep neural networks. arXiv:1706.​05098
Zurück zum Zitat Sharma, M., Kandasamy, I., & Vasantha, W.b. (2021). YoungSheldon at SemEval-2021 task 5: Fine-tuning pre-trained language models for toxic spans detection using token classification objective. In Proceedings of the 15th international workshop on semantic evaluation (SemEval-2021). https://doi.org/10.18653/v1/2021.semeval-1.130(pp. 953–959). Sharma, M., Kandasamy, I., & Vasantha, W.b. (2021). YoungSheldon at SemEval-2021 task 5: Fine-tuning pre-trained language models for toxic spans detection using token classification objective. In Proceedings of the 15th international workshop on semantic evaluation (SemEval-2021). https://​doi.​org/​10.​18653/​v1/​2021.​semeval-1.​130(pp. 953–959).
Zurück zum Zitat Temper, M., Poisel, R., & Tjoa, S. (2013). Facebook watchdog: A research agenda for detecting online grooming and bullying activities. In IEEE International conference on systems, man, and cybernetics, SMC. https://doi.org/10.1109/SMC.2013.487 (pp. 2854–2859). Temper, M., Poisel, R., & Tjoa, S. (2013). Facebook watchdog: A research agenda for detecting online grooming and bullying activities. In IEEE International conference on systems, man, and cybernetics, SMC. https://​doi.​org/​10.​1109/​SMC.​2013.​487 (pp. 2854–2859).
Zurück zum Zitat Wang, B., Ding, Y., Liu, S., & Zhou, X. (2019). Ynu_wb at HASOC 2019: Ordered neurons LSTM with attention for identifying hate speech and offensive language. In Working notes of FIRE 2019 - forum for information retrieval evaluation. http://ceur-ws.org/Vol-2517/T3-2.pdf (pp. 191–198). Wang, B., Ding, Y., Liu, S., & Zhou, X. (2019). Ynu_wb at HASOC 2019: Ordered neurons LSTM with attention for identifying hate speech and offensive language. In Working notes of FIRE 2019 - forum for information retrieval evaluation. http://​ceur-ws.​org/​Vol-2517/​T3-2.​pdf (pp. 191–198).
Zurück zum Zitat Wiegreffe, S., & Pinter, Y. (2019). Attention is not not explanation. In Proceedings of the 2019 conference on empirical methods in natural language processing and the 9th international joint conference on natural language processing (EMNLP-IJCNLP). https://doi.org/10.18653/v1/D19-1002 (pp. 11–20). Wiegreffe, S., & Pinter, Y. (2019). Attention is not not explanation. In Proceedings of the 2019 conference on empirical methods in natural language processing and the 9th international joint conference on natural language processing (EMNLP-IJCNLP). https://​doi.​org/​10.​18653/​v1/​D19-1002 (pp. 11–20).
Zurück zum Zitat Xiang, T., Macavaney, S., Yang, E., & et al (2021). Toxccin: Toxic content classification with interpretability. In Proceedings of the 11th workshop on computational approaches to subjectivity, sentiment and social media analysis. https://aclanthology.org/2021.wassa-1.1 (pp. 1–12). Xiang, T., Macavaney, S., Yang, E., & et al (2021). Toxccin: Toxic content classification with interpretability. In Proceedings of the 11th workshop on computational approaches to subjectivity, sentiment and social media analysis. https://​aclanthology.​org/​2021.​wassa-1.​1 (pp. 1–12).
Zurück zum Zitat Zaidan, O., Eisner, J., & Piatko, C. (2007). Using “annotator rationales” to improve machine learning for text categorization. In Human language technologies 2007: the conference of the North American chapter of the association for computational linguistics; proceedings of the main conference. https://aclanthology.org/N07-1033 (pp. 260–267). Zaidan, O., Eisner, J., & Piatko, C. (2007). Using “annotator rationales” to improve machine learning for text categorization. In Human language technologies 2007: the conference of the North American chapter of the association for computational linguistics; proceedings of the main conference. https://​aclanthology.​org/​N07-1033 (pp. 260–267).
Zurück zum Zitat Zhu, Q., Lin, Z., Zhang, Y., & et al (2021). HITSZ-HLT at SemEval-2021 task 5: Ensemble sequence labeling and span boundary detection for toxic span detection. In Proceedings of the 15th international workshop on semantic evaluation (SemEval-2021). https://doi.org/10.18653/v1/2021.semeval-1.63 (pp. 521–526). Zhu, Q., Lin, Z., Zhang, Y., & et al (2021). HITSZ-HLT at SemEval-2021 task 5: Ensemble sequence labeling and span boundary detection for toxic span detection. In Proceedings of the 15th international workshop on semantic evaluation (SemEval-2021). https://​doi.​org/​10.​18653/​v1/​2021.​semeval-1.​63 (pp. 521–526).
Metadaten
Titel
Multi-task learning for toxic comment classification and rationale extraction
verfasst von
Kiran Babu Nelatoori
Hima Bindu Kommanti
Publikationsdatum
20.08.2022
Verlag
Springer US
Erschienen in
Journal of Intelligent Information Systems / Ausgabe 2/2023
Print ISSN: 0925-9902
Elektronische ISSN: 1573-7675
DOI
https://doi.org/10.1007/s10844-022-00726-4

Weitere Artikel der Ausgabe 2/2023

Journal of Intelligent Information Systems 2/2023 Zur Ausgabe

Premium Partner