Skip to main content
Top

2021 | OriginalPaper | Chapter

Misogynous Text Classification Using SVM and LSTM

Authors : Maibam Debina Devi, Navanath Saharia

Published in: Advanced Computing

Publisher: Springer Singapore

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Discrimination and manipulation are becoming predominant in social network activities. Comments bearing attitudes, such as distress, hate, and aggression in Social Networking Sites (SNS) add fuel to the process of discrimination. This research aims to classify texts, which are misogynous in nature using Support Vector Machine (SVM) and Long-Short Term Memory (LSTM) for user-generated texts of English and Hindi languages written using the Roman script. Approximately 87% accuracy was achieved while SVM was trained with Term Frequency-Inverse Document Frequency (TF-IDF) feature and for Hindi comments approximately 93.43% accuracy was achieved for English using Bidirectional LSTM (Bi-LSTM).

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
2.
go back to reference Ahluwalia, R., Shcherbinina, E., Callow, E., Nascimento, A.C., De Cock, M.: Detecting misogynous tweets. In: IberEval@ SEPLN, pp. 242–248 (2018) Ahluwalia, R., Shcherbinina, E., Callow, E., Nascimento, A.C., De Cock, M.: Detecting misogynous tweets. In: IberEval@ SEPLN, pp. 242–248 (2018)
3.
go back to reference Altın, L.S.M., Bravo, A., Saggion, H.: LaSTUS/TALN at TRAC-2020 trolling, aggression and cyberbullying. In: Proceedings of the Second Workshop on Trolling, Aggression and Cyberbullying, pp. 83–86 (2020) Altın, L.S.M., Bravo, A., Saggion, H.: LaSTUS/TALN at TRAC-2020 trolling, aggression and cyberbullying. In: Proceedings of the Second Workshop on Trolling, Aggression and Cyberbullying, pp. 83–86 (2020)
4.
go back to reference Arras, L., Montavon, G.G., Müller, K.R., Samek, W.: Explaining recurrent neural network predictions in sentiment analysis. arXiv preprint arXiv:1706.07206 (2017) Arras, L., Montavon, G.G., Müller, K.R., Samek, W.: Explaining recurrent neural network predictions in sentiment analysis. arXiv preprint arXiv:​1706.​07206 (2017)
5.
go back to reference Badjatiya, P., Gupta, S., Gupta, M., Varma, V.: Deep learning for hate speech detection in tweets. In: Proceedings of the 26th International Conference on World Wide Web Companion, pp. 759–760 (2017) Badjatiya, P., Gupta, S., Gupta, M., Varma, V.: Deep learning for hate speech detection in tweets. In: Proceedings of the 26th International Conference on World Wide Web Companion, pp. 759–760 (2017)
6.
go back to reference Basile, V., et al.: SemEval-2019 task 5: multilingual detection of hate speech against immigrants and women in twitter. In: Proceedings of the 13th International Workshop on Semantic Evaluation, pp. 54–63 (2019) Basile, V., et al.: SemEval-2019 task 5: multilingual detection of hate speech against immigrants and women in twitter. In: Proceedings of the 13th International Workshop on Semantic Evaluation, pp. 54–63 (2019)
7.
go back to reference Beltagy, I., Lo, K., Cohan, A.: SciBERT: a pretrained language model for scientific text. In: Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), pp. 3606–3611 (2019) Beltagy, I., Lo, K., Cohan, A.: SciBERT: a pretrained language model for scientific text. In: Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), pp. 3606–3611 (2019)
8.
go back to reference Bhattacharya, S., et al.: Developing a multilingual annotated corpus of misogyny and aggression. arXiv preprint arXiv:2003.07428 (2020) Bhattacharya, S., et al.: Developing a multilingual annotated corpus of misogyny and aggression. arXiv preprint arXiv:​2003.​07428 (2020)
10.
go back to reference Charton, E., Meurs, M.J., Jean-Louis, L., Gagnon, M.: Using collaborative tagging for text classification: from text classification to opinion mining. In: Informatics, vol. 1, pp. 32–51. Multidisciplinary Digital Publishing Institute (2014) Charton, E., Meurs, M.J., Jean-Louis, L., Gagnon, M.: Using collaborative tagging for text classification: from text classification to opinion mining. In: Informatics, vol. 1, pp. 32–51. Multidisciplinary Digital Publishing Institute (2014)
11.
go back to reference Chekuri, C., Goldwasser, M.H., Raghavan, P., Upfal, E.: Web search using automatic classification. In: Proceedings of the Sixth International Conference on the World Wide Web (1997) Chekuri, C., Goldwasser, M.H., Raghavan, P., Upfal, E.: Web search using automatic classification. In: Proceedings of the Sixth International Conference on the World Wide Web (1997)
12.
go back to reference Conneau, A., Schwenk, H., Barrault, L., Lecun, Y.: Very deep convolutional networks for text classification. In: Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, Long Papers, vol. 1, pp. 1107–1116 (2017) Conneau, A., Schwenk, H., Barrault, L., Lecun, Y.: Very deep convolutional networks for text classification. In: Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, Long Papers, vol. 1, pp. 1107–1116 (2017)
13.
go back to reference Corazza, M., Menini, S., Cabrio, E., Tonelli, S., Villata, S.: A multilingual evaluation for online hate speech detection. ACM Trans. Internet Technol. (TOIT) 20(2), 1–22 (2020)CrossRef Corazza, M., Menini, S., Cabrio, E., Tonelli, S., Villata, S.: A multilingual evaluation for online hate speech detection. ACM Trans. Internet Technol. (TOIT) 20(2), 1–22 (2020)CrossRef
14.
go back to reference Das, A., Gambäck, B.: Code-mixing in social media text: the last language identification frontier? (2015) Das, A., Gambäck, B.: Code-mixing in social media text: the last language identification frontier? (2015)
15.
go back to reference Davidson, T., Warmsley, D., Macy, M., Weber, I.: Automated hate speech detection and the problem of offensive language. In: Eleventh International AAAI Conference on Web and Social Media (2017) Davidson, T., Warmsley, D., Macy, M., Weber, I.: Automated hate speech detection and the problem of offensive language. In: Eleventh International AAAI Conference on Web and Social Media (2017)
16.
go back to reference Devi, M.D., Saharia, N.: Learning adaptable approach to classify sentiment with incremental datasets. Procedia Comput. Sci. 171, 2426–2434 (2020)CrossRef Devi, M.D., Saharia, N.: Learning adaptable approach to classify sentiment with incremental datasets. Procedia Comput. Sci. 171, 2426–2434 (2020)CrossRef
17.
go back to reference Fan, R.E., Chang, K.W., Hsieh, C.J., Wang, X.R., Lin, C.J.: LIBLINEAR: a library for large linear classification. J. Mach. Learn. Res. 9, 1871–1874 (2008)MATH Fan, R.E., Chang, K.W., Hsieh, C.J., Wang, X.R., Lin, C.J.: LIBLINEAR: a library for large linear classification. J. Mach. Learn. Res. 9, 1871–1874 (2008)MATH
18.
go back to reference Frenda, S., Ghanem, B., Montes-y Gómez, M., Rosso, P.: Online hate speech against women: automatic identification of misogyny and sexism on Twitter. J. Intell. Fuzzy Syst. 36(5), 4743–4752 (2019)CrossRef Frenda, S., Ghanem, B., Montes-y Gómez, M., Rosso, P.: Online hate speech against women: automatic identification of misogyny and sexism on Twitter. J. Intell. Fuzzy Syst. 36(5), 4743–4752 (2019)CrossRef
19.
go back to reference Goenaga, I., et al.: Automatic misogyny identification using neural networks. In: IberEval@ SEPLN, pp. 249–254 (2018) Goenaga, I., et al.: Automatic misogyny identification using neural networks. In: IberEval@ SEPLN, pp. 249–254 (2018)
20.
go back to reference Joulin, A., Grave, É., Bojanowski, P., Mikolov, T.: Bag of tricks for efficient text classification. In: Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, Short Papers, vol. 2, pp. 427–431 (2017) Joulin, A., Grave, É., Bojanowski, P., Mikolov, T.: Bag of tricks for efficient text classification. In: Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, Short Papers, vol. 2, pp. 427–431 (2017)
21.
go back to reference Kim, J.: User-generated content (UGC) revolution?: critique of the promise of Youtube. Ph.D. thesis, University of Iowa (2010) Kim, J.: User-generated content (UGC) revolution?: critique of the promise of Youtube. Ph.D. thesis, University of Iowa (2010)
22.
go back to reference Kim, J.: # iamafeminist as the “mother tag”: feminist identification and activism against misogyny on Twitter in South Korea. Fem. Media Stud. 17(5), 804–820 (2017)CrossRef Kim, J.: # iamafeminist as the “mother tag”: feminist identification and activism against misogyny on Twitter in South Korea. Fem. Media Stud. 17(5), 804–820 (2017)CrossRef
23.
go back to reference Kim, Y., Nam, T.: An efficient text filter for adult web documents. In: 2006 8th International Conference Advanced Communication Technology, vol. 1, pp. 3-pp. IEEE (2006) Kim, Y., Nam, T.: An efficient text filter for adult web documents. In: 2006 8th International Conference Advanced Communication Technology, vol. 1, pp. 3-pp. IEEE (2006)
24.
go back to reference Liu, H., Chiroma, F., Cocea, M.: Identification and classification of misogynous tweets using multi-classifier fusion. In: IberEval@ SEPLN, pp. 268–273 (2018) Liu, H., Chiroma, F., Cocea, M.: Identification and classification of misogynous tweets using multi-classifier fusion. In: IberEval@ SEPLN, pp. 268–273 (2018)
25.
go back to reference Lynn, T., Endo, P.T., Rosati, P., Silva, I., Santos, G.L., Ging, D.: A comparison of machine learning approaches for detecting misogynistic speech in urban dictionary. In: 2019 International Conference on Cyber Situational Awareness, Data Analytics And Assessment (CyberSA), pp. 1–8. IEEE (2019) Lynn, T., Endo, P.T., Rosati, P., Silva, I., Santos, G.L., Ging, D.: A comparison of machine learning approaches for detecting misogynistic speech in urban dictionary. In: 2019 International Conference on Cyber Situational Awareness, Data Analytics And Assessment (CyberSA), pp. 1–8. IEEE (2019)
26.
go back to reference MacAvaney, S., Yao, H.R., Yang, E., Russell, K., Goharian, N., Frieder, O.: Hate speech detection: challenges and solutions. PloS One 14(8), e0221152 (2019)CrossRef MacAvaney, S., Yao, H.R., Yang, E., Russell, K., Goharian, N., Frieder, O.: Hate speech detection: challenges and solutions. PloS One 14(8), e0221152 (2019)CrossRef
27.
go back to reference Mullany, L., Trickett, L.: Misogyny hate crime evaluation report (2018) Mullany, L., Trickett, L.: Misogyny hate crime evaluation report (2018)
28.
29.
go back to reference Saharia, N.: Phone-based identification of language in code-mixed social network data. J. Stat. Manag. Syst. 20(4), 565–574 (2017) Saharia, N.: Phone-based identification of language in code-mixed social network data. J. Stat. Manag. Syst. 20(4), 565–574 (2017)
30.
go back to reference Schmidt, S., Schnitzer, S., Rensing, C.: Text classification based filters for a domain-specific search engine. Comput. Ind. 78, 70–79 (2016) CrossRef Schmidt, S., Schnitzer, S., Rensing, C.: Text classification based filters for a domain-specific search engine. Comput. Ind. 78, 70–79 (2016) CrossRef
31.
go back to reference Shushkevich, E., Cardiff, J.: Automatic misogyny detection in social media: a survey. Computación y Sistemas 23(4) (2019) Shushkevich, E., Cardiff, J.: Automatic misogyny detection in social media: a survey. Computación y Sistemas 23(4) (2019)
32.
go back to reference Teodorescu, H.N., Saharia, N.: An internet slang annotated dictionary and its use in assessing message attitude and sentiments. In: 2015 International Conference on Speech Technology and Human-Computer Dialogue (SpeD), pp. 1–8. IEEE (2015) Teodorescu, H.N., Saharia, N.: An internet slang annotated dictionary and its use in assessing message attitude and sentiments. In: 2015 International Conference on Speech Technology and Human-Computer Dialogue (SpeD), pp. 1–8. IEEE (2015)
33.
go back to reference Teodorescu, H.N., Saharia, N.: A semantic analyzer for detecting attitudes on SNs. In: 2016 International Conference on Communications (COMM), pp. 47–50. IEEE (2016) Teodorescu, H.N., Saharia, N.: A semantic analyzer for detecting attitudes on SNs. In: 2016 International Conference on Communications (COMM), pp. 47–50. IEEE (2016)
34.
go back to reference Waseem, Z., Hovy, D.: Hateful symbols or hateful people? Predictive features for hate speech detection on Twitter. In: Proceedings of the NAACL Student Research Workshop, pp. 88–93 (2016) Waseem, Z., Hovy, D.: Hateful symbols or hateful people? Predictive features for hate speech detection on Twitter. In: Proceedings of the NAACL Student Research Workshop, pp. 88–93 (2016)
35.
go back to reference Wiegand, M., Ruppenhofer, J., Kleinbauer, T.: Detection of abusive language: the problem of biased datasets. In: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Long and Short Papers), vol. 1, pp. 602–608 (2019) Wiegand, M., Ruppenhofer, J., Kleinbauer, T.: Detection of abusive language: the problem of biased datasets. In: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Long and Short Papers), vol. 1, pp. 602–608 (2019)
36.
go back to reference Yao, M., Chelmis, C., Zois, D.S.: Cyberbullying ends here: towards robust detection of cyberbullying in social media. In: The World Wide Web Conference, pp. 3427–3433 (2019) Yao, M., Chelmis, C., Zois, D.S.: Cyberbullying ends here: towards robust detection of cyberbullying in social media. In: The World Wide Web Conference, pp. 3427–3433 (2019)
Metadata
Title
Misogynous Text Classification Using SVM and LSTM
Authors
Maibam Debina Devi
Navanath Saharia
Copyright Year
2021
Publisher
Springer Singapore
DOI
https://doi.org/10.1007/978-981-16-0401-0_26

Premium Partner