Skip to main content

2020 | OriginalPaper | Buchkapitel

Irony Detection in Bengali Tweets: A New Dataset, Experimentation and Results

verfasst von : Adhiraj Ghosh, Kamal Sarkar

Erschienen in: Computational Intelligence in Data Science

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Irony detection is a difficult task because the intended meaning of a sentence differs from the literal meaning or sentiment of that sentence. Most existing work on this subject has focused on irony detection in the English language. Since no public dataset is available for this task in the Bengali domain, we have created a Bengali irony detection dataset that contains a total of 1500 labeled Bengali tweets. This paper presents the description of the Bengali irony detection dataset developed by us and reports some results obtained on our Bengali irony dataset using several widely used machine learning algorithms such as Naïve Bayes, Support Vector Machine, K-Nearest Neighbor and Random Forest.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Sperber, D., Wilson, D.: Irony and the use-mention distinction. Philosophy. 3, 143–184 (1981) Sperber, D., Wilson, D.: Irony and the use-mention distinction. Philosophy. 3, 143–184 (1981)
2.
Zurück zum Zitat Bouazizi, M., Ohtsuki, T.O.: A pattern-based approach for sarcasm detection on Twitter. IEEE Access. 4, 5477–5488 (2016)CrossRef Bouazizi, M., Ohtsuki, T.O.: A pattern-based approach for sarcasm detection on Twitter. IEEE Access. 4, 5477–5488 (2016)CrossRef
4.
Zurück zum Zitat Barbieri, F., Saggion, H.: Modelling irony in Twitter. In: Proceedings of the Student Research Workshop at the 14th Conference of the European Chapter of the Association for Computational Linguistics, pp. 56–64 (2014) Barbieri, F., Saggion, H.: Modelling irony in Twitter. In: Proceedings of the Student Research Workshop at the 14th Conference of the European Chapter of the Association for Computational Linguistics, pp. 56–64 (2014)
5.
Zurück zum Zitat Karoui, J., Zitoune, F.B., Moriceau, V., Aussenac-Gilles, N., Belguith, L.H.: Towards a contextual pragmatic model to detect irony in tweets. In: The 53rd Annual Meeting of the Association for Computational Linguistics and The 7th International Joint Conference of the Asian Federation of Natural Language Processing, pp. 644–650 (2015) Karoui, J., Zitoune, F.B., Moriceau, V., Aussenac-Gilles, N., Belguith, L.H.: Towards a contextual pragmatic model to detect irony in tweets. In: The 53rd Annual Meeting of the Association for Computational Linguistics and The 7th International Joint Conference of the Asian Federation of Natural Language Processing, pp. 644–650 (2015)
6.
Zurück zum Zitat Van Hee, C.: Can machines sense irony?: exploring automatic irony detection on social media (Doctoral dissertation, Ghent University) (2017) Van Hee, C.: Can machines sense irony?: exploring automatic irony detection on social media (Doctoral dissertation, Ghent University) (2017)
7.
Zurück zum Zitat Van Hee, C., Lefever, E., Hoste, V.: Monday mornings are my fave:)# not exploring the automatic recognition of irony in english tweets. In: Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers, pp. 2730–2739 (2016) Van Hee, C., Lefever, E., Hoste, V.: Monday mornings are my fave:)# not exploring the automatic recognition of irony in english tweets. In: Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers, pp. 2730–2739 (2016)
8.
Zurück zum Zitat Van Hee, C., Lefever, E., Hoste, V.: Guidelines for annotating irony in social media text. version 2.0. Technical Report 16-01, LT3, Language and Translation Technology Team (2016) Van Hee, C., Lefever, E., Hoste, V.: Guidelines for annotating irony in social media text. version 2.0. Technical Report 16-01, LT3, Language and Translation Technology Team (2016)
9.
Zurück zum Zitat Van Hee, C., Lefever, E., Hoste, V.: SemEval-2018 task 3: irony detection in English Tweets. In: Proceedings of the 12th International Workshop on Semantic Evaluation, pp. 39–50 (2018) Van Hee, C., Lefever, E., Hoste, V.: SemEval-2018 task 3: irony detection in English Tweets. In: Proceedings of the 12th International Workshop on Semantic Evaluation, pp. 39–50 (2018)
10.
Zurück zum Zitat Ghosh, A., Veale, T.: IronyMagnet at SemEval-2018 task 3: a siamese network for irony detection in social media. In: Proceedings of the 12th International Workshop on Semantic Evaluation, pp. 570–575 (2018) Ghosh, A., Veale, T.: IronyMagnet at SemEval-2018 task 3: a siamese network for irony detection in social media. In: Proceedings of the 12th International Workshop on Semantic Evaluation, pp. 570–575 (2018)
11.
Zurück zum Zitat Baziotis, C., et al.: NTUA-SLP at SemEval-2018 task 3: tracking ironic tweets using ensembles of word and character level Attentive RNNs. arXiv preprint arXiv:1804.06659 (2018) Baziotis, C., et al.: NTUA-SLP at SemEval-2018 task 3: tracking ironic tweets using ensembles of word and character level Attentive RNNs. arXiv preprint arXiv:​1804.​06659 (2018)
12.
Zurück zum Zitat Wu, C., Wu, F., Wu, S., Liu, J., Yuan, Z., Huang, Y.: THU_NGN at semeval-2018 task 3: Tweet irony detection with densely connected LSTM and multi-task learning. In: Proceedings of the 12th International Workshop on Semantic Evaluation, pp. 51–56 (2018) Wu, C., Wu, F., Wu, S., Liu, J., Yuan, Z., Huang, Y.: THU_NGN at semeval-2018 task 3: Tweet irony detection with densely connected LSTM and multi-task learning. In: Proceedings of the 12th International Workshop on Semantic Evaluation, pp. 51–56 (2018)
13.
Zurück zum Zitat Rangwani, H., Kulshreshtha, D., Singh, A.K.: NLPRL-IITBHU at SemEval-2018 Task 3: combining linguistic features and emoji pre-trained CNN for irony detection in tweets. In: Proceedings of the 12th International Workshop on Semantic Evaluation, pp. 638–642 (2018) Rangwani, H., Kulshreshtha, D., Singh, A.K.: NLPRL-IITBHU at SemEval-2018 Task 3: combining linguistic features and emoji pre-trained CNN for irony detection in tweets. In: Proceedings of the 12th International Workshop on Semantic Evaluation, pp. 638–642 (2018)
14.
Zurück zum Zitat Rohanian, O., Taslimipoor, S., Evans, R., Mitkov, R.: WLV at SemEval-2018 task 3: dissecting tweets in search of irony. In: Proceedings of the 12th International Workshop on Semantic Evaluation, pp. 553–559 (2018) Rohanian, O., Taslimipoor, S., Evans, R., Mitkov, R.: WLV at SemEval-2018 task 3: dissecting tweets in search of irony. In: Proceedings of the 12th International Workshop on Semantic Evaluation, pp. 553–559 (2018)
15.
Zurück zum Zitat Wilson, D., Sperber, D.: On verbal irony. Lingua. 87(1), 53–76 (1992)CrossRef Wilson, D., Sperber, D.: On verbal irony. Lingua. 87(1), 53–76 (1992)CrossRef
16.
Zurück zum Zitat Burgers, C., van Mulken, M., Schellens, P.: Finding irony: an introduction of the verbal irony procedure (VIP). Metaphor Symbol. 26, 186–205 (2011)CrossRef Burgers, C., van Mulken, M., Schellens, P.: Finding irony: an introduction of the verbal irony procedure (VIP). Metaphor Symbol. 26, 186–205 (2011)CrossRef
18.
Zurück zum Zitat Ramos, J.: Using TF-IDF to determine word relevance in document queries. In: Proceedings of the first instructional conference on machine learning, vol. 242, pp. 133-142 (2003) Ramos, J.: Using TF-IDF to determine word relevance in document queries. In: Proceedings of the first instructional conference on machine learning, vol. 242, pp. 133-142 (2003)
20.
Zurück zum Zitat McCallum, A., Nigam, K.: A comparison of event models for naive bayes text classification. In: AAAI-98 Workshop on Learning for Text Categorization, vol. 752(1), pp. 41–48 (1998) McCallum, A., Nigam, K.: A comparison of event models for naive bayes text classification. In: AAAI-98 Workshop on Learning for Text Categorization, vol. 752(1), pp. 41–48 (1998)
21.
Zurück zum Zitat Yuan, Q., Cong, G., Thalmann, N.M.: Enhancing naive bayes with various smoothing methods for short text classification. In: Proceedings of the 21st International Conference on World Wide Web, pp. 645–646. ACM, (2012) Yuan, Q., Cong, G., Thalmann, N.M.: Enhancing naive bayes with various smoothing methods for short text classification. In: Proceedings of the 21st International Conference on World Wide Web, pp. 645–646. ACM, (2012)
22.
23.
Zurück zum Zitat Hsu, C.W., Lin, C.J.: A comparison of methods for multiclass support vector machines. IEEE Trans. Neural Networks 13(2), 415–425 (2002)CrossRef Hsu, C.W., Lin, C.J.: A comparison of methods for multiclass support vector machines. IEEE Trans. Neural Networks 13(2), 415–425 (2002)CrossRef
24.
Zurück zum Zitat Fix, E., Hodges, J.: Discriminatory analysis: nonparametric discrimination, consistency properties. USAF School of Aviation Medicine, Randolph Field, Texas (1951) Fix, E., Hodges, J.: Discriminatory analysis: nonparametric discrimination, consistency properties. USAF School of Aviation Medicine, Randolph Field, Texas (1951)
25.
27.
Zurück zum Zitat Breiman, L.: Bagging predictors. Mach. Learn. 24(2), 123–140 (1996)MATH Breiman, L.: Bagging predictors. Mach. Learn. 24(2), 123–140 (1996)MATH
28.
Zurück zum Zitat Blum, A., Kalai, A., Langford, J.: Beating the hold-out: bounds for k-fold and progressive cross-validation. In: COLT, vol. 99, pp. 203–208 (1999) Blum, A., Kalai, A., Langford, J.: Beating the hold-out: bounds for k-fold and progressive cross-validation. In: COLT, vol. 99, pp. 203–208 (1999)
29.
Zurück zum Zitat Albert, J.: Teaching Inference about Proportions Using Bayes and Discrete Models. J. Stat. Educ. 3(3) (1995) Albert, J.: Teaching Inference about Proportions Using Bayes and Discrete Models. J. Stat. Educ. 3(3) (1995)
30.
Zurück zum Zitat Cristianini, N., Scholkopf, B.: Support vector machines and kernel methods: the new generation of learning machines. AI Mag. 23(3), 31 (2002) Cristianini, N., Scholkopf, B.: Support vector machines and kernel methods: the new generation of learning machines. AI Mag. 23(3), 31 (2002)
31.
Zurück zum Zitat Yang, Y.: An evaluation of statistical approaches to text categorization. Inf. Retrieval 1(1–2), 69–90 (1999)CrossRef Yang, Y.: An evaluation of statistical approaches to text categorization. Inf. Retrieval 1(1–2), 69–90 (1999)CrossRef
32.
Zurück zum Zitat Lu, B., Charlton, M., Brunsdon, C., Harris, P.: The Minkowski approach for choosing the distance metric in geographically weighted regression. Int. J. Geogr. Inf. Sci. 30, 351–368 (2016)CrossRef Lu, B., Charlton, M., Brunsdon, C., Harris, P.: The Minkowski approach for choosing the distance metric in geographically weighted regression. Int. J. Geogr. Inf. Sci. 30, 351–368 (2016)CrossRef
Metadaten
Titel
Irony Detection in Bengali Tweets: A New Dataset, Experimentation and Results
verfasst von
Adhiraj Ghosh
Kamal Sarkar
Copyright-Jahr
2020
DOI
https://doi.org/10.1007/978-3-030-63467-4_9

Premium Partner