Skip to main content
Top

2017 | OriginalPaper | Chapter

Detecting Interethnic Relations with the Data from Social Media

Authors : Olessia Koltsova, Sergey Nikolenko, Svetlana Alexeeva, Oleg Nagornyy, Sergei Koltcov

Published in: Digital Transformation and Global Society

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

The ability of social media to rapidly disseminate judgements on ethnicity and to influence offline ethnic relations creates demand for the methods of automatic monitoring of ethnicity related online content. In this study we seek to measure the overall volume of ethnicity related discussion in the Russian language social media and to develop an approach that would automatically detect various aspects of attitudes to those ethnic groups. We develop a comprehensive list of ethnonyms and related bigrams that embrace 97 Post-Soviet ethnic groups and obtain all messages containing one of those words from a two-year period from all Russian language social media (N = 2,660,222 texts). We hand-code 7,181 messages where rare ethnicities are overrepresented and train a number of classifiers to recognize different aspects of authors’ attitudes and other text features. After calculating a number of standard quality metrics, we find that we reach good quality in detecting intergroup conflict, positive intergroup contact, and overall negative and positive sentiment. Relevance to the topic of ethnicity and general attitude to an ethnic group are least well predicted, while some aspects such as calls for violence against an ethnic group are not sufficiently present in the data to be predicted.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Apishev, M., Koltsov, S., Koltsova, E.Y., Nikolenko, S., Vorontsov, K.: Mining ethnic content online with additively regularized topic models. Computacion y Sistemas 20, 387–403 (2016). doi:10.13053/CyS-20-3-2473 Apishev, M., Koltsov, S., Koltsova, E.Y., Nikolenko, S., Vorontsov, K.: Mining ethnic content online with additively regularized topic models. Computacion y Sistemas 20, 387–403 (2016). doi:10.​13053/​CyS-20-3-2473
2.
go back to reference Attenberg, J., Ipeirotis, P.G., Provost, F.J.: Beat the machine: challenging workers to find the unknown unknowns. In: Proceedings of 11th AAAI Conference on Human Computation, pp. 2–7 (2011) Attenberg, J., Ipeirotis, P.G., Provost, F.J.: Beat the machine: challenging workers to find the unknown unknowns. In: Proceedings of 11th AAAI Conference on Human Computation, pp. 2–7 (2011)
3.
go back to reference Bartlett, J., et al.: Anti-Social Media. Demos, London (2014) Bartlett, J., et al.: Anti-Social Media. Demos, London (2014)
4.
go back to reference Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent Dirichlet allocation. J. Mach. Learn. Res. 3, 993–1022 (2003)MATH Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent Dirichlet allocation. J. Mach. Learn. Res. 3, 993–1022 (2003)MATH
5.
go back to reference Bodrunova, S., Koltsova, O., Nikolenko, S.: Are migranty all the same? Attitudes to re-settlers from post-soviet South in the Russian blogosphere (2016). Unpublished manuscript Bodrunova, S., Koltsova, O., Nikolenko, S.: Are migranty all the same? Attitudes to re-settlers from post-soviet South in the Russian blogosphere (2016). Unpublished manuscript
6.
go back to reference Bodrunova, S.S., Litvinenko, A.A., Gavra, D.P., Yakunin, A.V.: Twitter-based discourse on migrants in Russia: the case of 2013 bashings in Biryulyovo. Int. Rev. Manag. Mark. 5, 97–104 (2015) Bodrunova, S.S., Litvinenko, A.A., Gavra, D.P., Yakunin, A.V.: Twitter-based discourse on migrants in Russia: the case of 2013 bashings in Biryulyovo. Int. Rev. Manag. Mark. 5, 97–104 (2015)
7.
go back to reference Bohlin, L., Edler, D., Lancichinetti, A., Rosvall, M.: Community detection and visualization of networks with the map equation framework. In: Ding, Y., Rousseau, R., Wolfram, D. (eds.) Measuring Scholarly Impact, pp. 3–34. Springer, Cham (2014). doi:10.1007/978-3-319-10377-8_1 Bohlin, L., Edler, D., Lancichinetti, A., Rosvall, M.: Community detection and visualization of networks with the map equation framework. In: Ding, Y., Rousseau, R., Wolfram, D. (eds.) Measuring Scholarly Impact, pp. 3–34. Springer, Cham (2014). doi:10.​1007/​978-3-319-10377-8_​1
8.
go back to reference Burnap, P., Williams, M.L.: Cyber hate speech on Twitter: an application of machine classification and statistical modeling for policy and decision making. Policy Internet 7, 223–242 (2015). doi:10.1002/poi3.85 CrossRef Burnap, P., Williams, M.L.: Cyber hate speech on Twitter: an application of machine classification and statistical modeling for policy and decision making. Policy Internet 7, 223–242 (2015). doi:10.​1002/​poi3.​85 CrossRef
9.
go back to reference Chan, J., et al.: The internet and racial hate crime: offline spillovers from online access. MIS Q.: Manag. Inf. Syst. 40(2), 381–403 (2016)MathSciNetCrossRef Chan, J., et al.: The internet and racial hate crime: offline spillovers from online access. MIS Q.: Manag. Inf. Syst. 40(2), 381–403 (2016)MathSciNetCrossRef
11.
12.
go back to reference Djuric, N., Zhou, J., Morris, R., Grbovic, M., Radosavljevic, V., Bhamidipati, N.: Hate speech detection with comment embeddings. In: Proceedings of the 24th International Conference on World Wide Web, pp. 29–30. ACM (2015). doi:10.1145/2740908.2742760 Djuric, N., Zhou, J., Morris, R., Grbovic, M., Radosavljevic, V., Bhamidipati, N.: Hate speech detection with comment embeddings. In: Proceedings of the 24th International Conference on World Wide Web, pp. 29–30. ACM (2015). doi:10.​1145/​2740908.​2742760
13.
go back to reference Faris, R., Ashar, A., Gasser, U., Joo, D.: Understanding harmful speech online. Berkman Klein Center Research Publication No. 2016-21 (2016). doi:10.2139/ssrn.2882824 Faris, R., Ashar, A., Gasser, U., Joo, D.: Understanding harmful speech online. Berkman Klein Center Research Publication No. 2016-21 (2016). doi:10.​2139/​ssrn.​2882824
14.
go back to reference Gagliardone, I.: Mapping and Analysing Hate Speech Online. Social Science Research Network, Rochester (2014) Gagliardone, I.: Mapping and Analysing Hate Speech Online. Social Science Research Network, Rochester (2014)
15.
go back to reference Gibson, S., Lando, A.L.: Impact of Communication and the Media on Ethnic Conflict. IGI Global, Hershey (2015) Gibson, S., Lando, A.L.: Impact of Communication and the Media on Ethnic Conflict. IGI Global, Hershey (2015)
18.
20.
go back to reference Grishhenko, A.I., Nikolina, N.A.: Expressive ethnonyms as markers of hate speech [Jekspressivnye jetnonimy kak primety jazyka vrazhdy]. In: Hate Speech and Speech of Consent in the Socio-Cultural Context of Modern Society [Jazyk vrazhdy i jazyk soglasija v sociokul’turnom kontekste sovremennosti], pp. 175–187 (2006). (in Russian) Grishhenko, A.I., Nikolina, N.A.: Expressive ethnonyms as markers of hate speech [Jekspressivnye jetnonimy kak primety jazyka vrazhdy]. In: Hate Speech and Speech of Consent in the Socio-Cultural Context of Modern Society [Jazyk vrazhdy i jazyk soglasija v sociokul’turnom kontekste sovremennosti], pp. 175–187 (2006). (in Russian)
21.
go back to reference Kim, Y.-C., Jung, J.-Y., Ball-Rokeach, S.J.: Ethnicity, place, and communication technology: effects of ethnicity on multi-dimensional internet connectedness. Inf. Technol. People 20, 282–303 (2007). doi:10.1108/09593840710822877 CrossRef Kim, Y.-C., Jung, J.-Y., Ball-Rokeach, S.J.: Ethnicity, place, and communication technology: effects of ethnicity on multi-dimensional internet connectedness. Inf. Technol. People 20, 282–303 (2007). doi:10.​1108/​0959384071082287​7 CrossRef
22.
go back to reference Korobkova, O.S.: Hate speech indicators in ethnic membership nominations: sociolinguistic aspect [Markery jazyka vrazhdy v nominacijah jetnicheskoj prinadlezhnosti: so-ciolingvisticheskij aspekt]. Izvestia: Herzen Univ. J. Humanit. Sci. [Izvestija Rossijskogo gosudarstvennogo pedagogicheskogo universiteta im. AI Gercena] 200–205 (2009). (in Russian) Korobkova, O.S.: Hate speech indicators in ethnic membership nominations: sociolinguistic aspect [Markery jazyka vrazhdy v nominacijah jetnicheskoj prinadlezhnosti: so-ciolingvisticheskij aspekt]. Izvestia: Herzen Univ. J. Humanit. Sci. [Izvestija Rossijskogo gosudarstvennogo pedagogicheskogo universiteta im. AI Gercena] 200–205 (2009). (in Russian)
23.
go back to reference Kwok, I., Wang, Y.: Locate the hate: detecting tweets against blacks. In: Proceedings of the 27th AAAI Conference on Artificial Intelligence, AAAI 2013, pp. 1621–1622 (2013) Kwok, I., Wang, Y.: Locate the hate: detecting tweets against blacks. In: Proceedings of the 27th AAAI Conference on Artificial Intelligence, AAAI 2013, pp. 1621–1622 (2013)
24.
go back to reference McLaine, S.: Ethnic online communities. In: Cyberactivism: Online Activism in Theory and Practice, pp. 233–254 (2003) McLaine, S.: Ethnic online communities. In: Cyberactivism: Online Activism in Theory and Practice, pp. 233–254 (2003)
25.
go back to reference Mustafa, H., Hamid, H.A., Ahmad, J., Siarap, K.: Intercultural relationship, prejudice and ethnocentrism in a computer-mediated communication (CMC): a time-series experiment. Asian Soc. Sci. 8, 34–48 (2012). doi:10.5539/ass.v8n3p34 Mustafa, H., Hamid, H.A., Ahmad, J., Siarap, K.: Intercultural relationship, prejudice and ethnocentrism in a computer-mediated communication (CMC): a time-series experiment. Asian Soc. Sci. 8, 34–48 (2012). doi:10.​5539/​ass.​v8n3p34
26.
go back to reference Nikolenko, S.I., et al.: Topic modelling for qualitative studies. J. Inf. Sci. 43(1), 88–102 (2017)CrossRef Nikolenko, S.I., et al.: Topic modelling for qualitative studies. J. Inf. Sci. 43(1), 88–102 (2017)CrossRef
27.
go back to reference Nakamura, L.: Cybertypes: Race, Ethnicity, and Identity on the Internet. Routledge, Abingdon (2013) Nakamura, L.: Cybertypes: Race, Ethnicity, and Identity on the Internet. Routledge, Abingdon (2013)
28.
go back to reference Nobata, C., Tetreault, J., Thomas, A., Mehdad, Y., Chang, Y.: Abusive language detection in online user content. In: Proceedings of the 25th International Conference on World Wide Web, pp. 145–153. International World Wide Web Conferences Steering Committee (2016). doi:10.1145/2872427.2883062 Nobata, C., Tetreault, J., Thomas, A., Mehdad, Y., Chang, Y.: Abusive language detection in online user content. In: Proceedings of the 25th International Conference on World Wide Web, pp. 145–153. International World Wide Web Conferences Steering Committee (2016). doi:10.​1145/​2872427.​2883062
30.
go back to reference Silva, L., Mondal, M., Correa, D., Benevenuto, F., Weber, I.: Analyzing the targets of hate in online social media. In: Proceedings of the 10th International Conference on Web and Social Media, ICWSM 2016, pp. 687–690 (2016) Silva, L., Mondal, M., Correa, D., Benevenuto, F., Weber, I.: Analyzing the targets of hate in online social media. In: Proceedings of the 10th International Conference on Web and Social Media, ICWSM 2016, pp. 687–690 (2016)
31.
go back to reference Steinfeldt, J.A., Foltz, B.D., Kaladow, J.K., Carlson, T.N., Pagano Jr., L.A., Benton, E., Steinfeldt, M.C.: Racism in the electronic age: role of online forums in expressing racial attitudes about American Indians. Cult. Divers. Ethnic Minor. Psychol. 16, 362–371 (2010). doi:10.1037/a0018692 CrossRef Steinfeldt, J.A., Foltz, B.D., Kaladow, J.K., Carlson, T.N., Pagano Jr., L.A., Benton, E., Steinfeldt, M.C.: Racism in the electronic age: role of online forums in expressing racial attitudes about American Indians. Cult. Divers. Ethnic Minor. Psychol. 16, 362–371 (2010). doi:10.​1037/​a0018692 CrossRef
32.
go back to reference Sternin, I.A.: Politically incorrect national names in language consciousness of language’s possessor [Nepolitkorrektnye naimenovanija lic v jazykovom soznanii nositelja jazyka]. Polit. linguist. [Politicheskaja lingvistika] 1, 191–193 (2013) Sternin, I.A.: Politically incorrect national names in language consciousness of language’s possessor [Nepolitkorrektnye naimenovanija lic v jazykovom soznanii nositelja jazyka]. Polit. linguist. [Politicheskaja lingvistika] 1, 191–193 (2013)
33.
go back to reference Trebbe, J., Schoenhagen, P.: Ethnic minorities in the mass media: how migrants perceive their representation in Swiss public television. J. Int. Migr. Integr. 12, 411–428 (2011). doi:10.1007/s12134-011-0175-7 Trebbe, J., Schoenhagen, P.: Ethnic minorities in the mass media: how migrants perceive their representation in Swiss public television. J. Int. Migr. Integr. 12, 411–428 (2011). doi:10.​1007/​s12134-011-0175-7
34.
go back to reference Tukachinsky, R., Mastro, D., Yarchi, M.: Documenting portrayals of race/ethnicity on primetime television over a 20-year span and their association with national-level racial/ethnic attitudes. J. Soc. Issues 71, 17–38 (2015). doi:10.1111/josi.12094 CrossRef Tukachinsky, R., Mastro, D., Yarchi, M.: Documenting portrayals of race/ethnicity on primetime television over a 20-year span and their association with national-level racial/ethnic attitudes. J. Soc. Issues 71, 17–38 (2015). doi:10.​1111/​josi.​12094 CrossRef
35.
go back to reference Tulkens, S., Hilte, L., Lodewyckx, E., Verhoeven, B., Daelemans, W.: A dictionary-based approach to racism detection in Dutch social media. arXiv preprint arXiv:1608.08738 (2016) Tulkens, S., Hilte, L., Lodewyckx, E., Verhoeven, B., Daelemans, W.: A dictionary-based approach to racism detection in Dutch social media. arXiv preprint arXiv:​1608.​08738 (2016)
36.
go back to reference Tynes, B.M., Giang, M.T., Thompson, G.N.: Ethnic identity, intergroup contact, and outgroup orientation among diverse groups of adolescents on the Internet. CyberPsychol. Behav. 11, 459–465 (2008). doi:10.1089/cpb.2007.0085 CrossRef Tynes, B.M., Giang, M.T., Thompson, G.N.: Ethnic identity, intergroup contact, and outgroup orientation among diverse groups of adolescents on the Internet. CyberPsychol. Behav. 11, 459–465 (2008). doi:10.​1089/​cpb.​2007.​0085 CrossRef
37.
go back to reference Vepreva, I.T., Kupina, N.A.: The words of unrest in the world today: unofficial ethnonyms in real usage [Trevozhnaja leksika tekushhego vremeni: neoficial’nye jetnonimy v funkcii aktu-al’nyh slov]. Polit. linguist. [Politicheskaja lingvistika] 43–50 (2014). (in Russian) Vepreva, I.T., Kupina, N.A.: The words of unrest in the world today: unofficial ethnonyms in real usage [Trevozhnaja leksika tekushhego vremeni: neoficial’nye jetnonimy v funkcii aktu-al’nyh slov]. Polit. linguist. [Politicheskaja lingvistika] 43–50 (2014). (in Russian)
38.
go back to reference Warner, W., Hirschberg, J.: Detecting hate speech on the world wide web. In: Proceedings of the Second Workshop on Language in Social Media, pp. 19–26. Association for Computational Linguistics (2012) Warner, W., Hirschberg, J.: Detecting hate speech on the world wide web. In: Proceedings of the Second Workshop on Language in Social Media, pp. 19–26. Association for Computational Linguistics (2012)
39.
go back to reference Waseem, Z.: Are you a racist or am I seeing things? Annotator influence on hate speech detection on Twitter. In: Proceedings of the 1st Workshop on Natural Language Processing and Computational Social Science, pp. 138–142 (2016) Waseem, Z.: Are you a racist or am I seeing things? Annotator influence on hate speech detection on Twitter. In: Proceedings of the 1st Workshop on Natural Language Processing and Computational Social Science, pp. 138–142 (2016)
40.
go back to reference Waseem, Z., Hovy, D.: Hateful symbols or hateful people? Predictive features for hate speech detection on Twitter. In: Proceedings of NAACL-HLT 2016, pp. 88–93 (2016) Waseem, Z., Hovy, D.: Hateful symbols or hateful people? Predictive features for hate speech detection on Twitter. In: Proceedings of NAACL-HLT 2016, pp. 88–93 (2016)
Metadata
Title
Detecting Interethnic Relations with the Data from Social Media
Authors
Olessia Koltsova
Sergey Nikolenko
Svetlana Alexeeva
Oleg Nagornyy
Sergei Koltcov
Copyright Year
2017
DOI
https://doi.org/10.1007/978-3-319-69784-0_2

Premium Partner