Skip to main content
Erschienen in: Cognitive Computation 2/2021

26.01.2021

Detection of Sociolinguistic Features in Digital Social Networks for the Detection of Communities

verfasst von: Edwin Puertas, Luis Gabriel Moreno-Sandoval, Javier Redondo, Jorge Andres Alvarado-Valencia, Alexandra Pomares-Quimbaya

Erschienen in: Cognitive Computation | Ausgabe 2/2021

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The emergence of digital social networks has transformed society, social groups, and institutions in terms of the communication and expression of their opinions. Determining how language variations allow the detection of communities, together with the relevance of specific vocabulary (proposed by the National Council of Accreditation of Colombia (Consejo Nacional de Acreditación - CNA) to determine the quality evaluation parameters for universities in Colombia) in digital assemblages could lead to a better understanding of their dynamics and social foundations, thus resulting in better communication policies and intervention where necessary. The approach presented in this paper intends to determine what are the semantic spaces (sociolinguistic features) shared by social groups in digital social networks. It includes five layers based on Design Science Research, which are integrated with Natural Language Processing techniques (NLP), Computational Linguistics (CL), and Artificial Intelligence (AI). The approach is validated through a case study wherein the semantic values of a series of “Twitter” institutional accounts belonging to Colombian Universities are analyzed in terms of the 12 quality factors established by CNA. In addition, the topics and the sociolect used by different actors in the university communities are also analyzed. The current approach allows determining the sociolinguistic features of social groups in digital social networks. Its application allows detecting the words or concepts to which each actor of a social group (university) gives more importance in terms of vocabulary.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Dumbill E. A revolution that will transform how we live, work, and think: An interview with the authors of big data. Big data. 2013;1(2):73–7.CrossRef Dumbill E. A revolution that will transform how we live, work, and think: An interview with the authors of big data. Big data. 2013;1(2):73–7.CrossRef
2.
Zurück zum Zitat Meyerhoff M. Introducing sociolinguistics. Taylor & Francis Group: Routledge; 2015.CrossRef Meyerhoff M. Introducing sociolinguistics. Taylor & Francis Group: Routledge; 2015.CrossRef
3.
Zurück zum Zitat Meyerhoff M. Introducing sociolinguistics. Routledge; 2018. Meyerhoff M. Introducing sociolinguistics. Routledge; 2018.
4.
Zurück zum Zitat Scott J. Social network analysis: developments, advances, and prospects. Social network analysis and mining. 2011;1(1):21–6.CrossRef Scott J. Social network analysis: developments, advances, and prospects. Social network analysis and mining. 2011;1(1):21–6.CrossRef
5.
Zurück zum Zitat Zeinab Kafi, Khalil Motallebzadeh. An introduction to sociolinguistics. International Journal of Society, Culture & Language. 2016;4(2):134–40. Zeinab Kafi, Khalil Motallebzadeh. An introduction to sociolinguistics. International Journal of Society, Culture & Language. 2016;4(2):134–40.
6.
Zurück zum Zitat Bryden J, Funk S, Jansen VA. Word usage mirrors community structure in the online social network twitter. EPJ Data Science, 2013;2(1):3. Bryden J, Funk S, Jansen VA. Word usage mirrors community structure in the online social network twitter. EPJ Data Science, 2013;2(1):3.
7.
Zurück zum Zitat Ríos SA, Muñoz R. Dark web portal overlapping community detection based on topic models. In Proceedings of the ACM SIGKDD workshop on intelligence and security informatics. 2012. p. 1–7. Ríos SA, Muñoz R. Dark web portal overlapping community detection based on topic models. In Proceedings of the ACM SIGKDD workshop on intelligence and security informatics. 2012. p. 1–7.
8.
Zurück zum Zitat Nguyen D. A Seza Doğruöz, Carolyn P Rosé, and Franciska de Jong. Computational sociolinguistics: A survey Computational linguistics. 2016;42(3):537–93.CrossRef Nguyen D. A Seza Doğruöz, Carolyn P Rosé, and Franciska de Jong. Computational sociolinguistics: A survey Computational linguistics. 2016;42(3):537–93.CrossRef
9.
Zurück zum Zitat Reynolds WN, Salter WJ, Farber RM, Corley C, Dowling CP, Beeman WO, et al. Sociolect-based community detection. In 2013 IEEE International Conference on Intelligence and Security Informatics. 2013. p. 221–226, IEEE. Reynolds WN, Salter WJ, Farber RM, Corley C, Dowling CP, Beeman WO, et al. Sociolect-based community detection. In 2013 IEEE International Conference on Intelligence and Security Informatics. 2013. p. 221–226, IEEE.
10.
Zurück zum Zitat Mansouri F, Abdelalim S, Ikram EA. A modeling framework for the moroccan sociolect recognition used on the social media. In Proceedings of the 2nd international Conference on Big Data, Cloud and Applications. ACM. 2017. p. 34. Mansouri F, Abdelalim S, Ikram EA. A modeling framework for the moroccan sociolect recognition used on the social media. In Proceedings of the 2nd international Conference on Big Data, Cloud and Applications. ACM. 2017. p. 34. 
11.
Zurück zum Zitat Gibson KR. Tool use, language and social behavior in relationship to information processing capacities. Tools, language and cognition in human evolution. 1993. p. 251-269. Gibson KR. Tool use, language and social behavior in relationship to information processing capacities. Tools, language and cognition in human evolution. 1993. p. 251-269.
12.
Zurück zum Zitat K Adnan, R Akbar. An analytical study of information extraction from unstructured and multidimensional big data. Journal of Big Data. 2019;6(1):91.CrossRef K Adnan, R Akbar. An analytical study of information extraction from unstructured and multidimensional big data. Journal of Big Data. 2019;6(1):91.CrossRef
13.
Zurück zum Zitat Louwerse MM. Semantic variation in idiolect and sociolect: Corpus linguistic evidence from literary texts. Computers and the Humanities. 2004;38(2):207–21.CrossRef Louwerse MM. Semantic variation in idiolect and sociolect: Corpus linguistic evidence from literary texts. Computers and the Humanities. 2004;38(2):207–21.CrossRef
14.
Zurück zum Zitat Paradis RD, Davenport D, Menaker D, Taylor SM. Detection of groups in non-structured data. Procedia Computer Science. 2012;12:412–7.CrossRef Paradis RD, Davenport D, Menaker D, Taylor SM. Detection of groups in non-structured data. Procedia Computer Science. 2012;12:412–7.CrossRef
15.
Zurück zum Zitat A Hussain, E Cambria. Semi-supervised learning for big social data analysis. Neurocomputing. 2018;275:1662–733.CrossRef A Hussain, E Cambria. Semi-supervised learning for big social data analysis. Neurocomputing. 2018;275:1662–733.CrossRef
16.
Zurück zum Zitat Li L, Wu L, Evans JA. Social centralization and semantic collapse: Hyperbolic embeddings of networks and text. CoRR, abs/2001.09493, 2020. Li L, Wu L, Evans JA. Social centralization and semantic collapse: Hyperbolic embeddings of networks and text. CoRR, abs/2001.09493, 2020.
17.
Zurück zum Zitat Balaanand M, Karthikeyan N, Karthik S, Varatharajan R, Manogaran G, Sivaparthipan C. An enhanced graph-based semi-supervised learning algorithm to detect fake users on twitter. The Journal of Supercomputing. 2019;75(9):6085–105.CrossRef Balaanand M, Karthikeyan N, Karthik S, Varatharajan R, Manogaran G, Sivaparthipan C. An enhanced graph-based semi-supervised learning algorithm to detect fake users on twitter. The Journal of Supercomputing. 2019;75(9):6085–105.CrossRef
18.
Zurück zum Zitat Cavallari S, Cambria E, Cai H, Chang KC, Zheng VW. Embedding both finite and infinite communities on graphs [application notes]. IEEE Computational Intelligence Magazine. 2019;14(3):39–50.CrossRef Cavallari S, Cambria E, Cai H, Chang KC, Zheng VW. Embedding both finite and infinite communities on graphs [application notes]. IEEE Computational Intelligence Magazine. 2019;14(3):39–50.CrossRef
19.
Zurück zum Zitat H Fani, E Jiang, E Bagheri, F Al-Obeidat, W Du, M Kargar. User community detection via embedding of social network structure and temporal content. Information Processing & Management. 2020;57(2):102056.CrossRef H Fani, E Jiang, E Bagheri, F Al-Obeidat, W Du, M Kargar. User community detection via embedding of social network structure and temporal content. Information Processing & Management. 2020;57(2):102056.CrossRef
20.
Zurück zum Zitat Park C, Han J, Yu H. Deep multiplex graph infomax: Attentive multiplex network embedding using global information. Knowledge-Based Systems. 2020. p.105861. Park C, Han J, Yu H. Deep multiplex graph infomax: Attentive multiplex network embedding using global information. Knowledge-Based Systems. 2020. p.105861.
21.
Zurück zum Zitat Liu P, Zhang L, Gulla JA. Real-time social recommendation based on graph embedding and temporal context. International Journal of Human-Computer Studies. 2019;121:58–72.CrossRef Liu P, Zhang L, Gulla JA. Real-time social recommendation based on graph embedding and temporal context. International Journal of Human-Computer Studies. 2019;121:58–72.CrossRef
22.
Zurück zum Zitat Tkachenko N, Guo W. Conflict detection in linguistically diverse on-line social networks: A russia-ukraine case study. In Proceedings of the 11th International Conference on Management of Digital EcoSystems, MEDES ’19. Association for Computing Machinery. New York, NY, USA. 2019. p. 23-28. Tkachenko N, Guo W. Conflict detection in linguistically diverse on-line social networks: A russia-ukraine case study. In Proceedings of the 11th International Conference on Management of Digital EcoSystems, MEDES ’19. Association for Computing Machinery. New York, NY, USA. 2019. p. 23-28.
23.
Zurück zum Zitat E Cambria. Affective computing and sentiment analysis. IEEE intelligent systems. 2016;31(2):102–7. E Cambria. Affective computing and sentiment analysis. IEEE intelligent systems. 2016;31(2):102–7.
24.
Zurück zum Zitat Poria S, Chaturvedi I, Cambria E, Bisio F. Sentic lda: Improving on lda with semantic similarity for aspect-based sentiment analysis. In 2016 international joint conference on neural networks (IJCNN). 2016. p. 4465–4473, IEEE. Poria S, Chaturvedi I, Cambria E, Bisio F. Sentic lda: Improving on lda with semantic similarity for aspect-based sentiment analysis. In 2016 international joint conference on neural networks (IJCNN). 2016. p. 4465–4473, IEEE.
25.
Zurück zum Zitat Hevner A, Chatterjee S. Design research in information systems: theory and practice. Springer Science & Business Media. 2010;2. Hevner A, Chatterjee S. Design research in information systems: theory and practice. Springer Science & Business Media. 2010;2.
26.
Zurück zum Zitat González RA, Pomares A. La investigación científica basada en el diseño como eje de proyectos de investigación en ingeniería. Reunión Nacional ACOFI. 2012. p. 12–14. González RA, Pomares A. La investigación científica basada en el diseño como eje de proyectos de investigación en ingeniería. Reunión Nacional ACOFI. 2012. p. 12–14.
27.
Zurück zum Zitat Kietzmann JH, Hermkens K, McCarthy IP, Silvestre BS. Social media? get serious! understanding the functional building blocks of social media. Business horizons. 2011;54(3):241–51.CrossRef Kietzmann JH, Hermkens K, McCarthy IP, Silvestre BS. Social media? get serious! understanding the functional building blocks of social media. Business horizons. 2011;54(3):241–51.CrossRef
28.
Zurück zum Zitat Española RA. Banco de datos (CREA). Corpus de referencia del español actual. 2015. p. 2011–10. Española RA. Banco de datos (CREA). Corpus de referencia del español actual. 2015. p. 2011–10.
29.
Zurück zum Zitat Spitkovsky VI, Alshawi H, Chang AX, Jurafsky D. Unsupervised dependency parsing without gold part-of-speech tags. In Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics. Edinburgh, Scotland, UK. 2011. p. 1281–1290. Spitkovsky VI, Alshawi H, Chang AX, Jurafsky D. Unsupervised dependency parsing without gold part-of-speech tags. In Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics. Edinburgh, Scotland, UK. 2011. p. 1281–1290.
30.
Zurück zum Zitat Khurshid A, Gillam L, Tostevin L. University of surrey participation in trec8: Weirdness indexing for logical document extrapolation and retrieval (wilder). In The Eighth Text REtrieval Conference (TREC-8). Gaithersburg, Maryland. 1999. p. 1–8. Khurshid A, Gillam L, Tostevin L. University of surrey participation in trec8: Weirdness indexing for logical document extrapolation and retrieval (wilder). In The Eighth Text REtrieval Conference (TREC-8). Gaithersburg, Maryland. 1999. p. 1–8.
31.
Zurück zum Zitat Joseph K, Carley KM, Hong JI. Check-ins in blau space applying blau macrosociological theory to foursquare check-ins from new york city. ACM Transactions on Intelligent Systems and Technology (TIST). 2014;5(3):1–22.CrossRef Joseph K, Carley KM, Hong JI. Check-ins in blau space applying blau macrosociological theory to foursquare check-ins from new york city. ACM Transactions on Intelligent Systems and Technology (TIST). 2014;5(3):1–22.CrossRef
32.
Zurück zum Zitat Park Y, Alam MH, Ryu WJ, and Sangkeun Lee. Bl-lda: Bringing bigram to supervised topic model. In 2015 International Conference on Computational Science and Computational Intelligence (CSCI). 2015. p. 83–88, IEEE. Park Y, Alam MH, Ryu WJ, and Sangkeun Lee. Bl-lda: Bringing bigram to supervised topic model. In 2015 International Conference on Computational Science and Computational Intelligence (CSCI). 2015. p. 83–88, IEEE.
33.
Zurück zum Zitat Camacho D, Panizo-LLedot A, Bello-Orgaz G, Gonzalez-Pardo A, Cambria E. The four dimensions of social network analysis: An overview of research methods, applications, and software tools. Information Fusion. 2020;63:88–120.CrossRef Camacho D, Panizo-LLedot A, Bello-Orgaz G, Gonzalez-Pardo A, Cambria E. The four dimensions of social network analysis: An overview of research methods, applications, and software tools. Information Fusion. 2020;63:88–120.CrossRef
34.
Zurück zum Zitat Varelo AR. Hacia un modelo de aseguramiento de la calidad en la educación superior en colombia: estándares básicos y acreditación de excelencia. Educación superior, calidad y acreditación. CNA., 2003. Varelo AR. Hacia un modelo de aseguramiento de la calidad en la educación superior en colombia: estándares básicos y acreditación de excelencia. Educación superior, calidad y acreditación. CNA., 2003.
35.
Zurück zum Zitat Beeferman D, Berger A, Lafferty J. Statistical models for text segmentation. Machine learning. 1999;34(1–3):177–21010.CrossRef Beeferman D, Berger A, Lafferty J. Statistical models for text segmentation. Machine learning. 1999;34(1–3):177–21010.CrossRef
36.
Zurück zum Zitat Damani OP, Ghonge S. Appropriately incorporating statistical significance in pmi. In Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics. 2013. p. 163–169. Damani OP, Ghonge S. Appropriately incorporating statistical significance in pmi. In Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics. 2013. p. 163–169.
37.
Zurück zum Zitat Arora S, Li Y, Liang Y, Ma T, Risteski A. A latent variable model approach to pmi-based word embeddings. Transactions of the Association for Computational Linguistics. 2016;4:385–99.CrossRef Arora S, Li Y, Liang Y, Ma T, Risteski A. A latent variable model approach to pmi-based word embeddings. Transactions of the Association for Computational Linguistics. 2016;4:385–99.CrossRef
38.
Zurück zum Zitat Ahmad K, Gillman L, Tostevin L. Weirdness indexing for logical document extrapolation and retrieval. In Proceedings of the Eighth Text Retrieval Conference (TREC-8). 2000. p. 1–8. Ahmad K, Gillman L, Tostevin L. Weirdness indexing for logical document extrapolation and retrieval. In Proceedings of the Eighth Text Retrieval Conference (TREC-8). 2000. p. 1–8.
Metadaten
Titel
Detection of Sociolinguistic Features in Digital Social Networks for the Detection of Communities
verfasst von
Edwin Puertas
Luis Gabriel Moreno-Sandoval
Javier Redondo
Jorge Andres Alvarado-Valencia
Alexandra Pomares-Quimbaya
Publikationsdatum
26.01.2021
Verlag
Springer US
Erschienen in
Cognitive Computation / Ausgabe 2/2021
Print ISSN: 1866-9956
Elektronische ISSN: 1866-9964
DOI
https://doi.org/10.1007/s12559-021-09818-9

Weitere Artikel der Ausgabe 2/2021

Cognitive Computation 2/2021 Zur Ausgabe