Skip to main content

2016 | OriginalPaper | Buchkapitel

The Social Dynamics of Language Change in Online Networks

verfasst von : Rahul Goel, Sandeep Soni, Naman Goyal, John Paparrizos, Hanna Wallach, Fernando Diaz, Jacob Eisenstein

Erschienen in: Social Informatics

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Language change is a complex social phenomenon, revealing pathways of communication and sociocultural influence. But, while language change has long been a topic of study in sociolinguistics, traditional linguistic research methods rely on circumstantial evidence, estimating the direction of change from differences between older and younger speakers. In this paper, we use a data set of several million Twitter users to track language changes in progress. First, we show that language change can be viewed as a form of social influence: we observe complex contagion for phonetic spellings and “netspeak” abbreviations (e.g., lol), but not for older dialect markers from spoken language. Next, we test whether specific types of social network connections are more influential than others, using a parametric Hawkes process model. We find that tie strength plays an important role: densely embedded social ties are significantly better conduits of linguistic influence. Geographic locality appears to play a more limited role: we find relatively little evidence to support the hypothesis that individuals are more influenced by geographically local social ties, even in their usage of geographical dialect markers.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
1
The basic unit of linguistic differentiation is referred to as a “variable” in the sociolinguistic and dialectological literature [50]. We maintain this terminology here.
 
2
After running SAGE to identify words with coefficients above 2.0, we manually removed hashtags, named entities, non-English words, and descriptions of events.
 
3
Other sources, such as http://​urbandictionary.​com, report asl to be an abbreviation of age, sex, location? However, this definition is not compatible with typical usage on Twitter, e.g., currently hungry asl or that movie was funny asl.
 
4
ard, inna, and lls appear on multiple cities’ lists. These words are characteristic of the neighboring cities of Baltimore, Philadelphia, and Washington D.C.
 
5
The shuffle test assumes that the likelihood of two users forming a social network connection does not change over time. Researchers have proposed a test [32] that removes this assumption; we will scale this test to our data set in future work.
 
6
We also compared the full feature set—i.e., F1+F2+F3+F4—to feature set F1+F2+F3 and feature set F1+F2+F4. The results were almost identical, indicating that F3 (tie strength) and F4 (local) provide complementary information.
 
Literatur
1.
Zurück zum Zitat Adamic, L.A., Adar, E.: Friends and neighbors on the web. Soc. Netw. 25(3), 211–230 (2003)CrossRef Adamic, L.A., Adar, E.: Friends and neighbors on the web. Soc. Netw. 25(3), 211–230 (2003)CrossRef
2.
Zurück zum Zitat Al Zamal, F., Liu, W., Ruths, D.: Homophily and latent attribute inference: inferring latent attributes of Twitter users from neighbors. In: Proceedings of the International Conference on Web and Social Media (ICWSM), pp. 387–390 (2012) Al Zamal, F., Liu, W., Ruths, D.: Homophily and latent attribute inference: inferring latent attributes of Twitter users from neighbors. In: Proceedings of the International Conference on Web and Social Media (ICWSM), pp. 387–390 (2012)
3.
Zurück zum Zitat Alim, H.S.: Hip hop nation language. In: Duranti, A. (ed.) Linguistic Anthropology: A Reader, pp. 272–289. Wiley-Blackwell, Malden (2009) Alim, H.S.: Hip hop nation language. In: Duranti, A. (ed.) Linguistic Anthropology: A Reader, pp. 272–289. Wiley-Blackwell, Malden (2009)
4.
Zurück zum Zitat Anagnostopoulos, A., Kumar, R., Mahdian, M.: Influence and correlation in social networks. In: Proceedings of Knowledge Discovery and Data Mining (KDD), pp. 7–15 (2008) Anagnostopoulos, A., Kumar, R., Mahdian, M.: Influence and correlation in social networks. In: Proceedings of Knowledge Discovery and Data Mining (KDD), pp. 7–15 (2008)
5.
Zurück zum Zitat Androutsopoulos, J.: Language change and digital media: a review of conceptions and evidence. In: Coupland, N., Kristiansen, T. (eds.) Standard Languages and Language Standards in a Changing Europe. Novus, Oslo (2011) Androutsopoulos, J.: Language change and digital media: a review of conceptions and evidence. In: Coupland, N., Kristiansen, T. (eds.) Standard Languages and Language Standards in a Changing Europe. Novus, Oslo (2011)
6.
Zurück zum Zitat Anis, J.: Neography: unconventional spelling in French SMS text messages. In: Danet, B., Herring, S.C. (eds.) The Multilingual Internet: Language, Culture, and Communication Online, pp. 87–115. Oxford University Press, Oxford (2007)CrossRef Anis, J.: Neography: unconventional spelling in French SMS text messages. In: Danet, B., Herring, S.C. (eds.) The Multilingual Internet: Language, Culture, and Communication Online, pp. 87–115. Oxford University Press, Oxford (2007)CrossRef
7.
Zurück zum Zitat Backstrom, L., Sun, E., Marlow, C.: Find me if you can: improving geographical prediction with social and spatial proximity. In: Proceedings of the Conference on World-Wide Web (WWW), pp. 61–70 (2010) Backstrom, L., Sun, E., Marlow, C.: Find me if you can: improving geographical prediction with social and spatial proximity. In: Proceedings of the Conference on World-Wide Web (WWW), pp. 61–70 (2010)
8.
Zurück zum Zitat Bakshy, E., Rosenn, I., Marlow, C., Adamic, L.: The role of social networks in information diffusion. In: Proceedings of the Conference on World-Wide Web (WWW), Lyon, France, pp. 519–528 (2012) Bakshy, E., Rosenn, I., Marlow, C., Adamic, L.: The role of social networks in information diffusion. In: Proceedings of the Conference on World-Wide Web (WWW), Lyon, France, pp. 519–528 (2012)
9.
Zurück zum Zitat Baldwin, T., Cook, P., Lui, M., MacKinlay, A., Wang, L.: How noisy social media text, how diffrnt social media sources. In: Proceedings of the 6th International Joint Conference on Natural Language Processing (IJCNLP 2013), pp. 356–364 (2013) Baldwin, T., Cook, P., Lui, M., MacKinlay, A., Wang, L.: How noisy social media text, how diffrnt social media sources. In: Proceedings of the 6th International Joint Conference on Natural Language Processing (IJCNLP 2013), pp. 356–364 (2013)
10.
Zurück zum Zitat Benjamini, Y., Hochberg, Y.: Controlling the false discovery rate: a practical and powerful approach to multiple testing. J. Roy. Stat. Soc. Ser. B (Methodol.) 57(1), 289–300 (1995)MathSciNetMATH Benjamini, Y., Hochberg, Y.: Controlling the false discovery rate: a practical and powerful approach to multiple testing. J. Roy. Stat. Soc. Ser. B (Methodol.) 57(1), 289–300 (1995)MathSciNetMATH
11.
Zurück zum Zitat Bucholtz, M., Hall, K.: Identity and interaction: a sociocultural linguistic approach. Discourse Stud. 7(4–5), 585–614 (2005)CrossRef Bucholtz, M., Hall, K.: Identity and interaction: a sociocultural linguistic approach. Discourse Stud. 7(4–5), 585–614 (2005)CrossRef
12.
Zurück zum Zitat Bucholtz, M., Bermudez, N., Fung, V., Edwards, L., Vargas, R.: Hella Nor Cal or totally So Cal? The perceptual dialectology of California. J. Engl. Linguist. 35(4), 325–352 (2007)CrossRef Bucholtz, M., Bermudez, N., Fung, V., Edwards, L., Vargas, R.: Hella Nor Cal or totally So Cal? The perceptual dialectology of California. J. Engl. Linguist. 35(4), 325–352 (2007)CrossRef
13.
Zurück zum Zitat Centola, D., Macy, M.: Complex contagions and the weakness of long ties. Am. J. Sociol. 113(3), 702–734 (2007)CrossRef Centola, D., Macy, M.: Complex contagions and the weakness of long ties. Am. J. Sociol. 113(3), 702–734 (2007)CrossRef
14.
Zurück zum Zitat Crystal, D.: Language and the Internet, 2nd edn. Cambridge University Press, Cambridge (2006)CrossRef Crystal, D.: Language and the Internet, 2nd edn. Cambridge University Press, Cambridge (2006)CrossRef
15.
Zurück zum Zitat Dunbar, R.I.: Neocortex size as a constraint on group size in primates. J. Hum. Evol. 22(6), 469–493 (1992)MathSciNetCrossRef Dunbar, R.I.: Neocortex size as a constraint on group size in primates. J. Hum. Evol. 22(6), 469–493 (1992)MathSciNetCrossRef
16.
Zurück zum Zitat Eckert, P.: Linguistic Variation as Social Practice. Blackwell, Oxford (2000) Eckert, P.: Linguistic Variation as Social Practice. Blackwell, Oxford (2000)
17.
Zurück zum Zitat Eisenstein, J.: What to do about bad language on the internet. In: Proceedings of the North American Chapter of the Association for Computational Linguistics (NAACL), pp. 359–369 (2013) Eisenstein, J.: What to do about bad language on the internet. In: Proceedings of the North American Chapter of the Association for Computational Linguistics (NAACL), pp. 359–369 (2013)
18.
Zurück zum Zitat Eisenstein, J.: Systematic patterning in phonologically-motivated orthographic variation. J. Sociolinguistics 19, 161–188 (2015)CrossRef Eisenstein, J.: Systematic patterning in phonologically-motivated orthographic variation. J. Sociolinguistics 19, 161–188 (2015)CrossRef
19.
Zurück zum Zitat Eisenstein, J.: Written dialect variation in online social media. In: Boberg, C., Nerbonne, J., Watt, D. (eds.) Handbook of Dialectology. Wiley, Hoboken (2016) Eisenstein, J.: Written dialect variation in online social media. In: Boberg, C., Nerbonne, J., Watt, D. (eds.) Handbook of Dialectology. Wiley, Hoboken (2016)
20.
Zurück zum Zitat Eisenstein, J., Ahmed, A., Xing, E.P.: Sparse additive generative models of text. In: Proceedings of the International Conference on Machine Learning (ICML), pp. 1041–1048 (2011) Eisenstein, J., Ahmed, A., Xing, E.P.: Sparse additive generative models of text. In: Proceedings of the International Conference on Machine Learning (ICML), pp. 1041–1048 (2011)
21.
Zurück zum Zitat Fagyal, Z., Swarup, S., Escobar, A.M., Gasser, L., Lakkaraju, K.: Centers and peripheries: network roles in language change. Lingua 120(8), 2061–2079 (2010)CrossRef Fagyal, Z., Swarup, S., Escobar, A.M., Gasser, L., Lakkaraju, K.: Centers and peripheries: network roles in language change. Lingua 120(8), 2061–2079 (2010)CrossRef
22.
Zurück zum Zitat Granovetter, M.S.: The strength of weak ties. Am. J. Sociol. 78(6), 1360–1380 (1973)CrossRef Granovetter, M.S.: The strength of weak ties. Am. J. Sociol. 78(6), 1360–1380 (1973)CrossRef
23.
Zurück zum Zitat Green, L.J.: African American English: A Linguistic Introduction. Cambridge University Press, Cambridge (2002)CrossRef Green, L.J.: African American English: A Linguistic Introduction. Cambridge University Press, Cambridge (2002)CrossRef
24.
Zurück zum Zitat Griffiths, T.L., Kalish, M.L.: Language evolution by iterated learning with Bayesian agents. Cogn. Sci. 31(3), 441–480 (2007)CrossRef Griffiths, T.L., Kalish, M.L.: Language evolution by iterated learning with Bayesian agents. Cogn. Sci. 31(3), 441–480 (2007)CrossRef
25.
Zurück zum Zitat Hamilton, W.L., Leskovec, J., Jurafsky, D.: Diachronic word embeddings reveal statistical laws of semantic change. In: Proceedings of the Association for Computational Linguistics (ACL), Berlin (2016) Hamilton, W.L., Leskovec, J., Jurafsky, D.: Diachronic word embeddings reveal statistical laws of semantic change. In: Proceedings of the Association for Computational Linguistics (ACL), Berlin (2016)
26.
27.
Zurück zum Zitat Herring, S.C.: Grammar and electronic communication. In: Chapelle, C.A. (ed.) The Encyclopedia of Applied Linguistics. Wiley, Hoboken (2012) Herring, S.C.: Grammar and electronic communication. In: Chapelle, C.A. (ed.) The Encyclopedia of Applied Linguistics. Wiley, Hoboken (2012)
28.
Zurück zum Zitat Huberman, B., Romero, D.M., Wu, F.: Social networks that matter: Twitter under the microscope. First Monday 14(1) (2008) Huberman, B., Romero, D.M., Wu, F.: Social networks that matter: Twitter under the microscope. First Monday 14(1) (2008)
29.
Zurück zum Zitat Johnstone, B., Bhasin, N., Wittkofski, D.: “Dahntahn” Pittsburgh: monophthongal /aw/ and representations of localness in Southwestern Pennsylvania. Am. Speech 77(2), 148–176 (2002)CrossRef Johnstone, B., Bhasin, N., Wittkofski, D.: “Dahntahn” Pittsburgh: monophthongal /aw/ and representations of localness in Southwestern Pennsylvania. Am. Speech 77(2), 148–176 (2002)CrossRef
30.
Zurück zum Zitat Kulkarni, V., Al-Rfou, R., Perozzi, B., Skiena, S.: Statistically significant detection of linguistic change. In: Proceedings of the Conference on World-Wide Web (WWW), pp. 625–635 (2015) Kulkarni, V., Al-Rfou, R., Perozzi, B., Skiena, S.: Statistically significant detection of linguistic change. In: Proceedings of the Conference on World-Wide Web (WWW), pp. 625–635 (2015)
31.
Zurück zum Zitat Kwak, H., Lee, C., Park, H., Moon, S.: What is Twitter, a social network or a news media? In: Proceedings of the Conference on World-Wide Web (WWW), pp. 591–600 (2010) Kwak, H., Lee, C., Park, H., Moon, S.: What is Twitter, a social network or a news media? In: Proceedings of the Conference on World-Wide Web (WWW), pp. 591–600 (2010)
32.
Zurück zum Zitat La Fond, T., Neville, J.: Randomization tests for distinguishing social influence and homophily effects. In: Proceedings of the Conference on World-Wide Web (WWW), pp. 601–610 (2010) La Fond, T., Neville, J.: Randomization tests for distinguishing social influence and homophily effects. In: Proceedings of the Conference on World-Wide Web (WWW), pp. 601–610 (2010)
33.
Zurück zum Zitat Labov, W.: The social motivation of a sound change. Word 19(3), 273–309 (1963)CrossRef Labov, W.: The social motivation of a sound change. Word 19(3), 273–309 (1963)CrossRef
34.
Zurück zum Zitat Labov, W.: Principles of Linguistic Change, vol. 2: Social Factors, vol. 2. Wiley-Blackwell, Hoboken (2001) Labov, W.: Principles of Linguistic Change, vol. 2: Social Factors, vol. 2. Wiley-Blackwell, Hoboken (2001)
35.
Zurück zum Zitat Labov, W.: Review of linguistic variation as social practice, by Penelope Eckert. Lang. Soc. 31, 277–284 (2002)CrossRef Labov, W.: Review of linguistic variation as social practice, by Penelope Eckert. Lang. Soc. 31, 277–284 (2002)CrossRef
36.
Zurück zum Zitat Labov, W.: Principles of Linguistic Change, vol. 3: Cognitive and Cultural Factors, vol. 3. Wiley-Blackwell, Hoboken (2011) Labov, W.: Principles of Linguistic Change, vol. 3: Cognitive and Cultural Factors, vol. 3. Wiley-Blackwell, Hoboken (2011)
37.
Zurück zum Zitat Latour, B., Woolgar, S.: Laboratory Life: The Construction of Scientific Facts. Princeton University Press, Princeton (2013) Latour, B., Woolgar, S.: Laboratory Life: The Construction of Scientific Facts. Princeton University Press, Princeton (2013)
38.
Zurück zum Zitat Li, L., Deng, H., Dong, A., Chang, Y., Zha, H.: Identifying and labeling search tasks via query-based Hawkes processes. In: Proceedings of Knowledge Discovery and Data Mining (KDD), pp. 731–740 (2014) Li, L., Deng, H., Dong, A., Chang, Y., Zha, H.: Identifying and labeling search tasks via query-based Hawkes processes. In: Proceedings of Knowledge Discovery and Data Mining (KDD), pp. 731–740 (2014)
39.
Zurück zum Zitat Li, L., Zha, H.: Learning parametric models for social infectivity in multi-dimensional Hawkes processes. In: Proceedings of the National Conference on Artificial Intelligence (AAAI) (2015) Li, L., Zha, H.: Learning parametric models for social infectivity in multi-dimensional Hawkes processes. In: Proceedings of the National Conference on Artificial Intelligence (AAAI) (2015)
40.
Zurück zum Zitat Milroy, L., Milroy, J.: Social network and social class: toward an integrated sociolinguistic model. Lang. Soc. 21(01), 1–26 (1992)CrossRef Milroy, L., Milroy, J.: Social network and social class: toward an integrated sociolinguistic model. Lang. Soc. 21(01), 1–26 (1992)CrossRef
41.
Zurück zum Zitat Niyogi, P., Berwick, R.C.: A dynamical systems model for language change. Complex Syst. 11(3), 161–204 (1997)MathSciNet Niyogi, P., Berwick, R.C.: A dynamical systems model for language change. Complex Syst. 11(3), 161–204 (1997)MathSciNet
42.
Zurück zum Zitat Ogata, Y.: On Lewis’ simulation method for point processes. IEEE Trans. Inf. Theor. 27(1), 23–31 (1981)MATHCrossRef Ogata, Y.: On Lewis’ simulation method for point processes. IEEE Trans. Inf. Theor. 27(1), 23–31 (1981)MATHCrossRef
43.
Zurück zum Zitat Pavalanathan, U., Eisenstein, J.: Audience-modulated variation in online social media. Am. Speech 90(2), 187–213 (2015)CrossRef Pavalanathan, U., Eisenstein, J.: Audience-modulated variation in online social media. Am. Speech 90(2), 187–213 (2015)CrossRef
44.
Zurück zum Zitat Pavalanathan, U., Eisenstein, J.: Confounds and consequences in geotagged Twitter data. In: Proceedings of Empirical Methods for Natural Language Processing (EMNLP), September 2015 Pavalanathan, U., Eisenstein, J.: Confounds and consequences in geotagged Twitter data. In: Proceedings of Empirical Methods for Natural Language Processing (EMNLP), September 2015
45.
Zurück zum Zitat Rickford, J.R.: Geographical diversity, residential segregation, and the vitality of African American vernacular English and its speakers. Transform. Anthropol. 18(1), 28–34 (2010)CrossRef Rickford, J.R.: Geographical diversity, residential segregation, and the vitality of African American vernacular English and its speakers. Transform. Anthropol. 18(1), 28–34 (2010)CrossRef
46.
Zurück zum Zitat Sadilek, A., Kautz, H., Bigham, J.P.: Finding your friends and following them to where you are. In: Proceedings of the Conference on Web Search and Data Mining (WSDM), pp. 723–732 (2012) Sadilek, A., Kautz, H., Bigham, J.P.: Finding your friends and following them to where you are. In: Proceedings of the Conference on Web Search and Data Mining (WSDM), pp. 723–732 (2012)
47.
Zurück zum Zitat Squires, L.: Enregistering internet language. Lang. Soc. 39, 457–492 (2010)CrossRef Squires, L.: Enregistering internet language. Lang. Soc. 39, 457–492 (2010)CrossRef
48.
Zurück zum Zitat Tagliamonte, S.A., Denis, D.: Linguistic ruin? LOL! Instant messaging and teen language. Am. Speech 83(1), 3–34 (2008)CrossRef Tagliamonte, S.A., Denis, D.: Linguistic ruin? LOL! Instant messaging and teen language. Am. Speech 83(1), 3–34 (2008)CrossRef
49.
Zurück zum Zitat Trudgill, P.: Sex, covert prestige and linguistic change in the urban British English of Norwich. Lang. Soc. 1(2), 179–195 (1972)CrossRef Trudgill, P.: Sex, covert prestige and linguistic change in the urban British English of Norwich. Lang. Soc. 1(2), 179–195 (1972)CrossRef
50.
Zurück zum Zitat Wolfram, W.: The linguistic variable: fact and fantasy. Am. Speech 66(1), 22–32 (1991)CrossRef Wolfram, W.: The linguistic variable: fact and fantasy. Am. Speech 66(1), 22–32 (1991)CrossRef
51.
Zurück zum Zitat Zhao, Q., Erdogdu, M.A., He, H.Y., Rajaraman, A., Leskovec, J.: Seismic: a self-exciting point process model for predicting tweet popularity. In: Proceedings of Knowledge Discovery and Data Mining (KDD), pp. 1513–1522 (2015) Zhao, Q., Erdogdu, M.A., He, H.Y., Rajaraman, A., Leskovec, J.: Seismic: a self-exciting point process model for predicting tweet popularity. In: Proceedings of Knowledge Discovery and Data Mining (KDD), pp. 1513–1522 (2015)
Metadaten
Titel
The Social Dynamics of Language Change in Online Networks
verfasst von
Rahul Goel
Sandeep Soni
Naman Goyal
John Paparrizos
Hanna Wallach
Fernando Diaz
Jacob Eisenstein
Copyright-Jahr
2016
DOI
https://doi.org/10.1007/978-3-319-47880-7_3

Neuer Inhalt