Skip to main content
Top

2018 | OriginalPaper | Chapter

Leveraging Local Interactions for Geolocating Social Media Users

Authors : Mohammad Ebrahimi, Elaheh ShafieiBavani, Raymond Wong, Fang Chen

Published in: Advances in Knowledge Discovery and Data Mining

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Predicting the geolocation of social media users is one of the core tasks in many applications, such as rapid disaster response, targeted advertisement, and recommending local events. In this paper, we introduce a new approach for user geolocation that unifies users’ social relationships, textual content, and metadata. Our two key contributions are as follows: (1) We leverage semantic similarity between users’ posts to predict their geographic proximity. To achieve this, we train and utilize a powerful word embedding model over millions of tweets. (2) To deal with isolated users in the social graph, we utilize a stacking-based learning approach to predict users’ locations based on their tweets’ textual content and metadata. Evaluation on three standard Twitter benchmark datasets shows that our approach outperforms state-of-the-art user geolocation methods.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Footnotes
1
We consider uni-directional mentions, since bi-directional mentions are too rare to be useful in the datasets used in our experiments [30].
 
Literature
1.
go back to reference Ashktorab, Z., Brown, C., Nandi, M., Culotta, A.: Tweedr: mining twitter to inform disaster response. In: ISCRAM 2014 (2014) Ashktorab, Z., Brown, C., Nandi, M., Culotta, A.: Tweedr: mining twitter to inform disaster response. In: ISCRAM 2014 (2014)
2.
go back to reference Cha, M., Gwon, Y., Kung, H.T.: Twitter geolocation and regional classification via sparse coding. In: ICWSM 2015, pp. 582–585 (2015) Cha, M., Gwon, Y., Kung, H.T.: Twitter geolocation and regional classification via sparse coding. In: ICWSM 2015, pp. 582–585 (2015)
3.
go back to reference Cheng, Z., Caverlee, J., Lee, K.: You are where you tweet: a content-based approach to geo-locating twitter users. In: CIKM 2010, pp. 759–768. ACM (2010) Cheng, Z., Caverlee, J., Lee, K.: You are where you tweet: a content-based approach to geo-locating twitter users. In: CIKM 2010, pp. 759–768. ACM (2010)
4.
go back to reference Compton, R., Jurgens, D., Allen, D.: Geotagging one hundred million twitter accounts with total variation minimization. In: BigData 2014, pp. 393–401. IEEE (2014) Compton, R., Jurgens, D., Allen, D.: Geotagging one hundred million twitter accounts with total variation minimization. In: BigData 2014, pp. 393–401. IEEE (2014)
5.
go back to reference Davis Jr., C.A., Pappa, G.L., Rocha de Oliveira, D.R., Arcanjo, F.L.: Inferring the location of twitter messages based on user relationships. Trans. GIS 15(6), 735–751 (2011)CrossRef Davis Jr., C.A., Pappa, G.L., Rocha de Oliveira, D.R., Arcanjo, F.L.: Inferring the location of twitter messages based on user relationships. Trans. GIS 15(6), 735–751 (2011)CrossRef
8.
go back to reference Eisenstein, J., O’Connor, B., Smith, N.A., Xing, E.P.: A latent variable model for geographic lexical variation. In: EMNLP 2010, pp. 1277–1287. ACL (2010) Eisenstein, J., O’Connor, B., Smith, N.A., Xing, E.P.: A latent variable model for geographic lexical variation. In: EMNLP 2010, pp. 1277–1287. ACL (2010)
9.
go back to reference Ester, M., Kriegel, H.-P., Sander, J., Xu, X., et al.: A density-based algorithm for discovering clusters in large spatial databases with noise. In: KDD 1996, vol. 96, pp. 226–231 (1996) Ester, M., Kriegel, H.-P., Sander, J., Xu, X., et al.: A density-based algorithm for discovering clusters in large spatial databases with noise. In: KDD 1996, vol. 96, pp. 226–231 (1996)
10.
go back to reference Han, B., Baldwin,T.: Lexical normalisation of short text messages: Makn sens a# twitter. In: ACL-HLT 2011, pp. 368–378. ACL (2011) Han, B., Baldwin,T.: Lexical normalisation of short text messages: Makn sens a# twitter. In: ACL-HLT 2011, pp. 368–378. ACL (2011)
11.
go back to reference Han, B., Cook, P., Baldwin, T.: Geolocation prediction in social media data by finding location indicative words. In: COLING 2012, pp. 1045–1062 (2012) Han, B., Cook, P., Baldwin, T.: Geolocation prediction in social media data by finding location indicative words. In: COLING 2012, pp. 1045–1062 (2012)
12.
go back to reference Han, B., Cook, P., Baldwin, T.: Text-based twitter user geolocation prediction. Artif. Intell. Res. 49, 451–500 (2014) Han, B., Cook, P., Baldwin, T.: Text-based twitter user geolocation prediction. Artif. Intell. Res. 49, 451–500 (2014)
13.
go back to reference Han, B., Hugo, A., Rahimi, A., Derczynski, L., Baldwin, T.: Twitter geolocation prediction shared task of the 2016 workshop on noisy user-generated text. In: WNUT 2016, pp. 213–217 (2016) Han, B., Hugo, A., Rahimi, A., Derczynski, L., Baldwin, T.: Twitter geolocation prediction shared task of the 2016 workshop on noisy user-generated text. In: WNUT 2016, pp. 213–217 (2016)
14.
go back to reference Hecht, B., Hong, L., Suh, B., Chi, E.H.: Tweets from Justin Bieber’s heart: the dynamics of the location field in user profiles. In: ACM SIGCHI 2011, pp. 237–246. ACM (2011) Hecht, B., Hong, L., Suh, B., Chi, E.H.: Tweets from Justin Bieber’s heart: the dynamics of the location field in user profiles. In: ACM SIGCHI 2011, pp. 237–246. ACM (2011)
15.
go back to reference Hong, L., Ahmed, A., Gurumurthy, S., Smola, A.J., Tsioutsiouliklis, K.: Discovering geographical topics in the twitter stream. In: WWW 2012, pp. 769–778. ACM (2012) Hong, L., Ahmed, A., Gurumurthy, S., Smola, A.J., Tsioutsiouliklis, K.: Discovering geographical topics in the twitter stream. In: WWW 2012, pp. 769–778. ACM (2012)
16.
go back to reference Hulden, M., Silfverberg, M., Francom, J.: Kernel density estimation for text-based geolocation. In: AAAI 2015, pp. 145–150 (2015) Hulden, M., Silfverberg, M., Francom, J.: Kernel density estimation for text-based geolocation. In: AAAI 2015, pp. 145–150 (2015)
17.
go back to reference Jayasinghe, G., Jin, B., Mchugh, J., Robinson, B., Wan, S.: Csiro data61 at the WNUT geo shared task. In: WNUT 2016, pp. 218–226 (2016) Jayasinghe, G., Jin, B., Mchugh, J., Robinson, B., Wan, S.: Csiro data61 at the WNUT geo shared task. In: WNUT 2016, pp. 218–226 (2016)
18.
go back to reference Jurgens, D.: That’s what friends are for: inferring location in online social media platforms based on social relationships. In: ICWSM 2013, vol. 13, pp. 273–282 (2013) Jurgens, D.: That’s what friends are for: inferring location in online social media platforms based on social relationships. In: ICWSM 2013, vol. 13, pp. 273–282 (2013)
19.
go back to reference Kusner, M.J., Sun, Y., Kolkin, N.I., Weinberger, K.Q.: From word embeddings to document distances. In: ICML 2015, pp. 957–966 (2015) Kusner, M.J., Sun, Y., Kolkin, N.I., Weinberger, K.Q.: From word embeddings to document distances. In: ICML 2015, pp. 957–966 (2015)
20.
go back to reference Li, R., Wang, S., Deng, H., Wang, R., Chang, K.C.-C.: Towards social user profiling: unified and discriminative influence model for inferring home locations. In: SIGKDD 2012, pp. 1023–1031. ACM (2012) Li, R., Wang, S., Deng, H., Wang, R., Chang, K.C.-C.: Towards social user profiling: unified and discriminative influence model for inferring home locations. In: SIGKDD 2012, pp. 1023–1031. ACM (2012)
21.
go back to reference Lian, D., Ge, Y., Zhang, F., Yuan, N.J., Xie, X., Zhou, T., Rui, Y.: Content-aware collaborative filtering for location recommendation based on human mobility data. In: ICDM 2015, pp. 261–270. IEEE (2015) Lian, D., Ge, Y., Zhang, F., Yuan, N.J., Xie, X., Zhou, T., Rui, Y.: Content-aware collaborative filtering for location recommendation based on human mobility data. In: ICDM 2015, pp. 261–270. IEEE (2015)
22.
go back to reference Liu, J., Inkpen, D.: Estimating user location in social media with stacked denoising auto-encoders. In: NAACL-HLT 2015, pp. 201–210 (2015) Liu, J., Inkpen, D.: Estimating user location in social media with stacked denoising auto-encoders. In: NAACL-HLT 2015, pp. 201–210 (2015)
23.
go back to reference Mahmud, J., Nichols, J., Drews, C.: Where is this tweet from? inferring home locations of twitter users. In: ICWSM 2012, vol. 12, pp. 511–514 (2012) Mahmud, J., Nichols, J., Drews, C.: Where is this tweet from? inferring home locations of twitter users. In: ICWSM 2012, vol. 12, pp. 511–514 (2012)
24.
go back to reference Mikolov, T., Sutskever, I., Chen, K., Corrado, G.S., Dean, J.: Distributed representations of words and phrases and their compositionality. In: NIPS 2013, pp. 3111–3119 (2013) Mikolov, T., Sutskever, I., Chen, K., Corrado, G.S., Dean, J.: Distributed representations of words and phrases and their compositionality. In: NIPS 2013, pp. 3111–3119 (2013)
25.
go back to reference Miura, Y., Taniguchi, M., Taniguchi, T., Ohkuma, T.: A simple scalable neural networks based model for geolocation prediction in twitter. In: WNUT 2016, pp. 235–239 (2016) Miura, Y., Taniguchi, M., Taniguchi, T., Ohkuma, T.: A simple scalable neural networks based model for geolocation prediction in twitter. In: WNUT 2016, pp. 235–239 (2016)
27.
go back to reference Rahimi, A., Baldwin, T., Cohn, T.: Continuous representation of location for geolocation and lexical dialectology using mixture density networks. In: EMNLP 2017, pp. 167–176. ACL (2017) Rahimi, A., Baldwin, T., Cohn, T.: Continuous representation of location for geolocation and lexical dialectology using mixture density networks. In: EMNLP 2017, pp. 167–176. ACL (2017)
28.
go back to reference Rahimi, A., Cohn, T., Baldwin, T.: Twitter user geolocation using a unified text and network prediction model. In: ACL-IJCNLP 2015, pp. 630–636. ACL (2015) Rahimi, A., Cohn, T., Baldwin, T.: Twitter user geolocation using a unified text and network prediction model. In: ACL-IJCNLP 2015, pp. 630–636. ACL (2015)
29.
go back to reference Rahimi, A., Cohn, T., Baldwin, T.: A neural model for user geolocation and lexical dialectology. In: ACL 2017 (2017) Rahimi, A., Cohn, T., Baldwin, T.: A neural model for user geolocation and lexical dialectology. In: ACL 2017 (2017)
30.
go back to reference Rahimi, A., Vu, D., Cohn, T., Baldwin, T.: Exploiting text and network context for geolocation of social media users. In: NAACL-HLT 2015, pp. 1362–1367. ACL (2015) Rahimi, A., Vu, D., Cohn, T., Baldwin, T.: Exploiting text and network context for geolocation of social media users. In: NAACL-HLT 2015, pp. 1362–1367. ACL (2015)
31.
go back to reference Roller, S., Speriosu, M., Rallapalli, S., Wing, B., Baldridge, J.: Supervised text-based geolocation using language models on an adaptive grid. In: EMNLP-CONLL 2012, pp. 1500–1510. ACL (2012) Roller, S., Speriosu, M., Rallapalli, S., Wing, B., Baldridge, J.: Supervised text-based geolocation using language models on an adaptive grid. In: EMNLP-CONLL 2012, pp. 1500–1510. ACL (2012)
34.
go back to reference Wing, B., Baldridge, J.: Hierarchical discriminative classification for text-based geolocation. In: EMNLP 2014, pp. 336–348. ACL (2014) Wing, B., Baldridge, J.: Hierarchical discriminative classification for text-based geolocation. In: EMNLP 2014, pp. 336–348. ACL (2014)
Metadata
Title
Leveraging Local Interactions for Geolocating Social Media Users
Authors
Mohammad Ebrahimi
Elaheh ShafieiBavani
Raymond Wong
Fang Chen
Copyright Year
2018
DOI
https://doi.org/10.1007/978-3-319-93040-4_63

Premium Partner