Skip to main content

2021 | OriginalPaper | Buchkapitel

Microblog User Location Inference Based on POI and Query Likelihood Model

verfasst von : Yimin Liu, Xiangyang Luo, Han Li

Erschienen in: Information and Communications Security

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Location inference of microblog users is of great significance for disaster monitoring, public opinion tracing and tracking, and extensive location-based services. However due to the noisy content of microblog text and the ambiguity of geographic location, it is quite difficult to infer user location based only on user-generated text. This paper proposes a microblog user location inference algorithm based on POI and query likelihood model, named PaQL. First, the POI (Point of Interest) model of each region is constructed based on the electronic map. Then, from the word segmentation results of the user’s blog texts, the POIs with stronger location orientation are extracted as user features. Next, the inverse region frequency of POIs is calculated, based on which the correlation between users and the candidate regions is calculated based on the query likelihood model. Finally, the candidate region with the highest correlation is considered as the user’s inferred location. The location inference experiment is conducted on the provincial-level data set (3,862k blogs of 154k users) and the city-level data set (3,086k blogs of 103k users) of Sina Weibo platform. The results show that: Compared with three existing typical algorithms, GP-FLIW, GP-LIWTF and WC-EFS, which are only based on user text, the precision of provincial-level inference is improved by 7.80%, 4.99% and 1.41%, respectively, and the city-level inference precision is improved by 10.67%, 8.38% and 3.72%, respectively. Moreover, the proposed algorithm also outperforms the existing methods in terms of recall and \({F}_{1}\).

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Zola, P., Ragno, C., Cortez, P.: A google trends spatial clustering approach for a worldwide twitter user geolocation. Information Processing & Management 57(6), 102312 (2020). Zola, P., Ragno, C., Cortez, P.: A google trends spatial clustering approach for a worldwide twitter user geolocation. Information Processing & Management 57(6), 102312 (2020).
2.
Zurück zum Zitat Tumasjan, A., Sprenger, T.O., Sandner, P.G., Welpe I.M.: Predicting elections with twitter: what 140 characters reveal about political sentiment. In: 4th International Conference on Weblogs and Social Media, pp. 178–185. AAAI, Washington DC, USA (2010). Tumasjan, A., Sprenger, T.O., Sandner, P.G., Welpe I.M.: Predicting elections with twitter: what 140 characters reveal about political sentiment. In: 4th International Conference on Weblogs and Social Media, pp. 178–185. AAAI, Washington DC, USA (2010).
3.
Zurück zum Zitat Carmela, C., Agostino, F., Clara, P.: Bursty event detection in twitter streams. ACM Trans. Knowl. Discov. Data 13(4), 1–28 (2019) Carmela, C., Agostino, F., Clara, P.: Bursty event detection in twitter streams. ACM Trans. Knowl. Discov. Data 13(4), 1–28 (2019)
4.
Zurück zum Zitat Lan, L., Malbasa, V., Vucetic, S.: Spatial scan for disease mapping on a mobile population. In: 28th AAAI conference on Artificial Intelligence, pp. 431–437. AAAI, Québec, Canada (2014). Lan, L., Malbasa, V., Vucetic, S.: Spatial scan for disease mapping on a mobile population. In: 28th AAAI conference on Artificial Intelligence, pp. 431–437. AAAI, Québec, Canada (2014).
5.
Zurück zum Zitat Heba, A., John, K., Gireeja, R., Horvitz, E.: To buy or not to buy: computing value of spatiotemporal information. ACM Transactions on Spatial Algorithms and Systems 5(40), 1–25 (2019) Heba, A., John, K., Gireeja, R., Horvitz, E.: To buy or not to buy: computing value of spatiotemporal information. ACM Transactions on Spatial Algorithms and Systems 5(40), 1–25 (2019)
7.
Zurück zum Zitat Chen, Z.Y., James, C., Kyumin, L.: You are where you tweet: a content-based approach to geo-locating twitter users. In: 19th ACM International Conference on Information and Knowledge Management, pp. 759–768. ACM, Toronto, Canada (2010). Chen, Z.Y., James, C., Kyumin, L.: You are where you tweet: a content-based approach to geo-locating twitter users. In: 19th ACM International Conference on Information and Knowledge Management, pp. 759–768. ACM, Toronto, Canada (2010).
8.
Zurück zum Zitat Ryoo, K.M., Moon, S.S.: Inferring twitter user locations with 10km accuracy. In: 23rd International Conference on World Wide Web, pp. 643–648. ACM, Seoul, Korea (2014). Ryoo, K.M., Moon, S.S.: Inferring twitter user locations with 10km accuracy. In: 23rd International Conference on World Wide Web, pp. 643–648. ACM, Seoul, Korea (2014).
9.
Zurück zum Zitat Rahimi, A., Vu, D., Cohn, T., Baldwin, T.: Exploiting text and network context for geolocation of social media users. In: 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 1362–1367. NAACL, Denver, USA (2015). Rahimi, A., Vu, D., Cohn, T., Baldwin, T.: Exploiting text and network context for geolocation of social media users. In: 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 1362–1367. NAACL, Denver, USA (2015).
10.
Zurück zum Zitat Huang, B., Carley, K.M.: A hierarchical location prediction neural network for twitter user geolocation. In: 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, pp. 4731–4741. ACL, Hong Kong, China (2019). Huang, B., Carley, K.M.: A hierarchical location prediction neural network for twitter user geolocation. In: 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, pp. 4731–4741. ACL, Hong Kong, China (2019).
11.
Zurück zum Zitat Longbo, K., Zhi, L., Yan, H.: Spot: Locating social media users based on social network context. Proceedings of the VLDB Endowment 7(13), 1681–1684 (2014)CrossRef Longbo, K., Zhi, L., Yan, H.: Spot: Locating social media users based on social network context. Proceedings of the VLDB Endowment 7(13), 1681–1684 (2014)CrossRef
12.
Zurück zum Zitat Tian, H.C., Zhang, M., Luo, X.Y., Liu, F.L., Qiao, Y.Q.: Twitter user location inference based on representation learning and label propagation. In: The Web Conference 2020, pp. 2648–2654. ACM, Taipei, Taiwan (2020). Tian, H.C., Zhang, M., Luo, X.Y., Liu, F.L., Qiao, Y.Q.: Twitter user location inference based on representation learning and label propagation. In: The Web Conference 2020, pp. 2648–2654. ACM, Taipei, Taiwan (2020).
13.
Zurück zum Zitat Miura, Y., Taniguchi, M., Taniguchi, T., Ohkuma, T.: Unifying text, metadata, and user network representations with a neural network for geolocation prediction. In: 55th Annual Meeting of the Association for Computational Linguistics, pp. 1260–1272. ACL, Vancouver, Canada (2017). Miura, Y., Taniguchi, M., Taniguchi, T., Ohkuma, T.: Unifying text, metadata, and user network representations with a neural network for geolocation prediction. In: 55th Annual Meeting of the Association for Computational Linguistics, pp. 1260–1272. ACL, Vancouver, Canada (2017).
14.
Zurück zum Zitat Rahimi, A., Cohn, T., Baldwin, T.: Semi-supervised user geolocation via graph convolutional networks. In: 56th Annual Meeting of the Association for Computational Linguistics, pp. 2009–2019. ACL, Melbourne, Australia (2018). Rahimi, A., Cohn, T., Baldwin, T.: Semi-supervised user geolocation via graph convolutional networks. In: 56th Annual Meeting of the Association for Computational Linguistics, pp. 2009–2019. ACL, Melbourne, Australia (2018).
15.
Zurück zum Zitat Ryan, C., David, J., David, A.: Geotagging one hundred million twitter accounts with total variation minimization. In: 2014 IEEE International Conference on Big Data, pp. 393–401. IEEE, Washington DC, USA (2014). Ryan, C., David, J., David, A.: Geotagging one hundred million twitter accounts with total variation minimization. In: 2014 IEEE International Conference on Big Data, pp. 393–401. IEEE, Washington DC, USA (2014).
16.
Zurück zum Zitat Xin, Z., Han, J., Sun, A.: A survey of location prediction on twitter. IEEE Trans. Knowl. Data Eng. 30(9), 1–20 (2017) Xin, Z., Han, J., Sun, A.: A survey of location prediction on twitter. IEEE Trans. Knowl. Data Eng. 30(9), 1–20 (2017)
17.
Zurück zum Zitat Hecht, B., Hong, L.C., Suh, B.W., Chi, E.H.: Tweets from Justin Bieber’s heart: the dynamics of the location field in user profiles. In: 2011 CHI Conference on Human Factors in Computing Systems, pp. 237–246. ACM, Vancouver, Canada (2011). Hecht, B., Hong, L.C., Suh, B.W., Chi, E.H.: Tweets from Justin Bieber’s heart: the dynamics of the location field in user profiles. In: 2011 CHI Conference on Human Factors in Computing Systems, pp. 237–246. ACM, Vancouver, Canada (2011).
19.
Zurück zum Zitat Li, C., Weng, J.S., He, Q., Yao, Y.X., Datta, A., Sun, A., et al.: Twiner: Named entity recognition in targeted twitter stream. In: 35th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 721–730. ACM, Portland, USA (2012). Li, C., Weng, J.S., He, Q., Yao, Y.X., Datta, A., Sun, A., et al.: Twiner: Named entity recognition in targeted twitter stream. In: 35th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 721–730. ACM, Portland, USA (2012).
20.
Zurück zum Zitat Han, B., Cook, P., Baldwin, T.: Geolocation prediction in social media data by finding location indicative words. In: 24th International Conference on Computational Linguistics, pp. 1045–1062. ACM, Mumbai, India (2012). Han, B., Cook, P., Baldwin, T.: Geolocation prediction in social media data by finding location indicative words. In: 24th International Conference on Computational Linguistics, pp. 1045–1062. ACM, Mumbai, India (2012).
21.
Zurück zum Zitat Han, B., Cook, P., Baldwin, T.: Text-based twitter user geolocation prediction. Journal of Artificial Intelligence Research 49(1), 451–500 (2014)CrossRef Han, B., Cook, P., Baldwin, T.: Text-based twitter user geolocation prediction. Journal of Artificial Intelligence Research 49(1), 451–500 (2014)CrossRef
22.
Zurück zum Zitat Chi, L.H., Lim, K.H., Alam, N., Butler, C.: Geolocation prediction in twitter using location indicative words and textual features. In: 2nd Workshop on Noisy User-Generated Text, pp. 227–234. COLING, Osaka, Japan (2016). Chi, L.H., Lim, K.H., Alam, N., Butler, C.: Geolocation prediction in twitter using location indicative words and textual features. In: 2nd Workshop on Noisy User-Generated Text, pp. 227–234. COLING, Osaka, Japan (2016).
23.
Zurück zum Zitat Tian, H.C.: Research on social network user location prediction technology, University of Information Engineering (2019). Tian, H.C.: Research on social network user location prediction technology, University of Information Engineering (2019).
24.
Zurück zum Zitat Jay, M., Croft, Bruce, W.: A language modeling approach to information retrieval. In: 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 275–281. ACM, Melbourne, Australia (1998). Jay, M., Croft, Bruce, W.: A language modeling approach to information retrieval. In: 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 275–281. ACM, Melbourne, Australia (1998).
25.
Zurück zum Zitat Salton, G., Buckley, C.: Term-weighting approaches in automatic text retrieval. Inf. Process. Manage. 24(5), 513–523 (1988)CrossRef Salton, G., Buckley, C.: Term-weighting approaches in automatic text retrieval. Inf. Process. Manage. 24(5), 513–523 (1988)CrossRef
Metadaten
Titel
Microblog User Location Inference Based on POI and Query Likelihood Model
verfasst von
Yimin Liu
Xiangyang Luo
Han Li
Copyright-Jahr
2021
DOI
https://doi.org/10.1007/978-3-030-86890-1_26

Premium Partner