Skip to main content
Top
Published in: Journal of Geographical Systems 1/2024

11-01-2024 | Original Article

CHTopoNER model-based method for recognizing Chinese place names from social media information

Authors: Mengwei Zhang, Xingui Liu, Zheng Zhang, Yue Qiu, Zhipeng Jiang, Pengyu Zhang

Published in: Journal of Geographical Systems | Issue 1/2024

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Chinese toponym recognition is crucial in named entity recognition and has significant implications for improving geographic information systems. Based on the real-time nature of social media and rich geographical data contained in social media, it is important to identify Chinese toponyms, including compound toponyms, informal toponyms, and other forms of social media content, for automatic geospatial information extraction. However, the strong word-building ability, diverse features, and ambiguity of Chinese toponyms combined with the linguistic irregularities of social media pose significant challenges for accurately locating toponym boundaries and resolving ambiguities. Furthermore, existing Chinese toponym recognition methods often ignore the fusion of local and global features during feature extraction, resulting in semantic information loss. Therefore, we used the Chinese-roberta-wwm-ext pre-trained language model to encode input text and obtain character-level information. An improved SoftLexicon-based statistical method was employed to acquire word-level semantic information, which was then integrated with character-level semantic information. A two-channel neural network layer comprising a bi-directional long short-term memory and an inception-dilated convolutional neural network was utilized to extract global and local features from text. Additionally, a conditional random field was applied to establish label constraints. The proposed deep neural network model, called CHTopoNER, is designed to identify various forms of Chinese toponyms in irregular Chinese social media content. Its effectiveness was validated on four publicly available annotated toponym datasets and a custom social media dataset. CHTopoNER surpasses state-of-the-art Chinese toponym recognition models and achieves promising results for extracting various types of toponyms and spatial location terms.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
go back to reference Akbik A, Bergmann T, Blythe D et al (2019) FLAIR: an easy-to-use framework for state-of-the-art NLP. In: Proceedings of the 2019 conference of the North American chapter of the association for computational linguistics (demonstrations), pp 54–59 Akbik A, Bergmann T, Blythe D et al (2019) FLAIR: an easy-to-use framework for state-of-the-art NLP. In: Proceedings of the 2019 conference of the North American chapter of the association for computational linguistics (demonstrations), pp 54–59
go back to reference Amada I, Asai A, Shindo H et al (2020) LUKE: deep contextualized entity representations with entity-aware self-attention. arXiv preprint arXiv:2010.01057 Amada I, Asai A, Shindo H et al (2020) LUKE: deep contextualized entity representations with entity-aware self-attention. arXiv preprint arXiv:​2010.​01057
go back to reference Amitay E, Har’El N, Sivan R et al (2004) Web-a-where: geotagging web content. In: Proceedings of the 27th annual international ACM SIGIR conference on research and development in information retrieval, pp 273–280 Amitay E, Har’El N, Sivan R et al (2004) Web-a-where: geotagging web content. In: Proceedings of the 27th annual international ACM SIGIR conference on research and development in information retrieval, pp 273–280
go back to reference Bo C, Weihong LI, Haoxin T (2019) Chinese hierarchical address segmentation based on BiLSTM-CRF. Geogr Inf Sci 21(8):1143–1151 Bo C, Weihong LI, Haoxin T (2019) Chinese hierarchical address segmentation based on BiLSTM-CRF. Geogr Inf Sci 21(8):1143–1151
go back to reference Chen W, Zhang Y, Isahara H (2006) Chinese named entity recognition with conditional random fields. In: Proceedings of the 5th SIGHAN workshop on Chinese language processing, pp 118–121 Chen W, Zhang Y, Isahara H (2006) Chinese named entity recognition with conditional random fields. In: Proceedings of the 5th SIGHAN workshop on Chinese language processing, pp 118–121
go back to reference Chen Y, Ouyang Y, Li W et al (2010) Using deep belief nets for Chinese named entity categorization. In: Proceedings of the 2010 named entities workshop, pp 102–109 Chen Y, Ouyang Y, Li W et al (2010) Using deep belief nets for Chinese named entity categorization. In: Proceedings of the 2010 named entities workshop, pp 102–109
go back to reference Collobert R, Weston J, Bottou L et al (2011) Natural language processing (almost) from scratch. J Mach Learn Res 12:2493–2537 Collobert R, Weston J, Bottou L et al (2011) Natural language processing (almost) from scratch. J Mach Learn Res 12:2493–2537
go back to reference DeLozier G, Baldridge J, London L (2015) Gazetteer-independent toponym resolution using geographic word profiles. In: 29th AAAI conference on artificial intelligence, vol 29 DeLozier G, Baldridge J, London L (2015) Gazetteer-independent toponym resolution using geographic word profiles. In: 29th AAAI conference on artificial intelligence, vol 29
go back to reference Devlin J, Chang MW, Lee K et al (2018) Bert: pretraining of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 Devlin J, Chang MW, Lee K et al (2018) Bert: pretraining of deep bidirectional transformers for language understanding. arXiv preprint arXiv:​1810.​04805
go back to reference Di L, Ling X, Guangwen W (2021) Design of Chinese named entity recognition algorithm based on BiLSTM-CRF model. In: 2021 IEEE conference on telecommunications, optics and computer science (TOCS), pp 37–41 Di L, Ling X, Guangwen W (2021) Design of Chinese named entity recognition algorithm based on BiLSTM-CRF model. In: 2021 IEEE conference on telecommunications, optics and computer science (TOCS), pp 37–41
go back to reference Du P, Liu Y (2011) Recognition of Chinese place names based on ontology. Xibei Shifan Daxue Xuebao J Northwest Norm Univ 47(6):87–93 Du P, Liu Y (2011) Recognition of Chinese place names based on ontology. Xibei Shifan Daxue Xuebao J Northwest Norm Univ 47(6):87–93
go back to reference Fernández NJ, Periñán-Pascual C (2021) nLORE: a linguistically rich deep-learning system for locative-reference extraction in tweets. In: Intelligent environments 2021: workshop proceedings of the 17th international conference on intelligent environments, vol 29. IOS Press, pp 243 Fernández NJ, Periñán-Pascual C (2021) nLORE: a linguistically rich deep-learning system for locative-reference extraction in tweets. In: Intelligent environments 2021: workshop proceedings of the 17th international conference on intelligent environments, vol 29. IOS Press, pp 243
go back to reference Finkel JR, Grenager T, Manning CD (2005) Incorporating non-local information into information extraction systems by Gibbs sampling. In: Proceedings of the 43rd Annual meeting of the association for computational linguistics (ACL’05), pp 363–370 Finkel JR, Grenager T, Manning CD (2005) Incorporating non-local information into information extraction systems by Gibbs sampling. In: Proceedings of the 43rd Annual meeting of the association for computational linguistics (ACL’05), pp 363–370
go back to reference Goodchild MF (2007) Citizens as voluntary sensors: spatial data infrastructure in the world of web 2.0. Int J Spat Data Infrastruct Res 2(2):24–32 Goodchild MF (2007) Citizens as voluntary sensors: spatial data infrastructure in the world of web 2.0. Int J Spat Data Infrastruct Res 2(2):24–32
go back to reference Goyal P, Dollar P, Girshick RB et al (2017) Accurate, large minibatch SGD: training ImageNet in 1 h. arXiv: computer vision and pattern recognition Goyal P, Dollar P, Girshick RB et al (2017) Accurate, large minibatch SGD: training ImageNet in 1 h. arXiv: computer vision and pattern recognition
go back to reference Graves A, Mohamed A, Hinton G (2013) Speech recognition with deep recurrent neural networks. In: 2013 IEEE international conference on acoustics, speech, and signal processing, pp 6645–6649 Graves A, Mohamed A, Hinton G (2013) Speech recognition with deep recurrent neural networks. In: 2013 IEEE international conference on acoustics, speech, and signal processing, pp 6645–6649
go back to reference Grishman R, Sundheim BM (1996) A brief history. In: COLING, volume 1. Message understanding conference: 16th international conference on computational linguistics Grishman R, Sundheim BM (1996) A brief history. In: COLING, volume 1. Message understanding conference: 16th international conference on computational linguistics
go back to reference Hill LL (2009) Georeferencing: the geographic associations of information. MIT Press, Cambridge Hill LL (2009) Georeferencing: the geographic associations of information. MIT Press, Cambridge
go back to reference Hoffer E, Hubara I, Soudry D (2017) Train longer, generalize better: closing the generalization gap in large batch training of neural networks. Adv Neural Inf Process Syst 30:1731–1741 Hoffer E, Hubara I, Soudry D (2017) Train longer, generalize better: closing the generalization gap in large batch training of neural networks. Adv Neural Inf Process Syst 30:1731–1741
go back to reference Hu YH, Ge L (2007) A supervised machine learning approach to toponym disambiguation. The geospatial web: how geobrowsers, social software and the web 2.0 are Shaping the network society. Springer, Cham, pp 117–128CrossRef Hu YH, Ge L (2007) A supervised machine learning approach to toponym disambiguation. The geospatial web: how geobrowsers, social software and the web 2.0 are Shaping the network society. Springer, Cham, pp 117–128CrossRef
go back to reference Hu X, Zhou Z, Sun Y, Kersten J, Klan F, Fan H, Wiegmann M (2022b) GazPNE2: a general place name extractor for microblogs fusing gazetteers and pretrained transformer models. IEEE Internet Things J 9(17):16259–16271CrossRef Hu X, Zhou Z, Sun Y, Kersten J, Klan F, Fan H, Wiegmann M (2022b) GazPNE2: a general place name extractor for microblogs fusing gazetteers and pretrained transformer models. IEEE Internet Things J 9(17):16259–16271CrossRef
go back to reference Kamalloo E, Rafiei D (2018) A coherent unsupervised model for toponym resolution. In: Proceedings of the 2018 world wide web conference, pp 1287–1296 Kamalloo E, Rafiei D (2018) A coherent unsupervised model for toponym resolution. In: Proceedings of the 2018 world wide web conference, pp 1287–1296
go back to reference Keskar NS, Mudigere D, Nocedal J et al (2016) On large-batch training for deep learning: generalization gap and sharp minima. arXiv preprint arXiv:1609.04836 Keskar NS, Mudigere D, Nocedal J et al (2016) On large-batch training for deep learning: generalization gap and sharp minima. arXiv preprint arXiv:​1609.​04836
go back to reference Lafferty J, McCallum A, Pereira FCN (2001) Conditional random fields: Probabilistic models for segmenting and labeling sequence data Lafferty J, McCallum A, Pereira FCN (2001) Conditional random fields: Probabilistic models for segmenting and labeling sequence data
go back to reference Levow GA (2006) The 3 international Chinese language processing bakeoff: word segmentation and named entity recognition. In: Proceedings of the Fifth SIGHAN workshop on Chinese language processing, pp 108–117 Levow GA (2006) The 3 international Chinese language processing bakeoff: word segmentation and named entity recognition. In: Proceedings of the Fifth SIGHAN workshop on Chinese language processing, pp 108–117
go back to reference Lieberman MD, Samet H Sankaranarayanan J (2010) Geotagging with local lexicons to build indexes for textually-specified spatial data. In: 2010 IEEE 26th International conference on data engineering, pp 201–212 Lieberman MD, Samet H Sankaranarayanan J (2010) Geotagging with local lexicons to build indexes for textually-specified spatial data. In: 2010 IEEE 26th International conference on data engineering, pp 201–212
go back to reference Mengjun K, Qingyun DU, Mingjun W (2015) A new method of Chinese address extraction based on address tree model. Acta Geod Cartogr Sin 44(1):99 Mengjun K, Qingyun DU, Mingjun W (2015) A new method of Chinese address extraction based on address tree model. Acta Geod Cartogr Sin 44(1):99
go back to reference Pennington J, Socher R, Manning CD (2014) Glove: global vectors for word representation. In: Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP), pp 1532–1543 Pennington J, Socher R, Manning CD (2014) Glove: global vectors for word representation. In: Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP), pp 1532–1543
go back to reference Qin Y, Lin Y, Takanobu R et al (2020) ERICA: improving entity and relation understanding for pretrained language models via contrastive learning. arXiv preprint arXiv:2012.15022 Qin Y, Lin Y, Takanobu R et al (2020) ERICA: improving entity and relation understanding for pretrained language models via contrastive learning. arXiv preprint arXiv:​2012.​15022
go back to reference Roberts K, Bejan CA, Harabagiu S (2010) Toponym disambiguation using events. In: 23rd international FLAIRS conference, vol 10 Roberts K, Bejan CA, Harabagiu S (2010) Toponym disambiguation using events. In: 23rd international FLAIRS conference, vol 10
go back to reference Si S, Danhao Z (2017) Research on Chinese place name recognition based on deep learning. Trans Beijing Inst Technol 37(11):54–59 Si S, Danhao Z (2017) Research on Chinese place name recognition based on deep learning. Trans Beijing Inst Technol 37(11):54–59
go back to reference Smith LN (2017) Cyclical learning rates for training neural networks. In: 2017 IEEE winter conference on applications of computer vision, pp 464–472 Smith LN (2017) Cyclical learning rates for training neural networks. In: 2017 IEEE winter conference on applications of computer vision, pp 464–472
go back to reference Wolf T, Debut L, Sanh V et al (2020) Transformers: state-of-the-art natural language processing. In: Proceedings of the 2020 conference on empirical methods in natural language processing: system demonstrations, pp 38–45 Wolf T, Debut L, Sanh V et al (2020) Transformers: state-of-the-art natural language processing. In: Proceedings of the 2020 conference on empirical methods in natural language processing: system demonstrations, pp 38–45
go back to reference Xueying Z, Chuju Z, Guonian LÜ (2010) Design and analysis of a classification scheme of geographical named entities. Geo Inf Sci 12(2):220–227 Xueying Z, Chuju Z, Guonian LÜ (2010) Design and analysis of a classification scheme of geographical named entities. Geo Inf Sci 12(2):220–227
go back to reference Yu B, Wei J (2020) IDCNN-CRF-based domain named entity recognition method. In: 2020 2nd international conference on civil aviation safety and information technology ICCASIT, pp 542–546 Yu B, Wei J (2020) IDCNN-CRF-based domain named entity recognition method. In: 2020 2nd international conference on civil aviation safety and information technology ICCASIT, pp 542–546
Metadata
Title
CHTopoNER model-based method for recognizing Chinese place names from social media information
Authors
Mengwei Zhang
Xingui Liu
Zheng Zhang
Yue Qiu
Zhipeng Jiang
Pengyu Zhang
Publication date
11-01-2024
Publisher
Springer Berlin Heidelberg
Published in
Journal of Geographical Systems / Issue 1/2024
Print ISSN: 1435-5930
Electronic ISSN: 1435-5949
DOI
https://doi.org/10.1007/s10109-023-00433-w

Other articles of this Issue 1/2024

Journal of Geographical Systems 1/2024 Go to the issue

Premium Partner