Skip to main content

2020 | OriginalPaper | Buchkapitel

Enhanced Entity Mention Recognition and Disambiguation Technologies for Chinese Knowledge Base Q&A

verfasst von : Gang Wu, Wenfang Wu, Hangxu Ji, Xianxian Hou, Li Xia

Erschienen in: Semantic Technology

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Entity linking, which usually involves mention recognition and entity disambiguation, is an important task in knowledge base question and answer (KBQA). However, due to the diversity of Chinese grammatical structure, the complexity of Chinese natural language expressions and the lack of contextual information, there are still many challenges in the task of the Chinese KBQA. We discussed two subtasks of the entity linking separately. For the mention recognition part, in order to get the only topic entity mention of the question, we proposed a topic entity mention recognition algorithm based on sequence annotation. The algorithm combines a variety of feature vectors based on word embedding, and uses model BiGRU-CRF model to perform sequence labeling modeling. We also proposed an entity disambiguation algorithm based on a similarity calculation with extended information. The algorithm not only realized the information expansion by crawling the candidate entity for related problems, but also made full use of contextual information by combining lexical level similarity and sentence semantic similarity. In addition, the experimental results show that the proposed entity linking solution possesses huge advantages compared to several baseline systems.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
1
Alibaba Co.
 
2
Yun Ma is the creator of Alibaba Co.
 
4
Smartisan Technology Co., Ltd., commonly known as Smartisan, is a Chinese multinational technology company headquartered in Beijing and Chengdu.
 
Literatur
1.
Zurück zum Zitat Basile, P., Caputo, A.: Entity linking for tweets. Encycl. Seman. Comput. Rob. Intell. 01(01), 1630020 (2017)CrossRef Basile, P., Caputo, A.: Entity linking for tweets. Encycl. Seman. Comput. Rob. Intell. 01(01), 1630020 (2017)CrossRef
3.
Zurück zum Zitat Gutmann, B., Kersting, K.: TildeCRF: conditional random fields for logical sequences. In: European Conference on Machine Learning (2006) Gutmann, B., Kersting, K.: TildeCRF: conditional random fields for logical sequences. In: European Conference on Machine Learning (2006)
4.
Zurück zum Zitat Hachey, B., Radford, W., Nothman, J., Honnibal, M., Curran, J.R.: Evaluating entity linking with wikipedia. Artif. Intell. 194(3), 130–150 (2013)MathSciNetCrossRef Hachey, B., Radford, W., Nothman, J., Honnibal, M., Curran, J.R.: Evaluating entity linking with wikipedia. Artif. Intell. 194(3), 130–150 (2013)MathSciNetCrossRef
5.
Zurück zum Zitat Han, X., Le, S., Zhao, J.: Collective entity linking in web text: a graph-based method (2011) Han, X., Le, S., Zhao, J.: Collective entity linking in web text: a graph-based method (2011)
6.
Zurück zum Zitat Hancock, J.M.: Jaccard Distance (Jaccard Index, Jaccard Similarity Coefficient) (2014) Hancock, J.M.: Jaccard Distance (Jaccard Index, Jaccard Similarity Coefficient) (2014)
7.
Zurück zum Zitat Hkiri, A.O.E., Mallat, S., Zrigui, M.: Improving coverage of rule based NER systems. In: International Conference on Information & Communication Technology & Accessibility (2016) Hkiri, A.O.E., Mallat, S., Zrigui, M.: Improving coverage of rule based NER systems. In: International Conference on Information & Communication Technology & Accessibility (2016)
8.
Zurück zum Zitat Hoffart, J., et al.: Robust disambiguation of named entities in text. In: Conference on Empirical Methods in Natural Language Processing (2015) Hoffart, J., et al.: Robust disambiguation of named entities in text. In: Conference on Empirical Methods in Natural Language Processing (2015)
10.
Zurück zum Zitat Lafferty, J., McCallum, A., Pereira, F.C.: Conditional random fields: probabilistic models for segmenting and labeling sequence data (2001) Lafferty, J., McCallum, A., Pereira, F.C.: Conditional random fields: probabilistic models for segmenting and labeling sequence data (2001)
11.
Zurück zum Zitat Pilz, A., Paaß, G.: From names to entities using thematic context distance (2011) Pilz, A., Paaß, G.: From names to entities using thematic context distance (2011)
12.
Zurück zum Zitat Ratinov, L.A., Dan, R., Downey, D., Anderson, M.: Local and global algorithms for disambiguation to wikipedia. In: Meeting of the Association for Computational Linguistics: Human Language Technologies (2011) Ratinov, L.A., Dan, R., Downey, D., Anderson, M.: Local and global algorithms for disambiguation to wikipedia. In: Meeting of the Association for Computational Linguistics: Human Language Technologies (2011)
13.
Zurück zum Zitat Shen, W., Wang, J., Han, J.: Entity linking with a knowledge base: issues, techniques, and solutions. IEEE Trans. Knowl. Data Eng. 27(2), 443–460 (2015)CrossRef Shen, W., Wang, J., Han, J.: Entity linking with a knowledge base: issues, techniques, and solutions. IEEE Trans. Knowl. Data Eng. 27(2), 443–460 (2015)CrossRef
14.
Zurück zum Zitat Wei, Z., Yan, C.S., Jian, S., Tan, C.L.: Entity linking with effective acronym expansion, instance selection and topic modeling. In: International Joint Conference on Artificial Intelligence (2011) Wei, Z., Yan, C.S., Jian, S., Tan, C.L.: Entity linking with effective acronym expansion, instance selection and topic modeling. In: International Joint Conference on Artificial Intelligence (2011)
15.
Zurück zum Zitat Yao, H., Liu, H., Zhang, P.: A novel sentence similarity model with word embedding based on convolutional neural network: sentence similarity model with word embedding based on convolutional neural network. Concurrency Comput. Pract. Experience 30, e4415 (2018)CrossRef Yao, H., Liu, H., Zhang, P.: A novel sentence similarity model with word embedding based on convolutional neural network: sentence similarity model with word embedding based on convolutional neural network. Concurrency Comput. Pract. Experience 30, e4415 (2018)CrossRef
16.
Zurück zum Zitat Zheng, Z., Li, F., Huang, M., Zhu, X.: Learning to link entities with knowledge base. In: Human Language Technologies: The Conference of the North American Chapter of the Association for Computational Linguistics (2010) Zheng, Z., Li, F., Huang, M., Zhu, X.: Learning to link entities with knowledge base. In: Human Language Technologies: The Conference of the North American Chapter of the Association for Computational Linguistics (2010)
Metadaten
Titel
Enhanced Entity Mention Recognition and Disambiguation Technologies for Chinese Knowledge Base Q&A
verfasst von
Gang Wu
Wenfang Wu
Hangxu Ji
Xianxian Hou
Li Xia
Copyright-Jahr
2020
DOI
https://doi.org/10.1007/978-3-030-41407-8_7

Neuer Inhalt