Skip to main content

2018 | OriginalPaper | Buchkapitel

A Method of Chinese Named Entity Recognition Based on CNN-BILSTM-CRF Model

verfasst von : Sun Long, Rao Yuan, Lu Yi, Li Xue

Erschienen in: Data Science

Verlag: Springer Singapore

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The main task of naming entity recognition is to identify the person names, location names, organization names, meaningful time, dates and other quantitative phrases and also classifying them into different categories. Yet, there are lots of field terms, meaningful entities, complicated location names and complex name of organizations contained in all kinds of these field terms is the most important mission in digging out the text now. The extraction of Chinese is more difficult than English entities owing to the lack of definite boundary and size characteristics of Chinese words. Therefore, we would train and test the marked corpus which is based on the CNN-BILSTM-CRF neutral network model in order to make good use of CNN to get the presentation character of words and label the words by using BILSTM and CRF. It makes the extraction of entity in the common came true. The experimental results show that the accuracy rate, recall rate and F value that we got in an unaddressed artificial features condition are 98.81%, 90.70% and 91.57% respectively which is better than the results we got by using the method of BILSTM+CRF and conditional random field (CRF).

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Ma, X., Liu, Z., Hovy, E.: Unsupervised ranking model for entity coreference resolution. In: Proceedings of NAACL-2016, San Diego, California, USA, June 2016 Ma, X., Liu, Z., Hovy, E.: Unsupervised ranking model for entity coreference resolution. In: Proceedings of NAACL-2016, San Diego, California, USA, June 2016
2.
Zurück zum Zitat Volk, M., Clematide, S.: Learn-filter-apply-forget. Mixed approaches to named entity recognition. In: Proceedings of NLDB 2001, pp. 153–163 (2001) Volk, M., Clematide, S.: Learn-filter-apply-forget. Mixed approaches to named entity recognition. In: Proceedings of NLDB 2001, pp. 153–163 (2001)
3.
Zurück zum Zitat Grishman, R., Sundhenim, B.: Message understanding conference-6: a brief history. In: Proceeding of the 16th International Conference on Computational Linguistics (COLING 1996), vol. 8 (1996) Grishman, R., Sundhenim, B.: Message understanding conference-6: a brief history. In: Proceeding of the 16th International Conference on Computational Linguistics (COLING 1996), vol. 8 (1996)
4.
Zurück zum Zitat Bikel, D., Miller, S., Schwartz, R., Weischedel, R.: High-performance learning name-finder. In: Proceedings of the 5th Conference on Applied Natural Language Processing (1997) Bikel, D., Miller, S., Schwartz, R., Weischedel, R.: High-performance learning name-finder. In: Proceedings of the 5th Conference on Applied Natural Language Processing (1997)
5.
Zurück zum Zitat Borthwick, J., Sterling, E., Agichtein, R., Grishman, N.Y.U.: Description of the MENE named entity system as used in MUC-7. MUC-7, Washington D.C, pp. 145–150 (1998) Borthwick, J., Sterling, E., Agichtein, R., Grishman, N.Y.U.: Description of the MENE named entity system as used in MUC-7. MUC-7, Washington D.C, pp. 145–150 (1998)
6.
Zurück zum Zitat Isozaki, H., Kazawa, H.: Efficient support vector classifiers for named entity recognition Isozaki, H., Kazawa, H.: Efficient support vector classifiers for named entity recognition
7.
Zurück zum Zitat Klein, D., Smarr, J., Nguyen, H., et al.: Named entity recognition with character-level models. In: Proceedings of the Seventh Conference on Natural Language Learning at HLT-NAACL 2003-Volume 4. Association for Computational Linguistics, pp. 180–183 (2003) Klein, D., Smarr, J., Nguyen, H., et al.: Named entity recognition with character-level models. In: Proceedings of the Seventh Conference on Natural Language Learning at HLT-NAACL 2003-Volume 4. Association for Computational Linguistics, pp. 180–183 (2003)
8.
Zurück zum Zitat Wu, Y., Zhao, J., Xu, B., et al.: Chinese named entity recognition based on multiple features. In: Proceedings of the Conference on Human Language Technology and Empirical Methods in Natural Language Processing. Association for Computational Linguistics, pp. 427–434 (2005) Wu, Y., Zhao, J., Xu, B., et al.: Chinese named entity recognition based on multiple features. In: Proceedings of the Conference on Human Language Technology and Empirical Methods in Natural Language Processing. Association for Computational Linguistics, pp. 427–434 (2005)
9.
Zurück zum Zitat Feng, Y.: A rapid algorithm to chinese named entity recognition based on single character hints. J. Chin. Inf. Process. 22(1), 104–110 (2008). (In Chinese) Feng, Y.: A rapid algorithm to chinese named entity recognition based on single character hints. J. Chin. Inf. Process. 22(1), 104–110 (2008). (In Chinese)
10.
Zurück zum Zitat Hao, W.: Named entity extraction model based on hierarchical pattern matching. New Technol. Libr. Inf. Serv. (5), pp. 62–68 (2007) Hao, W.: Named entity extraction model based on hierarchical pattern matching. New Technol. Libr. Inf. Serv. (5), pp. 62–68 (2007)
11.
Zurück zum Zitat Li, L., Jin, L., Jiang, Y., Huang, D.: Recognizing biomedical named entities based on the sentence vector/twin word embeddings conditioned bidirectional LSTM. In: Sun, M., Huang, X., Lin, H., Liu, Z., Liu, Y. (eds.) CCL/NLP-NABD -2016. LNCS (LNAI), vol. 10035, pp. 165–176. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-47674-2_15CrossRef Li, L., Jin, L., Jiang, Y., Huang, D.: Recognizing biomedical named entities based on the sentence vector/twin word embeddings conditioned bidirectional LSTM. In: Sun, M., Huang, X., Lin, H., Liu, Z., Liu, Y. (eds.) CCL/NLP-NABD -2016. LNCS (LNAI), vol. 10035, pp. 165–176. Springer, Cham (2016). https://​doi.​org/​10.​1007/​978-3-319-47674-2_​15CrossRef
12.
Zurück zum Zitat Mikolov, T., Karafiat, M., Burget, L., Cernocky, J., Khudanpur, S.: Recurrent neural network based language model. In: INTERSPEECH (2010) Mikolov, T., Karafiat, M., Burget, L., Cernocky, J., Khudanpur, S.: Recurrent neural network based language model. In: INTERSPEECH (2010)
13.
Zurück zum Zitat Graves, A., Schmidhuber, J.: Framewise phoneme classification with bidirectional LSTM and other neural network architectures. Neural Netw. 18, 602–610 (2005)CrossRef Graves, A., Schmidhuber, J.: Framewise phoneme classification with bidirectional LSTM and other neural network architectures. Neural Netw. 18, 602–610 (2005)CrossRef
14.
Zurück zum Zitat Graves, A., Mohamed, A., Hinton, G.: Speech recognition with deep recurrent neural networks (2013) Graves, A., Mohamed, A., Hinton, G.: Speech recognition with deep recurrent neural networks (2013)
15.
Zurück zum Zitat Ratnaparkhi, A.: A maximum entropy model for part-of-speech tagging. In: Proceedings of EMNLP (1996) Ratnaparkhi, A.: A maximum entropy model for part-of-speech tagging. In: Proceedings of EMNLP (1996)
16.
Zurück zum Zitat McCallum, A., Freitag, D., Pereira, F.: Maximum entropy Markov models for information extraction and segmentation. In: Proceedings of ICML (2000) McCallum, A., Freitag, D., Pereira, F.: Maximum entropy Markov models for information extraction and segmentation. In: Proceedings of ICML (2000)
17.
Zurück zum Zitat Lafferty, J., McCallum, A., Pereira, F.: Conditional random fields: probabilistic models for segmenting and labeling sequence data. In: Proceedings of ICML (2001) Lafferty, J., McCallum, A., Pereira, F.: Conditional random fields: probabilistic models for segmenting and labeling sequence data. In: Proceedings of ICML (2001)
18.
Zurück zum Zitat Santos, C.D., Zadrozny, B.: Learning character-level representations for part-of-speech tagging. In: Proceedings of ICML-2014, pp. 1818–1826 (2014) Santos, C.D., Zadrozny, B.: Learning character-level representations for part-of-speech tagging. In: Proceedings of ICML-2014, pp. 1818–1826 (2014)
Metadaten
Titel
A Method of Chinese Named Entity Recognition Based on CNN-BILSTM-CRF Model
verfasst von
Sun Long
Rao Yuan
Lu Yi
Li Xue
Copyright-Jahr
2018
Verlag
Springer Singapore
DOI
https://doi.org/10.1007/978-981-13-2206-8_15

Premium Partner