skip to main content
10.1145/3022227.3022299acmconferencesArticle/Chapter ViewAbstractPublication PagesicuimcConference Proceedingsconference-collections
short-paper

Two-stage approach to named entity recognition using Wikipedia and DBpedia

Published:05 January 2017Publication History

ABSTRACT

In natural language understanding, extraction of named entity (NE) mentions in given text and classification of the mentions into pre-defined NE types are important processes. Most NE recognition (NER) relies on resources such as a training corpus or NE dictionary, but collecting them manually is laborious and time-consuming. This paper proposes a two-stage approach based on nothing but Wikipedia and DBpedia to implement NER. This paper also addresses technical problems in developing Korean NER. In experiments, the proposed method can recognize NEs in short question sentences with 14.2% errors.

References

  1. S. Bird, E. Klein, and E. Loper. Natural language processing with Python. O'Reilly Media, Inc., 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. L. Dong, F. Wei, M. Zhou, and K. Xu. Question answering over Freebase with multi-column convolutional neural networks. In Proceedings of ACL, 2015.Google ScholarGoogle ScholarCross RefCross Ref
  3. J. R. Finkel, T. Grenager, and C. Manning. Incorporating non-local information into information extraction systems by gibbs sampling. In Proceedings of ACL, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. A. Graves, A.-r. Mohamed, and G. Hinton. Speech recognition with deep recurrent neural networks. In Proceedings of IEEE ICASSP, 2013.Google ScholarGoogle ScholarCross RefCross Ref
  5. Y. Kim, Y. Hamn, J. Kim, D. Hwang, and K.-S. Choi. A non-morphological approach for dbpedia URI spotting within korean text. In Proceedings of HCLT, 2014.Google ScholarGoogle Scholar
  6. D. Kingma and J. Ba. Adam: A method for stochastic optimization. In Proceedings of ICLR, 2015.Google ScholarGoogle Scholar
  7. J. Nothman, N. Ringland, W. Radford, T. Murphy, and J. R. Curran. Learning multilingual named entity recognition from Wikipedia. Artif. Intell., 194:151--175, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. M. Schuster and K. K. Paliwal. Bidirectional recurrent neural networks. IEEE Trans Sig. Process., 45:2673--2681, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. W. Shen, J. Wang, and J. Han. Entity linking with a knowledge base: Issues, techniques, and solutions. IEEE Trans. Knowl. Data Eng., 27:443--460, 2015.Google ScholarGoogle ScholarCross RefCross Ref
  10. S. K. H. Song, Yeongkil; Jeong. A semi-automatic construction method of a named entity dictionary based on wikipedia. J. Korea Inst. Inf. Commun. Eng., 42, 2015.Google ScholarGoogle Scholar
  11. N. Srivastava, G. Hinton, A. Krizhevsky, I. Sutskever, and R. Salakhutdinov. Dropout: A simple way to prevent neural networks from overfitting. J. Mach. Learn. Res., 15:1929--1958, 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. I. Sutskever, O. Vinyals, and Q. V. Le. Sequence to sequence learning with neural networks. In Proceedings of ACL, 2014.Google ScholarGoogle Scholar
  13. W. Yin, M. Yu, B. Xiang, B. Zhou, and H. Schütze. Simple question answering by attentive convolutional neural network. ArXiv e-prints, 2016.Google ScholarGoogle Scholar

Index Terms

  1. Two-stage approach to named entity recognition using Wikipedia and DBpedia

          Recommendations

          Comments

          Login options

          Check if you have access through your login credentials or your institution to get full access on this article.

          Sign in
          • Published in

            cover image ACM Conferences
            IMCOM '17: Proceedings of the 11th International Conference on Ubiquitous Information Management and Communication
            January 2017
            746 pages
            ISBN:9781450348881
            DOI:10.1145/3022227

            Copyright © 2017 ACM

            Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

            Publisher

            Association for Computing Machinery

            New York, NY, United States

            Publication History

            • Published: 5 January 2017

            Permissions

            Request permissions about this article.

            Request Permissions

            Check for updates

            Qualifiers

            • short-paper

            Acceptance Rates

            IMCOM '17 Paper Acceptance Rate113of366submissions,31%Overall Acceptance Rate213of621submissions,34%

          PDF Format

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader