Skip to main content
Top

2018 | OriginalPaper | Chapter

Unsupervised Slot Filler Refinement via Entity Community Construction

Authors : Zengzhuang Xu, Rui Song, Bowei Zou, Yu Hong

Published in: Natural Language Processing and Chinese Computing

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Given an entity (query), slot filling aims to find and extract the values (slot fillers) of its specific attributes (slot types) from a large-scale of document collections. Most existing work of slot filling models slot fillers separately and only considers direct relations between slot fillers and query, ignoring other slot fillers in context. In this paper we propose an unsupervised slot filler refinement approach via entity community construction to filter out the incorrect fillers collaboratively. The community-based framework mainly consists of (1) filler community generated by a point-wise mutual information-based hierarchical clustering, and (2) query community constructed by a co-occurrence graph model.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Footnotes
1
queryID: SF13_ENG_038, in KBP 2013 ESF data set.
 
3
per:{cause_of_death, date_of_birth, date_of_death, age, charges} and
org:{date_founded, date_dissolved, number_of_employees_members, website}.
 
5
With slot type per:parents.
 
Literature
1.
go back to reference Surdeanu, M.: Overview of the TAC2013 knowledge base population evaluation: English slot filling and temporal slot filling. In: Proceedings of the Sixth Text Analysis Conference (TAC) (2013) Surdeanu, M.: Overview of the TAC2013 knowledge base population evaluation: English slot filling and temporal slot filling. In: Proceedings of the Sixth Text Analysis Conference (TAC) (2013)
2.
3.
go back to reference Ji, H., Grishman, R., Dang, H.T., Griffitt, K., Ellis, J.: Overview of the TAC 2010 knowledge base population track. In: Proceedings of the Third Text Analysis Conference (TAC) (2010) Ji, H., Grishman, R., Dang, H.T., Griffitt, K., Ellis, J.: Overview of the TAC 2010 knowledge base population track. In: Proceedings of the Third Text Analysis Conference (TAC) (2010)
4.
go back to reference Sammons, M., Song, Y., Wang, R., Kundu, G., Tsai, C.T., Upadhyay, S., Ancha, S., Mayhew, S., Roth, D.: Overview of UI-CCQ systems for event argument extraction, entity discovery and linking, and slot filler validation. In: Proceedings of the Seventh Text Analysis Conference (TAC) (2014) Sammons, M., Song, Y., Wang, R., Kundu, G., Tsai, C.T., Upadhyay, S., Ancha, S., Mayhew, S., Roth, D.: Overview of UI-CCQ systems for event argument extraction, entity discovery and linking, and slot filler validation. In: Proceedings of the Seventh Text Analysis Conference (TAC) (2014)
5.
go back to reference Yu, D., Huang, H., Cassidy, T., Ji, H., Wang, C., Zhi, S., Han, J., Voss, C.R., Magdon-Ismail, M.: The wisdom of minority: unsupervised slot filling validation based on multi-dimensional truth-finding. In: Proceedings of the 25th International Conference on Computational Linguistics (COLING), pp. 1567–1578 (2014) Yu, D., Huang, H., Cassidy, T., Ji, H., Wang, C., Zhi, S., Han, J., Voss, C.R., Magdon-Ismail, M.: The wisdom of minority: unsupervised slot filling validation based on multi-dimensional truth-finding. In: Proceedings of the 25th International Conference on Computational Linguistics (COLING), pp. 1567–1578 (2014)
6.
go back to reference Rajani, N.F., Viswanathan, V., Bentor, Y., Mooney, R.J.: Stacked ensembles of information extractors for knowledge-base population. In: Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (ACL), pp. 177–187 (2015) Rajani, N.F., Viswanathan, V., Bentor, Y., Mooney, R.J.: Stacked ensembles of information extractors for knowledge-base population. In: Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (ACL), pp. 177–187 (2015)
7.
go back to reference Xu, S., Zhang, C., Niu, Z., Mei, R., Chen, J., Zhang, J., Fu, H.: Bit’s slot-filling method for TAC-KBP 2013. In: Proceedings of the Sixth Text Analysis Conference (TAC) (2013) Xu, S., Zhang, C., Niu, Z., Mei, R., Chen, J., Zhang, J., Fu, H.: Bit’s slot-filling method for TAC-KBP 2013. In: Proceedings of the Sixth Text Analysis Conference (TAC) (2013)
8.
go back to reference Nguyen, T.H., He, Y., Pershina, M., Li, X., Grishman, R.: New York University 2014 knowledge base population systems. In: Proceedings of the Seventh Text Analysis Conference (TAC) (2014) Nguyen, T.H., He, Y., Pershina, M., Li, X., Grishman, R.: New York University 2014 knowledge base population systems. In: Proceedings of the Seventh Text Analysis Conference (TAC) (2014)
9.
go back to reference Białecki, A., Muir, R., Ingersoll, G., Imagination, L.: Apache Lucene 4. In: SIGIR 2012 Workshop on Open Source Information Retrieval (2012) Białecki, A., Muir, R., Ingersoll, G., Imagination, L.: Apache Lucene 4. In: SIGIR 2012 Workshop on Open Source Information Retrieval (2012)
10.
go back to reference Angeli, G., Premkumar, M.J., Manning, C.D.: Leveraging linguistic structure for open domain information extraction. In: Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (ACL), pp. 344–354 (2015) Angeli, G., Premkumar, M.J., Manning, C.D.: Leveraging linguistic structure for open domain information extraction. In: Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (ACL), pp. 344–354 (2015)
11.
12.
go back to reference Prim, R.C.: Shortest connection networks and some generalizations. Bell Labs Tech. J. 36(6), 1389–1401 (1957)CrossRef Prim, R.C.: Shortest connection networks and some generalizations. Bell Labs Tech. J. 36(6), 1389–1401 (1957)CrossRef
13.
go back to reference Pakhira, M.K.: A fast k-means algorithm using cluster shifting to produce compact and separate clusters (research note). Int. J. Eng.-Trans. A: Basics 28(1), 35–43 (2015) Pakhira, M.K.: A fast k-means algorithm using cluster shifting to produce compact and separate clusters (research note). Int. J. Eng.-Trans. A: Basics 28(1), 35–43 (2015)
14.
go back to reference Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent dirichlet allocation. J. Mach. Learn. Res. 3(Jan), 993–1022 (2003)MATH Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent dirichlet allocation. J. Mach. Learn. Res. 3(Jan), 993–1022 (2003)MATH
15.
go back to reference Rao, C.R.: A review of canonical coordinates and an alternative to correspondence analysis using Hellinger distance. Qüestiió: quaderns d’estadística i investigació operativa 19(1), 23–63 (1995) Rao, C.R.: A review of canonical coordinates and an alternative to correspondence analysis using Hellinger distance. Qüestiió: quaderns d’estadística i investigació operativa 19(1), 23–63 (1995)
16.
17.
go back to reference Clauset, A., Newman, M.E., Moore, C.: Finding community structure in very large networks. Phys. Rev. E 70(6), 066111 (2004)CrossRef Clauset, A., Newman, M.E., Moore, C.: Finding community structure in very large networks. Phys. Rev. E 70(6), 066111 (2004)CrossRef
18.
go back to reference Roth, B., Barth, T., Wiegand, M., Singh, M., Klakow, D.: Effective slot filling based on shallow distant supervision methods. In: Proceedings of the Sixth Text Analysis Conference (TAC) (2013) Roth, B., Barth, T., Wiegand, M., Singh, M., Klakow, D.: Effective slot filling based on shallow distant supervision methods. In: Proceedings of the Sixth Text Analysis Conference (TAC) (2013)
19.
go back to reference Yu, D., Li, H., Cassidy, T., Li, Q., Huang, H., Chen, Z., Ji, H., Zhang, Y., Roth, D.: RPI-BLENDER TAC-KBP2013 knowledge base population system. In: Theory and Applications of Categories (2013) Yu, D., Li, H., Cassidy, T., Li, Q., Huang, H., Chen, Z., Ji, H., Zhang, Y., Roth, D.: RPI-BLENDER TAC-KBP2013 knowledge base population system. In: Theory and Applications of Categories (2013)
20.
go back to reference Angeli, G., Chaganty, A.T., Chang, A.X., Reschke, K., Tibshirani, J., Wu, J., Bastani, O., Siilats, K., Manning, C.D.: Stanford’s 2013 KBP system. In: Proceedings of the Sixth Text Analysis Conference (TAC) (2013) Angeli, G., Chaganty, A.T., Chang, A.X., Reschke, K., Tibshirani, J., Wu, J., Bastani, O., Siilats, K., Manning, C.D.: Stanford’s 2013 KBP system. In: Proceedings of the Sixth Text Analysis Conference (TAC) (2013)
21.
go back to reference Powers, D.M.W.: Evaluation: from precision, recall and F-measure to ROC, informedness, markedness and correlation. J. Mach. Learn. Technol. 2(1), 37–63 (2011)MathSciNet Powers, D.M.W.: Evaluation: from precision, recall and F-measure to ROC, informedness, markedness and correlation. J. Mach. Learn. Technol. 2(1), 37–63 (2011)MathSciNet
22.
go back to reference Griffiths, T.: Gibbs Sampling in the Generative Model of Latent Dirichlet Allocation (2002) Griffiths, T.: Gibbs Sampling in the Generative Model of Latent Dirichlet Allocation (2002)
23.
go back to reference Griffiths, T.L., Steyvers, M.: Finding scientific topics. Proc. Nat. Acad. Sci. 101(Suppl. 1), 5228–5235 (2004)CrossRef Griffiths, T.L., Steyvers, M.: Finding scientific topics. Proc. Nat. Acad. Sci. 101(Suppl. 1), 5228–5235 (2004)CrossRef
24.
go back to reference Lewis, J., Ossowski, S., Hicks, J., Errami, M., Garner, H.R.: Text similarity: an alternative way to search medline. Bioinformatics 22(18), 2298–2304 (2006)CrossRef Lewis, J., Ossowski, S., Hicks, J., Errami, M., Garner, H.R.: Text similarity: an alternative way to search medline. Bioinformatics 22(18), 2298–2304 (2006)CrossRef
Metadata
Title
Unsupervised Slot Filler Refinement via Entity Community Construction
Authors
Zengzhuang Xu
Rui Song
Bowei Zou
Yu Hong
Copyright Year
2018
DOI
https://doi.org/10.1007/978-3-319-73618-1_54

Premium Partner