Skip to main content

2018 | OriginalPaper | Buchkapitel

Unsupervised Slot Filler Refinement via Entity Community Construction

verfasst von : Zengzhuang Xu, Rui Song, Bowei Zou, Yu Hong

Erschienen in: Natural Language Processing and Chinese Computing

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Given an entity (query), slot filling aims to find and extract the values (slot fillers) of its specific attributes (slot types) from a large-scale of document collections. Most existing work of slot filling models slot fillers separately and only considers direct relations between slot fillers and query, ignoring other slot fillers in context. In this paper we propose an unsupervised slot filler refinement approach via entity community construction to filter out the incorrect fillers collaboratively. The community-based framework mainly consists of (1) filler community generated by a point-wise mutual information-based hierarchical clustering, and (2) query community constructed by a co-occurrence graph model.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
1
queryID: SF13_ENG_038, in KBP 2013 ESF data set.
 
3
per:{cause_of_death, date_of_birth, date_of_death, age, charges} and
org:{date_founded, date_dissolved, number_of_employees_members, website}.
 
5
With slot type per:parents.
 
Literatur
1.
Zurück zum Zitat Surdeanu, M.: Overview of the TAC2013 knowledge base population evaluation: English slot filling and temporal slot filling. In: Proceedings of the Sixth Text Analysis Conference (TAC) (2013) Surdeanu, M.: Overview of the TAC2013 knowledge base population evaluation: English slot filling and temporal slot filling. In: Proceedings of the Sixth Text Analysis Conference (TAC) (2013)
2.
Zurück zum Zitat Fortunato, S.: Community detection in graphs. Phys. Rep. 486(3), 75–174 (2009)MathSciNet Fortunato, S.: Community detection in graphs. Phys. Rep. 486(3), 75–174 (2009)MathSciNet
3.
Zurück zum Zitat Ji, H., Grishman, R., Dang, H.T., Griffitt, K., Ellis, J.: Overview of the TAC 2010 knowledge base population track. In: Proceedings of the Third Text Analysis Conference (TAC) (2010) Ji, H., Grishman, R., Dang, H.T., Griffitt, K., Ellis, J.: Overview of the TAC 2010 knowledge base population track. In: Proceedings of the Third Text Analysis Conference (TAC) (2010)
4.
Zurück zum Zitat Sammons, M., Song, Y., Wang, R., Kundu, G., Tsai, C.T., Upadhyay, S., Ancha, S., Mayhew, S., Roth, D.: Overview of UI-CCQ systems for event argument extraction, entity discovery and linking, and slot filler validation. In: Proceedings of the Seventh Text Analysis Conference (TAC) (2014) Sammons, M., Song, Y., Wang, R., Kundu, G., Tsai, C.T., Upadhyay, S., Ancha, S., Mayhew, S., Roth, D.: Overview of UI-CCQ systems for event argument extraction, entity discovery and linking, and slot filler validation. In: Proceedings of the Seventh Text Analysis Conference (TAC) (2014)
5.
Zurück zum Zitat Yu, D., Huang, H., Cassidy, T., Ji, H., Wang, C., Zhi, S., Han, J., Voss, C.R., Magdon-Ismail, M.: The wisdom of minority: unsupervised slot filling validation based on multi-dimensional truth-finding. In: Proceedings of the 25th International Conference on Computational Linguistics (COLING), pp. 1567–1578 (2014) Yu, D., Huang, H., Cassidy, T., Ji, H., Wang, C., Zhi, S., Han, J., Voss, C.R., Magdon-Ismail, M.: The wisdom of minority: unsupervised slot filling validation based on multi-dimensional truth-finding. In: Proceedings of the 25th International Conference on Computational Linguistics (COLING), pp. 1567–1578 (2014)
6.
Zurück zum Zitat Rajani, N.F., Viswanathan, V., Bentor, Y., Mooney, R.J.: Stacked ensembles of information extractors for knowledge-base population. In: Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (ACL), pp. 177–187 (2015) Rajani, N.F., Viswanathan, V., Bentor, Y., Mooney, R.J.: Stacked ensembles of information extractors for knowledge-base population. In: Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (ACL), pp. 177–187 (2015)
7.
Zurück zum Zitat Xu, S., Zhang, C., Niu, Z., Mei, R., Chen, J., Zhang, J., Fu, H.: Bit’s slot-filling method for TAC-KBP 2013. In: Proceedings of the Sixth Text Analysis Conference (TAC) (2013) Xu, S., Zhang, C., Niu, Z., Mei, R., Chen, J., Zhang, J., Fu, H.: Bit’s slot-filling method for TAC-KBP 2013. In: Proceedings of the Sixth Text Analysis Conference (TAC) (2013)
8.
Zurück zum Zitat Nguyen, T.H., He, Y., Pershina, M., Li, X., Grishman, R.: New York University 2014 knowledge base population systems. In: Proceedings of the Seventh Text Analysis Conference (TAC) (2014) Nguyen, T.H., He, Y., Pershina, M., Li, X., Grishman, R.: New York University 2014 knowledge base population systems. In: Proceedings of the Seventh Text Analysis Conference (TAC) (2014)
9.
Zurück zum Zitat Białecki, A., Muir, R., Ingersoll, G., Imagination, L.: Apache Lucene 4. In: SIGIR 2012 Workshop on Open Source Information Retrieval (2012) Białecki, A., Muir, R., Ingersoll, G., Imagination, L.: Apache Lucene 4. In: SIGIR 2012 Workshop on Open Source Information Retrieval (2012)
10.
Zurück zum Zitat Angeli, G., Premkumar, M.J., Manning, C.D.: Leveraging linguistic structure for open domain information extraction. In: Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (ACL), pp. 344–354 (2015) Angeli, G., Premkumar, M.J., Manning, C.D.: Leveraging linguistic structure for open domain information extraction. In: Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (ACL), pp. 344–354 (2015)
11.
12.
Zurück zum Zitat Prim, R.C.: Shortest connection networks and some generalizations. Bell Labs Tech. J. 36(6), 1389–1401 (1957)CrossRef Prim, R.C.: Shortest connection networks and some generalizations. Bell Labs Tech. J. 36(6), 1389–1401 (1957)CrossRef
13.
Zurück zum Zitat Pakhira, M.K.: A fast k-means algorithm using cluster shifting to produce compact and separate clusters (research note). Int. J. Eng.-Trans. A: Basics 28(1), 35–43 (2015) Pakhira, M.K.: A fast k-means algorithm using cluster shifting to produce compact and separate clusters (research note). Int. J. Eng.-Trans. A: Basics 28(1), 35–43 (2015)
14.
Zurück zum Zitat Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent dirichlet allocation. J. Mach. Learn. Res. 3(Jan), 993–1022 (2003)MATH Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent dirichlet allocation. J. Mach. Learn. Res. 3(Jan), 993–1022 (2003)MATH
15.
Zurück zum Zitat Rao, C.R.: A review of canonical coordinates and an alternative to correspondence analysis using Hellinger distance. Qüestiió: quaderns d’estadística i investigació operativa 19(1), 23–63 (1995) Rao, C.R.: A review of canonical coordinates and an alternative to correspondence analysis using Hellinger distance. Qüestiió: quaderns d’estadística i investigació operativa 19(1), 23–63 (1995)
16.
Zurück zum Zitat Girvan, M., Newman, M.E.: Community structure in social and biological networks. Proc. Nat. Acad. Sci. 99(12), 7821–7826 (2002)MathSciNetCrossRefMATH Girvan, M., Newman, M.E.: Community structure in social and biological networks. Proc. Nat. Acad. Sci. 99(12), 7821–7826 (2002)MathSciNetCrossRefMATH
17.
Zurück zum Zitat Clauset, A., Newman, M.E., Moore, C.: Finding community structure in very large networks. Phys. Rev. E 70(6), 066111 (2004)CrossRef Clauset, A., Newman, M.E., Moore, C.: Finding community structure in very large networks. Phys. Rev. E 70(6), 066111 (2004)CrossRef
18.
Zurück zum Zitat Roth, B., Barth, T., Wiegand, M., Singh, M., Klakow, D.: Effective slot filling based on shallow distant supervision methods. In: Proceedings of the Sixth Text Analysis Conference (TAC) (2013) Roth, B., Barth, T., Wiegand, M., Singh, M., Klakow, D.: Effective slot filling based on shallow distant supervision methods. In: Proceedings of the Sixth Text Analysis Conference (TAC) (2013)
19.
Zurück zum Zitat Yu, D., Li, H., Cassidy, T., Li, Q., Huang, H., Chen, Z., Ji, H., Zhang, Y., Roth, D.: RPI-BLENDER TAC-KBP2013 knowledge base population system. In: Theory and Applications of Categories (2013) Yu, D., Li, H., Cassidy, T., Li, Q., Huang, H., Chen, Z., Ji, H., Zhang, Y., Roth, D.: RPI-BLENDER TAC-KBP2013 knowledge base population system. In: Theory and Applications of Categories (2013)
20.
Zurück zum Zitat Angeli, G., Chaganty, A.T., Chang, A.X., Reschke, K., Tibshirani, J., Wu, J., Bastani, O., Siilats, K., Manning, C.D.: Stanford’s 2013 KBP system. In: Proceedings of the Sixth Text Analysis Conference (TAC) (2013) Angeli, G., Chaganty, A.T., Chang, A.X., Reschke, K., Tibshirani, J., Wu, J., Bastani, O., Siilats, K., Manning, C.D.: Stanford’s 2013 KBP system. In: Proceedings of the Sixth Text Analysis Conference (TAC) (2013)
21.
Zurück zum Zitat Powers, D.M.W.: Evaluation: from precision, recall and F-measure to ROC, informedness, markedness and correlation. J. Mach. Learn. Technol. 2(1), 37–63 (2011)MathSciNet Powers, D.M.W.: Evaluation: from precision, recall and F-measure to ROC, informedness, markedness and correlation. J. Mach. Learn. Technol. 2(1), 37–63 (2011)MathSciNet
22.
Zurück zum Zitat Griffiths, T.: Gibbs Sampling in the Generative Model of Latent Dirichlet Allocation (2002) Griffiths, T.: Gibbs Sampling in the Generative Model of Latent Dirichlet Allocation (2002)
23.
Zurück zum Zitat Griffiths, T.L., Steyvers, M.: Finding scientific topics. Proc. Nat. Acad. Sci. 101(Suppl. 1), 5228–5235 (2004)CrossRef Griffiths, T.L., Steyvers, M.: Finding scientific topics. Proc. Nat. Acad. Sci. 101(Suppl. 1), 5228–5235 (2004)CrossRef
24.
Zurück zum Zitat Lewis, J., Ossowski, S., Hicks, J., Errami, M., Garner, H.R.: Text similarity: an alternative way to search medline. Bioinformatics 22(18), 2298–2304 (2006)CrossRef Lewis, J., Ossowski, S., Hicks, J., Errami, M., Garner, H.R.: Text similarity: an alternative way to search medline. Bioinformatics 22(18), 2298–2304 (2006)CrossRef
Metadaten
Titel
Unsupervised Slot Filler Refinement via Entity Community Construction
verfasst von
Zengzhuang Xu
Rui Song
Bowei Zou
Yu Hong
Copyright-Jahr
2018
DOI
https://doi.org/10.1007/978-3-319-73618-1_54