Skip to main content
Erschienen in: Journal of Combinatorial Optimization 1/2019

29.12.2017

Optimizing model parameter for entity summarization across knowledge graphs

verfasst von: Jihong Yan, Chen Xu, Na Li, Ming Gao, Aoying Zhou

Erschienen in: Journal of Combinatorial Optimization | Ausgabe 1/2019

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Knowledge graphs, which belongs to the category of semantic networks, are considered as a new method of knowledge representation of health care data. It establishes a semantic explanation model for human perception and health care information processing. Each knowledge graph is composed of massive entities and relationships. However, it is an arduous work to search and visualize users’ interested entities and attributes since there are many attributes for an entity across different knowledge graphs. It is a natural problem how to summarize an entity based on multiple knowledge graphs. We propose a three-stage algorithm to solve the problem of entity summarization across knowledge graphs, including candidate generation, knowledge graph linkage, and entity summarization. We propose an unsupervised framework to link different knowledge graphs based on the semantic and structure information of entities. To further reduce the computational cost, we employ word embedding technique to find the similar entities in semantic, and filter some pairs of unmatched entities. Finally, we model entity summarization as personalized ranking problem in a knowledge graph. We conduct a set of experiments to evaluate our proposed method on four real datasets: historical data for user query, two English knowledge graphs (YAGO and DBpeida) and an English corpus. Experimental results demonstrate the effectiveness of our proposed method by comparing with baselines.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Biega J, Kuzey E, Suchanek FM (2013) Inside yago2s: a transparent information extraction architecture. In: Proceedings of the 22nd International Conference on World Wide Web, ACM, pp 325–328 Biega J, Kuzey E, Suchanek FM (2013) Inside yago2s: a transparent information extraction architecture. In: Proceedings of the 22nd International Conference on World Wide Web, ACM, pp 325–328
Zurück zum Zitat Bordes A, Glorot X, Weston J, Bengio Y (2012) Joint learning of words and meaning representations for open-text semantic parsing. In: AISTATS 22, pp 127–135 Bordes A, Glorot X, Weston J, Bengio Y (2012) Joint learning of words and meaning representations for open-text semantic parsing. In: AISTATS 22, pp 127–135
Zurück zum Zitat Bordes A, Usunier N, Garciaduran A, Weston J, Yakhnenko O (2013) Translating embeddings for modeling multi-relational data. In: Advances in neural information processing systems, pp 2787–2795 Bordes A, Usunier N, Garciaduran A, Weston J, Yakhnenko O (2013) Translating embeddings for modeling multi-relational data. In: Advances in neural information processing systems, pp 2787–2795
Zurück zum Zitat Bordes A, Weston J, Collobert R, Bengio Y (2011) Learning structured embeddings of knowledge bases. In: Conference on artificial intelligence, number EPFL-CONF-192344 Bordes A, Weston J, Collobert R, Bengio Y (2011) Learning structured embeddings of knowledge bases. In: Conference on artificial intelligence, number EPFL-CONF-192344
Zurück zum Zitat Cheng G, Tran T, Qu Y (2011) Relin: relatedness and informativeness-based centrality for entity summarization. Lect Note Comput Sci 7031:114–129CrossRef Cheng G, Tran T, Qu Y (2011) Relin: relatedness and informativeness-based centrality for entity summarization. Lect Note Comput Sci 7031:114–129CrossRef
Zurück zum Zitat Cheng G, Xu D, Qu Y (2015) C3d+p: a summarization method for interactive entity resolution. Web Semant Sci Servi Agents World Wide Web 35:203–213CrossRef Cheng G, Xu D, Qu Y (2015) C3d+p: a summarization method for interactive entity resolution. Web Semant Sci Servi Agents World Wide Web 35:203–213CrossRef
Zurück zum Zitat Chieu HL, Ng HT (2002) A maximum entropy approach to information extraction from semi-structured and free text. In: Aaai/iaai 2002, pp 786–791 Chieu HL, Ng HT (2002) A maximum entropy approach to information extraction from semi-structured and free text. In: Aaai/iaai 2002, pp 786–791
Zurück zum Zitat Dong X, Gabrilovich E, Heitz G, Horn W, Lao N, Murphy K, Strohmann T, Sun S, Zhang W (2014) Knowledge vault: a web-scale approach to probabilistic knowledge fusion In: ACM SIGKDD international conference on knowledge discovery and data mining, pp 601–610 Dong X, Gabrilovich E, Heitz G, Horn W, Lao N, Murphy K, Strohmann T, Sun S, Zhang W (2014) Knowledge vault: a web-scale approach to probabilistic knowledge fusion In: ACM SIGKDD international conference on knowledge discovery and data mining, pp 601–610
Zurück zum Zitat Faloutsos C, Mccurley KS, Tomkins A (2004) Fast discovery of connection subgraphs In: Tenth ACM SIGKDD international conference on knowledge discovery and data mining, pp 118–127 Faloutsos C, Mccurley KS, Tomkins A (2004) Fast discovery of connection subgraphs In: Tenth ACM SIGKDD international conference on knowledge discovery and data mining, pp 118–127
Zurück zum Zitat Fang L, Sarma AD, Yu C, Bohannon P (2011) Rex: explaining relationships between entity pairs. Proc Vldb Endow 5:241–252CrossRef Fang L, Sarma AD, Yu C, Bohannon P (2011) Rex: explaining relationships between entity pairs. Proc Vldb Endow 5:241–252CrossRef
Zurück zum Zitat Fattah MA, Ren F (2008) Automatic text summarization. Gas 692:10785 Fattah MA, Ren F (2008) Automatic text summarization. Gas 692:10785
Zurück zum Zitat García-Hernández RA, Ledeneva Y (2009) Word sequence models for single text summarization. In: Second international conferences on advances in computer-human interactions, 2009. ACHI’09, IEEE, pp 44–48 García-Hernández RA, Ledeneva Y (2009) Word sequence models for single text summarization. In: Second international conferences on advances in computer-human interactions, 2009. ACHI’09, IEEE, pp 44–48
Zurück zum Zitat Kong C, Gao M, Xu C, Qian W, Zhou A (2016) Entity matching across multiple heterogeneous data sources. In: Database systems for advanced applications 21st international conference (DASFAA), Dallas, TX, USA, pp. 133–146 Kong C, Gao M, Xu C, Qian W, Zhou A (2016) Entity matching across multiple heterogeneous data sources. In: Database systems for advanced applications 21st international conference (DASFAA), Dallas, TX, USA, pp. 133–146
Zurück zum Zitat Kruengkrai C, Jaruskulchai C (2003) Generic text summarization using local and global properties of sentences In: Proceedings of IEEE/WIC international conference on web intelligence, 2003. WI 2003, IEEE, pp 201–206 Kruengkrai C, Jaruskulchai C (2003) Generic text summarization using local and global properties of sentences In: Proceedings of IEEE/WIC international conference on web intelligence, 2003. WI 2003, IEEE, pp 201–206
Zurück zum Zitat Kyoomarsi F, Khosravi H, Eslami E, Dehkordy PK, Tajoddin A (2008) Optimizing text summarization based on fuzzy logic In: Seventh IEEE/ACIS international conference on computer and information science, 2008, ICIS 08, IEEE, pp 347–352 Kyoomarsi F, Khosravi H, Eslami E, Dehkordy PK, Tajoddin A (2008) Optimizing text summarization based on fuzzy logic In: Seventh IEEE/ACIS international conference on computer and information science, 2008, ICIS 08, IEEE, pp 347–352
Zurück zum Zitat Lin CY (1999) Training a selection function for extraction In: Proceedings of the eighth international conference on Information and knowledge management, ACM, pp 55–62 Lin CY (1999) Training a selection function for extraction In: Proceedings of the eighth international conference on Information and knowledge management, ACM, pp 55–62
Zurück zum Zitat Lin Y, Liu Z, Sun M, Liu Y, Zhu X (2015) Learning entity and relation embeddings for knowledge graph completion, pp 2181–2187 Lin Y, Liu Z, Sun M, Liu Y, Zhu X (2015) Learning entity and relation embeddings for knowledge graph completion, pp 2181–2187
Zurück zum Zitat Mikolov T, Chen K, Corrado G, Dean J (2013) Efficient estimation of word representations in vector space. CoRR. arXiv:1301.3781 Mikolov T, Chen K, Corrado G, Dean J (2013) Efficient estimation of word representations in vector space. CoRR. arXiv:​1301.​3781
Zurück zum Zitat Page L (1998) The pagerank citation ranking: bringing order to the web. Stanford Digital Libraries Working Paper 9, pp 1–14 Page L (1998) The pagerank citation ranking: bringing order to the web. Stanford Digital Libraries Working Paper 9, pp 1–14
Zurück zum Zitat Pass G, Chowdhury A, Torgeson C (2006) A picture of search. In: Infoscale Pass G, Chowdhury A, Torgeson C (2006) A picture of search. In: Infoscale
Zurück zum Zitat Radev DR, Hovy E, McKeown K (2002) Introduction to the special issue on summarization. Comput Linguist 28:399–408CrossRef Radev DR, Hovy E, McKeown K (2002) Introduction to the special issue on summarization. Comput Linguist 28:399–408CrossRef
Zurück zum Zitat Radev DR, McKeown KR (1998) Generating natural language summaries from multiple on-line sources. Comput Linguist 24:470–500 Radev DR, McKeown KR (1998) Generating natural language summaries from multiple on-line sources. Comput Linguist 24:470–500
Zurück zum Zitat Socher R, Perelygin A, Wu JY, Chuang J, Manning CD, Ng AY, Potts C et al. (2013) Recursive deep models for semantic compositionality over a sentiment treebank In: Proceedings of the conference on empirical methods in natural language processing (EMNLP), Vol. 1631, p 1642 Socher R, Perelygin A, Wu JY, Chuang J, Manning CD, Ng AY, Potts C et al. (2013) Recursive deep models for semantic compositionality over a sentiment treebank In: Proceedings of the conference on empirical methods in natural language processing (EMNLP), Vol. 1631, p 1642
Zurück zum Zitat Suchanek FM, Kasneci G, Weikum G (2007) Yago: a core of semantic knowledge In: Proceedings of the 16th international conference on World Wide Web, ACM, pp 697–706 Suchanek FM, Kasneci G, Weikum G (2007) Yago: a core of semantic knowledge In: Proceedings of the 16th international conference on World Wide Web, ACM, pp 697–706
Zurück zum Zitat Sutskever I, Tenenbaum JB, Salakhutdinov RR (2009) Modelling relational data using bayesian clustered tensor factorization In: Advances in neural information processing systems, pp 1821–1828 Sutskever I, Tenenbaum JB, Salakhutdinov RR (2009) Modelling relational data using bayesian clustered tensor factorization In: Advances in neural information processing systems, pp 1821–1828
Zurück zum Zitat Svore KM, Vanderwende L, Burges CJ (2007) Enhancing single-document summarization by combining ranknet and third-party sources. In: Emnlp-conll, pp 448–457 Svore KM, Vanderwende L, Burges CJ (2007) Enhancing single-document summarization by combining ranknet and third-party sources. In: Emnlp-conll, pp 448–457
Zurück zum Zitat Takamura H, Okumura M (2009) Text summarization model based on maximum coverage problem and its variant. In: Proceedings of the 12th Conference of the European chapter of the association for computational linguistics, Association for computational linguistics, pp 781–789 Takamura H, Okumura M (2009) Text summarization model based on maximum coverage problem and its variant. In: Proceedings of the 12th Conference of the European chapter of the association for computational linguistics, Association for computational linguistics, pp 781–789
Zurück zum Zitat Thalhammer A, Rettinger A (2014) Browsing dbpedia entities with summaries. In: The Semantic Web: ESWC 2014 Satellite Events, pp 511–515 Thalhammer A, Rettinger A (2014) Browsing dbpedia entities with summaries. In: The Semantic Web: ESWC 2014 Satellite Events, pp 511–515
Zurück zum Zitat Wang Z, Zhang J, Feng J, Chen Z (2014) Knowledge graph embedding by translating on hyperplanes, pp 1112–1119 Wang Z, Zhang J, Feng J, Chen Z (2014) Knowledge graph embedding by translating on hyperplanes, pp 1112–1119
Zurück zum Zitat Yan J (2016) Entity summarization based on Web text and knowledge graph Ph.D. Dissertation, East China Normal University Yan J (2016) Entity summarization based on Web text and knowledge graph Ph.D. Dissertation, East China Normal University
Zurück zum Zitat Yan J, Cheng W, Wang C, Liu J, Gao M, Zhou A (2015) Optimizing word set coverage for multi-event summarization. J Comb Optim 30:996–1015MathSciNetCrossRefMATH Yan J, Cheng W, Wang C, Liu J, Gao M, Zhou A (2015) Optimizing word set coverage for multi-event summarization. J Comb Optim 30:996–1015MathSciNetCrossRefMATH
Metadaten
Titel
Optimizing model parameter for entity summarization across knowledge graphs
verfasst von
Jihong Yan
Chen Xu
Na Li
Ming Gao
Aoying Zhou
Publikationsdatum
29.12.2017
Verlag
Springer US
Erschienen in
Journal of Combinatorial Optimization / Ausgabe 1/2019
Print ISSN: 1382-6905
Elektronische ISSN: 1573-2886
DOI
https://doi.org/10.1007/s10878-017-0225-y

Weitere Artikel der Ausgabe 1/2019

Journal of Combinatorial Optimization 1/2019 Zur Ausgabe