Skip to main content
Erschienen in: The VLDB Journal 6/2015

01.12.2015 | Regular Paper

Fast rule mining in ontological knowledge bases with AMIE\(+\)

verfasst von: Luis Galárraga, Christina Teflioudi, Katja Hose, Fabian M. Suchanek

Erschienen in: The VLDB Journal | Ausgabe 6/2015

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Recent advances in information extraction have led to huge knowledge bases (KBs), which capture knowledge in a machine-readable format. Inductive logic programming (ILP) can be used to mine logical rules from these KBs, such as “If two persons are married, then they (usually) live in the same city.” While ILP is a mature field, mining logical rules from KBs is difficult, because KBs make an open-world assumption. This means that absent information cannot be taken as counterexamples. Our approach AMIE (Galárraga et al. in WWW, 2013) has shown how rules can be mined effectively from KBs even in the absence of counterexamples. In this paper, we show how this approach can be optimized to mine even larger KBs with more than 12M statements. Extensive experiments show how our new approach, AMIE\(+\), extends to areas of mining that were previously beyond reach.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
2
RDF schema has only positive rules and no disjointness constraints or similar concepts.
 
4
In these cases, the pruning precision in Table 6 was computed by comparing the output of AMIE\(+\) to the output of AMIE on the mined subset.
 
5
We used the YAGO3 [28] types because the type signatures in older versions of YAGO were too general. For example, the relation livesIn is defined from person to location in YAGO2s, whereas in YAGO3 it is defined from person to city.
 
Literatur
1.
Zurück zum Zitat Abedjan Z., Naumann F.: Synonym analysis for predicate expansion. In: ESWC (2013) Abedjan Z., Naumann F.: Synonym analysis for predicate expansion. In: ESWC (2013)
2.
Zurück zum Zitat Abedjan, Z., Lorey, J., Naumann, F.: Reconciling ontologies and the web of data. In: CIKM (2012) Abedjan, Z., Lorey, J., Naumann, F.: Reconciling ontologies and the web of data. In: CIKM (2012)
3.
Zurück zum Zitat Adé, H., Raedt, L., Bruynooghe, M.: Declarative bias for specific-to-general ilp systems. Mach. Learn. 20, 119–154 (1995) Adé, H., Raedt, L., Bruynooghe, M.: Declarative bias for specific-to-general ilp systems. Mach. Learn. 20, 119–154 (1995)
4.
Zurück zum Zitat Agrawal, R., Imieliński, T., Swami, A.: Mining association rules between sets of items in large databases. In: SIGMOD (1993) Agrawal, R., Imieliński, T., Swami, A.: Mining association rules between sets of items in large databases. In: SIGMOD (1993)
5.
Zurück zum Zitat Agrawal, R., Mannila, H., Srikant, R., Toivonen, H., Verkamo, A.I.: Fast discovery of association rules. In: Advances in Knowledge Discovery and Data Mining (1996) Agrawal, R., Mannila, H., Srikant, R., Toivonen, H., Verkamo, A.I.: Fast discovery of association rules. In: Advances in Knowledge Discovery and Data Mining (1996)
6.
Zurück zum Zitat Auer, S., Bizer, C., Kobilarov, G., Lehmann, J., Cyganiak, R., Ives, Z.G.: DBpedia: a nucleus for a Web of open data. In: ISWC (2007) Auer, S., Bizer, C., Kobilarov, G., Lehmann, J., Cyganiak, R., Ives, Z.G.: DBpedia: a nucleus for a Web of open data. In: ISWC (2007)
7.
Zurück zum Zitat Carlson, A., Betteridge, J., Kisiel, B., Settles, B., Jr., E.R.H., Mitchell, T.M.: Toward an architecture for never-ending language learning. In: AAAI (2010) Carlson, A., Betteridge, J., Kisiel, B., Settles, B., Jr., E.R.H., Mitchell, T.M.: Toward an architecture for never-ending language learning. In: AAAI (2010)
8.
Zurück zum Zitat Chasseur, C., Patel, J.M.: Design and evaluation of storage organizations for read-optimized main memory databases. Proc. VLDB Endow. 6(13), 1474–1485 (2013)CrossRef Chasseur, C., Patel, J.M.: Design and evaluation of storage organizations for read-optimized main memory databases. Proc. VLDB Endow. 6(13), 1474–1485 (2013)CrossRef
9.
Zurück zum Zitat Chi, Y., Muntz, R.R., Nijssen, S., Kok, J.N.: Frequent subtree mining: an overview. Fundam. Inf. 66(1–2), 26–37 (2004)MathSciNet Chi, Y., Muntz, R.R., Nijssen, S., Kok, J.N.: Frequent subtree mining: an overview. Fundam. Inf. 66(1–2), 26–37 (2004)MathSciNet
10.
Zurück zum Zitat Cimiano, P., Hotho, A., Staab, S.: Comparing conceptual, divisive and agglomerative clustering for learning taxonomies from text. In: ECAI (2004) Cimiano, P., Hotho, A., Staab, S.: Comparing conceptual, divisive and agglomerative clustering for learning taxonomies from text. In: ECAI (2004)
11.
Zurück zum Zitat d’Amato, C., Bryl, V., Serafini, L.: Data-driven logical reasoning. In: URSW (2012) d’Amato, C., Bryl, V., Serafini, L.: Data-driven logical reasoning. In: URSW (2012)
12.
Zurück zum Zitat d’Amato, C., Fanizzi, N., Esposito, F.: Inductive learning for the semantic web: what does it buy? Semant. Web 1(1,2), 53–59 (2010) d’Amato, C., Fanizzi, N., Esposito, F.: Inductive learning for the semantic web: what does it buy? Semant. Web 1(1,2), 53–59 (2010)
13.
Zurück zum Zitat David, J., Guillet, F., Briand, H.: Association rule ontology matching approach. Int. J. Semant. Web Inf. Syst. 3(2), 27–49 (2007)CrossRef David, J., Guillet, F., Briand, H.: Association rule ontology matching approach. Int. J. Semant. Web Inf. Syst. 3(2), 27–49 (2007)CrossRef
14.
Zurück zum Zitat Dehaspe, L., Toironen, H.: Discovery of relational association rules. In: Relational Data Mining. Springer, New York (2000) Dehaspe, L., Toironen, H.: Discovery of relational association rules. In: Relational Data Mining. Springer, New York (2000)
15.
Zurück zum Zitat Dehaspe, L., Toivonen, H.: Discovery of frequent DATALOG patterns. Data Min. Knowl. Discov. 3(1), 7–36 (1999)CrossRef Dehaspe, L., Toivonen, H.: Discovery of frequent DATALOG patterns. Data Min. Knowl. Discov. 3(1), 7–36 (1999)CrossRef
16.
Zurück zum Zitat Dong, X., Gabrilovich, E., Heitz, G., Horn, W., Lao, N., Murphy, K., Strohmann, T., Sun, S., Zhang, W.: Knowledge vault: a web-scale approach to probabilistic knowledge fusion. In: KDD (2014) Dong, X., Gabrilovich, E., Heitz, G., Horn, W., Lao, N., Murphy, K., Strohmann, T., Sun, S., Zhang, W.: Knowledge vault: a web-scale approach to probabilistic knowledge fusion. In: KDD (2014)
17.
Zurück zum Zitat Galárraga, L.A., Teflioudi, C., Hose, K., Suchanek, F.M.: AMIE: association rule mining under incomplete evidence in ontological knowledge bases. In: WWW (2013) Galárraga, L.A., Teflioudi, C., Hose, K., Suchanek, F.M.: AMIE: association rule mining under incomplete evidence in ontological knowledge bases. In: WWW (2013)
18.
Zurück zum Zitat Goethals, B., Van den Bussche, J.: Relational association rules: getting WARMER. In: Pattern Detection and Discovery, vol. 2447. Springer, Berlin (2002) Goethals, B., Van den Bussche, J.: Relational association rules: getting WARMER. In: Pattern Detection and Discovery, vol. 2447. Springer, Berlin (2002)
19.
Zurück zum Zitat Grice, P.: Logic and conversation. J. Syntax Semant. 3, 41–58 (1975) Grice, P.: Logic and conversation. J. Syntax Semant. 3, 41–58 (1975)
20.
Zurück zum Zitat Grimnes, G.A., Edwards, P., Preece, A.D.: Learning meta-descriptions of the FOAF network. In: ISWC (2004) Grimnes, G.A., Edwards, P., Preece, A.D.: Learning meta-descriptions of the FOAF network. In: ISWC (2004)
21.
Zurück zum Zitat Hellmann, S., Lehmann, J., Auer, S.: Learning of OWL class descriptions on very large knowledge bases. Int. J. Semant. Web Inf. Syst. 5(2), 25–48 (2009)CrossRef Hellmann, S., Lehmann, J., Auer, S.: Learning of OWL class descriptions on very large knowledge bases. Int. J. Semant. Web Inf. Syst. 5(2), 25–48 (2009)CrossRef
22.
Zurück zum Zitat Huang, Y., Tresp, V., Bundschus, M., Rettinger, A., Kriegel, H.P.: Multivariate prediction for learning on the semantic web. In: ILP (2011) Huang, Y., Tresp, V., Bundschus, M., Rettinger, A., Kriegel, H.P.: Multivariate prediction for learning on the semantic web. In: ILP (2011)
23.
Zurück zum Zitat Jozefowska, J., Lawrynowicz, A., Lukaszewski, T.: The role of semantics in mining frequent patterns from knowledge bases in description logics with rules. Theory Pract. Log. Program. 10(3), 251–289 (2010)MATHMathSciNetCrossRef Jozefowska, J., Lawrynowicz, A., Lukaszewski, T.: The role of semantics in mining frequent patterns from knowledge bases in description logics with rules. Theory Pract. Log. Program. 10(3), 251–289 (2010)MATHMathSciNetCrossRef
24.
Zurück zum Zitat Kuramochi, M., Karypis, G.: Frequent subgraph discovery. In: ICDM. IEEE Computer Society (2001) Kuramochi, M., Karypis, G.: Frequent subgraph discovery. In: ICDM. IEEE Computer Society (2001)
25.
Zurück zum Zitat Lehmann, J.: DL-learner: learning concepts In Description logics. J. Mach. Learn. Res. (JMLR) 10, 2639–2642 (2009)MATH Lehmann, J.: DL-learner: learning concepts In Description logics. J. Mach. Learn. Res. (JMLR) 10, 2639–2642 (2009)MATH
26.
Zurück zum Zitat Lisi, F.A.: Building rules on top of ontologies for the semantic web with inductive logic programming. TPLP 8(3), 271–300 (2008)MATHMathSciNet Lisi, F.A.: Building rules on top of ontologies for the semantic web with inductive logic programming. TPLP 8(3), 271–300 (2008)MATHMathSciNet
27.
Zurück zum Zitat Maedche, A., Zacharias, V.: Clustering ontology-based metadata in the semantic web. In: PKDD (2002) Maedche, A., Zacharias, V.: Clustering ontology-based metadata in the semantic web. In: PKDD (2002)
28.
Zurück zum Zitat Mahdisoltani, F., Biega, J., Suchanek, F.M.: Yago3: a knowledge base from multilingual wikipedias. In: CIDR (2015) Mahdisoltani, F., Biega, J., Suchanek, F.M.: Yago3: a knowledge base from multilingual wikipedias. In: CIDR (2015)
29.
Zurück zum Zitat Mamer, T., Bryant, C., McCall, J.: L-modified ilp evaluation functions for positive-only biological grammar learning. In: Zelezny, F., Lavrac, N. (eds.) Inductive logic programming, No. 5194 in LNAI. Springer, Berlin (2008) Mamer, T., Bryant, C., McCall, J.: L-modified ilp evaluation functions for positive-only biological grammar learning. In: Zelezny, F., Lavrac, N. (eds.) Inductive logic programming, No. 5194 in LNAI. Springer, Berlin (2008)
30.
Zurück zum Zitat McGuinness, D.L., Fikes, R., Rice, J., Wilder, S.: An environment for merging and testing large ontologies. In: KR (2000) McGuinness, D.L., Fikes, R., Rice, J., Wilder, S.: An environment for merging and testing large ontologies. In: KR (2000)
31.
Zurück zum Zitat Muggleton, S.: Inverse entailment and progol. New Gener. Comput. 13(3&4), 245–286 (1995)CrossRef Muggleton, S.: Inverse entailment and progol. New Gener. Comput. 13(3&4), 245–286 (1995)CrossRef
32.
Zurück zum Zitat Muggleton, S.: Learning from positive data. In: ILP (1997) Muggleton, S.: Learning from positive data. In: ILP (1997)
33.
Zurück zum Zitat Nakashole, N., Sozio, M., Suchanek, F., Theobald, M.: Query-time reasoning in uncertain rdf knowledge bases with soft and hard rules. In: Workshop on Very Large Data Search (VLDS) at VLDB (2012) Nakashole, N., Sozio, M., Suchanek, F., Theobald, M.: Query-time reasoning in uncertain rdf knowledge bases with soft and hard rules. In: Workshop on Very Large Data Search (VLDS) at VLDB (2012)
34.
Zurück zum Zitat Nebot, V., Berlanga, R.: Finding association rules in semantic web data. Knowl Based Syst. 25(1), 51–62 (2012)CrossRef Nebot, V., Berlanga, R.: Finding association rules in semantic web data. Knowl Based Syst. 25(1), 51–62 (2012)CrossRef
35.
Zurück zum Zitat Nickel, M., Tresp, V., Kriegel, H.P.: Factorizing yago: scalable machine learning for linked data. In: WWW (2012) Nickel, M., Tresp, V., Kriegel, H.P.: Factorizing yago: scalable machine learning for linked data. In: WWW (2012)
36.
Zurück zum Zitat Noy, N.F., Musen, M.A.: PROMPT: algorithm and tool for automated ontology merging and alignment. In: AAAI/IAAI. AAAI Press (2000) Noy, N.F., Musen, M.A.: PROMPT: algorithm and tool for automated ontology merging and alignment. In: AAAI/IAAI. AAAI Press (2000)
37.
Zurück zum Zitat Richardson, M., Domingos, P.: Markov logic networks. Mach. Learn. 62(1–2), 107–136 (2006)CrossRef Richardson, M., Domingos, P.: Markov logic networks. Mach. Learn. 62(1–2), 107–136 (2006)CrossRef
38.
Zurück zum Zitat Schoenmackers, S., Etzioni, O., Weld, D.S., Davis, J.: Learning first-order Horn clauses from web text. In: EMNLP (2010) Schoenmackers, S., Etzioni, O., Weld, D.S., Davis, J.: Learning first-order Horn clauses from web text. In: EMNLP (2010)
39.
Zurück zum Zitat Suchanek, F.M., Abiteboul, S., Senellart, P.: PARIS: probabilistic alignment of relations, instances, and schema. PVLDB 5(3), 157–168 (2011) Suchanek, F.M., Abiteboul, S., Senellart, P.: PARIS: probabilistic alignment of relations, instances, and schema. PVLDB 5(3), 157–168 (2011)
40.
Zurück zum Zitat Suchanek, F.M., Kasneci, G., Weikum, G.: Yago: a core of semantic knowledge. In: WWW (2007) Suchanek, F.M., Kasneci, G., Weikum, G.: Yago: a core of semantic knowledge. In: WWW (2007)
41.
Zurück zum Zitat Tan, P.N., Kumar, V., Srivastava, J.: Selecting the right interestingness measure for association patterns. In: KDD (2002) Tan, P.N., Kumar, V., Srivastava, J.: Selecting the right interestingness measure for association patterns. In: KDD (2002)
43.
Zurück zum Zitat Völker, J., Niepert, M.: Statistical schema induction. In: ESWC (2011) Völker, J., Niepert, M.: Statistical schema induction. In: ESWC (2011)
45.
Zurück zum Zitat Zeng, Q., Patel, J., Page, D.: QuickFOIL: scalable inductive logic programming. In: VLDB (2014) Zeng, Q., Patel, J., Page, D.: QuickFOIL: scalable inductive logic programming. In: VLDB (2014)
Metadaten
Titel
Fast rule mining in ontological knowledge bases with AMIE
verfasst von
Luis Galárraga
Christina Teflioudi
Katja Hose
Fabian M. Suchanek
Publikationsdatum
01.12.2015
Verlag
Springer Berlin Heidelberg
Erschienen in
The VLDB Journal / Ausgabe 6/2015
Print ISSN: 1066-8888
Elektronische ISSN: 0949-877X
DOI
https://doi.org/10.1007/s00778-015-0394-1

Weitere Artikel der Ausgabe 6/2015

The VLDB Journal 6/2015 Zur Ausgabe

Premium Partner