Skip to main content
Erschienen in: Journal of Intelligent Information Systems 1/2017

24.09.2015

Classifying and querying very large taxonomies with bit-vector encoding

verfasst von: Hassan Aït-Kaci, Samir Amir

Erschienen in: Journal of Intelligent Information Systems | Ausgabe 1/2017

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In this article, we address the question of how efficiently Semantic Web (SW) reasoners perform in processing (classifying and querying) taxonomies of enormous size and whether it is possible to improve on existing implementations. We use a bit-vector encoding technique to implement taxonomic concept classification and Boolean-query answering. We describe the technique we have used, which achieves high performance, and discuss implementation issues. We compare the performance of our implementation with those of the best existing SW reasoning systems over several very large taxonomies under the exact same conditions for so-called TBox reasoning. The results show that our system is among the best for concept classification and several orders-of-magnitude more efficient in terms of response time for query answering. We present these results in detail and comment them. We also discuss pragmatic issues such as cycle detection and decoding.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Anhänge
Nur mit Berechtigung zugänglich
Fußnoten
1
In Semantic Web lingo, a Knowledge Base (KB) is defined as a formal ontology consisting of two parts (or “boxes”): (1) a Terminological Box (abbreviated as TBox); and, (2) an Assertional Box (abbreviated as ABox). The TBox contains the formal axioms that define the structure and semantic properties of the actual instance data; which instance data constitute the ABox. In Database lingo, the TBox corresponds to the schema and the ABox to the actual data.
 
11
Description Logics in the \(\mathcal {EL}\)-family are weaker versions that provide existential roles (∃r.C) but no universal roles (∀r.C).
 
12
To the best of our knowledge, this is the latest best bound as of 2011. However, these algorithms are not implementable due to prohibitive size of constants. For more recent work on parallelizing Strassen’s algorithm, see (Ballard et al. 2014). This, however, requires special harware (GPGPUs).
 
13
op. cit., pages 125–126.
 
14
See Section 4.2.
 
19
See Section 3.3 for a discussion concernining this point.
 
20
We use “ &” to denote “and,” and “ |” to denote “or.”
 
21
We implemented such a facility—see Appendix.
 
23
The ordering on bit-vector codes is simply defined as c 1c 2 iff c 1=c 1 & c 2.
 
24
In the same manner as we have noticed that SnoRocket does.
 
27
Recall that a bit vector is written with its lowest bit to the right.
 
28
In what follows, we shall use the “dot” notation of object-oriented methods to denote all functions or operations on codes.
 
29
Note that ij implies necessarily that i<j (by Condition (1) and since n<m).
 
30
Actually, the java.util.TreeSet does maintain a doubly-linked list for its elements in order to ensure its two ordered iterators (ascending and descending). But this structure is not made public and one cannot splice in new elements from a given found element. But it is a simple matter to modify the source code of java.util.TreeSet.java and adapt it to what is needed.
 
31
In fact, they see that only as a possible a posteriori optimization, but one that would cause their optimal spanning-tree finding algorithm to be incorrect if applied incrementally while it is executed.
 
Literatur
Zurück zum Zitat Aït-Kaci, H., & Amir, S. (2013). Classifying and querying very large taxonomies—a comparative study to the best of our knowledge. \(\mathcal {CEDAR}\) Technical Report Number 2, \(\mathcal {CEDAR}\) Project, LIRIS, Département d’Informatique, Université Claude Bernard Lyon 1, Villeurbanne, France. [Available online http://cedar.liris.cnrs.fr/papers/ctr2.pdf]. Aït-Kaci, H., & Amir, S. (2013). Classifying and querying very large taxonomies—a comparative study to the best of our knowledge. \(\mathcal {CEDAR}\) Technical Report Number 2, \(\mathcal {CEDAR}\) Project, LIRIS, Département d’Informatique, Université Claude Bernard Lyon 1, Villeurbanne, France. [Available online http://​cedar.​liris.​cnrs.​fr/​papers/​ctr2.​pdf].
Zurück zum Zitat Amir, S., & Aït-Kaci, H. (2013). CEDAR: a fast taxonomic reasoner based on lattice operations. In Proceedings of the ISWC 2013 Posters & Demonstrations Track, Sydney, Australia (pp. 9–12). Amir, S., & Aït-Kaci, H. (2013). CEDAR: a fast taxonomic reasoner based on lattice operations. In Proceedings of the ISWC 2013 Posters & Demonstrations Track, Sydney, Australia (pp. 9–12).
Zurück zum Zitat Amir, S., & Aït-Kaci, H. (2014a). CEDAR: efficient reasoning for the semantic web. In Tenth International Conference on Signal-Image Technology and Internet-Based Systems, SITIS 2014 (pp. 157–163), Marrakech, Morocco. Amir, S., & Aït-Kaci, H. (2014a). CEDAR: efficient reasoning for the semantic web. In Tenth International Conference on Signal-Image Technology and Internet-Based Systems, SITIS 2014 (pp. 157–163), Marrakech, Morocco.
Zurück zum Zitat Amir, S., & Aït-Kaci, H. (2014b). Design and implementation of an efficient semantic web reasoner. CEDAR Technical Report Number 12, \(\mathcal {CEDAR}\) Project, LIRIS, Département d’Informatique, Université Claude Bernard Lyon Vol. 1, Villeurbanne, France. [Available online http://cedar.liris.cnrs.fr/papers/ctr12.pdf]. Amir, S., & Aït-Kaci, H. (2014b). Design and implementation of an efficient semantic web reasoner. CEDAR Technical Report Number 12, \(\mathcal {CEDAR}\) Project, LIRIS, Département d’Informatique, Université Claude Bernard Lyon Vol. 1, Villeurbanne, France. [Available online http://​cedar.​liris.​cnrs.​fr/​papers/​ctr12.​pdf].
Zurück zum Zitat Baader, F., Brandt, S., & Lutz, C. (2005). Pushing the \(\mathcal {EL}\) envelope. In L. P. Kaelbling, & A. Saffiotti (Eds.), Proceedings of the 19th International Joint Conference on Artificial Intelligence (pp. 364–369). Edinburgh: IJCAI’05, Morgan Kaufmann Publishers. [Available online http://www.ijcai.org/papers/0372.pdf] Baader, F., Brandt, S., & Lutz, C. (2005). Pushing the \(\mathcal {EL}\) envelope. In L. P. Kaelbling, & A. Saffiotti (Eds.), Proceedings of the 19th International Joint Conference on Artificial Intelligence (pp. 364–369). Edinburgh: IJCAI’05, Morgan Kaufmann Publishers. [Available online http://​www.​ijcai.​org/​papers/​0372.​pdf]
Zurück zum Zitat Kazakov, Y. (2009). Consequence-driven reasoning for horn \(\mathcal {SHIQ}\) ontologies. In C. Boutilier (Ed.), Proceedings of the 21st international conference on artificial intelligence (pp. 2040–2045). Pasadena: IJCAI’09, Association for the Advancement of Artificial Intelligence. [Available online http://ijcai.org/papers09/Papers/IJCAI09-336.pdf]. Kazakov, Y. (2009). Consequence-driven reasoning for horn \(\mathcal {SHIQ}\) ontologies. In C. Boutilier (Ed.), Proceedings of the 21st international conference on artificial intelligence (pp. 2040–2045). Pasadena: IJCAI’09, Association for the Advancement of Artificial Intelligence. [Available online http://​ijcai.​org/​papers09/​Papers/​IJCAI09-336.​pdf].
Zurück zum Zitat Kazakov, Y., Krötzsch, M., & Simančík, F. (2011). Unchain my \(\mathcal {EL}\) reasoner. In R. Rosati, S. Rudolph, & M. Zakharyaschev (Eds.), Proceedings of the 24th international workshop on description logics. Barcelona: DL’11, CEUR Workshop Proceedings. [Available online http://ceur-ws.org/Vol-745/paper_54.pdf]. Kazakov, Y., Krötzsch, M., & Simančík, F. (2011). Unchain my \(\mathcal {EL}\) reasoner. In R. Rosati, S. Rudolph, & M. Zakharyaschev (Eds.), Proceedings of the 24th international workshop on description logics. Barcelona: DL’11, CEUR Workshop Proceedings. [Available online http://​ceur-ws.​org/​Vol-745/​paper_​54.​pdf].
Zurück zum Zitat Lawley, M. J., & Bousquet, C. (2010). Fast classification in Protegé́: Snorocket as an OWL 2 EL reasoner. In T. Meyer, M. A. Orgun, & K. Taylor (Eds.), Proceedings of the 2nd Australasian ontology workshop: Advances in ontologies. (pp. 45–50). Adelaide: AOW’10, ACS. [Available online http://krr.meraka.org.za/~aow2010/Lawley-etal.pdf]. Lawley, M. J., & Bousquet, C. (2010). Fast classification in Protegé́: Snorocket as an OWL 2 EL reasoner. In T. Meyer, M. A. Orgun, & K. Taylor (Eds.), Proceedings of the 2nd Australasian ontology workshop: Advances in ontologies. (pp. 45–50). Adelaide: AOW’10, ACS. [Available online http://​krr.​meraka.​org.​za/​~aow2010/​Lawley-etal.​pdf].
Metadaten
Titel
Classifying and querying very large taxonomies with bit-vector encoding
verfasst von
Hassan Aït-Kaci
Samir Amir
Publikationsdatum
24.09.2015
Verlag
Springer US
Erschienen in
Journal of Intelligent Information Systems / Ausgabe 1/2017
Print ISSN: 0925-9902
Elektronische ISSN: 1573-7675
DOI
https://doi.org/10.1007/s10844-015-0383-2

Weitere Artikel der Ausgabe 1/2017

Journal of Intelligent Information Systems 1/2017 Zur Ausgabe

Premium Partner