Skip to main content
Top

2014 | OriginalPaper | Chapter

Utilizing Coverage Lists as a Pruning Mechanism for Concept Discovery

Authors : Alev Mutlu, Abdullah Dogan, Pinar Karagoz

Published in: Information Sciences and Systems 2014

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Inductive logic programming (ILP)-based concept discovery systems lack computational efficiency due to the evaluation of the large search spaces they build. One way to tackle this issue is employing pruning mechanisms. In this work, we propose a two-phase pruning mechanism for concept discovery systems that employ an Apriori-like refinement operator and evaluate the goodness of the concept descriptors based on their support value. The first step, which is novel in this work, is computationally inexpensive and prunes the search space based on the coverages of the concept descriptors. The second step employs a widely employed pruning mechanism based on the support value of the concept descriptors. The experimental results show that the first step leaves a search space reduced by 4–22 % to be evaluated by the second step, which is more costly.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Footnotes
1
Dunur is a relatedness of two people due to marriage such that A is dunur of B if a child of A is married to a child of B.
 
2
It is a symmetric relation.
 
3
Elti is a relatedness of two people due to marriage such that A is elti of B if A’s husband is brother of B’s husband.
 
4
See footnote 2.
 
Literature
1.
go back to reference S. Dzeroski, Multi-relational data mining: an introduction. SIGKDD Explor. 5(1), 1–16 (2003)CrossRef S. Dzeroski, Multi-relational data mining: an introduction. SIGKDD Explor. 5(1), 1–16 (2003)CrossRef
2.
go back to reference S. Muggleton, Inductive Logic Programming, The MIT Encyclopedia of the Cognitive Sciences (MITECS) (MIT Press, Cambridge, 1999) S. Muggleton, Inductive Logic Programming, The MIT Encyclopedia of the Cognitive Sciences (MITECS) (MIT Press, Cambridge, 1999)
3.
go back to reference H. Blockeel, L. Dehaspe, B. Demoen, G. Janssens, H. Vandecasteele, Improving the efficiency of inductive logic programming through the use of query packs. J. Artif. Intell. Res. 16, 135–166 (2002)MATH H. Blockeel, L. Dehaspe, B. Demoen, G. Janssens, H. Vandecasteele, Improving the efficiency of inductive logic programming through the use of query packs. J. Artif. Intell. Res. 16, 135–166 (2002)MATH
4.
go back to reference A. Mutlu, P. Senkul, Y. Kavurucu, Improving the scalability of ILP-based multi-relational concept discovery system through parallelization. Knowl. Based Syst. 24, 352–368 (2012)CrossRef A. Mutlu, P. Senkul, Y. Kavurucu, Improving the scalability of ILP-based multi-relational concept discovery system through parallelization. Knowl. Based Syst. 24, 352–368 (2012)CrossRef
5.
go back to reference V.S. Costa, A. Srinivasan, R. Camacho, H. Blockeel, B. Demoen, G. Janssens, J. Struyf, H. Vandecasteele, W.V. Laer, Query transformations for improving the efficiency of ILP systems. J. Mach. Learn. Res. 4, 465–491 (2003) V.S. Costa, A. Srinivasan, R. Camacho, H. Blockeel, B. Demoen, G. Janssens, J. Struyf, H. Vandecasteele, W.V. Laer, Query transformations for improving the efficiency of ILP systems. J. Mach. Learn. Res. 4, 465–491 (2003)
6.
go back to reference A. Srinivasan, A study of two sampling methods for analyzing large datasets with ILP. Data Min. Knowl. Disc. 3(1), 95–123 (1999)CrossRef A. Srinivasan, A study of two sampling methods for analyzing large datasets with ILP. Data Min. Knowl. Disc. 3(1), 95–123 (1999)CrossRef
7.
go back to reference Y. Kavurucu, P. Senkul, I.H. Toroslu, Concept discovery on relational databases: new techniques for search space pruning and rule quality improvement. Knowl. Based Syst. 23(8), 743–756 (2010)CrossRef Y. Kavurucu, P. Senkul, I.H. Toroslu, Concept discovery on relational databases: new techniques for search space pruning and rule quality improvement. Knowl. Based Syst. 23(8), 743–756 (2010)CrossRef
8.
go back to reference B. Tausend, Representing biases for inductive logic programming, in ECML. Volume 784 of Lecture Notes in Computer Science, ed. by F. Bergadano, L.D. Raedt (Springer, Heidelberg, 1994), pp. 427–430 B. Tausend, Representing biases for inductive logic programming, in ECML. Volume 784 of Lecture Notes in Computer Science, ed. by F. Bergadano, L.D. Raedt (Springer, Heidelberg, 1994), pp. 427–430
9.
go back to reference L. Dehaspe, L.D. Raedt, Mining association rules in multiple relations, in ILP. Volume 1297 of Lecture Notes in Computer Science, ed. by N. Lavrac, S. Dzeroski (Springer, Heidelberg, 1997), pp. 125–132 L. Dehaspe, L.D. Raedt, Mining association rules in multiple relations, in ILP. Volume 1297 of Lecture Notes in Computer Science, ed. by N. Lavrac, S. Dzeroski (Springer, Heidelberg, 1997), pp. 125–132
10.
go back to reference A. Mutlu, P. Senkul, Improving hash table hit ratio of an ilp-based concept discovery system with memoization capabilities, in ISCIS, ed. by E. Gelenbe, R. Lent (Springer, Heidelberg, 2012), pp. 261–269 A. Mutlu, P. Senkul, Improving hash table hit ratio of an ilp-based concept discovery system with memoization capabilities, in ISCIS, ed. by E. Gelenbe, R. Lent (Springer, Heidelberg, 2012), pp. 261–269
11.
go back to reference D. Maier, The Theory of Relational Databases (Computer Science Press, Rockville, 1983)MATH D. Maier, The Theory of Relational Databases (Computer Science Press, Rockville, 1983)MATH
12.
go back to reference A. Srinivasan, S. Muggleton, R. King, M. Sternberg, Theories for mutagenicity: a study of first-order and feature based induction, Technical report, PRG-TR-8-95 Oxford University Computing Laboratory (1995) A. Srinivasan, S. Muggleton, R. King, M. Sternberg, Theories for mutagenicity: a study of first-order and feature based induction, Technical report, PRG-TR-8-95 Oxford University Computing Laboratory (1995)
13.
go back to reference A. Srinivasan, R.D. King, S.H. Muggleton, M. Sternberg, The predictive toxicology evaluation challenge, in IJCAI-97: Proceedings of the 15th International Joint Conference on Artificial Intelligence, pp. 1–6 (1997) A. Srinivasan, R.D. King, S.H. Muggleton, M. Sternberg, The predictive toxicology evaluation challenge, in IJCAI-97: Proceedings of the 15th International Joint Conference on Artificial Intelligence, pp. 1–6 (1997)
14.
go back to reference M.J. Pazzani, C. Brunk, G. Silverstein, A knowledge-intensive approach to learning relational concepts, in Proceedings of the Eighth Intl. Workshop on Machine Learning (ML’91), pp. 432–436 (1991) M.J. Pazzani, C. Brunk, G. Silverstein, A knowledge-intensive approach to learning relational concepts, in Proceedings of the Eighth Intl. Workshop on Machine Learning (ML’91), pp. 432–436 (1991)
15.
go back to reference A.K. Akobeng, Understanding diagnostic tests 1: sensitivity, specificity and predictive values. Acta Paediatr. 96(3), 338–341 (2007)CrossRef A.K. Akobeng, Understanding diagnostic tests 1: sensitivity, specificity and predictive values. Acta Paediatr. 96(3), 338–341 (2007)CrossRef
16.
go back to reference R. Parikh, A. Mathai, S. Parikh, G.C. Sekhar, R. Thomas, Understanding and using sensitivity, specificity and predictive values. Ind. J. Ophthalmol. 56(1), 45 (2008)CrossRef R. Parikh, A. Mathai, S. Parikh, G.C. Sekhar, R. Thomas, Understanding and using sensitivity, specificity and predictive values. Ind. J. Ophthalmol. 56(1), 45 (2008)CrossRef
Metadata
Title
Utilizing Coverage Lists as a Pruning Mechanism for Concept Discovery
Authors
Alev Mutlu
Abdullah Dogan
Pinar Karagoz
Copyright Year
2014
DOI
https://doi.org/10.1007/978-3-319-09465-6_28

Premium Partner