Skip to main content
Top

2021 | OriginalPaper | Chapter

Constraint-Adaptive Rule Mining in Large Databases

Authors : Meng Li, Ya-Lin Zhang, Qitao Shi, Xinxing Yang, Qing Cui, Longfei Li, Jun Zhou

Published in: Database Systems for Advanced Applications

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Decision rules are widely used due to their interpretability, efficiency, and stability in various applications, especially for financial tasks, such as fraud detection and loan assessment. In many scenarios, it is highly demanded to generate decision rules under some specific constraints. However, the performance, efficiency, and adaptivity of previous methods, which take no consideration of these constraints, is far from satisfactory in these scenarios, especially when the constraints are relatively tight. In this paper, to deal with this problem, we propose a constraint-adaptive rule mining algorithm named CARM (Constraint Adaptive Rule Mining), which is a novel decision tree based model. To provide a practical balance between purity and constraint fitness when building the trees, an adaptive criterion is designed and applied to better meet the constraints. Besides, a rule extraction and pruning process is applied to satisfy the constraints and further alleviate the overfitting problem. In addition, to improve the coverage, an iterative covering framework is proposed in this paper. Experiments on both public and business data sets show that the proposed method is able to achieve better performance, competitive efficiency, as well as low rule complexity when comparing with other methods.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Agrawal, R., Mannila, H., Srikant, R., Toivonen, H., Verkamo, A.I.: Fast discovery of association rules. In: Advances in Knowledge Discovery and Data Mining, pp. 307–328. AAAI/MIT Press (1996) Agrawal, R., Mannila, H., Srikant, R., Toivonen, H., Verkamo, A.I.: Fast discovery of association rules. In: Advances in Knowledge Discovery and Data Mining, pp. 307–328. AAAI/MIT Press (1996)
3.
go back to reference Dash, S., Günlük, O., Wei, D.: Boolean decision rules via column generation. In: NIPS, pp. 4660–4670 (2018) Dash, S., Günlük, O., Wei, D.: Boolean decision rules via column generation. In: NIPS, pp. 4660–4670 (2018)
4.
go back to reference Frank, E., Witten, I.H.: Generating accurate rule sets without global optimization. In: ICML, pp. 144–151. Morgan Kaufmann (1998) Frank, E., Witten, I.H.: Generating accurate rule sets without global optimization. In: ICML, pp. 144–151. Morgan Kaufmann (1998)
5.
go back to reference Gao, Q., Xu, D.: An empirical study on the application of the evidential reasoning rule to decision making in financial investment. Knowl.-Based Syst. 164, 226–234 (2019)CrossRef Gao, Q., Xu, D.: An empirical study on the application of the evidential reasoning rule to decision making in financial investment. Knowl.-Based Syst. 164, 226–234 (2019)CrossRef
7.
go back to reference Bayardo Jr., R.J., Agrawal, R., Gunopulos, D.: Constraint-based rule mining in large, dense databases. In: ICDE, pp. 188–197 (1999) Bayardo Jr., R.J., Agrawal, R., Gunopulos, D.: Constraint-based rule mining in large, dense databases. In: ICDE, pp. 188–197 (1999)
8.
go back to reference Letham, B., Rudin, C., McCormick, T.H., Madigan, D.: Interpretable classifiers using rules and Bayesian analysis: building a better stroke prediction model. Ann. Appl. Stat. 9(3), 1350–1371 (2015)MathSciNetCrossRef Letham, B., Rudin, C., McCormick, T.H., Madigan, D.: Interpretable classifiers using rules and Bayesian analysis: building a better stroke prediction model. Ann. Appl. Stat. 9(3), 1350–1371 (2015)MathSciNetCrossRef
9.
go back to reference Liu, B., Hsu, W., Ma, Y.: Integrating classification and association rule mining. In: KDD, pp. 80–86 (1998) Liu, B., Hsu, W., Ma, Y.: Integrating classification and association rule mining. In: KDD, pp. 80–86 (1998)
10.
go back to reference Marchand, M., Sokolova, M.: Learning with decision lists of data-dependent features. JMLR 6, 427–451 (2005)MathSciNetMATH Marchand, M., Sokolova, M.: Learning with decision lists of data-dependent features. JMLR 6, 427–451 (2005)MathSciNetMATH
12.
go back to reference Quinlan, J.R.: C4.5: Programs for Machine Learning. Morgan Kaufmann, Burlington (1993) Quinlan, J.R.: C4.5: Programs for Machine Learning. Morgan Kaufmann, Burlington (1993)
13.
go back to reference Rudin, C., Letham, B., Madigan, D.: Learning theory analysis for association rules and sequential event prediction. JMLR 14(1), 3441–3492 (2013)MathSciNetMATH Rudin, C., Letham, B., Madigan, D.: Learning theory analysis for association rules and sequential event prediction. JMLR 14(1), 3441–3492 (2013)MathSciNetMATH
14.
go back to reference Wang, T., Rudin, C., Doshi-Velez, F., Liu, Y., Klampfl, E., MacNeille, P.: A Bayesian framework for learning rule sets for interpretable classification. JMLR 18, 70:1–70:37 (2017)MathSciNetMATH Wang, T., Rudin, C., Doshi-Velez, F., Liu, Y., Klampfl, E., MacNeille, P.: A Bayesian framework for learning rule sets for interpretable classification. JMLR 18, 70:1–70:37 (2017)MathSciNetMATH
15.
go back to reference Yang, H., Rudin, C., Seltzer, M.: Scalable Bayesian rule lists. In: ICML, vol. 70, pp. 3921–3930. PMLR (2017) Yang, H., Rudin, C., Seltzer, M.: Scalable Bayesian rule lists. In: ICML, vol. 70, pp. 3921–3930. PMLR (2017)
16.
go back to reference Zhang, Y.L., Li, L.: Interpretable MTL from heterogeneous domains using boosted tree. In: CIKM, pp. 2053–2056 (2019) Zhang, Y.L., Li, L.: Interpretable MTL from heterogeneous domains using boosted tree. In: CIKM, pp. 2053–2056 (2019)
17.
go back to reference Zhang, Y., et al.: Distributed deep forest and its application to automatic detection of cash-out fraud. TIST 10(5), 55:1–55:19 (2019) Zhang, Y., et al.: Distributed deep forest and its application to automatic detection of cash-out fraud. TIST 10(5), 55:1–55:19 (2019)
Metadata
Title
Constraint-Adaptive Rule Mining in Large Databases
Authors
Meng Li
Ya-Lin Zhang
Qitao Shi
Xinxing Yang
Qing Cui
Longfei Li
Jun Zhou
Copyright Year
2021
DOI
https://doi.org/10.1007/978-3-030-73200-4_41

Premium Partner