Skip to main content
Erschienen in: International Journal of Machine Learning and Cybernetics 6/2020

06.02.2020 | Original Article

Incremental attribute reduction with rough set for dynamic datasets with simultaneously increasing samples and attributes

verfasst von: Lianjie Dong, Degang Chen

Erschienen in: International Journal of Machine Learning and Cybernetics | Ausgabe 6/2020

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Attribute reduction with rough set is a popular data analysis methodology for data dimensionality reduction. For dynamic datasets, the existing research has mainly focused on incremental attribute reduction with increasing samples (rows) or attributes (columns), but there is hardly any further research on attribute reduction for dynamic datasets with simultaneously increasing samples and attributes. This paper presents a novel incremental algorithm for attribute reduction with rough set. Firstly, the definition of discernibility relation is proposed based on the improved discernibility matrix. Then, the incremental mechanisms of samples and attributes are studied in terms of discernibility relation under a unified framework. On the basis of two incremental mechanisms, a unified incremental mechanism is introduced for dynamic datasets with simultaneously increasing samples and attributes, and the incremental algorithm is developed according to the unified incremental mechanism. The proposed algorithm has the solid mathematical foundation, which is also suitable for datasets with massive samples and attributes. Finally, compared experimentally with other algorithms, the efficiency of the developed incremental algorithm is demonstrated in terms of running time.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Weitere Produktempfehlungen anzeigen
Literatur
1.
Zurück zum Zitat Bazan J (2000) Chapter 17 A comparison of dynamic and non-dynamic rough set methods for extracting laws from decision tables. Rough sets in knowledge discovery 1. Physica-Verlag HD Bazan J (2000) Chapter 17 A comparison of dynamic and non-dynamic rough set methods for extracting laws from decision tables. Rough sets in knowledge discovery 1. Physica-Verlag HD
2.
Zurück zum Zitat Bang W, Bien Z (2007) Incremental inductive learning algorithm in the framework of rough set theory and its application. Int J Fuzzy Syst 1:25–36 Bang W, Bien Z (2007) Incremental inductive learning algorithm in the framework of rough set theory and its application. Int J Fuzzy Syst 1:25–36
3.
Zurück zum Zitat Chen DG, Yang YY (2014) Attribute reduction for heterogeneous data based on the combination of classical and fuzzy rough set models. IEEE Trans Fuzzy Syst 22(5):1325–1334 Chen DG, Yang YY (2014) Attribute reduction for heterogeneous data based on the combination of classical and fuzzy rough set models. IEEE Trans Fuzzy Syst 22(5):1325–1334
4.
Zurück zum Zitat Chen DG (2013) Fuzzy rough set theory and method. Beijing, China Chen DG (2013) Fuzzy rough set theory and method. Beijing, China
5.
Zurück zum Zitat Chen DG, Zhao SY, Zhang L, Yang YY, Zhang X (2012) Sample pair selection for attribute reduction with rough set. IEEE Trans Knowl Data Eng 24(11):2080–2093 Chen DG, Zhao SY, Zhang L, Yang YY, Zhang X (2012) Sample pair selection for attribute reduction with rough set. IEEE Trans Knowl Data Eng 24(11):2080–2093
6.
Zurück zum Zitat Chen DG, Zhang L, Zhao SY, Hu QH, Zhu P (2012) A novel algorithm for finding reducts with fuzzy rough sets. IEEE Trans Fuzzy Syst 20(2):385–389 Chen DG, Zhang L, Zhao SY, Hu QH, Zhu P (2012) A novel algorithm for finding reducts with fuzzy rough sets. IEEE Trans Fuzzy Syst 20(2):385–389
7.
Zurück zum Zitat Guan L (2009) An incremental updating algorithm of attribute reduction set in decision tables. In: International conference on fuzzy systems and knowledge discovery. Tianjin, China, vol 6, pp 421–425 Guan L (2009) An incremental updating algorithm of attribute reduction set in decision tables. In: International conference on fuzzy systems and knowledge discovery. Tianjin, China, vol 6, pp 421–425
8.
Zurück zum Zitat Hao C, Li JH, Fan M, Liu WQ, Tsang Eric CC (2017) Optimal scale selection in dynamic multi-scale decision tables based on sequential three-wan decisions. Inf Sci 415:213–232 Hao C, Li JH, Fan M, Liu WQ, Tsang Eric CC (2017) Optimal scale selection in dynamic multi-scale decision tables based on sequential three-wan decisions. Inf Sci 415:213–232
9.
Zurück zum Zitat Hu F, Dai J, Wang G (2007) Incremental algorithms for attribute reduction in the decision table. Control Decis 22(3):268–277MATH Hu F, Dai J, Wang G (2007) Incremental algorithms for attribute reduction in the decision table. Control Decis 22(3):268–277MATH
10.
Zurück zum Zitat Hu F, Wang G, Huang H, Wu Y (2005) Incremental attribute reduction based on elementary sets. In: Proceedings of 10th International Conference on Rough Sets. Fussy Sets. Data Mining and Granular Computing. Regina. pp 185–193 Hu F, Wang G, Huang H, Wu Y (2005) Incremental attribute reduction based on elementary sets. In: Proceedings of 10th International Conference on Rough Sets. Fussy Sets. Data Mining and Granular Computing. Regina. pp 185–193
11.
Zurück zum Zitat Hu QH, Yu DR, Xie ZX, Li XD (2007) EROS: ensemble rough subspaces. Pattern Recognit 40(12):3728–3739MATH Hu QH, Yu DR, Xie ZX, Li XD (2007) EROS: ensemble rough subspaces. Pattern Recognit 40(12):3728–3739MATH
12.
Zurück zum Zitat Hu QH, Zhang L, Chen DG, Pedrycz W, Yu DR (2010) Gaussian kernel based fuzzy rough sets: model, uncertainty measures and applications. Int J Approx Reason 51(4):453–471MATH Hu QH, Zhang L, Chen DG, Pedrycz W, Yu DR (2010) Gaussian kernel based fuzzy rough sets: model, uncertainty measures and applications. Int J Approx Reason 51(4):453–471MATH
13.
Zurück zum Zitat Hu QH, Yu DR, Pedrycz W, Chen DG (2011) Kernelized fuzzy rough sets and their applications. IEEE Trans Knowl Data Eng 23(11):1649–1667 Hu QH, Yu DR, Pedrycz W, Chen DG (2011) Kernelized fuzzy rough sets and their applications. IEEE Trans Knowl Data Eng 23(11):1649–1667
14.
Zurück zum Zitat Blaszczynski J, Slowinski R (2003) Incremental induction of decision rules from dominance-based rough approximations. Electron Notes Theor Comput Sci 84(4):40–51MATH Blaszczynski J, Slowinski R (2003) Incremental induction of decision rules from dominance-based rough approximations. Electron Notes Theor Comput Sci 84(4):40–51MATH
15.
Zurück zum Zitat Jing Y, Li T, Fujita H, Wang B, Cheng C (2018) An incremental attribute reduction method for dynamic data mining. Inf Sci 465:202–218MathSciNet Jing Y, Li T, Fujita H, Wang B, Cheng C (2018) An incremental attribute reduction method for dynamic data mining. Inf Sci 465:202–218MathSciNet
16.
Zurück zum Zitat Jing Y, Li T, Huang J, Zhang Y (2016) An incremental attribute reduction approach based on knowledge granularity under the attribute generalization. Int J Approx Reason 76:80–95MathSciNetMATH Jing Y, Li T, Huang J, Zhang Y (2016) An incremental attribute reduction approach based on knowledge granularity under the attribute generalization. Int J Approx Reason 76:80–95MathSciNetMATH
17.
Zurück zum Zitat Luo C, Li T, Chen H, Liu D (2012) An incremental approach for updating approximations based on set-valued ordered information systems. In: International conference on rough sets and current trends in computing, Springer, Berlin, pp 363–369 Luo C, Li T, Chen H, Liu D (2012) An incremental approach for updating approximations based on set-valued ordered information systems. In: International conference on rough sets and current trends in computing, Springer, Berlin, pp 363–369
18.
Zurück zum Zitat Li JH, Aswani Kumar C, Mei CL, Wang XZ (2017) Comparison of reduction in formal decision contexts. Int J Approx Reason 80:100–122MathSciNetMATH Li JH, Aswani Kumar C, Mei CL, Wang XZ (2017) Comparison of reduction in formal decision contexts. Int J Approx Reason 80:100–122MathSciNetMATH
19.
Zurück zum Zitat Li WT, Pedrycz W, Xue XP, Xu WH, Fan BJ (2019) Fuzziness and incremental information of disjoint regions in double-quantitative decision-theoretic rough set model. Int J Mach Learn Cybernet 10:2669–2690 Li WT, Pedrycz W, Xue XP, Xu WH, Fan BJ (2019) Fuzziness and incremental information of disjoint regions in double-quantitative decision-theoretic rough set model. Int J Mach Learn Cybernet 10:2669–2690
20.
Zurück zum Zitat Li T, Ruan D, Geert W, Song J, Xu Y (2007) A rough sets based characteristic relation approach for dynamic attribute generalization in data mining. Knowl-Based Syst 20(5):485–494 Li T, Ruan D, Geert W, Song J, Xu Y (2007) A rough sets based characteristic relation approach for dynamic attribute generalization in data mining. Knowl-Based Syst 20(5):485–494
21.
Zurück zum Zitat Liang J, Wang F, Dang C, Qian Y (2014) A group incremental approach to feature selection applying rough set technique. IEEE Trans Knowl Data Eng 26(2):294–308 Liang J, Wang F, Dang C, Qian Y (2014) A group incremental approach to feature selection applying rough set technique. IEEE Trans Knowl Data Eng 26(2):294–308
22.
Zurück zum Zitat Liu ZT (1999) An incremental arithmetic for the smallest reduction of attributes. Acta Electron Sinica 27(11):96–98 Liu ZT (1999) An incremental arithmetic for the smallest reduction of attributes. Acta Electron Sinica 27(11):96–98
23.
Zurück zum Zitat Miao D, Zhao Y, Yao Y, Li H, Xu F (2009) Relative reducts in consistent and inconsistent decision tables of the Pawlak rough set model. Inf Sci 179(24):4140–4150MathSciNetMATH Miao D, Zhao Y, Yao Y, Li H, Xu F (2009) Relative reducts in consistent and inconsistent decision tables of the Pawlak rough set model. Inf Sci 179(24):4140–4150MathSciNetMATH
24.
Zurück zum Zitat Modrzejewski M (1993) Feature selection using rough sets theory. Machine learning: ECML-93. In: European conference on machine learning, Vienna, Austria, pp 213–226 Modrzejewski M (1993) Feature selection using rough sets theory. Machine learning: ECML-93. In: European conference on machine learning, Vienna, Austria, pp 213–226
25.
Zurück zum Zitat Orlowska M, Orlowski M (1992) Maintenance of knowledge in dynamic information systems. Intelligent decision support. Springer, Netherlands, pp 315–329 Orlowska M, Orlowski M (1992) Maintenance of knowledge in dynamic information systems. Intelligent decision support. Springer, Netherlands, pp 315–329
26.
Zurück zum Zitat Pawlak Z (1982) Rough sets. Int J Comput Inform Sci 11(5):341–356MATH Pawlak Z (1982) Rough sets. Int J Comput Inform Sci 11(5):341–356MATH
27.
Zurück zum Zitat Pawlak Z (1991) Rough sets: theoretical aspects of reasoning about data. Kluwer Academic Publishers, BerlinMATH Pawlak Z (1991) Rough sets: theoretical aspects of reasoning about data. Kluwer Academic Publishers, BerlinMATH
28.
Zurück zum Zitat Riza LS, Janusz A, Bergmeir C, Cornelis C, Herrera F, Slezak D, Benitez JM (2014) Implementing algorithms of rough set theory and fuzzy rough set theory in the R package ‘‘RoughSets’’. Inf Sci 287:68–89 Riza LS, Janusz A, Bergmeir C, Cornelis C, Herrera F, Slezak D, Benitez JM (2014) Implementing algorithms of rough set theory and fuzzy rough set theory in the R package ‘‘RoughSets’’. Inf Sci 287:68–89
29.
Zurück zum Zitat Rahman M, Islam M (2014) FIMUS: a framework for imputing missing values using co-appearance, correlation and similarity analysis. Knowl Based Syst 56:311–327 Rahman M, Islam M (2014) FIMUS: a framework for imputing missing values using co-appearance, correlation and similarity analysis. Knowl Based Syst 56:311–327
30.
Zurück zum Zitat Skowron A, Rauszer C (1992) The discernibility matrices and functions in information systems. Intell Decis Support 11:331–362 Skowron A, Rauszer C (1992) The discernibility matrices and functions in information systems. Intell Decis Support 11:331–362
31.
Zurück zum Zitat Susmaga R (1998) Experiments in incremental computation of reducts. Methodol Appl 18:530–553MATH Susmaga R (1998) Experiments in incremental computation of reducts. Methodol Appl 18:530–553MATH
32.
Zurück zum Zitat Swiniarski RW, Skowron A (2003) Rough set methods in feature selection and recognition. Pattern Recogn Lett 24(6):833–849MATH Swiniarski RW, Skowron A (2003) Rough set methods in feature selection and recognition. Pattern Recogn Lett 24(6):833–849MATH
33.
Zurück zum Zitat Shan N, Ziarko W (2010) Data-based acquisition and incremental modification of classification rules. Comput Intell 11(2):357–370 Shan N, Ziarko W (2010) Data-based acquisition and incremental modification of classification rules. Comput Intell 11(2):357–370
34.
Zurück zum Zitat Shu W, Shen H (2014) Updating attribute reduction in incomplete decision systems with the variation of attribute set. Int J Approx Reason 55(3):867–884MathSciNetMATH Shu W, Shen H (2014) Updating attribute reduction in incomplete decision systems with the variation of attribute set. Int J Approx Reason 55(3):867–884MathSciNetMATH
35.
Zurück zum Zitat Stawicki S, Ślęzak D (2013) Recent advances in decision bireducts: complexity, heuristics and streams. Rough sets and knowledge technology. Springer, Berlin Stawicki S, Ślęzak D (2013) Recent advances in decision bireducts: complexity, heuristics and streams. Rough sets and knowledge technology. Springer, Berlin
36.
Zurück zum Zitat Teng S, Liu M, Yang A, Zhang J, Nian Y, He M (2016) Efficient attribute reduction from the viewpoint of discernibility. Inf Sci 326:297–314MathSciNetMATH Teng S, Liu M, Yang A, Zhang J, Nian Y, He M (2016) Efficient attribute reduction from the viewpoint of discernibility. Inf Sci 326:297–314MathSciNetMATH
37.
Zurück zum Zitat Wang CZ, Huang Y, Shao MW, Fan XD (2019) Fuzzy rough set-based attribute reduction using distance measures. Knowl-Based Syst 164:205–212 Wang CZ, Huang Y, Shao MW, Fan XD (2019) Fuzzy rough set-based attribute reduction using distance measures. Knowl-Based Syst 164:205–212
39.
Zurück zum Zitat Wang CZ, Shi YP, Fan XD, Shao MW (2019) Attribute reduction based on k-nearest neighborhood rough sets. Int J Approx Reason 106:18–31MathSciNetMATH Wang CZ, Shi YP, Fan XD, Shao MW (2019) Attribute reduction based on k-nearest neighborhood rough sets. Int J Approx Reason 106:18–31MathSciNetMATH
41.
Zurück zum Zitat Wang F, Liang J, Qian Y (2013) Attribute reduction: a dimension incremental strategy. Knowl-Based Syst 39(2):95–108 Wang F, Liang J, Qian Y (2013) Attribute reduction: a dimension incremental strategy. Knowl-Based Syst 39(2):95–108
42.
Zurück zum Zitat Wang Q, Li JH, Wei L, Qian T (2020) Optimal granule level selection: a granule description accuracy viewpoint. Int J Approx Reason 116:85–105MathSciNetMATH Wang Q, Li JH, Wei L, Qian T (2020) Optimal granule level selection: a granule description accuracy viewpoint. Int J Approx Reason 116:85–105MathSciNetMATH
43.
Zurück zum Zitat Yang M (2007) An incremental updating algorithm for attributes reduction based on the improved discernibility matrix. Chin J Comput 30(3):815–822MathSciNet Yang M (2007) An incremental updating algorithm for attributes reduction based on the improved discernibility matrix. Chin J Comput 30(3):815–822MathSciNet
44.
Zurück zum Zitat Yao YY, Zhao Y (2008) Attribute reduction in decision-theoretic rough set models. Inf Sci 178(17):3356–3373MathSciNetMATH Yao YY, Zhao Y (2008) Attribute reduction in decision-theoretic rough set models. Inf Sci 178(17):3356–3373MathSciNetMATH
45.
Zurück zum Zitat Yang Y, Chen D, Wang H (2017) Active sample selection based incremental algorithm for attribute reduction with rough sets. IEEE Trans Fuzzy Syst 25(4):825–838 Yang Y, Chen D, Wang H (2017) Active sample selection based incremental algorithm for attribute reduction with rough sets. IEEE Trans Fuzzy Syst 25(4):825–838
46.
Zurück zum Zitat Zhao SY, Tsang ECC, Chen DG (2010) The model of fuzzy variable precision rough sets. IEEE Trans Fuzzy Syst 17(2):451–471 Zhao SY, Tsang ECC, Chen DG (2010) The model of fuzzy variable precision rough sets. IEEE Trans Fuzzy Syst 17(2):451–471
47.
Zurück zum Zitat Zhao SY, Chen H, Li CP, Zhai MY, Du XY (2013) RFRR:robust fuzzy rough reduction. IEEE Trans Fuzzy Syst 21(5):825–841 Zhao SY, Chen H, Li CP, Zhai MY, Du XY (2013) RFRR:robust fuzzy rough reduction. IEEE Trans Fuzzy Syst 21(5):825–841
48.
Zurück zum Zitat Zhao SY, Tsang ECC, Chen DG, Wang XZ (2010) Building a rule-based classifier-a fuzzy-rough set approach. IEEE Trans Knowl Data Eng 22(5):624–638 Zhao SY, Tsang ECC, Chen DG, Wang XZ (2010) Building a rule-based classifier-a fuzzy-rough set approach. IEEE Trans Knowl Data Eng 22(5):624–638
49.
Zurück zum Zitat Zhao SY, Chen H, Li CP, Du XY, Sun H (2015) A novel approach to building a robust fuzzy rough classifier. IEEE Trans Fuzzy Syst 23(4):769–786 Zhao SY, Chen H, Li CP, Du XY, Sun H (2015) A novel approach to building a robust fuzzy rough classifier. IEEE Trans Fuzzy Syst 23(4):769–786
Metadaten
Titel
Incremental attribute reduction with rough set for dynamic datasets with simultaneously increasing samples and attributes
verfasst von
Lianjie Dong
Degang Chen
Publikationsdatum
06.02.2020
Verlag
Springer Berlin Heidelberg
Erschienen in
International Journal of Machine Learning and Cybernetics / Ausgabe 6/2020
Print ISSN: 1868-8071
Elektronische ISSN: 1868-808X
DOI
https://doi.org/10.1007/s13042-020-01065-y

Weitere Artikel der Ausgabe 6/2020

International Journal of Machine Learning and Cybernetics 6/2020 Zur Ausgabe

Neuer Inhalt