Skip to main content
Erschienen in: International Journal of Machine Learning and Cybernetics 5/2020

02.05.2019 | Original Article

Similarity-based attribute reduction in rough set theory: a clustering perspective

verfasst von: Xiuyi Jia, Ya Rao, Lin Shang, Tongjun Li

Erschienen in: International Journal of Machine Learning and Cybernetics | Ausgabe 5/2020

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Attribute reduction is one of the most important research issues in the rough set theory. The purpose of attribute reduction is to find a minimal attribute subset that satisfies some specific criteria, while the minimal attribute subset is called attribute reduct. In this paper, we define a similarity-based attribute reduct based on a clustering perspective. Each decision class is treated as a cluster, and the defined similarity-based attribute reduct can maintain or increase the discriminating ability of different clusters in the case of removing redundant attributes. In view of this, firstly, we define the intra-class similarity for objects in the same decision class and the inter-class similarity for objects between different decision classes. Secondly, we define a similarity-based attribute reduct by maximizing intra-class similarity and minimizing inter-class similarity in the rough set model. Thirdly, by considering the heuristic search strategy, we also design a corresponding reduction method for the proposed attribute reduct. The experimental results indicate that compared with other representative attribute reducts, our proposed attribute reduct can significantly improve the classification performance.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Weitere Produktempfehlungen anzeigen
Literatur
2.
Zurück zum Zitat Chen D, Zhao S, Zhang L, Yang Y, Zhang X (2012) Sample pair selection for attribute reduction with rough set. IEEE Trans Knowl Data Eng 24(11):2080–2093CrossRef Chen D, Zhao S, Zhang L, Yang Y, Zhang X (2012) Sample pair selection for attribute reduction with rough set. IEEE Trans Knowl Data Eng 24(11):2080–2093CrossRef
3.
Zurück zum Zitat Dai J, Wang W, Tian H, Liu L (2013) Attribute selection based on a new conditional entropy for incomplete decision systems. Knowl Based Syst 39(2):207–213CrossRef Dai J, Wang W, Tian H, Liu L (2013) Attribute selection based on a new conditional entropy for incomplete decision systems. Knowl Based Syst 39(2):207–213CrossRef
4.
Zurück zum Zitat Dai JH, Hu H, Wu WZ, Qian YH, Huang DB (2018) Maximal discernibility pairs based approach to attribute reduction in fuzzy rough sets. IEEE Trans Fuzzy Syst 26(4):2174–2187CrossRef Dai JH, Hu H, Wu WZ, Qian YH, Huang DB (2018) Maximal discernibility pairs based approach to attribute reduction in fuzzy rough sets. IEEE Trans Fuzzy Syst 26(4):2174–2187CrossRef
5.
Zurück zum Zitat Feng L, Luo F, Fang D, Yuan Y (2012) Approaches for attribute core and attribute reduction based on an improved extended positive region. J Shandong Univ (Nat Sci) 47(1):72–76MathSciNet Feng L, Luo F, Fang D, Yuan Y (2012) Approaches for attribute core and attribute reduction based on an improved extended positive region. J Shandong Univ (Nat Sci) 47(1):72–76MathSciNet
6.
Zurück zum Zitat Hall M, Frank E, Holmes G, Pfahringer B, Reutemann P, Witten IH (2009) The WEKA data mining software: an update. ACM SIGKDD Explor Newsl 11(1):10–18CrossRef Hall M, Frank E, Holmes G, Pfahringer B, Reutemann P, Witten IH (2009) The WEKA data mining software: an update. ACM SIGKDD Explor Newsl 11(1):10–18CrossRef
7.
Zurück zum Zitat Hu Q, Zhao H, Xie Z, Yu D (2007) Consistency based attribute reduction. In: Zhou ZH, Li H, Yang Q (eds) Advances in knowledge discovery and data mining. PAKDD 2007. Lecture notes in computer science, vol 4426. Springer, Berlin, Heidelberg, pp 96–107 Hu Q, Zhao H, Xie Z, Yu D (2007) Consistency based attribute reduction. In: Zhou ZH, Li H, Yang Q (eds) Advances in knowledge discovery and data mining. PAKDD 2007. Lecture notes in computer science, vol 4426. Springer, Berlin, Heidelberg, pp 96–107
8.
Zurück zum Zitat Jia X, Shang L, Zhou B, Yao Y (2016) Generalized attribute reduct in rough set theory. Knowl Based Syst 91:204–218CrossRef Jia X, Shang L, Zhou B, Yao Y (2016) Generalized attribute reduct in rough set theory. Knowl Based Syst 91:204–218CrossRef
9.
Zurück zum Zitat Jia X, Li W, Shang L (2019) A multiphase cost-sensitive learning method based on the multiclass three-way decision-theoretic rough set model. Inf Sci 485:248–262CrossRef Jia X, Li W, Shang L (2019) A multiphase cost-sensitive learning method based on the multiclass three-way decision-theoretic rough set model. Inf Sci 485:248–262CrossRef
10.
Zurück zum Zitat Li J, Kumar CA, Mei C, Wang X (2017) Comparison of reduction in formal decision contexts. Int J Approx Reason 80:100–122MathSciNetCrossRef Li J, Kumar CA, Mei C, Wang X (2017) Comparison of reduction in formal decision contexts. Int J Approx Reason 80:100–122MathSciNetCrossRef
11.
Zurück zum Zitat Liu K, Yang X, Yu H, Mi J, Wang P, Chen X (2019) Rough set based semi-supervised feature selection via ensemble selector. Knowl Based Syst 165:282–296CrossRef Liu K, Yang X, Yu H, Mi J, Wang P, Chen X (2019) Rough set based semi-supervised feature selection via ensemble selector. Knowl Based Syst 165:282–296CrossRef
12.
Zurück zum Zitat Min F, Zhang ZH, Dong J (2018) Ant colony optimization with partial-complete searching for attribute reduction. J Comput Sci 25:170–182MathSciNetCrossRef Min F, Zhang ZH, Dong J (2018) Ant colony optimization with partial-complete searching for attribute reduction. J Comput Sci 25:170–182MathSciNetCrossRef
13.
14.
Zurück zum Zitat Pawalk Z (1991) Rough sets: theoretical aspects of reasoning about data. Springer, Netherlands Pawalk Z (1991) Rough sets: theoretical aspects of reasoning about data. Springer, Netherlands
15.
Zurück zum Zitat Qian J, Lv P, Yue XD, Liu CH, Jing ZJ (2015) Hierarchical attribute reduction algorithms for big data using mapreduce. Knowl Based Syst 73:18–31CrossRef Qian J, Lv P, Yue XD, Liu CH, Jing ZJ (2015) Hierarchical attribute reduction algorithms for big data using mapreduce. Knowl Based Syst 73:18–31CrossRef
16.
Zurück zum Zitat Qian Y, Liang J, Pedrycz W, Dang C (2010) Positive approximation: an accelerator for attribute reduction in rough set theory. Artif Intell 174(9–10):597–618MathSciNetCrossRef Qian Y, Liang J, Pedrycz W, Dang C (2010) Positive approximation: an accelerator for attribute reduction in rough set theory. Artif Intell 174(9–10):597–618MathSciNetCrossRef
17.
Zurück zum Zitat Stanfill C, Waltz D (1986) Toward memory-based reasoning. Commun ACM 29(12):1213–1228CrossRef Stanfill C, Waltz D (1986) Toward memory-based reasoning. Commun ACM 29(12):1213–1228CrossRef
18.
Zurück zum Zitat Sun SB, Zhang MC, Hu CX (2011) VPRS-based attribute reduction approaches in incomplete information systems. J Inf Comput Sci 8(9):1515–1523 Sun SB, Zhang MC, Hu CX (2011) VPRS-based attribute reduction approaches in incomplete information systems. J Inf Comput Sci 8(9):1515–1523
19.
Zurück zum Zitat Thangavel K, Pethalakshmi A (2009) Dimensionality reduction based on rough set theory: a review. Appl Soft Comput 9(1):1–12CrossRef Thangavel K, Pethalakshmi A (2009) Dimensionality reduction based on rough set theory: a review. Appl Soft Comput 9(1):1–12CrossRef
20.
Zurück zum Zitat Wang C, Wu C, Chen D (2008) A systematic study on attribute reduction with rough sets based on general binary relations. Inf Sci 178(9):2237–2261MathSciNetCrossRef Wang C, Wu C, Chen D (2008) A systematic study on attribute reduction with rough sets based on general binary relations. Inf Sci 178(9):2237–2261MathSciNetCrossRef
21.
Zurück zum Zitat Wang C, Chen D, Wu C, Hu Q (2011) Data compression with homomorphism in covering information systems. Int J Approx Reason 52(4):519–525MathSciNetCrossRef Wang C, Chen D, Wu C, Hu Q (2011) Data compression with homomorphism in covering information systems. Int J Approx Reason 52(4):519–525MathSciNetCrossRef
22.
Zurück zum Zitat Xu F, Miao D, Wei L (2009) Fuzzy-rough attribute reduction via mutual information with an application to cancer classification. Comput Math Appl 57(6):1010–1017CrossRef Xu F, Miao D, Wei L (2009) Fuzzy-rough attribute reduction via mutual information with an application to cancer classification. Comput Math Appl 57(6):1010–1017CrossRef
23.
Zurück zum Zitat Xu S, Yang X, Yu H, Yu DJ, Yang J, Tsang EC (2016) Multi-label learning with label-specific feature reduction. Knowl Based Syst 104:52–61CrossRef Xu S, Yang X, Yu H, Yu DJ, Yang J, Tsang EC (2016) Multi-label learning with label-specific feature reduction. Knowl Based Syst 104:52–61CrossRef
24.
Zurück zum Zitat Yang T, Li QG, Zhou BL (2013) Related family: a new method for attribute reduction of covering information systems. Inf Sci 228:175–191MathSciNetCrossRef Yang T, Li QG, Zhou BL (2013) Related family: a new method for attribute reduction of covering information systems. Inf Sci 228:175–191MathSciNetCrossRef
25.
Zurück zum Zitat Yang X, Liang S, Yu H, Gao S, Qian Y (2019) Pseudo-label neighborhood rough set: measures and attribute reductions. Int J Approx Reason 105:112–129MathSciNetCrossRef Yang X, Liang S, Yu H, Gao S, Qian Y (2019) Pseudo-label neighborhood rough set: measures and attribute reductions. Int J Approx Reason 105:112–129MathSciNetCrossRef
26.
Zurück zum Zitat Yang XB, Yao YY (2018) Ensemble selector for attribute reduction. Appl Soft Comput 70:1–11CrossRef Yang XB, Yao YY (2018) Ensemble selector for attribute reduction. Appl Soft Comput 70:1–11CrossRef
27.
28.
Zurück zum Zitat Yao Y, Zhao Y (2009) Discernibility matrix simplification for constructing attribute reducts. Inf Sci 179(7):867–882MathSciNetCrossRef Yao Y, Zhao Y (2009) Discernibility matrix simplification for constructing attribute reducts. Inf Sci 179(7):867–882MathSciNetCrossRef
29.
Zurück zum Zitat Zhang WX, Mi JS, Wu WZ (2003) Approaches to knowledge reductions in inconsistent systems. Int J Intell Syst 18(9):989–1000CrossRef Zhang WX, Mi JS, Wu WZ (2003) Approaches to knowledge reductions in inconsistent systems. Int J Intell Syst 18(9):989–1000CrossRef
30.
Zurück zum Zitat Zhao Y, Yao Y, Luo F (2007) Data analysis based on discernibility and indiscernibility. Inf Sci 177(22):4959–4976CrossRef Zhao Y, Yao Y, Luo F (2007) Data analysis based on discernibility and indiscernibility. Inf Sci 177(22):4959–4976CrossRef
Metadaten
Titel
Similarity-based attribute reduction in rough set theory: a clustering perspective
verfasst von
Xiuyi Jia
Ya Rao
Lin Shang
Tongjun Li
Publikationsdatum
02.05.2019
Verlag
Springer Berlin Heidelberg
Erschienen in
International Journal of Machine Learning and Cybernetics / Ausgabe 5/2020
Print ISSN: 1868-8071
Elektronische ISSN: 1868-808X
DOI
https://doi.org/10.1007/s13042-019-00959-w

Weitere Artikel der Ausgabe 5/2020

International Journal of Machine Learning and Cybernetics 5/2020 Zur Ausgabe

Neuer Inhalt