Skip to main content
Erschienen in: Granular Computing 2/2020

11.12.2018 | Original Paper

Neighborhood attribute reduction approach to partially labeled data

verfasst von: Keyu Liu, Eric C. C. Tsang, Jingjing Song, Hualong Yu, Xiangjian Chen, Xibei Yang

Erschienen in: Granular Computing | Ausgabe 2/2020

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Presently, from the viewpoint of rough set, most of the attribute reductions are only suitable for analyzing samples with complete labels. However, in many real-world applications, it is difficult to acquire the detailed labels of all samples, it follows that many attribute reductions may be ineffective for data with both labeled and unlabeled samples, i.e., partially labeled data. To fill such a gap, the attribute reduction is explored by neighborhood rough set over partially labeled data. First, two different measurements are combined for evaluating the importance of attribute, which comes from the labeled and unlabeled samples, respectively. Second, a heuristic algorithm is re-designed using such combined importance for computing reduct. Finally, by considering several different ratios of missing labels over UCI datasets, the experimental results demonstrate that the reducts derived by our approach not only reduce the degree of uncertainty, but also offer us better classification performance. Therefore, the main contribution of this paper is to construct an effective attribute reduction strategy for partially labeled data. Moreover, this research also suggests new applications for considering attribute reduction problems in complex data.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Chen SM, Chang YC (2011) Weighted fuzzy rule interpolation based on GA-based weight-learning techniques. IEEE Trans Fuzzy Syst 19:729–744 Chen SM, Chang YC (2011) Weighted fuzzy rule interpolation based on GA-based weight-learning techniques. IEEE Trans Fuzzy Syst 19:729–744
Zurück zum Zitat Chen SM, Chen JH (2009) Fuzzy risk analysis based on ranking generalized fuzzy numbers with different heights and different spreads. Expert Syst Appl 36:6320–6334 Chen SM, Chen JH (2009) Fuzzy risk analysis based on ranking generalized fuzzy numbers with different heights and different spreads. Expert Syst Appl 36:6320–6334
Zurück zum Zitat Chen SM, Tanuwijaya K (2011) Fuzzy forecasting based on high-order fuzzy logical relationships and automatic clustering techniques. Expert Syst Appl 38:15425–15437 Chen SM, Tanuwijaya K (2011) Fuzzy forecasting based on high-order fuzzy logical relationships and automatic clustering techniques. Expert Syst Appl 38:15425–15437
Zurück zum Zitat Chen SM, Lee SH, Lee CH (2001) A new method for generating fuzzy rules from numerical data for handling classification problems. Appl Artif Intell 15:645–664 Chen SM, Lee SH, Lee CH (2001) A new method for generating fuzzy rules from numerical data for handling classification problems. Appl Artif Intell 15:645–664
Zurück zum Zitat Chen DG, Zhao SY, Zhang L, Yang YP, Zhang X (2012) Sample pair selection for attribute reduction with rough set. IEEE Trans Knowl Data Eng 24:2080–2093 Chen DG, Zhao SY, Zhang L, Yang YP, Zhang X (2012) Sample pair selection for attribute reduction with rough set. IEEE Trans Knowl Data Eng 24:2080–2093
Zurück zum Zitat Dai JH, Wang WT, Xu Q (2013) An uncertainty measure for incomplete decision tables and its applications. IEEE Trans Cybern 43:1277–1289 Dai JH, Wang WT, Xu Q (2013) An uncertainty measure for incomplete decision tables and its applications. IEEE Trans Cybern 43:1277–1289
Zurück zum Zitat Dai JH, Hu QH, Zhang JH, Hu H, Zheng NG (2017) Attribute selection for partially labeled categorical data by rough set approach. IEEE Trans Cybern 47:2460–2471 Dai JH, Hu QH, Zhang JH, Hu H, Zheng NG (2017) Attribute selection for partially labeled categorical data by rough set approach. IEEE Trans Cybern 47:2460–2471
Zurück zum Zitat Dou HL, Yang XB, Song XN, Yu HL, Wu WZ, Yang JY (2016) Decision-theoretic rough set: a multicost strategy. Knowl Based Syst 91:71–83 Dou HL, Yang XB, Song XN, Yu HL, Wu WZ, Yang JY (2016) Decision-theoretic rough set: a multicost strategy. Knowl Based Syst 91:71–83
Zurück zum Zitat Dubois D, Prade H (1990) Rough fuzzy sets and fuzzy rough sets. Int J Gen Syst 17:191–209MATH Dubois D, Prade H (1990) Rough fuzzy sets and fuzzy rough sets. Int J Gen Syst 17:191–209MATH
Zurück zum Zitat Fisher RA (1921) On the “probable error” of a coefficient of correlation. Metron 1:3–32 Fisher RA (1921) On the “probable error” of a coefficient of correlation. Metron 1:3–32
Zurück zum Zitat Hu QH, Yu DR, Xie ZX, Liu JF (2006) Fuzzy probabilistic approximation spaces and their information measures. IEEE Trans Fuzzy Syst 14:191–201 Hu QH, Yu DR, Xie ZX, Liu JF (2006) Fuzzy probabilistic approximation spaces and their information measures. IEEE Trans Fuzzy Syst 14:191–201
Zurück zum Zitat Hu QH, Liu JF, Yu DR (2008a) Mixed feature selection based on granulation and approximation. Knowl Based Syst 21:294–304 Hu QH, Liu JF, Yu DR (2008a) Mixed feature selection based on granulation and approximation. Knowl Based Syst 21:294–304
Zurück zum Zitat Hu QH, Yu DR, Xie ZX (2008b) Neighborhood classifiers. Expert Syst Appl 34:866–876 Hu QH, Yu DR, Xie ZX (2008b) Neighborhood classifiers. Expert Syst Appl 34:866–876
Zurück zum Zitat Hu QH, Zhang L, Chen DG, Pedrycz W, Yu DR (2010) Gaussian kernel based fuzzy rough sets: model, uncertainty measures and applications. Int J Approx Reason 51:453–471MATH Hu QH, Zhang L, Chen DG, Pedrycz W, Yu DR (2010) Gaussian kernel based fuzzy rough sets: model, uncertainty measures and applications. Int J Approx Reason 51:453–471MATH
Zurück zum Zitat Hu J, Li TR, Wang HJ, Fujita H (2016) Hierarchical cluster ensemble model based on knowledge granulation. Knowl Based Syst 91:179–188 Hu J, Li TR, Wang HJ, Fujita H (2016) Hierarchical cluster ensemble model based on knowledge granulation. Knowl Based Syst 91:179–188
Zurück zum Zitat Huang B, Li HX (2018) Distance-based information granularity in neighborhood-based granular space. Granul Comput 3:93–110 Huang B, Li HX (2018) Distance-based information granularity in neighborhood-based granular space. Granul Comput 3:93–110
Zurück zum Zitat Ju HR, Li HX, Yang XB, Zhou XZ, Huang B (2017) Cost-sensitive rough set: a multi-granulation approach. Knowl Based Syst 123:137–153 Ju HR, Li HX, Yang XB, Zhou XZ, Huang B (2017) Cost-sensitive rough set: a multi-granulation approach. Knowl Based Syst 123:137–153
Zurück zum Zitat Mi JS, Wu WZ, Zhang WX (2004) Approaches to knowledge reduction based on variable precision rough set model. Inf Sci 159:255–272MathSciNetMATH Mi JS, Wu WZ, Zhang WX (2004) Approaches to knowledge reduction based on variable precision rough set model. Inf Sci 159:255–272MathSciNetMATH
Zurück zum Zitat Min F, Xu J (2016) Semi-greedy heuristics for feature selection with test cost constraints. Granul Comput 1:199–211 Min F, Xu J (2016) Semi-greedy heuristics for feature selection with test cost constraints. Granul Comput 1:199–211
Zurück zum Zitat Pawlak Z (1992) Rough sets: theoretical aspects of reasoning about data. Kluwer Academic Publishers, DordrechtMATH Pawlak Z (1992) Rough sets: theoretical aspects of reasoning about data. Kluwer Academic Publishers, DordrechtMATH
Zurück zum Zitat Pedrycz W, Chen SM (2011) Granular computing and intelligent systems: design with information granules of higher order and higher type. Springer, Heidelberg Pedrycz W, Chen SM (2011) Granular computing and intelligent systems: design with information granules of higher order and higher type. Springer, Heidelberg
Zurück zum Zitat Pedrycz W, Chen SM (2015) Granular computing and decision-making: interactive and iterative approaches. Springer, Heidelberg Pedrycz W, Chen SM (2015) Granular computing and decision-making: interactive and iterative approaches. Springer, Heidelberg
Zurück zum Zitat Peter JF, Skowron A, Synak P, Ramanna S (2003) Rough sets and information granulation. In: Proceedings 10th international fuzzy systems association world congress, Istanbul, Turkey, pp 370–377 Peter JF, Skowron A, Synak P, Ramanna S (2003) Rough sets and information granulation. In: Proceedings 10th international fuzzy systems association world congress, Istanbul, Turkey, pp 370–377
Zurück zum Zitat Polkowski L, Artiemjew P (2015) Granular computing in decision approximation: an application of rough mereology. Springer, HeidelbergMATH Polkowski L, Artiemjew P (2015) Granular computing in decision approximation: an application of rough mereology. Springer, HeidelbergMATH
Zurück zum Zitat Skowron A, Stepaniuk J (1996) Tolerance approximation spaces. Fundamenta Informaticae 27:245–253MathSciNetMATH Skowron A, Stepaniuk J (1996) Tolerance approximation spaces. Fundamenta Informaticae 27:245–253MathSciNetMATH
Zurück zum Zitat Swiniarski W, Skowron A (2003) Rough set methods in feature selection and recognition. Pattern Recognit Lett 24:83–849MATH Swiniarski W, Skowron A (2003) Rough set methods in feature selection and recognition. Pattern Recognit Lett 24:83–849MATH
Zurück zum Zitat Wang GY (2017) Dgcc: data-driven granular cognitive computing. Granul Comput 2:343–355 Wang GY (2017) Dgcc: data-driven granular cognitive computing. Granul Comput 2:343–355
Zurück zum Zitat Wang HY, Chen SM (2008) Evaluating students’ answerscripts using fuzzy numbers associated with degrees of confidence. IEEE Trans Fuzzy Syst 16:403–415 Wang HY, Chen SM (2008) Evaluating students’ answerscripts using fuzzy numbers associated with degrees of confidence. IEEE Trans Fuzzy Syst 16:403–415
Zurück zum Zitat Wang CZ, Shao MW, He Q, Qian YH, Qi YL (2016) Feature subset selection based on fuzzy neighborhood rough sets. Knowl Based Syst 111:173–179 Wang CZ, Shao MW, He Q, Qian YH, Qi YL (2016) Feature subset selection based on fuzzy neighborhood rough sets. Knowl Based Syst 111:173–179
Zurück zum Zitat Wang GY, Yang J, Xu J (2017) Granular computing: from granularity optimization to multi-granularity joint problem solving. Granul Comput 2:105–120 Wang GY, Yang J, Xu J (2017) Granular computing: from granularity optimization to multi-granularity joint problem solving. Granul Comput 2:105–120
Zurück zum Zitat Wang CZ, Hu QH, Wang XZ, Chen DG, Qian YH, Dong Z (2018) Feature selection based on neighborhood discrimination index. IEEE Trans Neural Netw Learn Syst 29:2986–2999MathSciNet Wang CZ, Hu QH, Wang XZ, Chen DG, Qian YH, Dong Z (2018) Feature selection based on neighborhood discrimination index. IEEE Trans Neural Netw Learn Syst 29:2986–2999MathSciNet
Zurück zum Zitat Wei W, Liang JY, Wang JH, Qian YH (2013) Decision-relative discernibility matrices in the sense of entropies. Int J Gen Syst 42:721–738MathSciNetMATH Wei W, Liang JY, Wang JH, Qian YH (2013) Decision-relative discernibility matrices in the sense of entropies. Int J Gen Syst 42:721–738MathSciNetMATH
Zurück zum Zitat Wojna A (2005) Analogy-based reasoning in classifier construction. Trans Rough Sets IV 3700:277–374MathSciNetMATH Wojna A (2005) Analogy-based reasoning in classifier construction. Trans Rough Sets IV 3700:277–374MathSciNetMATH
Zurück zum Zitat Wu WZ, Qian YH, Li TJ, Gu SM (2016) On rule acquisition in incomplete multi-scale decision tables. Inf Sci 378:282–302MathSciNetMATH Wu WZ, Qian YH, Li TJ, Gu SM (2016) On rule acquisition in incomplete multi-scale decision tables. Inf Sci 378:282–302MathSciNetMATH
Zurück zum Zitat Xu SP, Yang XB, Yu HL, Yu DJ, Yang JY, Tsang ECC (2016) Multi-label learning with label-specific feature reduction. Knowl Based Syst 104:52–61 Xu SP, Yang XB, Yu HL, Yu DJ, Yang JY, Tsang ECC (2016) Multi-label learning with label-specific feature reduction. Knowl Based Syst 104:52–61
Zurück zum Zitat Xu WH, Li WT, Zhang XT (2017) Generalized multigranulation rough sets and optimal granularity selection. Granul Comput 2:271–288 Xu WH, Li WT, Zhang XT (2017) Generalized multigranulation rough sets and optimal granularity selection. Granul Comput 2:271–288
Zurück zum Zitat Yang XB, Yao YY (2018) Ensemble selector for attribute reduction. Appl Soft Comput 70:1–11 Yang XB, Yao YY (2018) Ensemble selector for attribute reduction. Appl Soft Comput 70:1–11
Zurück zum Zitat Yang XB, Song XN, Dou HL, Yang JY (2011a) Multi-granulation rough set: from crisp to fuzzy case. Ann Fuzzy Math Inform 1:55–70MathSciNetMATH Yang XB, Song XN, Dou HL, Yang JY (2011a) Multi-granulation rough set: from crisp to fuzzy case. Ann Fuzzy Math Inform 1:55–70MathSciNetMATH
Zurück zum Zitat Yang XB, Zhang M, Dou HL, Yang JY (2011b) Neighborhood systems-based rough sets in incomplete information system. Knowl Based Syst 24:858–867 Yang XB, Zhang M, Dou HL, Yang JY (2011b) Neighborhood systems-based rough sets in incomplete information system. Knowl Based Syst 24:858–867
Zurück zum Zitat Yang XB, Qi YS, Song XN, Yang JY (2013) Test cost sensitive multigranulation rough set: model and minimal cost selection. Inf Sci 250:184–199MathSciNetMATH Yang XB, Qi YS, Song XN, Yang JY (2013) Test cost sensitive multigranulation rough set: model and minimal cost selection. Inf Sci 250:184–199MathSciNetMATH
Zurück zum Zitat Yang XB, Qi Y, Yu HL, Song XN, Yang JY (2014) Updating multigranulation rough approximations with increasing of granular structures. Knowl Based Syst 64:59–69 Yang XB, Qi Y, Yu HL, Song XN, Yang JY (2014) Updating multigranulation rough approximations with increasing of granular structures. Knowl Based Syst 64:59–69
Zurück zum Zitat Yang XB, Liang SC, Yu HL, Gao S, Qian YH (2019) Pseudo-label neighborhood rough set: measures and attribute reductions. Int J Approx Reason 105:112–129MathSciNetMATH Yang XB, Liang SC, Yu HL, Gao S, Qian YH (2019) Pseudo-label neighborhood rough set: measures and attribute reductions. Int J Approx Reason 105:112–129MathSciNetMATH
Zurück zum Zitat Zhang X, Mei CL, Chen DG, Li JH (2016) Feature selection in mixed data: a method using a novel fuzzy rough set-based information entropy. Pattern Recognit 56:1–15MATH Zhang X, Mei CL, Chen DG, Li JH (2016) Feature selection in mixed data: a method using a novel fuzzy rough set-based information entropy. Pattern Recognit 56:1–15MATH
Zurück zum Zitat Zhi HL, Li JH (2018) Granule description based on positive and negative attributes. Granul Comput 3:1–14MathSciNet Zhi HL, Li JH (2018) Granule description based on positive and negative attributes. Granul Comput 3:1–14MathSciNet
Zurück zum Zitat Zhou ZH, Li M (2010) Semi-supervised learning by disagreement. Knowl Inf Syst 24:415–439MathSciNet Zhou ZH, Li M (2010) Semi-supervised learning by disagreement. Knowl Inf Syst 24:415–439MathSciNet
Zurück zum Zitat Zhu P, Wen QY (2012) Information-theoretic measures associated with rough set approximations. Inf Sci 212:33–43MathSciNetMATH Zhu P, Wen QY (2012) Information-theoretic measures associated with rough set approximations. Inf Sci 212:33–43MathSciNetMATH
Metadaten
Titel
Neighborhood attribute reduction approach to partially labeled data
verfasst von
Keyu Liu
Eric C. C. Tsang
Jingjing Song
Hualong Yu
Xiangjian Chen
Xibei Yang
Publikationsdatum
11.12.2018
Verlag
Springer International Publishing
Erschienen in
Granular Computing / Ausgabe 2/2020
Print ISSN: 2364-4966
Elektronische ISSN: 2364-4974
DOI
https://doi.org/10.1007/s41066-018-00151-5

Weitere Artikel der Ausgabe 2/2020

Granular Computing 2/2020 Zur Ausgabe