Skip to main content

2016 | OriginalPaper | Buchkapitel

DualPOS: A Semi-supervised Attribute Selection Approach for Symbolic Data Based on Rough Set Theory

verfasst von : Jianhua Dai, Huifeng Han, Hu Hu, Qinghua Hu, Jinghong Zhang, Wentao Wang

Erschienen in: Web-Age Information Management

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Rough set theory, supplying an effective model for representation of uncertain knowledge, has been widely used in knowledge engineering and data mining. Especially, rough set theory has been used as an attribute selection method with much success. However, current rough set approaches for attribute reduction are unsuitable for semi-supervised learning as no enough labeled data can guarantee to calculate the dependency degree. We propose a new attribute selection strategy based on rough sets, called DualPOS. It provides mutual function mechanism of multi-attributes, and generates the most consistent one as a candidate. Experiments are carried out to test the performances of classification and clustering of the proposed algorithm. The results show that DualPOS is valid for attribute selection in semi-supervised learning.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Blum, A.L., Langley, P.: Selection of relevant features and examples in machine learning. Artif. Intell. 97(1C2), 245–271 (1997)MathSciNetCrossRefMATH Blum, A.L., Langley, P.: Selection of relevant features and examples in machine learning. Artif. Intell. 97(1C2), 245–271 (1997)MathSciNetCrossRefMATH
2.
Zurück zum Zitat Guyon, I., Elisseeff, A.: An introduction to variable and feature selection. J. Mach. Learn. Res. 3, 1157–1182 (2003)MATH Guyon, I., Elisseeff, A.: An introduction to variable and feature selection. J. Mach. Learn. Res. 3, 1157–1182 (2003)MATH
3.
Zurück zum Zitat Bae, C., Yeh, W.C., Chung, Y.Y., Liu, S.L.: Feature selection with intelligent dynamic swarm and rough set. Expert Syst. Appl. 37(10), 7026–7032 (2010)CrossRef Bae, C., Yeh, W.C., Chung, Y.Y., Liu, S.L.: Feature selection with intelligent dynamic swarm and rough set. Expert Syst. Appl. 37(10), 7026–7032 (2010)CrossRef
6.
Zurück zum Zitat Revett, K., Iantovics, B.: A survey of electronic fetal monitoring: a computational perspective. Stud. Comput. Intell. 486, 135–141 (2014) Revett, K., Iantovics, B.: A survey of electronic fetal monitoring: a computational perspective. Stud. Comput. Intell. 486, 135–141 (2014)
7.
Zurück zum Zitat Skowron, A., Rauszer, C.: The discernibility matrices and functions in information systems. In: Slowiński, R. (ed.) Intelligent Decision Support. Theory and Decision Library, vol. 11, pp. 331–362. Springer, Netherlands (1992)CrossRef Skowron, A., Rauszer, C.: The discernibility matrices and functions in information systems. In: Slowiński, R. (ed.) Intelligent Decision Support. Theory and Decision Library, vol. 11, pp. 331–362. Springer, Netherlands (1992)CrossRef
8.
Zurück zum Zitat Vafaie, H., Imam, I.F.: Feature selection methods: genetic algorithms vs. greedy-like search. In: Proceedings of the International Conference on Fuzzy and Intelligent Control Systems, pp. 39–43 (1994) Vafaie, H., Imam, I.F.: Feature selection methods: genetic algorithms vs. greedy-like search. In: Proceedings of the International Conference on Fuzzy and Intelligent Control Systems, pp. 39–43 (1994)
9.
Zurück zum Zitat Hu, X., Cercone, N.: Learning in relational databases: a rough set approach. Comput. Intell. 11(2), 323–338 (1995)CrossRef Hu, X., Cercone, N.: Learning in relational databases: a rough set approach. Comput. Intell. 11(2), 323–338 (1995)CrossRef
10.
Zurück zum Zitat Hu, X.: Knowledge discovery in databases: an attribute-oriented rough set approach. Ph.D. thesis, Citeseer (1995) Hu, X.: Knowledge discovery in databases: an attribute-oriented rough set approach. Ph.D. thesis, Citeseer (1995)
11.
Zurück zum Zitat Susmaga, R.: Reducts and constructs in attribute reduction. Fundamenta Informaticae 61(2), 159–181 (2004)MathSciNetMATH Susmaga, R.: Reducts and constructs in attribute reduction. Fundamenta Informaticae 61(2), 159–181 (2004)MathSciNetMATH
12.
Zurück zum Zitat Dai, J., Wang, W., Xu, Q.: An uncertainty measure for incomplete decision tables and its applications. IEEE Trans. Cybern. 43(4), 1277–1289 (2013)CrossRef Dai, J., Wang, W., Xu, Q.: An uncertainty measure for incomplete decision tables and its applications. IEEE Trans. Cybern. 43(4), 1277–1289 (2013)CrossRef
13.
Zurück zum Zitat Dai, J., Wang, W., Tian, H., Liu, L.: Attribute selection based on a new conditional entropy for incomplete decision systems. Knowl.-Based Syst. 39, 207–213 (2013)CrossRef Dai, J., Wang, W., Tian, H., Liu, L.: Attribute selection based on a new conditional entropy for incomplete decision systems. Knowl.-Based Syst. 39, 207–213 (2013)CrossRef
14.
Zurück zum Zitat Dai, J., Xu, Q., Wang, W., Tian, H.: Conditional entropy for incomplete decision systems and its application in data mining. Int. J. Gen. Syst. 41(7), 713–728 (2012)MathSciNetCrossRefMATH Dai, J., Xu, Q., Wang, W., Tian, H.: Conditional entropy for incomplete decision systems and its application in data mining. Int. J. Gen. Syst. 41(7), 713–728 (2012)MathSciNetCrossRefMATH
15.
Zurück zum Zitat Dai, J., Xu, Q.: Attribute selection based on information gain ratio in fuzzy rough set theory with application to tumor classification. Appl. Soft Comput. 13(1), 211–221 (2013)CrossRef Dai, J., Xu, Q.: Attribute selection based on information gain ratio in fuzzy rough set theory with application to tumor classification. Appl. Soft Comput. 13(1), 211–221 (2013)CrossRef
16.
Zurück zum Zitat Dai, J., Li, Y.X., Liu, Q.: Hybrid genetic algorithm for reduct of attributes in decision system based on rough set theory. Wuhan Univ. J. Nat. Sci. 7(3), 285–289 (2002)CrossRef Dai, J., Li, Y.X., Liu, Q.: Hybrid genetic algorithm for reduct of attributes in decision system based on rough set theory. Wuhan Univ. J. Nat. Sci. 7(3), 285–289 (2002)CrossRef
17.
Zurück zum Zitat Dai, J., Chen, W., Gu, H., Pan, Y.: Particle swarm algorithm for minimal attribute reduction of decision data tables. In: Proceedings First International Multi-Symposiums on Computer and Computational Sciences (IMSCCS 2006), Hangzhou, China, I, pp. 572–575, April 2006 Dai, J., Chen, W., Gu, H., Pan, Y.: Particle swarm algorithm for minimal attribute reduction of decision data tables. In: Proceedings First International Multi-Symposiums on Computer and Computational Sciences (IMSCCS 2006), Hangzhou, China, I, pp. 572–575, April 2006
18.
Zurück zum Zitat Bazan, J.G., Nguyen, H.S., Nguyen, S.H., Synak, P., Wróblewski, J.: Rough set algorithms in classification problem. In: Polkowski, L., Tsumoto, S., Lin, T.Y. (eds.) Rough Set Methods and Applications. Studies in Fuzziness and Soft Computing, vol. 56, pp. 49–88. Springer, Heidelberg (2000)CrossRef Bazan, J.G., Nguyen, H.S., Nguyen, S.H., Synak, P., Wróblewski, J.: Rough set algorithms in classification problem. In: Polkowski, L., Tsumoto, S., Lin, T.Y. (eds.) Rough Set Methods and Applications. Studies in Fuzziness and Soft Computing, vol. 56, pp. 49–88. Springer, Heidelberg (2000)CrossRef
19.
Zurück zum Zitat Wroblewski, J.: Finding minimal reducts using genetic algorithms. In: Proccedings of the 2nd Annual Join Conference on Infromation Science, pp. 186–189 (1995) Wroblewski, J.: Finding minimal reducts using genetic algorithms. In: Proccedings of the 2nd Annual Join Conference on Infromation Science, pp. 186–189 (1995)
20.
Zurück zum Zitat Zhu, X., Goldberg, A.B.: Introduction to semi-supervised learning. Synth. Lect. Artif. Intell. Mach. Learn. 3(1), 1–130 (2009)CrossRefMATH Zhu, X., Goldberg, A.B.: Introduction to semi-supervised learning. Synth. Lect. Artif. Intell. Mach. Learn. 3(1), 1–130 (2009)CrossRefMATH
21.
Zurück zum Zitat Pawlak, Z., Sowinski, R.: Rough set approach to multi-attribute decision analysis. Eur. J. Oper. Res. 72(3), 443–459 (1994)CrossRefMATH Pawlak, Z., Sowinski, R.: Rough set approach to multi-attribute decision analysis. Eur. J. Oper. Res. 72(3), 443–459 (1994)CrossRefMATH
22.
Zurück zum Zitat Pawlak, Z., Grzymala-Busse, J., Slowinski, R., Ziarko, W.: Rough sets. Commun. ACM 38(11), 88–95 (1995)CrossRef Pawlak, Z., Grzymala-Busse, J., Slowinski, R., Ziarko, W.: Rough sets. Commun. ACM 38(11), 88–95 (1995)CrossRef
23.
24.
Zurück zum Zitat Dai, J., Wang, W., Xu, Q., Tian, H.: Uncertainty measurement for interval-valued decision systems based on extended conditional entropy. Knowl.-Based Syst. 27, 443–450 (2012)CrossRef Dai, J., Wang, W., Xu, Q., Tian, H.: Uncertainty measurement for interval-valued decision systems based on extended conditional entropy. Knowl.-Based Syst. 27, 443–450 (2012)CrossRef
25.
Zurück zum Zitat Fayyad, U., Irani, K.: Multi-interval discretization of continuous-valued attributes for classification learning. In: Proceedings of the 13th International Join Conference on Artificial Intelligence, pp. 1022–1027 (1993) Fayyad, U., Irani, K.: Multi-interval discretization of continuous-valued attributes for classification learning. In: Proceedings of the 13th International Join Conference on Artificial Intelligence, pp. 1022–1027 (1993)
26.
Zurück zum Zitat Jain, A., Zongker, D.: Feature selection: evaluation, application, and small sample performance. IEEE Trans. Pattern Anal. Mach. Intell. 19(2), 153–158 (1997)CrossRef Jain, A., Zongker, D.: Feature selection: evaluation, application, and small sample performance. IEEE Trans. Pattern Anal. Mach. Intell. 19(2), 153–158 (1997)CrossRef
27.
Zurück zum Zitat Zhu, H., Zhou, M.: Efficient role transfer based on kuhn-munkres algorithm. IEEE Trans. Syst. Man Cybern. Part A Syst. Hum. 42(2), 491–496 (2012)MathSciNetCrossRef Zhu, H., Zhou, M.: Efficient role transfer based on kuhn-munkres algorithm. IEEE Trans. Syst. Man Cybern. Part A Syst. Hum. 42(2), 491–496 (2012)MathSciNetCrossRef
Metadaten
Titel
DualPOS: A Semi-supervised Attribute Selection Approach for Symbolic Data Based on Rough Set Theory
verfasst von
Jianhua Dai
Huifeng Han
Hu Hu
Qinghua Hu
Jinghong Zhang
Wentao Wang
Copyright-Jahr
2016
DOI
https://doi.org/10.1007/978-3-319-39958-4_31

Neuer Inhalt