Skip to main content
Top
Published in: International Journal of Machine Learning and Cybernetics 4/2018

19-09-2016 | Original Article

Streamwise feature selection: a rough set method

Authors: Mohammad Masoud Javidi, Sadegh Eskandari

Published in: International Journal of Machine Learning and Cybernetics | Issue 4/2018

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Traditional feature selection methods assume that the entire input feature set is available from the beginning. However, streaming features (SF) is an integral part of many real-world applications. In this scenario, the number of training examples is fixed while the number of features grows with time as new features stream in. A critical challenge for streamwise feature selection (SFS) is the unavailability of the entire feature set before learning starts. Several efforts have been made to address the SFS problem, however they all need some prior knowledge about the entire feature set. In this paper, the SFS problem is considered from the rough sets (RS) perspective. The main motivation for this consideration is that RS-based data mining does not require any domain knowledge other than the given dataset. The proposed method uses the significance analysis concepts in RS theory to control the unknown feature space in SFS problems. This algorithm is evaluated extensively on several high-dimensional datasets in terms of compactness, classification accuracy, and running time. Experimental results demonstrate that the algorithm achieves better results than existing SFS algorithms.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Show more products
Literature
1.
go back to reference Bishop CM (2006) Pattern recognition and machine learning (information science and statistics). Springer-Verlag New York Inc., SecaucusMATH Bishop CM (2006) Pattern recognition and machine learning (information science and statistics). Springer-Verlag New York Inc., SecaucusMATH
2.
go back to reference Theodoridis S, Koutroumbas K (2009) Pattern recognition. Academic Press, CambridgeMATH Theodoridis S, Koutroumbas K (2009) Pattern recognition. Academic Press, CambridgeMATH
3.
go back to reference Guyon I, Elliseff A (2003) An introduction to variable and feature selection. J Mach Learn Res 3:1157–1182MATH Guyon I, Elliseff A (2003) An introduction to variable and feature selection. J Mach Learn Res 3:1157–1182MATH
4.
go back to reference Wang J, Zhao P, Hoi S, Jin R (2014) Online feature selection and its applications. IEEE Trans Knowl Data Eng 26(3):698–710CrossRef Wang J, Zhao P, Hoi S, Jin R (2014) Online feature selection and its applications. IEEE Trans Knowl Data Eng 26(3):698–710CrossRef
5.
go back to reference Wu X, Yu K, Ding W, Wang H, Zhu X (2013) Online feature selection with streaming features. IEEE Trans Pattern Anal Mach Intell 35:1178–1192CrossRef Wu X, Yu K, Ding W, Wang H, Zhu X (2013) Online feature selection with streaming features. IEEE Trans Pattern Anal Mach Intell 35:1178–1192CrossRef
6.
go back to reference Ungar L, Zhou J, Foster D, Stine B (2005) Streaming feature selection using IIC. In: Proceedings of the 10th International Conference on Articial Intelligence and Statistics Ungar L, Zhou J, Foster D, Stine B (2005) Streaming feature selection using IIC. In: Proceedings of the 10th International Conference on Articial Intelligence and Statistics
7.
go back to reference He YL, Liu JNK, Hu YH, Wang XZ (2015) OWA operator based link prediction ensemble for social network. Expert Syst Appl 42(1):21–50CrossRef He YL, Liu JNK, Hu YH, Wang XZ (2015) OWA operator based link prediction ensemble for social network. Expert Syst Appl 42(1):21–50CrossRef
8.
go back to reference Perkins S, Lacker K, Theiler J (2003) Grafting: fast, incremental feature selection by gradient descent in function space. J Mach Learn Res 3:1333–1356MathSciNetMATH Perkins S, Lacker K, Theiler J (2003) Grafting: fast, incremental feature selection by gradient descent in function space. J Mach Learn Res 3:1333–1356MathSciNetMATH
9.
go back to reference Perkins S, Theiler J (2003) Online feature selection using grafting. In: International Conference on Machine Learning. ACM Press, pp 592–599 Perkins S, Theiler J (2003) Online feature selection using grafting. In: International Conference on Machine Learning. ACM Press, pp 592–599
10.
go back to reference Pudil P, Novoviov J, Kittler J (1994) Floating search methods in feature selection. Pattern Recogn Lett 15(11):1119–1125CrossRef Pudil P, Novoviov J, Kittler J (1994) Floating search methods in feature selection. Pattern Recogn Lett 15(11):1119–1125CrossRef
11.
go back to reference Wang F, Liang J, Qian Y (2013) Attribute reduction: a dimension incremental strategy. Knowl Based Sys 39:95–108CrossRef Wang F, Liang J, Qian Y (2013) Attribute reduction: a dimension incremental strategy. Knowl Based Sys 39:95–108CrossRef
12.
go back to reference Hedar AR, Wang J, Fukushima M (2008) Tabu search for attribute reduction in rough set theory. Soft Comput 12(9):909–918CrossRefMATH Hedar AR, Wang J, Fukushima M (2008) Tabu search for attribute reduction in rough set theory. Soft Comput 12(9):909–918CrossRefMATH
13.
go back to reference Li HR, Zhang WX (2005) Applying indiscernibility attribute sets to knowledge reduction. In: AI 2005: advances in artificial intelligence, vol 3809. Springer, Berlin, Heidelberg, pp 816–821. doi:10.1007/11589990_87 Li HR, Zhang WX (2005) Applying indiscernibility attribute sets to knowledge reduction. In: AI 2005: advances in artificial intelligence, vol 3809. Springer, Berlin, Heidelberg, pp 816–821. doi:10.​1007/​11589990_​87
14.
go back to reference Li K, Liu YS (2002) Rough set based attribute reduction approach in data mining. In: Proceedings of International Conference on Machine Learning and Cybernetics, vol. 1, pp 60–63 Li K, Liu YS (2002) Rough set based attribute reduction approach in data mining. In: Proceedings of International Conference on Machine Learning and Cybernetics, vol. 1, pp 60–63
15.
go back to reference Parthalain N, Shen Q, Jensen R (2010) A distance measure approach to exploring the rough set boundary region for attribute reduction. IEEE Trans Knowl Data Eng 22(3):305–317CrossRef Parthalain N, Shen Q, Jensen R (2010) A distance measure approach to exploring the rough set boundary region for attribute reduction. IEEE Trans Knowl Data Eng 22(3):305–317CrossRef
17.
go back to reference Weihua X, Yuan L, Xiuwu L (2012) Approaches to attribute reductions based on rough set and matrix computation in inconsistent ordered information systems. Knowl Based Syst 27:78–91CrossRef Weihua X, Yuan L, Xiuwu L (2012) Approaches to attribute reductions based on rough set and matrix computation in inconsistent ordered information systems. Knowl Based Syst 27:78–91CrossRef
18.
19.
go back to reference Wang XZ, Ashfag RAR, Fu AM (2015) Fuzziness based sample categorization for classifier performance improvement. J Intell Fuzzy Sys 29(3):1185–1196MathSciNetCrossRef Wang XZ, Ashfag RAR, Fu AM (2015) Fuzziness based sample categorization for classifier performance improvement. J Intell Fuzzy Sys 29(3):1185–1196MathSciNetCrossRef
20.
go back to reference He YL, Wang XZ, Huang JZX (2016) Fuzzy nonlinear regression analysis using a random weight network. Inf Sci 364–365:222–240CrossRef He YL, Wang XZ, Huang JZX (2016) Fuzzy nonlinear regression analysis using a random weight network. Inf Sci 364–365:222–240CrossRef
22.
go back to reference Wentao L, Weihua X (2015) Double-quantitative decision-theoretic rough set. Inf Sci 316:54–67CrossRef Wentao L, Weihua X (2015) Double-quantitative decision-theoretic rough set. Inf Sci 316:54–67CrossRef
24.
go back to reference Swiniarski RW, Skowron A (2003) Rough set methods in feature selection and recognition. Pattern Recogn Lett 24(6):833–849CrossRefMATH Swiniarski RW, Skowron A (2003) Rough set methods in feature selection and recognition. Pattern Recogn Lett 24(6):833–849CrossRefMATH
25.
go back to reference Jensen R, Shen Q (2001) A rough set-aided system for sorting WWW bookmarks. In: Proceedings of the First Asia-Pacific Conference on Web Intelligence: Research and Development. WI’01. London, UK Jensen R, Shen Q (2001) A rough set-aided system for sorting WWW bookmarks. In: Proceedings of the First Asia-Pacific Conference on Web Intelligence: Research and Development. WI’01. London, UK
26.
go back to reference Jensen R, Shen Q (2004) Semantics-preserving dimensionality reduction: rough and fuzzy-rough based approaches. IEEE Trans Knowl Data Eng 16(16):1457–1471CrossRef Jensen R, Shen Q (2004) Semantics-preserving dimensionality reduction: rough and fuzzy-rough based approaches. IEEE Trans Knowl Data Eng 16(16):1457–1471CrossRef
28.
29.
go back to reference Dubois D, Prade H (1992) Putting rough sets and fuzzy sets together. In: Słowinski´ R (ed) Intelligent decision support. Theory and decision library, vol 11. Springer, Netherlands, pp 203–232CrossRef Dubois D, Prade H (1992) Putting rough sets and fuzzy sets together. In: Słowinski´ R (ed) Intelligent decision support. Theory and decision library, vol 11. Springer, Netherlands, pp 203–232CrossRef
30.
go back to reference Yong L, Wenliang H, Yunliang J, Zhiyong Z (2014) Quick attribute reduct algorithm for neighborhood rough set model. Inf Sci 271:65–81MathSciNetCrossRefMATH Yong L, Wenliang H, Yunliang J, Zhiyong Z (2014) Quick attribute reduct algorithm for neighborhood rough set model. Inf Sci 271:65–81MathSciNetCrossRefMATH
31.
go back to reference Kumar SU, Inbarani HH (2015) A novel neighborhood rough set based classification approach for medical diagnosis. Proc Comput Sci 47:351–359CrossRef Kumar SU, Inbarani HH (2015) A novel neighborhood rough set based classification approach for medical diagnosis. Proc Comput Sci 47:351–359CrossRef
32.
36.
go back to reference Quinlan JR (1993) C4.5: programs for machine learning. Morgan Kaufmann Publishers Inc., San Francisco Quinlan JR (1993) C4.5: programs for machine learning. Morgan Kaufmann Publishers Inc., San Francisco
37.
go back to reference Chang CC, Lin CJ (2011) Libsvm: a library for support vector machines. ACM Trans Intell Sys Technol 2(3):1–27CrossRef Chang CC, Lin CJ (2011) Libsvm: a library for support vector machines. ACM Trans Intell Sys Technol 2(3):1–27CrossRef
38.
go back to reference Qian Y, Liang J (2008) Combination entropy and combination granulation in rough set theory. Int J Uncertain Fuzziness Knowl Based Sys 16(2):179–193MathSciNetCrossRefMATH Qian Y, Liang J (2008) Combination entropy and combination granulation in rough set theory. Int J Uncertain Fuzziness Knowl Based Sys 16(2):179–193MathSciNetCrossRefMATH
Metadata
Title
Streamwise feature selection: a rough set method
Authors
Mohammad Masoud Javidi
Sadegh Eskandari
Publication date
19-09-2016
Publisher
Springer Berlin Heidelberg
Published in
International Journal of Machine Learning and Cybernetics / Issue 4/2018
Print ISSN: 1868-8071
Electronic ISSN: 1868-808X
DOI
https://doi.org/10.1007/s13042-016-0595-y

Other articles of this Issue 4/2018

International Journal of Machine Learning and Cybernetics 4/2018 Go to the issue