Skip to main content
Top
Published in: International Journal of Machine Learning and Cybernetics 3/2013

01-06-2013 | Original Article

Minimizing data consumption with sequential online feature selection

Authors: Thomas Rückstieß, Christian Osendorfer, Patrick van der Smagt

Published in: International Journal of Machine Learning and Cybernetics | Issue 3/2013

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

In most real-world information processing problems, data is not a free resource. Its acquisition is often expensive and time-consuming. We investigate how such cost factors can be included in supervised classification tasks by deriving classification as a sequential decision process and making it accessible to reinforcement learning. Depending on previously selected features and the internal belief of the classifier, a next feature is chosen by a sequential online feature selection that learns which features are most informative at each time step. Experiments on toy datasets and a handwritten digits classification task show significant reduction in required data for correct classification, while a medical diabetes prediction task illustrates variable feature cost minimization as a further property of our algorithm.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Show more products
Footnotes
2
A partially observable MDP is a MDP with limited access to its states, i.e., the agent does not receive the full state information but only an incomplete observation based on the current state.
 
3
These costs represent a rough estimate of the time in minutes it takes to acquire the feature on a real patient. The estimates are based on oral communication with a local GP.
 
4
with the exception of the 5 rfa experiment, which only has 8 features in total. All of them carry information and an optimal static FS method would have to choose all 8.
 
Literature
1.
go back to reference Bazzani L, Freitas N, Larochelle H, Murino V, Ting JA (2011) Learning attentional policies for tracking and recognition in video with deep networks. In: Getoor L, Scheffer T (eds.) Proceedings of the 28th international conference on machine learning (ICML-11). ICML ’11, pp 937–944 Bazzani L, Freitas N, Larochelle H, Murino V, Ting JA (2011) Learning attentional policies for tracking and recognition in video with deep networks. In: Getoor L, Scheffer T (eds.) Proceedings of the 28th international conference on machine learning (ICML-11). ICML ’11, pp 937–944
2.
go back to reference Deisenroth M, Rasmussen C, Peters J (2009) Gaussian process dynamic programming. Neurocomputing 72(7–9):1508–1524CrossRef Deisenroth M, Rasmussen C, Peters J (2009) Gaussian process dynamic programming. Neurocomputing 72(7–9):1508–1524CrossRef
3.
go back to reference Dulac-Arnold G, Denoyer L, Preux P, Gallinari P (2011) Datum-wise classification: a sequential approach to sparsity. In: Proceedings of the European conference of machine learning (ECML 2011). Springer, pp 375–390 Dulac-Arnold G, Denoyer L, Preux P, Gallinari P (2011) Datum-wise classification: a sequential approach to sparsity. In: Proceedings of the European conference of machine learning (ECML 2011). Springer, pp 375–390
4.
go back to reference Ernst D, Geurts P, Wehenkel L (2005) Tree-based batch mode reinforcement learning. J Mach Learn Res 6(1):503MathSciNetMATH Ernst D, Geurts P, Wehenkel L (2005) Tree-based batch mode reinforcement learning. J Mach Learn Res 6(1):503MathSciNetMATH
7.
go back to reference Hochreiter S, Schmidhuber J (1997) Long short-term memory. Neural Comput 9(8):1735–1780CrossRef Hochreiter S, Schmidhuber J (1997) Long short-term memory. Neural Comput 9(8):1735–1780CrossRef
8.
go back to reference Hüsken M, Stagge P (2003) Recurrent neural networks for time series classification. Neurocomputing 50:223–235CrossRefMATH Hüsken M, Stagge P (2003) Recurrent neural networks for time series classification. Neurocomputing 50:223–235CrossRefMATH
9.
go back to reference LeCun Y, Bottou L, Bengio Y, Haffner P (1998) Gradient-based learning applied to document recognition. Proc IEEE 86(11):2278–2324CrossRef LeCun Y, Bottou L, Bengio Y, Haffner P (1998) Gradient-based learning applied to document recognition. Proc IEEE 86(11):2278–2324CrossRef
10.
go back to reference Lin L (1992) Self-improving reactive agents based on reinforcement learning, planning and teaching. Mach Learn 8(3):293–321 Lin L (1992) Self-improving reactive agents based on reinforcement learning, planning and teaching. Mach Learn 8(3):293–321
11.
go back to reference Liu F, Su J (2004) Reinforcement learning-based feature learning for object tracking. In: Proceedings of the 17th international conference on pattern recognition, vol 2. IEEE, pp 748–751 Liu F, Su J (2004) Reinforcement learning-based feature learning for object tracking. In: Proceedings of the 17th international conference on pattern recognition, vol 2. IEEE, pp 748–751
12.
go back to reference Liu H, Motoda H (2008) Computational methods of feature selection. Chapman & Hall, London Liu H, Motoda H (2008) Computational methods of feature selection. Chapman & Hall, London
13.
go back to reference Monahan G (1982) A survey of partially observable Markov decision processes: theory, models, and algorithms. Manag Sci 1–16 Monahan G (1982) A survey of partially observable Markov decision processes: theory, models, and algorithms. Manag Sci 1–16
14.
go back to reference Neumann G, Peters J (2009) Fitted q-iteration by advantage weighted regression. Adv Neural Inf Process Syst 21:1177–1184 Neumann G, Peters J (2009) Fitted q-iteration by advantage weighted regression. Adv Neural Inf Process Syst 21:1177–1184
15.
go back to reference Neumann G, Pfeiffer M, Hauser H (2006) Batch reinforcement learning methods for point to point movements. Tech rep Graz University of Technology Neumann G, Pfeiffer M, Hauser H (2006) Batch reinforcement learning methods for point to point movements. Tech rep Graz University of Technology
16.
go back to reference Norouzi E, Nili Ahmadabadi M, Nadjar Araabi B (2010) Attention control with reinforcement learning for face recognition under partial occlusion. Mach Vis Appl 1–12 Norouzi E, Nili Ahmadabadi M, Nadjar Araabi B (2010) Attention control with reinforcement learning for face recognition under partial occlusion. Mach Vis Appl 1–12
17.
go back to reference Paletta L, Fritz G, Seifert C (2005) Q-learning of sequential attention for visual object recognition from informative local descriptors. In: Proceedings of the 22nd international conference on machine learning, vol 22, p 649 Paletta L, Fritz G, Seifert C (2005) Q-learning of sequential attention for visual object recognition from informative local descriptors. In: Proceedings of the 22nd international conference on machine learning, vol 22, p 649
18.
go back to reference Perkins S, Theiler J (2003) Online feature selection using grafting. In: Proceedings of the 20th international conference on machine learning (ICML), pp 592–599 Perkins S, Theiler J (2003) Online feature selection using grafting. In: Proceedings of the 20th international conference on machine learning (ICML), pp 592–599
19.
go back to reference Riedmiller M (2005) NNeural fitted Q iteration—first experiences with a data efficient neural reinforcement learning method. In: Lecture notes in computer science, vol 3720 Riedmiller M (2005) NNeural fitted Q iteration—first experiences with a data efficient neural reinforcement learning method. In: Lecture notes in computer science, vol 3720
20.
go back to reference Saar-Tsechansky M, Provost F (2007) Handling missing values when applying classification models. J Mach Learn Res 8(1625–1657):9 Saar-Tsechansky M, Provost F (2007) Handling missing values when applying classification models. J Mach Learn Res 8(1625–1657):9
21.
go back to reference Schmidhuber J, Huber R (1991) Learning to generate artificial fovea trajectories for target detection. Int J Neural Syst 2(1):135–141CrossRef Schmidhuber J, Huber R (1991) Learning to generate artificial fovea trajectories for target detection. Int J Neural Syst 2(1):135–141CrossRef
22.
go back to reference Timmer S, Riedmiller M (2007) Fitted q iteration with cmacs. In: IEEE international symposium on approximate dynamic programming and reinforcement learning, 2007. ADPRL 2007, IEEE pp 1–8 Timmer S, Riedmiller M (2007) Fitted q iteration with cmacs. In: IEEE international symposium on approximate dynamic programming and reinforcement learning, 2007. ADPRL 2007, IEEE pp 1–8
23.
go back to reference Vijayakumar S, Schaal S (2000) Locally weighted projection regression: An o (n) algorithm for incremental real time learning in high dimensional space. In: Proceedings of the seventeenth international conference on machine learning (ICML 2000), Citeseer 1:288–293 Vijayakumar S, Schaal S (2000) Locally weighted projection regression: An o (n) algorithm for incremental real time learning in high dimensional space. In: Proceedings of the seventeenth international conference on machine learning (ICML 2000), Citeseer 1:288–293
24.
go back to reference Williams R, Peng J (1990) An efficient gradient-based algorithm for on-line training of recurrent network trajectories. Neural Comput 2(4):490–501CrossRef Williams R, Peng J (1990) An efficient gradient-based algorithm for on-line training of recurrent network trajectories. Neural Comput 2(4):490–501CrossRef
25.
go back to reference Wu X, Yu K, Wang H, Ding W (2010) Online streaming feature selection. In: Fürnkranz J, Joachims T (eds.) Proceedings of the 27th international conference on machine learning (ICML-10), pp 1159–1166 Wu X, Yu K, Wang H, Ding W (2010) Online streaming feature selection. In: Fürnkranz J, Joachims T (eds.) Proceedings of the 27th international conference on machine learning (ICML-10), pp 1159–1166
Metadata
Title
Minimizing data consumption with sequential online feature selection
Authors
Thomas Rückstieß
Christian Osendorfer
Patrick van der Smagt
Publication date
01-06-2013
Publisher
Springer-Verlag
Published in
International Journal of Machine Learning and Cybernetics / Issue 3/2013
Print ISSN: 1868-8071
Electronic ISSN: 1868-808X
DOI
https://doi.org/10.1007/s13042-012-0092-x

Other articles of this Issue 3/2013

International Journal of Machine Learning and Cybernetics 3/2013 Go to the issue