Skip to main content
Top

2018 | OriginalPaper | Chapter

Fast Supervised Selection of Prototypes for Metric-Based Learning

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

A crucial factor for successful learning is the finding of more convenient representations for a problem, such that subsequent processing can be delivered to linear or non-linear modeling methods. Similarity functions are a flexible way to express knowledge about a problem and to capture meaningful relations of data in input space. In this paper we use similarity functions to find an alternative data representation which is then reduced by selecting a subset of relevant prototypes, in a supervised way. The idea is tested in a set of modelling problems, characterized by a mixture of data types and different amounts of missing values. The results demonstrate competitive or better performance than traditional methods in terms of prediction error and sparsity of the representation.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Footnotes
1
For example, by the presence of missing values, by the feature semantics, etc.
 
2
Such variables are increasingly common, especially when they refer to a time periodicity, such as the month in a year.
 
3
It is not difficult to check that this is equivalent to the replacement of the missing similarities by the average of the non-missing ones. Therefore, the conjecture is that the missing values, if known, would not change the overall similarity significantly.
 
4
This property is not used in this work but it is interesting in other contexts, such as optimization.
 
5
The experiments were run on a HP laptop with 2GB of RAM and an Intel(R) Core(TM)2 Duo CPU T7500 at 2.20GHz.
 
6
See the caption of Table 1 for a description.
 
Literature
1.
go back to reference Osborne, H., Bridge, D. Models of similarity for case-based reasoning. In: Interdisciplinary Workshop on Similarity and Categorisation, pp. 173–179 (1997) Osborne, H., Bridge, D. Models of similarity for case-based reasoning. In: Interdisciplinary Workshop on Similarity and Categorisation, pp. 173–179 (1997)
2.
go back to reference Tibshirani, R.: Regression Shrinkage and Selection via the lasso. J. R. Stat. Soc. Ser. B. Wiley 58(1), 26788 (1996)MathSciNetMATH Tibshirani, R.: Regression Shrinkage and Selection via the lasso. J. R. Stat. Soc. Ser. B. Wiley 58(1), 26788 (1996)MathSciNetMATH
3.
go back to reference Baeza-Yates, R., Ribeiro, B.: Modern Information Retrieval. ACM Press, New York (1999) Baeza-Yates, R., Ribeiro, B.: Modern Information Retrieval. ACM Press, New York (1999)
4.
go back to reference Shawe-Taylor, J., Cristianini, N.: Kernel Methods for Pattern Analysis. Camb. Univ Press, Cambridge (2004)CrossRef Shawe-Taylor, J., Cristianini, N.: Kernel Methods for Pattern Analysis. Camb. Univ Press, Cambridge (2004)CrossRef
6.
go back to reference Pekalska, E.: The Dissimilarity representations in pattern recognition. Concepts, theory and applications. (Ph.D. Thesis) Delft University of Technology (2005) Pekalska, E.: The Dissimilarity representations in pattern recognition. Concepts, theory and applications. (Ph.D. Thesis) Delft University of Technology (2005)
8.
9.
go back to reference Gower, J.C.: A general coefficient of similarity and some of its properties. Biometrika 27(4), 857–871 (1971)CrossRef Gower, J.C.: A general coefficient of similarity and some of its properties. Biometrika 27(4), 857–871 (1971)CrossRef
10.
go back to reference Sokal, R.R., Michener, C.D.: Principles of Numerical Taxonomy. W.H. Freeman, San Francisco (1963) Sokal, R.R., Michener, C.D.: Principles of Numerical Taxonomy. W.H. Freeman, San Francisco (1963)
11.
go back to reference Dixon, J.K.: Pattern recognition with partly missing data. IEEE Trans. Syst. Man Cybernet. 9, 617–621 (1979)CrossRef Dixon, J.K.: Pattern recognition with partly missing data. IEEE Trans. Syst. Man Cybernet. 9, 617–621 (1979)CrossRef
12.
go back to reference Gower, J.C., Legendre, P.: Metric and Euclidean Properties of Dissimilarity Coefficients. J. Classification 3, 5–48 (1986)MathSciNetCrossRef Gower, J.C., Legendre, P.: Metric and Euclidean Properties of Dissimilarity Coefficients. J. Classification 3, 5–48 (1986)MathSciNetCrossRef
13.
go back to reference Pavoine, S., Vallet, J., Dufour, A.B., Gachet, S., Daniel, H.: On the challenge of treating various types of variables: application for improving the measurement of functional diversity. Oikos 118(3), 391–402 (2009)CrossRef Pavoine, S., Vallet, J., Dufour, A.B., Gachet, S., Daniel, H.: On the challenge of treating various types of variables: application for improving the measurement of functional diversity. Oikos 118(3), 391–402 (2009)CrossRef
14.
go back to reference Caputo, B., Sim, K., Furesjo, F., Smola, A.: Appearance-based object recognition using SVMs: which kernel should I use? In: NIPS Workshop on Statistical methods for Computational Experiments in Visual Processing and Computer Vision (2002) Caputo, B., Sim, K., Furesjo, F., Smola, A.: Appearance-based object recognition using SVMs: which kernel should I use? In: NIPS Workshop on Statistical methods for Computational Experiments in Visual Processing and Computer Vision (2002)
15.
go back to reference van Buuren, S., Groothuis-Oudshoorn, K.: mice: multivariate imputation by chained equations in R. J. Stat. Softw. 45(3), 1–67 (2011)CrossRef van Buuren, S., Groothuis-Oudshoorn, K.: mice: multivariate imputation by chained equations in R. J. Stat. Softw. 45(3), 1–67 (2011)CrossRef
16.
go back to reference Ripley, B.: Pattern Recognition and Neural Networks. Camb. Univ Press, Cambridge (1996)CrossRef Ripley, B.: Pattern Recognition and Neural Networks. Camb. Univ Press, Cambridge (1996)CrossRef
17.
go back to reference Ravindra Babu, T., Narasimha Murty, M.: Comparison of genetic algorithm based prototype selection schemes. Pattern Recognit. 34, 523–525 (2001)CrossRef Ravindra Babu, T., Narasimha Murty, M.: Comparison of genetic algorithm based prototype selection schemes. Pattern Recognit. 34, 523–525 (2001)CrossRef
18.
go back to reference Belanche, L.l., Hernández, J.: Similarity networks for heterogeneous data. In: Proceedings of the ESANN: European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning (2012) Belanche, L.l., Hernández, J.: Similarity networks for heterogeneous data. In: Proceedings of the ESANN: European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning (2012)
20.
go back to reference Kuncheva, L., Bezdek, J.: Nearest prototype classification: clustering, genetic algorithms, or random search? IEEE Trans. Syst. Man Cybern. Part C 28(1), 160–164 (1998)CrossRef Kuncheva, L., Bezdek, J.: Nearest prototype classification: clustering, genetic algorithms, or random search? IEEE Trans. Syst. Man Cybern. Part C 28(1), 160–164 (1998)CrossRef
21.
go back to reference Lipowezky, U.: Selection of the optimal prototype subset for 1-NN classification. Pattern Recognit. Lett. 19, 907–918 (1998)CrossRef Lipowezky, U.: Selection of the optimal prototype subset for 1-NN classification. Pattern Recognit. Lett. 19, 907–918 (1998)CrossRef
Metadata
Title
Fast Supervised Selection of Prototypes for Metric-Based Learning
Author
Lluís A. Belanche
Copyright Year
2018
DOI
https://doi.org/10.1007/978-3-030-01421-6_55

Premium Partner