Skip to main content
Erschienen in: Pattern Analysis and Applications 3-4/2008

01.09.2008 | Theoretical Advances

Distance-based discriminant analysis method and its applications

verfasst von: Serhiy Kosinov, Thierry Pun

Erschienen in: Pattern Analysis and Applications | Ausgabe 3-4/2008

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

This paper proposes a method of finding a discriminative linear transformation that enhances the data’s degree of conformance to the compactness hypothesis and its inverse. The problem formulation relies on inter-observation distances only, which is shown to improve non-parametric and non-linear classifier performance on benchmark and real-world data sets. The proposed approach is suitable for both binary and multiple-category classification problems, and can be applied as a dimensionality reduction technique. In the latter case, the number of necessary discriminative dimensions can be determined exactly. Also considered is a kernel-based extension of the proposed discriminant analysis method which overcomes the linearity assumption of the sought discriminative transformation imposed by the initial formulation. This enhancement allows the proposed method to be applied to non-linear classification problems and has an additional benefit of being able to accommodate indefinite kernels.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Anhänge
Nur mit Berechtigung zugänglich
Fußnoten
1
Here and in several other places we will use shorthand \(\prod_{i < j}^{N_X}\) to designate double product \(\prod_{i=1}^{N_X} \prod_{j=i+1}^{N_X}.\)
 
2
For instance, in contrast to the ACM technique, the DDA formulation applies naturally to the cases where there is no strict class separability, whereas the ACM method fails because the version space becomes an empty set.
 
3
The similar notation will be used further on, where a dash over a variable name will signify that the variable either depends on or is itself a supporting point.
 
4
A more detailed analysis may demonstrate that resorting to the Taylor series approximation might break conformance to the majorization requirements in the strict sense. However, the empirical evidence proved otherwise (see section 7), confirming the technique as an alternative of preference.
 
5
The elements g ij of matrix G not affected by the first two rules of (23) are assumed to have been initially set to zero.
 
6
A word of caution is in order as for the choice of k = 1, which corresponds to an ill-posed combinatorial problem [6].
 
Literatur
1.
Zurück zum Zitat Arkadev A, Braverman E (1966) Computers and pattern recognition. Thompson, Washington, DC Arkadev A, Braverman E (1966) Computers and pattern recognition. Thompson, Washington, DC
2.
Zurück zum Zitat Bahlmann C, Haasdonk B, Burkhardt H (2002) On-line handwriting recognition with support vector machines—a kernel approach. In: Eighth International Workshop on Frontiers in Handwriting Recognition. Ontario, Canada Bahlmann C, Haasdonk B, Burkhardt H (2002) On-line handwriting recognition with support vector machines—a kernel approach. In: Eighth International Workshop on Frontiers in Handwriting Recognition. Ontario, Canada
3.
Zurück zum Zitat Bartlett P (1997) For valid generalization, the size of the weights is more important than the size of the network. Adv Neural Inform Process Syst 9:134–140 Bartlett P (1997) For valid generalization, the size of the weights is more important than the size of the network. Adv Neural Inform Process Syst 9:134–140
4.
Zurück zum Zitat Bertero M, Boccacci P (1998) Introduction to inverse problems in imaging. Institute of Physics Publishing Bertero M, Boccacci P (1998) Introduction to inverse problems in imaging. Institute of Physics Publishing
5.
Zurück zum Zitat Blake CL, Merz CJ (1998) UCI repository of machine learning databases Blake CL, Merz CJ (1998) UCI repository of machine learning databases
6.
Zurück zum Zitat Borg I, Groenen PJF (1997) Modern multidimensional scaling. Springer, New YorkMATH Borg I, Groenen PJF (1997) Modern multidimensional scaling. Springer, New YorkMATH
7.
Zurück zum Zitat Marco Bressan, Vitria J (2003) Nonparametric discriminant analysis and nearest neighbor classification. Pattern Recogn Lett 24(15):2743–2749CrossRef Marco Bressan, Vitria J (2003) Nonparametric discriminant analysis and nearest neighbor classification. Pattern Recogn Lett 24(15):2743–2749CrossRef
8.
Zurück zum Zitat Chow GC (1960) Tests of equality between sets of coefficients in two linear regressions. Econometrica 28(3) Chow GC (1960) Tests of equality between sets of coefficients in two linear regressions. Econometrica 28(3)
9.
Zurück zum Zitat Commandeur J, Groenen PJF, Meulman J (1999) A distance-based variety of non-linear multivariate data analysis, including weights for objects and variables. Psychometrika 64(2):169–186CrossRefMathSciNet Commandeur J, Groenen PJF, Meulman J (1999) A distance-based variety of non-linear multivariate data analysis, including weights for objects and variables. Psychometrika 64(2):169–186CrossRefMathSciNet
10.
Zurück zum Zitat Cristianini N, Shawe-Taylor J (2000) An introduction to support vector machines and other kernel-based learning methods. Cambridge University Press, Cambridge Cristianini N, Shawe-Taylor J (2000) An introduction to support vector machines and other kernel-based learning methods. Cambridge University Press, Cambridge
11.
Zurück zum Zitat Dennis DeCoste, Bernhard Schölkopf (2002) Training invariant support vector machines. Mach Learn 46(1–3):161–190MATH Dennis DeCoste, Bernhard Schölkopf (2002) Training invariant support vector machines. Mach Learn 46(1–3):161–190MATH
12.
Zurück zum Zitat Dietterich TG (2000) Ensemble methods in machine learning. In: Kittler J, Roli F (eds) First international workshop on multiple classifier systems. Springer, Heidelberg, pp 1–15 Dietterich TG (2000) Ensemble methods in machine learning. In: Kittler J, Roli F (eds) First international workshop on multiple classifier systems. Springer, Heidelberg, pp 1–15
13.
Zurück zum Zitat Dinkelbach W (1967) On nonlinear fractional programming. Manage Sci A(13):492–498MathSciNet Dinkelbach W (1967) On nonlinear fractional programming. Manage Sci A(13):492–498MathSciNet
14.
Zurück zum Zitat Duda RO, Hart PE (1973) Pattern classification and scene analysis. Wiley, New YorkMATH Duda RO, Hart PE (1973) Pattern classification and scene analysis. Wiley, New YorkMATH
15.
Zurück zum Zitat Dunn D, Higgins WE, Wakeley J (1994) Texture segmentation using 2-d gabor elementary functions. IEEE Trans Pattern Anal Mach Intell 16(2):130–149CrossRef Dunn D, Higgins WE, Wakeley J (1994) Texture segmentation using 2-d gabor elementary functions. IEEE Trans Pattern Anal Mach Intell 16(2):130–149CrossRef
16.
Zurück zum Zitat Fisher RA (1936) The use of multiple measures in taxonomic problems. Ann Eugenics 7:179–188 Fisher RA (1936) The use of multiple measures in taxonomic problems. Ann Eugenics 7:179–188
17.
Zurück zum Zitat Fix E, Hodges J (1951) Discriminatory analysis: nonparametric discrimination: consistency properties. Technical Report 4, USAF School of Aviation Medicine Fix E, Hodges J (1951) Discriminatory analysis: nonparametric discrimination: consistency properties. Technical Report 4, USAF School of Aviation Medicine
18.
Zurück zum Zitat Fletcher R (1987) Practical methods of optimization. Wiley, ChichesterMATH Fletcher R (1987) Practical methods of optimization. Wiley, ChichesterMATH
19.
Zurück zum Zitat Fogel I, Sagi D (1989) Gabor filters as texture discriminator. Cybernetics 61:103–113 Fogel I, Sagi D (1989) Gabor filters as texture discriminator. Cybernetics 61:103–113
20.
Zurück zum Zitat Fukunaga K (1990) Introduction to statistical pattern recognition, 2nd edn. Academic, New YorkMATH Fukunaga K (1990) Introduction to statistical pattern recognition, 2nd edn. Academic, New YorkMATH
21.
Zurück zum Zitat Fukunaga K, Mantock J (1983) Nonparametric discriminant analysis. IEEE Trans Pattern Anal Mach Intell 5(6):671–678MATHCrossRef Fukunaga K, Mantock J (1983) Nonparametric discriminant analysis. IEEE Trans Pattern Anal Mach Intell 5(6):671–678MATHCrossRef
22.
Zurück zum Zitat Gentle J (1998) Numerical linear algebra for applications in statistics. Springer, BerlinMATH Gentle J (1998) Numerical linear algebra for applications in statistics. Springer, BerlinMATH
23.
Zurück zum Zitat Haasdonk B (2005) Feature space interpretation of SVMs with indefinite kernels. IEEE Trans Pattern Anal Mach Intell 27(4):482–492CrossRef Haasdonk B (2005) Feature space interpretation of SVMs with indefinite kernels. IEEE Trans Pattern Anal Mach Intell 27(4):482–492CrossRef
24.
Zurück zum Zitat Haasdonk B, Keysers D (2002) Tangent distance kernels for support vector machines. In: Proceedings of the 16th ICPR, pp 864–868 Haasdonk B, Keysers D (2002) Tangent distance kernels for support vector machines. In: Proceedings of the 16th ICPR, pp 864–868
25.
Zurück zum Zitat Haasdonk B, Bahlmann C (2004) Learning with distance substitution kernels. In: 26th Pattern Recognition Symposium of the German Association for Pattern Recognition (DAGM 2004). Springer, Tübingen, Germany Haasdonk B, Bahlmann C (2004) Learning with distance substitution kernels. In: 26th Pattern Recognition Symposium of the German Association for Pattern Recognition (DAGM 2004). Springer, Tübingen, Germany
26.
Zurück zum Zitat Ju Han, Kai-Kuang Ma (2007) Rotation-invariant and scale-invariant gabor features for texture image retrieval. Image Vis Comput 25(9):1474–1481CrossRef Ju Han, Kai-Kuang Ma (2007) Rotation-invariant and scale-invariant gabor features for texture image retrieval. Image Vis Comput 25(9):1474–1481CrossRef
27.
Zurück zum Zitat Trevor Hastie, Robert Tibshirani (1996) Discriminant adaptive nearest neighbor classification. IEEE Trans Pattern Anal Mach Intell 18(6):607–616CrossRef Trevor Hastie, Robert Tibshirani (1996) Discriminant adaptive nearest neighbor classification. IEEE Trans Pattern Anal Mach Intell 18(6):607–616CrossRef
28.
Zurück zum Zitat Heiser W (1995) Convergent computation by iterative majorization: theory and applications in multidimensional data analysis. Recent advances in descriptive multivariate analysis, pp. 157–189 Heiser W (1995) Convergent computation by iterative majorization: theory and applications in multidimensional data analysis. Recent advances in descriptive multivariate analysis, pp. 157–189
31.
Zurück zum Zitat Krogh A, Hertz JA (1992) A simple weight decay can improve generalization. In: Moody JE, Hanson SJ, Lippmann RP (eds) Advances in neural information processing systems, Vol 4. Morgan Kaufmann, San Francisco, pp 950–957 Krogh A, Hertz JA (1992) A simple weight decay can improve generalization. In: Moody JE, Hanson SJ, Lippmann RP (eds) Advances in neural information processing systems, Vol 4. Morgan Kaufmann, San Francisco, pp 950–957
32.
Zurück zum Zitat Lawrence S, Giles C (2000) Overfitting and neural networks: conjugate gradient and backpropagation. In: Proceedings of the IEEE international conference on neural networks. IEEE Press, pp 114–119 Lawrence S, Giles C (2000) Overfitting and neural networks: conjugate gradient and backpropagation. In: Proceedings of the IEEE international conference on neural networks. IEEE Press, pp 114–119
33.
Zurück zum Zitat De Leeuw J (1993) Fitting distances by least squares. Technical Report 130, Interdiviional Program in Statistics. UCLA, Los Angeles De Leeuw J (1993) Fitting distances by least squares. Technical Report 130, Interdiviional Program in Statistics. UCLA, Los Angeles
34.
Zurück zum Zitat Leibe B, Schiele B (2003) Analyzing appearance and contour based methods for object categorization. In: International conference on computer vision and pattern recognition (CVPR’03). Madison, WI, pp 409–415 Leibe B, Schiele B (2003) Analyzing appearance and contour based methods for object categorization. In: International conference on computer vision and pattern recognition (CVPR’03). Madison, WI, pp 409–415
35.
Zurück zum Zitat Li Y, Shapiro LG (2004) Object recognition for content-based image retrieval. In: Lecture Notes in Computer Science. Springer, Heidelberg Li Y, Shapiro LG (2004) Object recognition for content-based image retrieval. In: Lecture Notes in Computer Science. Springer, Heidelberg
36.
Zurück zum Zitat Hsuan-Tien Lin, Chih-Jen Lin (2003) A study on sigmoid kernels for SVM and the training of non-PSD kernels by SMO-type methods. Technical report, Department of Computer Science and Information Engineering, National Taiwan University, Taipei, Taiwan. Available at http://www.csie.ntu.edu.tw/∼cjlin/papers/tanh.pdf Hsuan-Tien Lin, Chih-Jen Lin (2003) A study on sigmoid kernels for SVM and the training of non-PSD kernels by SMO-type methods. Technical report, Department of Computer Science and Information Engineering, National Taiwan University, Taipei, Taiwan. Available at http://www.csie.ntu.edu.tw/∼cjlin/papers/tanh.pdf
37.
Zurück zum Zitat Mary X (2003) Sous-espaces hilbertiens, sous-dualités et applications. PhD thesis, Institut national des sciences appliquees de rouen - Insa rouen, ASI-PSI Mary X (2003) Sous-espaces hilbertiens, sous-dualités et applications. PhD thesis, Institut national des sciences appliquees de rouen - Insa rouen, ASI-PSI
38.
Zurück zum Zitat David Masip, Ludmila I Kuncheva, Jordi Vitrià (2005) An ensemble-based method for linear feature extraction for two-class problems. Pattern Anal Appl 8(3):227–237CrossRefMathSciNet David Masip, Ludmila I Kuncheva, Jordi Vitrià (2005) An ensemble-based method for linear feature extraction for two-class problems. Pattern Anal Appl 8(3):227–237CrossRefMathSciNet
39.
Zurück zum Zitat Tom Mitchell (1997) Machine learning. McGraw-Hill, New York Tom Mitchell (1997) Machine learning. McGraw-Hill, New York
40.
Zurück zum Zitat Moré JJ, Sorensen DC (1983) Computing a trust region step. SIAM J Sci Stat Comput 4(3):553–572MATHCrossRef Moré JJ, Sorensen DC (1983) Computing a trust region step. SIAM J Sci Stat Comput 4(3):553–572MATHCrossRef
41.
Zurück zum Zitat Moreno PJ, Ho PP, Vasconcelos N (2004) A kullback-leibler divergence based kernel for svm classification in multimedia applications. In: Thrun S, Saul L, Schölkopf B (eds) Advances in neural information processing systems, Vol 16. MIT, Cambridge Moreno PJ, Ho PP, Vasconcelos N (2004) A kullback-leibler divergence based kernel for svm classification in multimedia applications. In: Thrun S, Saul L, Schölkopf B (eds) Advances in neural information processing systems, Vol 16. MIT, Cambridge
42.
Zurück zum Zitat Nesterov Y, Nemirovskii A (1994) Interior Point Polynomial Methods in Convex Programming: Theory and Applications. Society for Industrial and Applied Mathematics, Philadelphia Nesterov Y, Nemirovskii A (1994) Interior Point Polynomial Methods in Convex Programming: Theory and Applications. Society for Industrial and Applied Mathematics, Philadelphia
43.
Zurück zum Zitat Ong CS, Smola AJ, Williamson RC (2002) Hyperkernels. In: Neural information processing systems, Vol 15. MIT, Cambridge Ong CS, Smola AJ, Williamson RC (2002) Hyperkernels. In: Neural information processing systems, Vol 15. MIT, Cambridge
44.
Zurück zum Zitat Ong CS, Mary X, Canu S, Smola AJ (2004) Learning with non-positive kernels. In: ICML ’04: Proceedings of the twenty-first international conference on Machine learning. ACM Ong CS, Mary X, Canu S, Smola AJ (2004) Learning with non-positive kernels. In: ICML ’04: Proceedings of the twenty-first international conference on Machine learning. ACM
45.
Zurück zum Zitat R. Paredes, E. Vidal (2000) A class-dependent weighted dissimilarity measure for nearest neighbor classification problems. Pattern Recogn Lett 21(12):1027–1036MATHCrossRef R. Paredes, E. Vidal (2000) A class-dependent weighted dissimilarity measure for nearest neighbor classification problems. Pattern Recogn Lett 21(12):1027–1036MATHCrossRef
46.
Zurück zum Zitat Rojas M, Santos S, Sorensen D (2000) A new matrix-free algorithm for the large-scale trust-region subproblem. SIAM J Optim 11(3):611–646MATHCrossRefMathSciNet Rojas M, Santos S, Sorensen D (2000) A new matrix-free algorithm for the large-scale trust-region subproblem. SIAM J Optim 11(3):611–646MATHCrossRefMathSciNet
47.
Zurück zum Zitat Roweis ST, Saul LK (2000) Nonlinear dimensionality reduction by locally linear embedding. Science 290:2323–2326CrossRef Roweis ST, Saul LK (2000) Nonlinear dimensionality reduction by locally linear embedding. Science 290:2323–2326CrossRef
48.
Zurück zum Zitat Schölkopf B (2001) The kernel trick for distances. In: Leen TK, Dietterich TG, Tresp V (eds) Advances in neural information processing systems, Vol 13. MIT, Cambridge, pp 301–307 Schölkopf B (2001) The kernel trick for distances. In: Leen TK, Dietterich TG, Tresp V (eds) Advances in neural information processing systems, Vol 13. MIT, Cambridge, pp 301–307
49.
Zurück zum Zitat Shen L, Bai L (2006) A review on gabor wavelets for face recognition. Pattern Anal Appl 9(2–3):273–292MathSciNet Shen L, Bai L (2006) A review on gabor wavelets for face recognition. Pattern Anal Appl 9(2–3):273–292MathSciNet
50.
Zurück zum Zitat Smith JR, Chang S-F (1996) Tools and techniques for color image retrieval. In: Storage and Retrieval for Image and Video Databases (SPIF), pp 426–437 Smith JR, Chang S-F (1996) Tools and techniques for color image retrieval. In: Storage and Retrieval for Image and Video Databases (SPIF), pp 426–437
51.
Zurück zum Zitat Squire D. McG, Müller W, Müller H, Raki J (1999) Content-based query of image databases, inspirations from text retrieval: inverted files, frequency-based weights and relevance feedback. In: The 11th Scandinavian Conference on Image Analysis. Kangerlussuaq, Greenland, pp 143–149 Squire D. McG, Müller W, Müller H, Raki J (1999) Content-based query of image databases, inspirations from text retrieval: inverted files, frequency-based weights and relevance feedback. In: The 11th Scandinavian Conference on Image Analysis. Kangerlussuaq, Greenland, pp 143–149
52.
Zurück zum Zitat Tenenbaum JB, de Silva V, Langford JC (2000) A global geometric framework for nonlinear dimensionality reduction. Science 290:2319–2323CrossRef Tenenbaum JB, de Silva V, Langford JC (2000) A global geometric framework for nonlinear dimensionality reduction. Science 290:2319–2323CrossRef
53.
Zurück zum Zitat Theodoridis S, Koutroumbas K (1999) Pattern recognition. Academic, London Theodoridis S, Koutroumbas K (1999) Pattern recognition. Academic, London
54.
Zurück zum Zitat Torkkola K, Campbell W (2000) Mutual information in learning feature transformations. In: Proceedings 17th international conference on machine learning, pp 1015–1022 Torkkola K, Campbell W (2000) Mutual information in learning feature transformations. In: Proceedings 17th international conference on machine learning, pp 1015–1022
55.
Zurück zum Zitat Trafalis TB, Malyscheff AM (2002) An analytic center machine. Mach Learn 46(1–3):203–223MATHCrossRef Trafalis TB, Malyscheff AM (2002) An analytic center machine. Mach Learn 46(1–3):203–223MATHCrossRef
56.
Zurück zum Zitat van Deun K, Groenen PJF (2003) Majorization algorithms for inspecting circles, ellipses, squares, rectangles, and rhombi. Technical report, Econometric Institute Report EI 2003-35 van Deun K, Groenen PJF (2003) Majorization algorithms for inspecting circles, ellipses, squares, rectangles, and rhombi. Technical report, Econometric Institute Report EI 2003-35
57.
Zurück zum Zitat Vapnik VN (1995) The nature of statistical learning theory. Springer, New YorkMATH Vapnik VN (1995) The nature of statistical learning theory. Springer, New YorkMATH
58.
Zurück zum Zitat Vapnik VN (1998) Statistical learning theory. Wiley, New-YorkMATH Vapnik VN (1998) Statistical learning theory. Wiley, New-YorkMATH
59.
Zurück zum Zitat Watanabe H, Yamaguchi T, Katagiri S (1997) Discriminative metric design for robust pattern recognition. IEEE Trans Signal Process 45(11):2655–2661CrossRef Watanabe H, Yamaguchi T, Katagiri S (1997) Discriminative metric design for robust pattern recognition. IEEE Trans Signal Process 45(11):2655–2661CrossRef
60.
Zurück zum Zitat Webb A (1995) Multidimensional scaling by iterative majorization using radial basis functions. Pattern Recogn 28(5):753–759CrossRef Webb A (1995) Multidimensional scaling by iterative majorization using radial basis functions. Pattern Recogn 28(5):753–759CrossRef
61.
Zurück zum Zitat Weiss S, Kulikowski C (1991) Computer systems that learn. Morgan Kaufmann, San Francisco Weiss S, Kulikowski C (1991) Computer systems that learn. Morgan Kaufmann, San Francisco
62.
Zurück zum Zitat Zhou X, Huang T (2001) Comparing discriminating transformations and SVM for learning during multimedia retrieval. In: Proceedings of the 9th ACM international conference on multimedia. Ottawa, Canada, pp 137–146 Zhou X, Huang T (2001) Comparing discriminating transformations and SVM for learning during multimedia retrieval. In: Proceedings of the 9th ACM international conference on multimedia. Ottawa, Canada, pp 137–146
63.
Zurück zum Zitat Zhou X, Huang T (2001) Small sample learning during multimedia retrieval using BiasMap. In: IEEE computer vision and pattern recognition (CVPR’01), Hawaii Zhou X, Huang T (2001) Small sample learning during multimedia retrieval using BiasMap. In: IEEE computer vision and pattern recognition (CVPR’01), Hawaii
Metadaten
Titel
Distance-based discriminant analysis method and its applications
verfasst von
Serhiy Kosinov
Thierry Pun
Publikationsdatum
01.09.2008
Verlag
Springer-Verlag
Erschienen in
Pattern Analysis and Applications / Ausgabe 3-4/2008
Print ISSN: 1433-7541
Elektronische ISSN: 1433-755X
DOI
https://doi.org/10.1007/s10044-007-0082-x

Weitere Artikel der Ausgabe 3-4/2008

Pattern Analysis and Applications 3-4/2008 Zur Ausgabe