skip to main content
10.1145/1273496.1273514acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicmlConference Proceedingsconference-collections
Article

Local similarity discriminant analysis

Published:20 June 2007Publication History

ABSTRACT

We propose a local, generative model for similarity-based classification. The method is applicable to the case that only pairwise similarities between samples are available. The classifier models the local class-conditional distribution using a maximum entropy estimate and empirical moment constraints. The resulting exponential class conditional-distributions are combined with class prior probabilities and misclassification costs to form the local similarity discriminant analysis (local SDA) classifier. We compare the performance of local SDA to a non-local version, to the local nearest centroid classifier, the nearest centroid classifier, k-NN, and to the recently-developed potential support vector machine (PSVM). Results show that local SDA is competitive with k-NN and the computationally-demanding PSVM while offering the advantages of a generative classifier.

References

  1. Belongie, S., Malik, J., & Puzicha, J. (2002). Shape matching and object recognition using shape contexts. IEEE Trans. on Pattern Analysis and Machine Intelligence, 24, 509--522. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Bicego, M., Murino, V., Pelillo, M., & Torsello, A. (2006). Special issue on similarity-based classification. Pattern Recognition, 39. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Cost, S., & Salzberg, S. (1993). A weighted nearest neighbor algorithm for learning with symbolic features. Machine Learning, 10, 57--78. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Cover, T., & Thomas, J. (1991). Elements of information theory. New York: John Wiley and Sons. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Devroye, L., Gyorfi, L., & Lugosi, G. (1996). A probabilistic theory of pattern recognition. New York: Springer-Verlag Inc.Google ScholarGoogle Scholar
  6. Duin, R. P. W., Pekalska, E., & de Ridder, D. (1999). Relational discriminant analysis. Pattern Recognition Letters, 20, 1175--1181. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Friedlander, M. P., & Gupta, M. R. (2006). On minimizing distortion and relative entropy. IEEE Trans. on Information Theory, 52, 238--245. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Gati, I., & Tversky, A. (1984). Weighting common and distinctive features in perceptual and conceptual judgments. Cognitive Psychology, 341--370.Google ScholarGoogle Scholar
  9. Goldfarb, L. (1985). A new approach to pattern recognition. Progress in Pattern Recognition, 2, 241--402.Google ScholarGoogle Scholar
  10. Graepel, T., Herbrich, R., & Obermayer, K. (1999). Classification on pairwise proximity data. Advances in Neural Information Processing Systems 11, 438--444. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Gupta, M. R., Cazzanti, L., & Koppal, A. J. (2007). Maximum entropy generative models for similarity-based learning. IEEE Intl. Symp. on Information Theory, to appear.Google ScholarGoogle ScholarCross RefCross Ref
  12. Hastie, T., Tibshirani, R., & Friedman, J. (2001). The elements of statistical learning. New York: Springer-Verlag.Google ScholarGoogle Scholar
  13. Hochreiter, S., Mozer, M. C., & Obermayer, K. (2003). Coulomb classifiers: Generalizing support vector machines via an analogy to electrostatic systems. Advances in Neural Information Processing Systems 15, 545--552.Google ScholarGoogle Scholar
  14. Hochreiter, S., & Obermayer, K. (2006). Support vector machines for dyadic data. Neural Computation, 18, 1472--1510. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Hoffmann, T., & Buhmann, J. (1997). Pairwise data clustering by deterministic annealing. IEEE Trans. on Pattern Analysis and Machine Intelligence, 19. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Jacobs, D. W., Weinshall, D., & Gdalyahu, Y. (2000). Classification with nonmetric distances: Image retrieval and class representation. IEEE Trans. on Pattern Analysis and Machine Intelligence, 22, 583--600. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Jaynes, E. T. (1982). On the rationale for maximum entropy methods. Proc. of the IEEE, 70, 939--952.Google ScholarGoogle ScholarCross RefCross Ref
  18. Jaynes, E. T. (2003). Probability theory: the logic of science. Cambridge University Press.Google ScholarGoogle Scholar
  19. Mitani, Y., & Hamamoto, Y. (2000). Classifier design based on the use of nearest neighbor samples. Proc. of the Intl. Conf. on Pattern Recognition, 769--772.Google ScholarGoogle ScholarCross RefCross Ref
  20. Mitani, Y., & Hamamoto, Y. (2006). A local mean-based nonparametric classifier. Pattern Recognition Letters, 27, 1151--1159. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Newman, D. J., Hettich, S., Blake, C. L., & Merz, C. J. (1998). UCI repository of machine learning databases.Google ScholarGoogle Scholar
  22. Pekalska, E., Paclíc, P., & Duin, R. P. W. (2001). A generalized kernel approach to dissimilarity-based classification. Journal of Machine Learning Research, 175--211. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Rosch, E. (1973). Natural categories. Cognitive Psychology, 328--350.Google ScholarGoogle Scholar
  24. Simard, P., Cun, Y. L., & Denker, J. (1993). Efficient pattern recognition using a new transformation distance. Advances in Neural Information Processing Systems 5, 50--68. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Stanfill, C., & Waltz, D. (1986). Toward memory-based reasoning. Communications of the ACM, 29, 1213--1228. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Tversky, A. (1977). Features of similarity. Psychological Review, 327--352.Google ScholarGoogle Scholar
  27. Tversky, A., & Gati, I. (1978). Studies of similarity. In E. Rosch and B. Lloyd (Eds.), Cognition and categorization. Hillsdale, N.J.: Earlbaum.Google ScholarGoogle Scholar
  28. Van Campenhout, J., & Cover, T. (1981). Maximum entropy and conditional probability. IEEE Trans. on Information Theory, 27, 483--489.Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. Weinshall, D., Jacobs, D. W., & Gdalyahu, Y. (1999). Classification in non-metric spaces. Advances in Neural Information Processing Systems 11, 838--844. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. Zhang, H., Berg, A. C., Maire, M., & Malik, J. (2006). SVM-KNN: discriminative nearest neighbor classification for visual category recognition. Proc. of the IEEE Conf. on Computer Vision and Pattern Recognition, 2126--2136. Google ScholarGoogle ScholarDigital LibraryDigital Library
  1. Local similarity discriminant analysis

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Other conferences
        ICML '07: Proceedings of the 24th international conference on Machine learning
        June 2007
        1233 pages
        ISBN:9781595937933
        DOI:10.1145/1273496

        Copyright © 2007 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 20 June 2007

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • Article

        Acceptance Rates

        Overall Acceptance Rate140of548submissions,26%

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader