skip to main content
10.1145/1273496.1273503acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicmlConference Proceedingsconference-collections
Article

The rendezvous algorithm: multiclass semi-supervised learning with Markov random walks

Published:20 June 2007Publication History

ABSTRACT

We consider the problem of multiclass classification where both labeled and unlabeled data points are given. We introduce and demonstrate a new approach for estimating a distribution over the missing labels where data points are viewed as nodes of a graph, and pairwise similarities are used to derive a transition probability matrix P for a Markov random walk between them. The algorithm associates each point with a particle which moves between points according to P. Labeled points are set to be absorbing states of the Markov random walk, and the probability of each particle to be absorbed by the different labeled points, as the number of steps increases, is then used to derive a distribution over the associated missing label. A computationally efficient algorithm to implement this is derived and demonstrated on both real and artificial data sets, including a numerical comparison with other methods.

References

  1. Azran, A., & Ghahramani, Z. (2006). Spectral Methods for Automatic Multiscale Data Clustering. IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Belkin, M., & Niyogi, P. (2003). Laplacian eigenmaps for dimensionality reduction and data representation. Neural Computation. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Belkin, M., Niyogi, P., & Sindhwani, V. (2005). On manifold regularization. International Workshop on Artificial Intelligence and Statistics (AISTATS).Google ScholarGoogle Scholar
  4. Bengio, Y., Delalleau, O., & Roux, N. L. (2006). Label propagation and quadratic criterion. Semi-Supervised Learning, Eds. O. Chapelle, B. Schlkopf and A. Zien. MIT Press.Google ScholarGoogle Scholar
  5. Burges, C., & Platt, J. (2006). Semi-supervised learning with conditional harmonic mixing. Semi-Supervised Learning, Eds. O. Chapelle, B. Schlkopf and A. Zien. MIT Press.Google ScholarGoogle Scholar
  6. Chapelle, O., Schölkopf, B., & Zien, A. (Eds.). (2006). Semi-Supervised Learning. Cambridge, MA: MIT Press.Google ScholarGoogle Scholar
  7. Chapelle, O., & Zien, A. (2005). Semi-supervised classification by low density separation. Proceedings of the Tenth International Workshop on Artificial Intelligence and Statistics.Google ScholarGoogle Scholar
  8. Joachims, T. (1999). Transductive inference for text classification using support vector machines. International Conference on Machine Learning (ICML). Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Joachims, T. (2003). Transductive learning via spectral graph partitioning. International Conference on Machine Learning (ICML).Google ScholarGoogle Scholar
  10. Meila, M. (2001). The multicut lemma. UW Statistics Technical Report 417.Google ScholarGoogle Scholar
  11. Meila, M., & Shi, J. (2001). A random walks view of spectral segmentation. Tenth International Workshop on Artificial Intelligence and Statistics (AISTATS).Google ScholarGoogle Scholar
  12. Seeger, M. (2002). Learning with Labeled and Unlabeled Data (Technical Report). University of Edinbrough.Google ScholarGoogle Scholar
  13. Shi, J., & Malik, J. (2000). Normalized Cuts and Image Segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Szummer, M., & Jaakkola, T. (2002). Partially Labeled Classification with Markov Random Walks. Advances in Neural Information Processing Systems (NIPS), 14.Google ScholarGoogle Scholar
  15. Weinberger, K. Q., & Saul, L. K. (2004). Unsupervised learning of image manifolds by semidefinite programming. IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Weston, J., Leslie, C. S., Ie, E., Zhou, D., Elisseeff, A., & Noble, W. S. (2005). Semi-supervised protein classification using cluster kernels. Bioinformatics. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Zhou, D., & Schlkopf, B. (2004). Learning from labeled and unlabeled data using random walks. Pattern Recognition, Proceedings of the 26th DAGM Symposium.Google ScholarGoogle Scholar
  18. Zhou, D., & Schlkopf, B. (2006). Discrete regularization. Semi-Supervised Learning, Eds. O. Chapelle, B. Schlkopf and A. Zien. MIT Press.Google ScholarGoogle Scholar
  19. Zhu, X. (2006). Semi-Supervised Learning Literature Survey (Technical Report 1530). Computer Science, University of Wisconsin-Madison.Google ScholarGoogle Scholar
  20. Zhu, X., Ghahramani, Z., & Lafferty, J. (2003). Semi-supervised learning using gaussian fields and harmonic functions. Advances in Neural Information Processing Systems (NIPS), 16.Google ScholarGoogle Scholar
  21. Zhu, X., Lafferty, J., Ghahramani, Z., & kandola, J. (2004). Nonparametric transforms of graph kernels for semi-supervised learning. International Conference on Machine Learning (ICML).Google ScholarGoogle Scholar
  1. The rendezvous algorithm: multiclass semi-supervised learning with Markov random walks

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Other conferences
        ICML '07: Proceedings of the 24th international conference on Machine learning
        June 2007
        1233 pages
        ISBN:9781595937933
        DOI:10.1145/1273496

        Copyright © 2007 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 20 June 2007

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • Article

        Acceptance Rates

        Overall Acceptance Rate140of548submissions,26%

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader