skip to main content
article

Approximation algorithms for classification problems with pairwise relationships: metric labeling and Markov random fields

Published:01 September 2002Publication History
Skip Abstract Section

Abstract

In a traditional classification problem, we wish to assign one of k labels (or classes) to each of n objects, in a way that is consistent with some observed data that we have about the problem. An active line of research in this area is concerned with classification when one has information about pairwise relationships among the objects to be classified; this issue is one of the principal motivations for the framework of Markov random fields, and it arises in areas such as image processing, biometry, and document analysis. In its most basic form, this style of analysis seeks to find a classification that optimizes a combinatorial function consisting of assignment costs---based on the individual choice of label we make for each object---and separation costs---based on the pair of choices we make for two "related" objects.We formulate a general classification problem of this type, the metric labeling problem; we show that it contains as special cases a number of standard classification frameworks, including several arising from the theory of Markov random fields. From the perspective of combinatorial optimization, our problem can be viewed as a substantial generalization of the multiway cut problem, and equivalent to a type of uncapacitated quadratic assignment problem.We provide the first nontrivial polynomial-time approximation algorithms for a general family of classification problems of this type. Our main result is an O(log k log log k)-approximation algorithm for the metric labeling problem, with respect to an arbitrary metric on a set of k labels, and an arbitrary weighted graph of relationships on a set of objects. For the special case in which the labels are endowed with the uniform metric---all distances are the same---our methods provide a 2-approximation algorithm.

References

  1. bartal Y. 1996. Approximations of metric spaces, and its algorithmic applications. In Proc. IEEE Symposium on Foundations of Computer Science. Google ScholarGoogle Scholar
  2. Bartal, Y. 1998. On approximating arbitrary metrics by tree metrics. In Proceedings of the ACM Symposium on Theory of Computing. ACM, New York. Google ScholarGoogle Scholar
  3. Besag, J. 1974. Spatial interaction and the statistical analysis of lattice systems. J. Roy. Stat. Soc. B 36.Google ScholarGoogle Scholar
  4. Besag, J. 1986. On the statistical analysis of dirty pictures. J. Roy. Stat. Soc. B 48.Google ScholarGoogle Scholar
  5. Boykov, Y., Veksler, O., and Zabih, R. 1998. Markov random fields with efficient approximations. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. IEEE computer Society Press, Los Alamtos, Calif. Google ScholarGoogle Scholar
  6. Boykov, Y., Veksler, O., and Zabih, R. 1999. Fast approximate energy minimization via graph cuts. In Proceedings of the International Conference on Computer Vision.Google ScholarGoogle Scholar
  7. Boykov, Y., Veksler, O., and Zabih, R. 2001. Fast approximate energy minimization via graph cuts. IEEE Trans. Patt. Anal. Mach. Intel. 23, 11 (Nov.), 1222--1239. Google ScholarGoogle Scholar
  8. Breiman, L., Friedman, J. H., Olshen, R. A., and Stone, C. J. 1984. Classification and Regression Trees. Wadsworth & Brooks/Cole.Google ScholarGoogle Scholar
  9. Calinescu, G., Karloff, H., and Rabani, Y. 1998. Improved approximation algorithms for multiway cut. In Proceedings of the ACM Symposium on Theory of Computing. ACM, New York. Google ScholarGoogle Scholar
  10. Chakrabarti, S., Dom, B., and Indyk, P. 1998. Enhanced hypertext categorization using hyperlinks. In Proceedings of ACM SIGMOD. ACM, New York. Google ScholarGoogle Scholar
  11. Charikar, M., Chekuri, C., Goel, A., Guha, S., and Plotkin, S. 1998. Approximating arbitrary metrics by O(n log n) trees. In Proceedings of the IEEE Symposium on Foundations of Computer Science. IEEE Computer Society Press, Los Alamitos, Calif.Google ScholarGoogle Scholar
  12. Chellappa, R., and Jain, A. 1993. Markov Random Fields: Theory and Applications. Academic Press, Orlando, Fla.Google ScholarGoogle Scholar
  13. Cohen, F. S. 1986. Markov random fields for image modeling and analysis. In Modeling and Application of Stochastic Processes, U. B. Desai, Ed. Kluwer. Google ScholarGoogle Scholar
  14. Cunningham, W. H., and Tang, L. 1994. Optimal 3-terminal cuts and linear programming. In Proceedings of the MPS Conference on Integer Programming and Combinatorial Optimization.Google ScholarGoogle Scholar
  15. Dahlhaus, E., Johnson, D., Papadimitriou, C., Seymour, P., and Yannakakis, M. 1994. The complexity of multi-terminal cuts. SIAM J. Comput. 23. Google ScholarGoogle Scholar
  16. Dietterich, T. G. 1997. Machine learning research: Four current directions. AI Mag. 18.Google ScholarGoogle Scholar
  17. Dubes, R., and Jain, A. 1989. Random field models in image analysis. J. Appl. Stat. 16.Google ScholarGoogle Scholar
  18. Erdös, P. L., and Székely, L. A. 1992. Evolutionary trees: An integer multicommodity max-flow min-cut theorem. Adv. Appl. Math 13. Google ScholarGoogle Scholar
  19. Ferrari, P., Frigessi, A., and de Sá, P. 1995. Fast approximate maximum a posteriori restoration of multi-color images. J. Roy. Stat. Soc. B 57.Google ScholarGoogle Scholar
  20. Geman, S., and Geman, D. 1984. Stochastic relaxation, gibbs distributions, and the bayesian restoration of images. IEEE Trans. Patt. Anal. Mach. Intell. 6.Google ScholarGoogle Scholar
  21. Greig, D., Porteous, B., and Seheult, A. 1989. Exact maximum a posteriori estimation for binary images. J. Roy. Stat. Soc. B 48.Google ScholarGoogle Scholar
  22. Ishikawa, H., and Geiger, D. 1998. Segmentation by grouping junctions. In IEEE Conference on Computer Vision Pattern Recognition. pp. 125--131. Google ScholarGoogle Scholar
  23. Karger, D. R., Klein, P., Stein, C., Thorup, M., and Young, N. E. 1999. Rounding algorithms for a geometric embedding of multiway cut. In Proceedings of the ACM Symposium on Theory of Computing. ACM, New York. Google ScholarGoogle Scholar
  24. Karzanov, A. V. 1998. Minimum 0-extension of graph metrics. Europ. J. Combinat. 19. Google ScholarGoogle Scholar
  25. Kinderman, R., and Snell, J. 1980. Markov Random Fields and their Applications. AMS Press.Google ScholarGoogle Scholar
  26. Li, S. 1995. Markov Random Field Modeling in Computer Vision. Springer-Verlag. New York. Google ScholarGoogle Scholar
  27. Pardalos, P., and Wolkowicz, H. 1994. Quadratic assignment and related problems. In DIMACS Series in Discrete Mathematics and Theoretical Computer Science, vol. 16.Google ScholarGoogle Scholar
  28. Potts, R. 1952. Some generalized order-disorder transformations. Proc. Cambridge Phil. Soc. 48.Google ScholarGoogle Scholar
  29. Queyranne, M. 1986. Performance ratio of polynomial heuristics for triangle inequality quadratic assignment problems. Oper. Res. Lett. 4.Google ScholarGoogle Scholar
  30. Roy, S., and Cox, I. 1998. A maximum-flow formulation of the n-camera stereo correspondence problem. In Proceedings of the 6th International Conference on Computer Vision. Google ScholarGoogle Scholar
  31. Woods, J. W. 1972. Two-dimensional discrete Markovian fields. IEEE Trans. Inf. Theory 18.Google ScholarGoogle Scholar

Index Terms

  1. Approximation algorithms for classification problems with pairwise relationships: metric labeling and Markov random fields

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in

    Full Access

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader