article

Approximation algorithms for classification problems with pairwise relationships: metric labeling and Markov random fields

Authors:
Jon Kleinberg

Cornell University, Ithaca, New York

Cornell University, Ithaca, New York
View Profile

,
Éva Tardos

Cornell University, Ithaca, New York

Cornell University, Ithaca, New York
View Profile

Authors Info & Claims

Journal of the ACM Volume 49 Issue 5pp 616–639https://doi.org/10.1145/585265.585268

Published:01 September 2002Publication History

Journal of the ACM

Abstract

In a traditional classification problem, we wish to assign one of k labels (or classes) to each of n objects, in a way that is consistent with some observed data that we have about the problem. An active line of research in this area is concerned with classification when one has information about pairwise relationships among the objects to be classified; this issue is one of the principal motivations for the framework of Markov random fields, and it arises in areas such as image processing, biometry, and document analysis. In its most basic form, this style of analysis seeks to find a classification that optimizes a combinatorial function consisting of assignment costs---based on the individual choice of label we make for each object---and separation costs---based on the pair of choices we make for two "related" objects.We formulate a general classification problem of this type, the metric labeling problem; we show that it contains as special cases a number of standard classification frameworks, including several arising from the theory of Markov random fields. From the perspective of combinatorial optimization, our problem can be viewed as a substantial generalization of the multiway cut problem, and equivalent to a type of uncapacitated quadratic assignment problem.We provide the first nontrivial polynomial-time approximation algorithms for a general family of classification problems of this type. Our main result is an O(log k log log k)-approximation algorithm for the metric labeling problem, with respect to an arbitrary metric on a set of k labels, and an arbitrary weighted graph of relationships on a set of objects. For the special case in which the labels are endowed with the uniform metric---all distances are the same---our methods provide a 2-approximation algorithm.

References

bartal Y. 1996. Approximations of metric spaces, and its algorithmic applications. In Proc. IEEE Symposium on Foundations of Computer Science. Google Scholar
Bartal, Y. 1998. On approximating arbitrary metrics by tree metrics. In Proceedings of the ACM Symposium on Theory of Computing. ACM, New York. Google Scholar
Besag, J. 1974. Spatial interaction and the statistical analysis of lattice systems. J. Roy. Stat. Soc. B 36.Google Scholar
Besag, J. 1986. On the statistical analysis of dirty pictures. J. Roy. Stat. Soc. B 48.Google Scholar
Boykov, Y., Veksler, O., and Zabih, R. 1998. Markov random fields with efficient approximations. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. IEEE computer Society Press, Los Alamtos, Calif. Google Scholar
Boykov, Y., Veksler, O., and Zabih, R. 1999. Fast approximate energy minimization via graph cuts. In Proceedings of the International Conference on Computer Vision.Google Scholar
Boykov, Y., Veksler, O., and Zabih, R. 2001. Fast approximate energy minimization via graph cuts. IEEE Trans. Patt. Anal. Mach. Intel. 23, 11 (Nov.), 1222--1239. Google Scholar
Breiman, L., Friedman, J. H., Olshen, R. A., and Stone, C. J. 1984. Classification and Regression Trees. Wadsworth & Brooks/Cole.Google Scholar
Calinescu, G., Karloff, H., and Rabani, Y. 1998. Improved approximation algorithms for multiway cut. In Proceedings of the ACM Symposium on Theory of Computing. ACM, New York. Google Scholar
Chakrabarti, S., Dom, B., and Indyk, P. 1998. Enhanced hypertext categorization using hyperlinks. In Proceedings of ACM SIGMOD. ACM, New York. Google Scholar
Charikar, M., Chekuri, C., Goel, A., Guha, S., and Plotkin, S. 1998. Approximating arbitrary metrics by O(n log n) trees. In Proceedings of the IEEE Symposium on Foundations of Computer Science. IEEE Computer Society Press, Los Alamitos, Calif.Google Scholar
Chellappa, R., and Jain, A. 1993. Markov Random Fields: Theory and Applications. Academic Press, Orlando, Fla.Google Scholar
Cohen, F. S. 1986. Markov random fields for image modeling and analysis. In Modeling and Application of Stochastic Processes, U. B. Desai, Ed. Kluwer. Google Scholar
Cunningham, W. H., and Tang, L. 1994. Optimal 3-terminal cuts and linear programming. In Proceedings of the MPS Conference on Integer Programming and Combinatorial Optimization.Google Scholar
Dahlhaus, E., Johnson, D., Papadimitriou, C., Seymour, P., and Yannakakis, M. 1994. The complexity of multi-terminal cuts. SIAM J. Comput. 23. Google Scholar
Dietterich, T. G. 1997. Machine learning research: Four current directions. AI Mag. 18.Google Scholar
Dubes, R., and Jain, A. 1989. Random field models in image analysis. J. Appl. Stat. 16.Google Scholar
Erdös, P. L., and Székely, L. A. 1992. Evolutionary trees: An integer multicommodity max-flow min-cut theorem. Adv. Appl. Math 13. Google Scholar
Ferrari, P., Frigessi, A., and de Sá, P. 1995. Fast approximate maximum a posteriori restoration of multi-color images. J. Roy. Stat. Soc. B 57.Google Scholar
Geman, S., and Geman, D. 1984. Stochastic relaxation, gibbs distributions, and the bayesian restoration of images. IEEE Trans. Patt. Anal. Mach. Intell. 6.Google Scholar
Greig, D., Porteous, B., and Seheult, A. 1989. Exact maximum a posteriori estimation for binary images. J. Roy. Stat. Soc. B 48.Google Scholar
Ishikawa, H., and Geiger, D. 1998. Segmentation by grouping junctions. In IEEE Conference on Computer Vision Pattern Recognition. pp. 125--131. Google Scholar
Karger, D. R., Klein, P., Stein, C., Thorup, M., and Young, N. E. 1999. Rounding algorithms for a geometric embedding of multiway cut. In Proceedings of the ACM Symposium on Theory of Computing. ACM, New York. Google Scholar
Karzanov, A. V. 1998. Minimum 0-extension of graph metrics. Europ. J. Combinat. 19. Google Scholar
Kinderman, R., and Snell, J. 1980. Markov Random Fields and their Applications. AMS Press.Google Scholar
Li, S. 1995. Markov Random Field Modeling in Computer Vision. Springer-Verlag. New York. Google Scholar
Pardalos, P., and Wolkowicz, H. 1994. Quadratic assignment and related problems. In DIMACS Series in Discrete Mathematics and Theoretical Computer Science, vol. 16.Google Scholar
Potts, R. 1952. Some generalized order-disorder transformations. Proc. Cambridge Phil. Soc. 48.Google Scholar
Queyranne, M. 1986. Performance ratio of polynomial heuristics for triangle inequality quadratic assignment problems. Oper. Res. Lett. 4.Google Scholar
Roy, S., and Cox, I. 1998. A maximum-flow formulation of the n-camera stereo correspondence problem. In Proceedings of the 6th International Conference on Computer Vision. Google Scholar
Woods, J. W. 1972. Two-dimensional discrete Markovian fields. IEEE Trans. Inf. Theory 18.Google Scholar

Index Terms

Approximation algorithms for classification problems with pairwise relationships: metric labeling and Markov random fields
1. Theory of computation
  1. Design and analysis of algorithms

Recommendations

Approximation Algorithms for Classification Problems with Pairwise Relationships: Metric Labeling and Markov Random Fields
FOCS '99: Proceedings of the 40th Annual Symposium on Foundations of Computer Science

In a traditional classification problem, we wish to assign one of k labels (or classes) to each of n objects, in a way that is consistent with some observed data that we have about the problem. An active line of research in this area is concerned with ...
Read More
The Hardness of Metric Labeling

The metric labeling problem is an elegant and powerful mathematical model capturing a wide range of classification problems. The input to the problem consists of a set $L$ of labels and a weighted graph $G=(V,E)$. Additionally, a metric distance ...
Read More
Semi-supervised genetic programming for classification
GECCO '11: Proceedings of the 13th annual conference on Genetic and evolutionary computation

Learning from unlabeled data provides innumerable advantages to a wide range of applications where there is a huge amount of unlabeled data freely available. Semi-supervised learning, which builds models from a small set of labeled examples and a ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

Published in

Journal of the ACM Volume 49, Issue 5
September 2002
137 pages
ISSN:0004-5411
EISSN:1557-735X
DOI:10.1145/585265
Issue’s Table of Contents

Copyright © 2002 ACM
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 1 September 2002
Published in jacm Volume 49, Issue 5

Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
Approximation algorithms
Markov random fields
classification
metric labeling
Qualifiers
- article
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 214
  Total Citations
  View Citations
- 2,584
  Total Downloads
- Downloads (Last 12 months)43
- Downloads (Last 6 weeks)6
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Approximation algorithms for classification problems with pairwise relationships: metric labeling and Markov random fields

Journal of the ACM

Abstract

References

Cited By

Index Terms

Recommendations

Approximation Algorithms for Classification Problems with Pairwise Relationships: Metric Labeling and Markov Random Fields

The Hardness of Metric Labeling

Semi-supervised genetic programming for classification

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Approximation algorithms for classification problems with pairwise relationships: metric labeling and Markov random fields

Journal of the ACM

Abstract

References

Cited By

Index Terms

Recommendations

Approximation Algorithms for Classification Problems with Pairwise Relationships: Metric Labeling and Markov Random Fields

The Hardness of Metric Labeling

Semi-supervised genetic programming for classification

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media