Efficient Submodular Function Minimization for Computer Vision

Pushmeet Kohli

doi:10.1017/CBO9781139177801.011

10 - Efficient Submodular Function Minimization for Computer Vision

from Part 4 - Tractability in Some Specific Areas

Published online by Cambridge University Press: 05 February 2014

Edited by

Youssef Hamadi and

Pushmeet Kohli: Affiliation:
Microsoft Research
Lucas Bordeaux: Affiliation:
Microsoft Research
Youssef Hamadi: Affiliation:
Microsoft Research
Pushmeet Kohli: Affiliation:
Microsoft Research

Book contents

Get access

Summary

Markov Random Fields have been successfully applied to many computer vision problems such as image segmentation, 3D reconstruction, and stereo. The problem of estimating the Maximum a Posteriori (MAP) solution of models such as Markov Random Fields (MRF) can be formulated as a function minimization problem. This has made function minimization an indispensable tool in computer vision. The problem of minimizing a function of discrete variables is, in general, NP-hard. However, functions belonging to certain classes of functions, such as submodular functions, can be minimized in polynomial time. In this chapter, we discuss examples of popular models used in computer vision for which the MAP inference problem results in a tractable function minimization problem. We also discuss how algorithms used in computer vision overcome challenges introduced by the scale and form of function minimization problems encountered in computer vision.

Labeling Problems in Computer Vision

Many problems in computer vision and scene understanding can be formulated in terms of finding the most probable values of certain hidden or unobserved variables. These variables encode some property of the scene and can be continuous or discrete. These problems are commonly referred to as labelling problems as they involve assigning a label to the hidden variables. Labelling problems occur in many forms, from lattice based problems of dense stereo and image segmentation discussed in [6, 40] to the use of pictorial structures for object recognition as done by [10]. Some examples of problems which can be formulated in this manner are shown in Figure 10.1.

Type: Chapter
Information: Tractability
Practical Approaches to Hard Problems
, pp. 285 - 303

DOI: https://doi.org/10.1017/CBO9781139177801.011 [Opens in a new window]

Publisher: Cambridge University Press

Print publication year: 2014

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Book contents

10 - Efficient Submodular Function Minimization for Computer Vision

Summary

Access options

Save book to Kindle

Save book to Dropbox

Save book to Google Drive