Skip to main content Accessibility help
×
Hostname: page-component-848d4c4894-5nwft Total loading time: 0 Render date: 2024-05-19T02:20:19.210Z Has data issue: false hasContentIssue false

10 - Efficient Submodular Function Minimization for Computer Vision

from Part 4 - Tractability in Some Specific Areas

Published online by Cambridge University Press:  05 February 2014

Pushmeet Kohli
Affiliation:
Microsoft Research
Lucas Bordeaux
Affiliation:
Microsoft Research
Youssef Hamadi
Affiliation:
Microsoft Research
Pushmeet Kohli
Affiliation:
Microsoft Research
Get access

Summary

Markov Random Fields have been successfully applied to many computer vision problems such as image segmentation, 3D reconstruction, and stereo. The problem of estimating the Maximum a Posteriori (MAP) solution of models such as Markov Random Fields (MRF) can be formulated as a function minimization problem. This has made function minimization an indispensable tool in computer vision. The problem of minimizing a function of discrete variables is, in general, NP-hard. However, functions belonging to certain classes of functions, such as submodular functions, can be minimized in polynomial time. In this chapter, we discuss examples of popular models used in computer vision for which the MAP inference problem results in a tractable function minimization problem. We also discuss how algorithms used in computer vision overcome challenges introduced by the scale and form of function minimization problems encountered in computer vision.

Labeling Problems in Computer Vision

Many problems in computer vision and scene understanding can be formulated in terms of finding the most probable values of certain hidden or unobserved variables. These variables encode some property of the scene and can be continuous or discrete. These problems are commonly referred to as labelling problems as they involve assigning a label to the hidden variables. Labelling problems occur in many forms, from lattice based problems of dense stereo and image segmentation discussed in [6, 40] to the use of pictorial structures for object recognition as done by [10]. Some examples of problems which can be formulated in this manner are shown in Figure 10.1.

Type
Chapter
Information
Tractability
Practical Approaches to Hard Problems
, pp. 285 - 303
Publisher: Cambridge University Press
Print publication year: 2014

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Save book to Kindle

To save this book to your Kindle, first ensure coreplatform@cambridge.org is added to your Approved Personal Document E-mail List under your Personal Document Settings on the Manage Your Content and Devices page of your Amazon account. Then enter the ‘name’ part of your Kindle email address below. Find out more about saving to your Kindle.

Note you can select to save to either the @free.kindle.com or @kindle.com variations. ‘@free.kindle.com’ emails are free but can only be saved to your device when it is connected to wi-fi. ‘@kindle.com’ emails can be delivered even when you are not connected to wi-fi, but note that service fees apply.

Find out more about the Kindle Personal Document Service.

Available formats
×

Save book to Dropbox

To save content items to your account, please confirm that you agree to abide by our usage policies. If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account. Find out more about saving content to Dropbox.

Available formats
×

Save book to Google Drive

To save content items to your account, please confirm that you agree to abide by our usage policies. If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account. Find out more about saving content to Google Drive.

Available formats
×