Computer Vision: Models, Learning, and Inference

Simon J. D. Prince

doi:10.1017/CBO9780511996504

Chapter 20: Models for visual words

pp. 483-504

Simon J. D. Prince

, University College London

Get access

Add bookmark
Cite
Share

Summary

In most of the models in this book, the observed data are treated as continuous. Hence, for generative models the data likelihood is usually based on the normal distribution. In this chapter, we explore generative models that treat the observed data as discrete. The data likelihoods are now based on the categorical distribution; they describe the probability of observing the different possible values of the discrete variable.

As a motivating example for the models in this chapter, consider the problem of scene classification (Figure 20.1). We are given example training images of different scene categories (e.g., office, coastline, forest, mountain) and we are asked to learn a model that can classify new examples. Studying the scenes in Figure 20.1 demonstrates how challenging a problem this is. Different images of the same scene may have very little in common with one another, yet we must somehow learn to identify them as the same. In this chapter, we will also discuss object recognition, which has many of the same characteristics; the appearance of an object such as a tree, bicycle, or chair can vary dramatically from one image to another, and we must somehow capture this variation.

The key to modeling these complex scenes is to encode the image as a collection of visual words, and use the frequencies with which these words occur as the substrate for further calculations. We start this chapter by describing this transformation.

About the book

Chapter DOI https://doi.org/10.1017/CBO9780511996504.028
Book DOI https://doi.org/10.1017/CBO9780511996504
Subjects Computer Science,Engineering,Image Processing and Machine Vision,Robotics, Vision, and Graphics
Format: Hardback
- Publication date: 18 June 2012
- ISBN: 9781107011793
Format: Digital
- Publication date: 05 August 2012
- ISBN: 9780511996504
Find out more details about this book

Access options

Review the options below to login to check your access.

Purchase options

eTextbook

US$100.00

Hardback

US$100.00

Have an access code?

To redeem an access code, please log in with your personal login.

If you believe you should have access to this content, please contact your institutional librarian or consult our FAQ page for further information about accessing our content.

Also available to purchase from these educational ebook suppliers

Computer Vision Models, Learning, and Inference