Inference and Learning from Data: Inference

Ali H. Sayed

doi:10.1017/9781009218245

Chapter 45: Value and Policy Iterations

pp. 1853-1916

Ali H. Sayed

, École Polytechnique Fédérale de Lausanne

Get access

Add bookmark
Cite
Share

Summary

We continue our treatment of Markov decision processes (MDPs) and focus in this chapter on methods for determining optimal actions or policies. We derive two popular methods known as value iteration and policy iteration, and establish their convergence properties. We also examine the Bellman optimality principle in the context of value and policy learning. In a later section, we extend the discussion to the more challenging case of partially observable MDPs (POMDPs), where the successive states of the MDP are unobservable to the agent, and the agent is only able to sense measurements emitted randomly by the MDP from the various states. We will define POMDPs and explain that they can be reformulated as belief‐MDPs with continuous (rather than discrete) states. This fact complicates the solution of the value iteration. Nevertheless, we will show that the successive value iterates share a useful property, namely, that they are piecewise linear and convex. This property can be exploited by computational methods to reduce the complexity of solving the value iteration for POMDPs.

About the book

Chapter DOI https://doi.org/10.1017/9781009218245.020
Book DOI https://doi.org/10.1017/9781009218245
Subjects Communications and Signal Processing,Computer Science,Engineering,Machine Learning and Pattern Recognition
Format: Hardback
- Publication date: 02 March 2023
- ISBN: 9781009218269
Format: Digital
- Publication date: 17 March 2023
- ISBN: 9781009218245
Find out more details about this book

Access options

Review the options below to login to check your access.

Purchase options

eTextbook

US$110.00

Hardback

US$110.00

Have an access code?

To redeem an access code, please log in with your personal login.

If you believe you should have access to this content, please contact your institutional librarian or consult our FAQ page for further information about accessing our content.

Also available to purchase from these educational ebook suppliers

Inference and Learning from Data Inference