Inference and Learning from Data: Inference

Ali H. Sayed

doi:10.1017/9781009218245

Chapter 44: Markov Decision Processes

pp. 1807-1852

Ali H. Sayed

, École Polytechnique Fédérale de Lausanne

Get access

Add bookmark
Cite
Share

Summary

Markov decision processes (MDPs) are at the core of reinforcement learning theory. Similar to Markov chains, MDPs involve an underlying Markovian process that evolves from one state to another, with the probability of visiting a new state being dependent on the most recent state. Different from Markov chains, MDPs involve both agents and actions taken by these agents. As a result, the next state is dependent on which action was chosen at the state preceding it. MDPs therefore provide a powerful framework to explore state spaces and to learn from actions and rewards.

About the book

Chapter DOI https://doi.org/10.1017/9781009218245.019
Book DOI https://doi.org/10.1017/9781009218245
Subjects Communications and Signal Processing,Computer Science,Engineering,Machine Learning and Pattern Recognition
Format: Hardback
- Publication date: 02 March 2023
- ISBN: 9781009218269
Format: Digital
- Publication date: 17 March 2023
- ISBN: 9781009218245
Find out more details about this book

Access options

Review the options below to login to check your access.

Purchase options

eTextbook

US$110.00

Hardback

US$110.00

Have an access code?

To redeem an access code, please log in with your personal login.

If you believe you should have access to this content, please contact your institutional librarian or consult our FAQ page for further information about accessing our content.

Also available to purchase from these educational ebook suppliers

Inference and Learning from Data Inference