Hostname: page-component-89b8bd64d-b5k59 Total loading time: 0 Render date: 2026-05-06T14:08:08.361Z Has data issue: false hasContentIssue false

SEMI-MARKOV DECISION PROCESSES

NONSTANDARD CRITERIA

Published online by Cambridge University Press:  22 October 2007

M. Baykal-Gürsoy
Affiliation:
Department of Industrial and Systems EngineeringRutgers University, Piscataway, NJ E-mail: gursoy@rci.rutgers.edu
K. Gürsoy
Affiliation:
Department of Management ScienceKean UniversityUnion, NJ

Abstract

Considered are semi-Markov decision processes (SMDPs) with finite state and action spaces. We study two criteria: the expected average reward per unit time subject to a sample path constraint on the average cost per unit time and the expected time-average variability. Under a certain condition, for communicating SMDPs, we construct (randomized) stationary policies that are ε-optimal for each criterion; the policy is optimal for the first criterion under the unichain assumption and the policy is optimal and pure for a specific variability function in the second criterion. For general multichain SMDPs, by using a state space decomposition approach, similar results are obtained.

Information

Type
Research Article
Copyright
Copyright © Cambridge University Press 2007

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Article purchase

Temporarily unavailable