Document Zbl 1039.93067

Cavazos-Cadena, Rolando; Montes-de-Oca, Raúl

Stationary optimal policies in a class of multichain positive dynamic programs with finite state space and risk-sensitive criterion. (English) Zbl 1039.93067

Appl. Math. 28, No. 1, 93-109 (2001).

This paper deals with Markov decision processes with finite state space, compact action sets and a nonnegative reward. For an exponential utility function with risk-sensitive coefficient, the performance of a decision policy is measured by the corresponding risk-sensitive expected total reward criterion. Under some mild conditions of continuity, the authors derive the risk-sensitive optimality equation when the value function is finite. Moreover, if the number of ergodic classes depends continuously on the policy, when a stationary policy is used to drive the system, then there exists an optimal stationary policy.

Reviewer: Makiko Nisio (Kobe)

MSC:

93E20	Optimal stochastic control
90C40	Markov and semi-Markov decision processes

Keywords:

Markov decision processes; risk-sensitive optimality; stationary policy; utility function; structural stability; ergodic class; invariant distribution

Cite Review PDF

Full Text: DOI