×

The numerical exploitation of periodicity in Markov decision processes. (English) Zbl 0526.90095


MSC:

90C40 Markov and semi-Markov decision processes
65K05 Numerical mathematical programming methods
Full Text: DOI

References:

[1] Bartmann D (1979) A method of bisection for discounted Markov decision problems. Z Oper Res 23:275–287 · Zbl 0412.90074 · doi:10.1007/BF01954692
[2] Carton D (1963) Une application de l’algorithme de Howard pour des phénomènes saisonniers. Proc. 3rd International Conference Operation Research Oslo, pp 683–691
[3] Hendrikx M, van Nunen J, Wessels J (1980) Some notes on iterative optimization of structured Markov decision processes with discounted rewards, Memorandum COSOR 80-20. Dept. Math. Comp. Sci., Eindhoven University of Technology · Zbl 0556.90089
[4] Nunen J van (1976) A set of successive approximation methods for discounted Markovian decision problems. Z Oper Res 20:203–208 · Zbl 0357.90074 · doi:10.1007/BF01920264
[5] Nunen J. van, Wessels J (1979) Successive approximations for Markov decision processes and Markov games with unbounded rewards. Math Operationsforsch Statist, Ser Optimization 10:431–455 · Zbl 0421.90075
[6] Platzman L (1977) Improved conditions for convergence in undiscounted Markov renewal programming. Oper Res 25:529–533 · Zbl 0383.90108 · doi:10.1287/opre.25.3.529
[7] Porteus E (1975) Bounds and transformations for discounted finite Markov decision chains. Oper Res 23:761–784 · Zbl 0322.90073 · doi:10.1287/opre.23.4.761
[8] Riis J (1965) Discounted Markov programming in a periodic process. Oper Res 13:920–929 · doi:10.1287/opre.13.6.920
[9] Schweitzer P (1971) Iterative solution of the functional equations of undiscounted Markov renewal programming. J Math Anal Appl 34:495–501 · Zbl 0218.90070 · doi:10.1016/0022-247X(71)90094-1
[10] Su S, Deininger R (1972) Generalization of White’s method of successive approximations to periodic Markovian decision processes. Oper Res 20:318–326 · Zbl 0241.90064 · doi:10.1287/opre.20.2.318
[11] Wal J van der (1980) The method of value oriented successive approximations for the average reward Markov decision process. OR Spektrum 1:233–242 · Zbl 0443.90109 · doi:10.1007/BF01719500
[12] Wal J van der (1981) Stochastic dynamic programming. Math. Centre Tract 139. Mathematisch Centrum Amsterdam · Zbl 0462.90055
[13] Wessels J (1980) Markov decision processes: implementation aspects Memorandum COSOR 80-14. Dept. of Math. and Comp. Sci., Eindhoven University of Technology
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.