[1] |
Bartmann D (1979) A method of bisection for discounted Markov decision problems. Z Oper Res 23:275–287 · Zbl 0412.90074 · doi:10.1007/BF01954692 |
[2] |
Carton D (1963) Une application de l’algorithme de Howard pour des phénomènes saisonniers. Proc. 3rd International Conference Operation Research Oslo, pp 683–691 |
[3] |
Hendrikx M, van Nunen J, Wessels J (1980) Some notes on iterative optimization of structured Markov decision processes with discounted rewards, Memorandum COSOR 80-20. Dept. Math. Comp. Sci., Eindhoven University of Technology · Zbl 0556.90089 |
[4] |
Nunen J van (1976) A set of successive approximation methods for discounted Markovian decision problems. Z Oper Res 20:203–208 · Zbl 0357.90074 · doi:10.1007/BF01920264 |
[5] |
Nunen J. van, Wessels J (1979) Successive approximations for Markov decision processes and Markov games with unbounded rewards. Math Operationsforsch Statist, Ser Optimization 10:431–455 · Zbl 0421.90075 |
[6] |
Platzman L (1977) Improved conditions for convergence in undiscounted Markov renewal programming. Oper Res 25:529–533 · Zbl 0383.90108 · doi:10.1287/opre.25.3.529 |
[7] |
Porteus E (1975) Bounds and transformations for discounted finite Markov decision chains. Oper Res 23:761–784 · Zbl 0322.90073 · doi:10.1287/opre.23.4.761 |
[8] |
Riis J (1965) Discounted Markov programming in a periodic process. Oper Res 13:920–929 · doi:10.1287/opre.13.6.920 |
[9] |
Schweitzer P (1971) Iterative solution of the functional equations of undiscounted Markov renewal programming. J Math Anal Appl 34:495–501 · Zbl 0218.90070 · doi:10.1016/0022-247X(71)90094-1 |
[10] |
Su S, Deininger R (1972) Generalization of White’s method of successive approximations to periodic Markovian decision processes. Oper Res 20:318–326 · Zbl 0241.90064 · doi:10.1287/opre.20.2.318 |
[11] |
Wal J van der (1980) The method of value oriented successive approximations for the average reward Markov decision process. OR Spektrum 1:233–242 · Zbl 0443.90109 · doi:10.1007/BF01719500 |
[12] |
Wal J van der (1981) Stochastic dynamic programming. Math. Centre Tract 139. Mathematisch Centrum Amsterdam · Zbl 0462.90055 |
[13] |
Wessels J (1980) Markov decision processes: implementation aspects Memorandum COSOR 80-14. Dept. of Math. and Comp. Sci., Eindhoven University of Technology |