van Dawen, R. Finite state dynamic programming with the total reward criterion. (English) Zbl 0602.90133 Z. Oper. Res., Ser. A 30, 1-14 (1986). Reviewer: E.Boylan MSC: 90C39 90C40 × Cite Format Result Cite Review PDF Full Text: DOI
van Nunen, J.; Stidham, S. jun. Action-dependent stopping times and Markov decision process with unbounded rewards. (English) Zbl 0471.90094 OR Spektrum 3, 145-152 (1981). MSC: 90C40 60K15 60G40 × Cite Format Result Cite Review PDF Full Text: DOI