Wrobel, A. (1984). „On Markovian Decision Models with a Finite Skeleton”. Mathematical Methods of Operations Research. 28 (February): 17—27. S2CID2545336. doi:10.1007/bf01919083.
Kearns, Michael; Mansour, Yishay; Ng, Andrew (2002). „A Sparse Sampling Algorithm for Near-Optimal Planning in Large Markov Decision Processes”. Machine Learning. 49 (193–208): 193—208. doi:10.1023/A:1017932429737.