Huizhen Yu, Dimitri P. Bertsekas: On Boundedness of Q-Learning Iterates for Stochastic Shortest Path Problems. Math. Oper. Res. 38(2): 209-227 (2013)