'Q-Learning for MDPs with General Spaces: Convergence and Near Optimality via Quantization under Weak Continuity', by Ali Kara, Naci Saldi, Serdar YĆ¼ksel.
http://jmlr.org/papers/v24/21-1457.html
#quantization #quantized #mdps
#quantization #quantized #mdps
'Provably Sample-Efficient Model-Free Algorithm for MDPs with Peak Constraints', by Qinbo Bai, Vaneet Aggarwal, Ather Gattami.
http://jmlr.org/papers/v24/21-0117.html
#mdps #markov #pcmdp