'Provably Sample-Efficient Model-Free Algorithm for MDPs with Peak Constraints', by Qinbo Bai, Vaneet Aggarwal, Ather Gattami.
http://jmlr.org/papers/v24/21-0117.html #mdps #markov #pcmdp
#mdps #markov #pcmdp