JMLR · @jmlr
677 followers · 280 posts · Server sigmoid.social

'q-Learning in Continuous Time', by Yanwei Jia, Xun Yu Zhou.

jmlr.org/papers/v24/22-0755.ht

#reinforcement #martingale #critic

Last updated 1 year ago

hobs · @hobs
408 followers · 763 posts · Server mstdn.social

If you play an infinite number of rounds you'll have infinite payoff ( theory )
But if you paid $1k to play each round your real world expected value is negative rather than the $ that math says it is.

Because...
Like betting strategy, you don't have an infinite bankroll...
Nor infinite time.

As you keep tossing tails, you have exponential payoff growth, BUT you also have linear cost growth
And you cant buy time with $

#martingale #infinity #ExpectedValue #Probability

Last updated 2 years ago