Published papers at TMLR · @tmlrpub
543 followers · 498 posts · Server sigmoid.social

Provably Convergent Policy Optimization via Metric-aware Trust Region Methods

Jun Song, Niao He, Lijun Ding, Chaoyue Zhao

Action editor: Amir-massoud Farahmand.

openreview.net/forum?id=jkTqJJ

#optimality #reinforcement #lagrangian

Last updated 1 year ago

New Submissions to TMLR · @tmlrsub
182 followers · 614 posts · Server sigmoid.social

Expectation of the maximum of Normal random variables with applications to reinforcement learning

openreview.net/forum?id=ZvM6TV

#reinforcement #expectation #optimality

Last updated 1 year ago

JMLR · @jmlr
678 followers · 282 posts · Server sigmoid.social

'Preconditioned Gradient Descent for Overparameterized Nonconvex Burer--Monteiro Factorization with Global Optimality Certification', by Gavin Zhang, Salar Fattahi, Richard Y. Zhang.

jmlr.org/papers/v24/22-0882.ht

#optimality #minimizer #overparameterization

Last updated 1 year ago

New Submissions to TMLR · @tmlrsub
168 followers · 474 posts · Server sigmoid.social

Provably Convergent Policy Optimization via Metric-aware Trust Region Methods

openreview.net/forum?id=jkTqJJ

#optimality #reinforcement #lagrangian

Last updated 2 years ago

JMLR · @jmlr
649 followers · 168 posts · Server sigmoid.social

'Reinforcement Learning for Joint Optimization of Multiple Rewards', by Mridul Agarwal, Vaneet Aggarwal.

jmlr.org/papers/v24/19-980.htm

#Rewards #reinforcement #optimality

Last updated 2 years ago

Published papers at TMLR · @tmlrpub
508 followers · 296 posts · Server sigmoid.social

Defense Against Reward Poisoning Attacks in Reinforcement Learning

Kiarash Banihashem, Adish Singla, Goran Radanovic

openreview.net/forum?id=goPsLn

#reinforcement #reward #optimality

Last updated 2 years ago

Tan Sing Kuang · @singkuangtan
2 followers · 42 posts · Server mstdn.social

I dislike modulo 2 addition in linear error correction codes. I prefer the monotone Boolean circuit instead. NOT operations are trouble makers. Without them life is much better. Therefore I used a monotone circuit in my deep learning model. vixra.org/abs/2212.0193

#optimality #ErrorCorrectionCode #monotonecircuit #deeplearning

Last updated 2 years ago

Stan Schymanski · @schymans
50 followers · 100 posts · Server mastodon.social

New : "Vegetation optimality explains the convergence of catchments on the Budyko curve", published today in HESS by Remko Nijzink and myself: hess.copernicus.org/articles/2
The paper also follows a strict approach, check it out!
@list_hydrocat

#openaccess #paper #openscience #research #science #hydrology #vegetation #optimality

Last updated 2 years ago

Julio R. Banga · @julio_r_banga
88 followers · 37 posts · Server mstdn.science

Hello everybody! I am trying to do research in computational biology (both in systems and synthetic bio). Big fan of principles in biology. Part of the . Looking forward to re-building my network, and expanding it, in this new place. Still trying to figure Mastodon-related things out...

#introduction #optimality #twittermigration

Last updated 2 years ago