FedSearch - Federated network search engine

Published papers at TMLR · @tmlrpub

543 followers · 498 posts · Server sigmoid.social

Open media

Provably Convergent Policy Optimization via Metric-aware Trust Region Methods

Jun Song, Niao He, Lijun Ding, Chaoyue Zhao

Action editor: Amir-massoud Farahmand.

https://openreview.net/forum?id=jkTqJJOGMS

#optimality #reinforcement #lagrangian

Last updated 2 years ago

Original post

New Submissions to TMLR · @tmlrsub

182 followers · 614 posts · Server sigmoid.social

Open media

Expectation of the maximum of Normal random variables with applications to reinforcement learning

https://openreview.net/forum?id=ZvM6TVPBBM

#reinforcement #expectation #optimality

Last updated 2 years ago

Original post

JMLR · @jmlr

678 followers · 282 posts · Server sigmoid.social

Open media

'Preconditioned Gradient Descent for Overparameterized Nonconvex Burer--Monteiro Factorization with Global Optimality Certification', by Gavin Zhang, Salar Fattahi, Richard Y. Zhang.

http://jmlr.org/papers/v24/22-0882.html

#optimality #minimizer #overparameterization

#optimality #minimizer #overparameterization

Last updated 2 years ago

Original post

New Submissions to TMLR · @tmlrsub

168 followers · 474 posts · Server sigmoid.social

Open media

Provably Convergent Policy Optimization via Metric-aware Trust Region Methods

https://openreview.net/forum?id=jkTqJJOGMS

#optimality #reinforcement #lagrangian

Last updated 3 years ago

Original post

JMLR · @jmlr

649 followers · 168 posts · Server sigmoid.social

Open media

'Reinforcement Learning for Joint Optimization of Multiple Rewards', by Mridul Agarwal, Vaneet Aggarwal.

http://jmlr.org/papers/v24/19-980.html

#rewards #reinforcement #optimality

#Rewards #reinforcement #optimality

Last updated 3 years ago

Original post

Published papers at TMLR · @tmlrpub

508 followers · 296 posts · Server sigmoid.social

Open media

Defense Against Reward Poisoning Attacks in Reinforcement Learning

Kiarash Banihashem, Adish Singla, Goran Radanovic

https://openreview.net/forum?id=goPsLn3RVo

#reinforcement #reward #optimality

Last updated 3 years ago

Original post

Tan Sing Kuang · @singkuangtan

2 followers · 42 posts · Server mstdn.social

Open media

I dislike modulo 2 addition in linear error correction codes. I prefer the monotone Boolean circuit instead. NOT operations are trouble makers. Without them life is much better. Therefore I used a monotone circuit in my deep learning model. https://vixra.org/abs/2212.0193
#deeplearning #monotonecircuit #ErrorCorrectionCode #optimality

#optimality #ErrorCorrectionCode #monotonecircuit #deeplearning

Last updated 3 years ago

Original post

Stan Schymanski · @schymans

50 followers · 100 posts · Server mastodon.social

Open media

New #openaccess #paper: "Vegetation optimality explains the convergence of catchments on the Budyko curve", published today in HESS by Remko Nijzink and myself: https://hess.copernicus.org/articles/26/6289/2022/
The paper also follows a strict #openscience approach, check it out!
#research #science #hydrology #vegetation #optimality @list_hydrocat

#openaccess #paper #openscience #research #science #hydrology #vegetation #optimality

Last updated 3 years ago

Original post

Julio R. Banga · @julio_r_banga

88 followers · 37 posts · Server mstdn.science

#introduction Hello everybody! I am trying to do research in computational biology (both in systems and synthetic bio). Big fan of #optimality principles in biology. Part of the #twittermigration. Looking forward to re-building my network, and expanding it, in this new place. Still trying to figure Mastodon-related things out...

#introduction #optimality #twittermigration

Last updated 3 years ago

Original post