FedSearch - Federated network search engine

FedSearch

Query documentation
API Documentation
Privacy
Server opt-out
Contact

David Meyer · @dmm

222 followers · 579 posts · Server mathstodon.xyz

Open media

Policy gradient methods in reinforcement learning are very cool and are used in a wide variety of machine learning applications including robotics, game playing, autonomous vehicles and many others, including incremental training of Large Language Models (LLMs).

A few of my notes on policy gradients are here: https://davidmeyer.github.io/ml/policy_gradient_methods_for_robotics.pdf. The LaTeX source is here: https://www.overleaf.com/read/kbgxbmhmrksb.

As always questions/comments/corrections/* greatly appreciated.

#machinelearning #Reinforcementlearning #policygradients #largelanguagemodelsa #TeXLaTeX