Reinforcement Learning

Reinforcement Learning howrar • 3w ago • 100%

Keynotes from the 2024 Reinforcement Learning Conference

www.youtube.com

Recordings for the [RLC](https://rl-conference.cc/) keynote talks have been released. Keynote speakers: - David Silver - Doina Precup (Not recorded) - Peter Stone - Finale Doshi-Velez - Sergey Levine - Emma Brunskill - Andrew Barto

Reinforcement Learning howrar • 1mo ago • 50%

OpenAI: Learning to Reason with LLMs

https://openai.com/index/learning-to-reason-with-llms/

OpenAI just put out a blog post about a new model trained via RL (I'm assuming this isn't the usual RLHF) to perform chain of thought reasoning before giving the user its answer. As usual, there's very little detail about how this is accomplished so it's hard for me to get excited about it, but the rest of you might find this interesting.

Reinforcement Learning howrar • 7mo ago • 50%

Introducing SIMA, a Scalable Instructable Multiworld Agent

deepmind.google