Light

tigerneil / reinforcementlearning.today Goto Github PK

View Code? Open in Web Editor NEW

11.0 6.0 2.0 97 KB

Made for a reading group at the Center for Safe AGI.

Home Page: http://reinforcementlearning.today

deepmind openai multi-agent reinforcement-learning causality huawei google-brain

reinforcementlearning.today's Introduction

Reinforcement learning today

2022-10-27: Today we share a paper:
- IN-CONTEXT REINFORCEMENT LEARNING WITH ALGORITHM DISTILLATION, by Michael Laskin, Luyu Wang, Junhyuk Oh, Emilio Parisotto, Stephen Spencer, Richie Steigerwald, DJ Strouse, Steven Hansen, Angelos Filos, Ethan Brooks, Maxime Gazeau, Himanshu Sahni, Satinder Singh, Volodymyr Mnih Details
  - download link
  - keywords: algorithm distillation
2022-10-11: Today we share a paper:
- Defining and Characterizing Reward Hacking, by Joar Skalse, Nikolaus H. R. Howe, Dmitrii Krasheninnikov, David Krueger Details
  - download link
  - keywords: reward hacking
2022-10-07: Today we share a paper:
- How RL Agents Behave When Their Actions Are Modified, by Eric D. Langlois, Tom Everitt Details
  - download link
  - keywords: modified action
2022-06-19: Today we share a paper:
- Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations, by Cong Lu, Philip J. Ball, Tim G. J. Rudner, Jack Parker-Holder, Michael A. Osborne, Yee Whye Teh Details
  - download link
  - keywords: offline
2021-11-06: Today we share a paper:
- Self-Consistent Models and Values, by Gregory Farquhar, Matteo Hessel, Kate Baumli Zita Marinho, Hado van Hasselt, Angelos Filos, David Silver
  - download link
  - keywords: model, self-consistent
2021-10-22: Today we share a paper:
- An Empirical Investigation of Representation Learning for Imitation, by Xin Chen, Sam Toyer, Cody Wild, Scott Emmons, Ian Fischer, Kuang-Huei Lee, Neel Alex, Steven H Wang, Ping Luo, Stuart Russell, Pieter Abbeel, Rohin Shah
  - download link
  - github repo
2021-10-20: Today we share a paper:
- Temporal Abstraction in Reinforcement Learning with the Successor Representation, by Marlos C. Machado, André Barreto, Doina Precup
  - download link
2021-10-19: Today we share a paper:
- REPRESENTATION LEARNING VIA INVARIANT CAUSAL MECHANISMS, by Jovana Mitrovic, Brian McWilliams, Jacob Walker, Lars Buesing, Charles Blundell
  - download link
  - keywords: causal, representation Learning, self-supervised Learning, contrastive Methods, causality
2021-10-16: Today we share a paper:
- Nash Equilibria in Finite-Horizon Multiagent Concurrent Games, by Senthil Rajasekaran, Moshe Y. Vardi
  - download link
  - keywords: multiagent, Nash, complexity
2021-10-13: Today we share a paper:
- Revisiting Rainbow: Promoting more Insightful and Inclusive Deep Reinforcement Learning Research, by Johan S. Obando-Ceron, Pablo Samuel Castro
  - download link
  - github repo
  - keywords: practice, algorithm, rainbow
2021-10-11: Today we share a paper:
- Batch size-invariance for policy optimization, by Jacob Hilton, Karl Cobbe, John Schulman
  - download link
  - keywords: policy, batch-size
2021-10-08: Today we share a paper;
- Recursively Summarizing Books with Human Feedback, by Jeff Wu, Long Ouyang, Daniel M. Ziegler, Nisan Stiennon, Ryan Lowe, Jan Leike, Paul Christiano
  - download link
  - keywords: alignment, summary, gpt-3, language-model
2021-10-06: Today we share a paper:
- Reinforcement Learning with Information-Theoretic Actuation, by Elliot Catt, Marcus Hutter, Joel Veness
  - download link
2021-09-28: Today we share a paper:
- Qatten: A General Framework for Cooperative Multiagent Reinforcement Learning, by Yaodong Yang, Jianye Hao, Ben Liao, Kun Shao, Guangyong Chen, Wulong Liu, Hongyao Tang
  - download link
2021-09-24: Today we share a paper:
- A Minimalist Approach to Offline Reinforcement Learning, by Scott Fujimoto, Shixiang Shane Gu
  - download link
2021-09-18: Today we share a paper:
- Negotiating team formation using deep reinforcement learning, by Yoram Bachrach, Richard Everett, Edward Hughes, Angeliki Lazaridou, Joel Z. Leibo, Marc Lanctot, Michael Johanson, Wojciech M. Czarnecki, Thore Graepel
  - download link
2021-09-14: Today we share two papers:
- CoBERL: Contrastive BERT for Reinforcement Learning, by Andrea Banino, Tim Scholtes, Adrià Puidomenech Badia, Jovana Mitrovic, Jacob Walker, Charles Blundell
  - download link
- Stabilizing Transformers for Reinforcement Learning, by Emilio Parisotto, H. F. Song, Jack W. Rae, Razvan Pascanu, Çaglar Gülçehre, Siddhant M. Jayakumar, Max Jaderberg, Raphael Lopez Kaufman, Aidan Clark, Seb Noury, M. Botvinick, N. Heess, R. Hadsell
  - download link
2021-07-03: Today we share a paper:
- Causally Correct Partial Models for Reinforcement Learning, by Danilo J. Rezende, Ivo Danihelka, George Papamakarios, Nan Rosemary Ke, Ray Jiang, Theophane Weber, Karol Gregor, Hamza Merzic, Fabio Viola, Jane Wang, Jovana Mitrovic, Frederic Besse, Ioannis Antonoglou, Lars Buesing Details
- download link
2020-08-02: Today we share a paper:
- Discovering Reinforcement Learning Algorithms, by Junhyuk Oh Matteo Hessel, Wojciech M. Czarnecki, Zhongwen Xu, Hado van Hasselt, Satinder Singh, David Silver Details
- download link
2020-07-06: Today we share a paper:
- An operator view of policy gradient methods, by Dibya Ghosh, Marlos C. Machado, and Nicolas Le Roux Details
- download link
2020-06-24: Today we share a paper:
- The Value-Improvement Path Towards Better Representations for Reinforcement Learning, by Will Dabney, Andre Barreto, Mark Rowland, Robert Dadashi, John Quan, Marc G. Bellemare, and David Silver Details
- download link
2020-06-23: Today we share a paper:
- An Optimistic Perspective on Offline Reinforcement Learning, by Rishabh Agarwal, Dale Schuurmans, and Mohammad Norouzi Details
- download link
2020-05-21: Today we share a paper:
- Deterministic Bellman Residual Minimization, by Ehsan Saleh and Nan Jiang Details
- download link
2020-05-12: Today we share a paper:
- Plan2Vec: Unsupervised Representation Learning by Latent Plans, by Ge Yang, Amy Zhang, Ari S. Morcos, Joelle Pineau, Pieter Abbeel, Roberto Calandra Details
- download link
2020-05-11: Today we share a paper:
- The Value of Abstraction, by Mark K. Ho, David Abel, Thomas L. Griffiths, Michael L. Littman Details
- download link
2020-05-05: Today we share a webconf：
- Leverage the Average: An Analysis of Regularization in RL Details
- Speaker: Matthieu Geist
- link
2020-04-27: Today we share a paper:
- Behaviour Suite for Reinforcement Learning, by Ian Osband, Yotam Doron, Matteo Hessel, John Aslanides, Eren Sezener, Andre Saraiva, Katrina McKinney, Tor Lattimore, Csaba Szepesvari, Satinder Singh, Benjamin Van Roy, Richard Sutton, David Silver, Hado Van Hasselt Details
- download link
- code repositary
2020-04-22: Today we share a webconf:
- RLTheory Seminars: a virtual seminar focuses on theoretical reinforcement learning.
- link
2020-04-19: Today we share a paper:
- Artificial and Computational Intelligence in Games: Revolutions in Computational Game AI, by Jialin Liu, Tom Schaul, Pieter Spronck, and Julian Togelius Details
- download link
2020-04-13: Today we share a paper:
- Leverage the Average: an Analysis of Regularization in RL, by Nino Vieillard, Tadashi Kozuno, Bruno Scherrer, Olivier Pietquin, Rémi Munos, Matthieu Geist Details
- download link
2020-04-08: Today we share a podcast of Csaba Szepesvari:
- Csaba Szepesvari of DeepMind shares his views on Bandits, Adversaries, PUCT in AlphaGo / AlphaZero / MuZero, AGI and RL, what is timeless, and more! Details
- link

reinforcementlearning.today's People

Contributors

Stargazers

Watchers

Forkers

jiaodaxiaozi agifoundation

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.