Light

chenxingqiang / recent_reinforcement_learning_view Goto Github PK

View Code? Open in Web Editor NEW

1.0 1.0 0.0 4 KB

强化学习专栏

recent_reinforcement_learning_view's Introduction

Recent_Reinforcement_Learning_View

强化学习专栏

强化学习专栏 62周目录

编外篇

第零周：数据科学，从计算到推理

第一部分强化学习理论探索专题

第一节理论和计算部分

第一周：强化学习基础概念
第二周：强化学习理论宗派
第三周：强化学习与监督学习
第四周：强化学习的实验环境
第五周：强化学习中的数学基础
第六周：强化学习中优化策略
第七周：强化学习中的实验环境构建

第二节推理部分

第八周：强化学习基本算法
第九周：最优价值算法 Q-learning 和 DQN 算法
第十周：基于策略梯度的算法
第十一周：稀疏回报求解和 Model-based 算法
第十二周：反向强化学习算法

第二部分强化学习应用场景专题

第十三周：强化学习在 AlphaZero 中的应用
第十四周：强化学习与推荐检索系统
第十五周：强化学习与无人驾驶
第十六周：强化学习与对战游戏
第十七周：强化学习与路径规划和飞行控制
第十八周：强化学习与动态规划
第十九周：强化学习与量化交易
第二十周：强化学习与自然语言处理
第二十一周：强化学习在 AutoML 中的应用
第二十二周：强化学习与机器人控制

第三部分强化学习编程实践专题

第一节背景介绍书籍

参考书籍 Deep reinforcement learning hands-on
第二十三周：What is Reinforcement Learning
第二十四周：OpenAI gym
第二十五周：OpenAI Gym API
第二十六周：DeepLearning with PyTorch
第二十七周：The Cross-Entropy Methods
第二十八周：Tabular Learning and the Bellman Equation
第二十九周：Deep Q-networks
第三十周： DQN extentions
第三十一周：stocks trading using RL
第三十二周：Policy Gradients: an alternative

第二节深度应用

第三十三周：The Actor-Critic Methods
第三十四周：Asynchronous Advantage Actor-Critic
第三十五周：Chatbot Training with RL
第三十六周：Web Navigation
第三十七周：Continuous Action Space
第三十八周：Trust regions--TRPO，PPO，and ACKTR
第三十九周：Black-box Optimizmization in RL
第四十周：Beyond Model-Free -- Imagination
第四十一周：An on Atari Breakout
第四十二周：AlphaGO Zero

第四部分强化学习前沿论文专题

第四十三周：开山鼻祖 DQN 系列
第四十四周：基于策略梯度的深度强化学习
第四十五周：分层 Deep Reinforcement Learning
第四十六周：Deep Reinforcement Learning 多任务和迁移学习
第四十七周：基于外部记忆模块的 Deep Reinforcement Learning
第四十八周：Deep Reinforcement Learning 中探索和利用问题
第四十九周：多 Agent Deep Reinforcement Learning 问题
第五十周：逆向深度强化学习专题
第五十一周：探索和监督学习
第五十二周：异步深度强化学习
第五十三周：强化学习与模仿学习

第五部分强化学习与深度学习交叉领域探索综述专题

第五十四周：强化学习与 GCN 交叉研究综述
第五十五周：强化学习与 CNN 交叉研究综述
第五十六周：强化学习与 RNN 交叉研究综述
第五十七周：强化学习与 AutoML 交叉研究综述

第六部分强化学习与对抗学习（GAN）交叉领域探索专题

第五十八周：强化学习与GAN交叉研究综述
第五十九周：强化学习与迁移学习热点综述
第六十周：强化学习与模仿学习热点综述
第六十一周：反向强化学习热点综述
第六十二周：强化学习未来发展方向综述

paper 每周共享共读

第一周：DEEP REINFORCEMENT LEARNING: AN OVERVIEW

recent_reinforcement_learning_view's People

Contributors

Stargazers

Watchers

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.