Comments (2)
Hello @AlexTo
It seems the BaseDeepQ implements the DoubleQ update, where the target is defined as follows:
Y = R(t+1) + γ * Q(S(t+1), argmax[Q(S(t+1), a; θt)]; θ′t). (4)
Just made a quick check: it looks correct, let us know if you find the formula is not implemented correctly.
van Hasselt, Hado, et al. “Deep Reinforcement Learning with Double Q-Learning.” ArXiv:1509.06461 [Cs], Dec. 2015, http://arxiv.org/abs/1509.06461.
from l2rpn-baselines.
Ah, I see, at first, I thought the formula is from Mnih, et al Human-level control through deep reinforcement learning. Nature, 518(7540), 529–533. https://doi.org/10.1038/nature14236
Looks like it is correct according to DoubleQ
Thank you
from l2rpn-baselines.
Related Issues (20)
- Failed to install via pypi on Windows HOT 1
- About the "DoubleDuelingDQN" baseline HOT 2
- Error while running the baselines HOT 3
- Tricks for fine-tuning the hyperparameters HOT 1
- Add issue template
- Add PPO with mazerl for example
- PPO with ACME framework
- GymEnvWithHeuristics: fix_action is missing something
- Issue when evaluating trained PPO_RLLIB agent HOT 1
- Make the environments compatible with new gym interface
- Rewrite the readme to use a recent baseline and not a deprecated one
- Have an example using DI-engine, maybe ?
- Issues with retraining from saved agent
- Evaluation with normalised observation and action space is improper for PPO_SB3
- Huge cleaning
- PPO_RLLIB code improvement
- "Impossible to use the RedispReward reward with an environment without generators cost" HOT 1
- '_missing_two_busbars_support_info' attribute is lost when using Train HOT 4
- encounter a litter error
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from l2rpn-baselines.