Lightweight, stable, efficient PyTorch implement of reinforcement learning
python3 DelayDDPG.py
All code is written in ↑ this file
You can see these gif/png ↓ in file "Result_GIF".
BipedalWalkerHardcore-V2-total:
LunarLanderContinuous-V2:
Plot LunarLanderContinuous-V2, TrainEpoch: 78, TimeUsed: 996s:
If you can understand Chinese, more details of DelayDDPG are described in Chinese in this website ↓
如果你能看得懂中文,那么我用中文写了对这个算法的详细介绍: