chuacheowhuan / pbt_marl_watered_down Goto Github PK
View Code? Open in Web Editor NEWMy attempt to reproduce a water down version of PBT (Population based training) for MARL (Multi-agent reinforcement learning) using DDPPO (Decentralized & distributed proximal policy optimization) from ray[rllib].
License: MIT License