codes of ICLR 2024 paper "CPPO: Continual Learning for Reinforcement Learning with Human Feedback"
fanlyu / cppo Goto Github PK
View Code? Open in Web Editor NEWThis project forked from hitsz-hlt/cppo
ICLR 2024 CPPO: Continual Learning for Reinforcement Learning with Human Feedback