Coder Social home page Coder Social logo

qzj-debug's Projects

awesome-rlhf icon awesome-rlhf

A curated list of reinforcement learning with human feedback resources (continually updated)

corl icon corl

High-quality single-file implementations of SOTA Offline RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC

decision-transformer icon decision-transformer

Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.

dreamer icon dreamer

Reproduction of Dreamerv1 and v2 in pytorch for deepmind control suite

dreamerv2-pytorch icon dreamerv2-pytorch

Pytorch implementation of DreamerV2: Mastering Atari with Discrete World Models, based on the original implementation

dutd icon dutd

Official source code for the ICLR 2023 paper Dynamic Update-to-Data Ratio: Minimizing World Model Overfitting (DUTD).

leetcode-master icon leetcode-master

《代码随想录》LeetCode 刷题攻略:200道经典题目刷题顺序,共60w字的详细图解,视频难点剖析,50余张思维导图,支持C++,Java,Python,Go,JavaScript等多语言版本,从此算法学习不再迷茫!🔥🔥 来看看,你会发现相见恨晚!🚀

marl-algorithms icon marl-algorithms

Implementations of IQL, QMIX, VDN, COMA, QTRAN, MAVEN, CommNet, DyMA-CL, and G2ANet on SMAC, the decentralised micromanagement scenario of StarCraft II

mbpo icon mbpo

Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"

mbpo_pytorch icon mbpo_pytorch

A pytorch reprelication of the model-based reinforcement learning algorithm MBPO

mile icon mile

用于服务器与docker容器的文件同步

mopo icon mopo

Model-based Offline Policy Optimization re-implement all by pytorch

pybullet-gym icon pybullet-gym

Open-source implementations of OpenAI Gym MuJoCo environments for use with the OpenAI Gym Reinforcement Learning Research Platform.

rl-papers icon rl-papers

📚强化学习方向顶会文章(持续更新) | Top Conference Papers on Reinforcement Learning(RL),including: NeurIPS, AAAI, IJCAI, ICML, AAMAS, ICLR, ICRA, etc.

rl-plotter icon rl-plotter

:sparkles: A plotter for reinforcement learning (RL)

rliable icon rliable

[NeurIPS'21 Outstanding Paper] Library for reliable evaluation on RL and ML benchmarks, even with only a handful of seeds.

tamil icon tamil

Official repository for "Task-Aware Information Routing from Common Representation Space in Lifelong Learning"

uwdac icon uwdac

record experiment result and source code

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.