Coder Social home page Coder Social logo

Dai Huiao's Projects

bcq icon bcq

Author's PyTorch implementation of BCQ for continuous and discrete actions

bppo icon bppo

Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).

corl icon corl

High-quality single-file implementations of SOTA Offline RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC

cql icon cql

Code for conservative Q-learning

cql-1 icon cql-1

Conservative Q Learning on top of SAC

ddpm icon ddpm

PyTorch DDPM implementation

decision-transformer icon decision-transformer

Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.

deep_rl_zoo icon deep_rl_zoo

A collection of Deep RL algorithms implemented with PyTorch to solve Atari games and classic control tasks like CartPole, LunarLander, and MountainCar.

dm_control icon dm_control

DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.

federallearning icon federallearning

通过阅读Communication-Efficient Learning of Deep Networks from Decentralized Data与Robust and Communication-Efficient Federated Learning from Non-IID Data两篇论文,复现FedAvg与STC算法,完成LSTM模型+ Shakespeare数据集的字符预测任务

ga-ddpg icon ga-ddpg

6D Grasping Policy from Point Clouds

handover-sim icon handover-sim

A simulation environment and benchmark for human-to-robot object handovers

handover-sim2real icon handover-sim2real

Official code for CVPR'23 paper: Learning Human-to-Robot Handovers from Point Clouds

handtailor icon handtailor

HandTailor: Towards High-Precision Monocular 3D Hand Recovery

iris icon iris

Transformers are Sample Efficient World Models

machine_learning_python icon machine_learning_python

通过阅读网上的资料代码,进行自我加工,努力实现常用的机器学习算法。实现算法有KNN、Kmeans、EM、Perceptron、决策树、逻辑回归、svm、adaboost、朴素贝叶斯

maddpg_uncertainty icon maddpg_uncertainty

Code for the MADDPG algorithm from the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"

mcq icon mcq

Code for Mildly Conservative Q-learning for Offline Reinforcement Learning (NeurIPS 2022)

mpo icon mpo

PyTorch Implementation of the Maximum a Posteriori Policy Optimisation

nanogpt icon nanogpt

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.