daihuiao Goto Github PK

followers: 1.0 following: 16.0 repos: 48.0 gists: 0.0

Name: Dai Huiao

Type: User

Company: Tianjin University

Bio: student of Tianjin University@Tianjin University

Dai Huiao's Projects

bcq

Author's PyTorch implementation of BCQ for continuous and discrete actions

bppo

Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).

corl

High-quality single-file implementations of SOTA Offline RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC

decision-transformer

Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.

deep_rl_zoo

A collection of Deep RL algorithms implemented with PyTorch to solve Atari games and classic control tasks like CartPole, LunarLander, and MountainCar.

denoising-diffusion-pytorch

Implementation of Denoising Diffusion Probabilistic Model in Pytorch

diffusion-policies-for-offline-rl

dm_control

DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.

elegantrl_diffusion_policy

Cloud-native Deep Reinforcement Learning. 🔥

通过阅读Communication-Efficient Learning of Deep Networks from Decentralized Data与Robust and Communication-Efficient Federated Learning from Non-IID Data两篇论文，复现FedAvg与STC算法，完成LSTM模型+ Shakespeare数据集的字符预测任务

federated-learning-pytorch

Handy PyTorch implementation of a federated learning (especially for painless research)

ga-ddpg

6D Grasping Policy from Point Clouds

gym_dockauv

Gym Environment for AUV docking procedure

handover-sim

A simulation environment and benchmark for human-to-robot object handovers

handover-sim2real

Official code for CVPR'23 paper: Learning Human-to-Robot Handovers from Point Clouds

handtailor

HandTailor: Towards High-Precision Monocular 3D Hand Recovery

inac_pytorch

iris

Transformers are Sample Efficient World Models

machine-learning-in-numpy

纯python实现机器学习算法,非套用sk-learn

machine_learning_python

通过阅读网上的资料代码，进行自我加工，努力实现常用的机器学习算法。实现算法有KNN、Kmeans、EM、Perceptron、决策树、逻辑回归、svm、adaboost、朴素贝叶斯

maddpg_uncertainty

Code for the MADDPG algorithm from the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"

mcq

Code for Mildly Conservative Q-learning for Offline Reinforcement Learning (NeurIPS 2022)

mpo

PyTorch Implementation of the Maximum a Posteriori Policy Optimisation

nanogpt

The simplest, fastest repository for training/finetuning medium-sized GPTs.

daihuiao Goto Github PK

Dai Huiao's Projects

Recommend Projects

Recommend Topics

Recommend Org