daihuiao Goto Github PK
Name: Dai Huiao
Type: User
Company: Tianjin University
Bio: student of Tianjin University@Tianjin University
Name: Dai Huiao
Type: User
Company: Tianjin University
Bio: student of Tianjin University@Tianjin University
Author's PyTorch implementation of BCQ for continuous and discrete actions
Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).
High-quality single-file implementations of SOTA Offline RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC
Code for conservative Q-learning
Conservative Q Learning on top of SAC
本人郑重声明,所有内容都不是原创的(做个笔记)
PyTorch DDPM implementation
Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.
A collection of Deep RL algorithms implemented with PyTorch to solve Atari games and classic control tasks like CartPole, LunarLander, and MountainCar.
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.
Cloud-native Deep Reinforcement Learning. 🔥
通过阅读Communication-Efficient Learning of Deep Networks from Decentralized Data与Robust and Communication-Efficient Federated Learning from Non-IID Data两篇论文,复现FedAvg与STC算法,完成LSTM模型+ Shakespeare数据集的字符预测任务
Handy PyTorch implementation of a federated learning (especially for painless research)
6D Grasping Policy from Point Clouds
Gym Environment for AUV docking procedure
A simulation environment and benchmark for human-to-robot object handovers
Official code for CVPR'23 paper: Learning Human-to-Robot Handovers from Point Clouds
HandTailor: Towards High-Precision Monocular 3D Hand Recovery
Transformers are Sample Efficient World Models
纯python实现机器学习算法,非套用sk-learn
通过阅读网上的资料代码,进行自我加工,努力实现常用的机器学习算法。实现算法有KNN、Kmeans、EM、Perceptron、决策树、逻辑回归、svm、adaboost、朴素贝叶斯
Code for the MADDPG algorithm from the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"
Code for Mildly Conservative Q-learning for Offline Reinforcement Learning (NeurIPS 2022)
PyTorch Implementation of the Maximum a Posteriori Policy Optimisation
The simplest, fastest repository for training/finetuning medium-sized GPTs.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.