Light

蒲源 photo

puyuan1996 Goto Github PK

followers: 31.0 following: 26.0 repos: 32.0 gists: 0.0

Name: 蒲源

Type: User

Company: China

Bio: 脚踏实地，仰望星空。Keep your feet on the ground, and your eyes on the stars.

Location: Shenzhen

蒲源's Projects

argsloader

Configuration Parsing and Management Based on ChainLoader

awesome-visual-rl

A curated list of reinforcement learning with vision (Visual RL) resources

di-1024

1024 + 深度强化学习（Deep Reinforcement Learning + 1024 Game)

di-card

di-engine

OpenDILab Decision AI Engine

di-engine-docs

DI-engine docs(Chinese and English)

dmc2gym

OpenAI Gym wrapper for the DeepMind Control Suite

genius-invokation-gym

原神七圣召唤模拟环境 Simulator of Genius Invocation

gobang

javascript gobang AI，JS五子棋AI，源码+教程，基于Alpha-Beta剪枝算法（不是神经网络）

gomoku_server_ui

An integrated example of front-end and back-end for a Gomoku game 五子棋前后端集成示例

iris

Transformers are Sample-Efficient World Models. ICLR 2023, notable top 5%.

katago

GTP engine and self-play learning in Go

katrain

Improve your Baduk skills by training with KataGo!

lc-sac

Implementation of LC-SAC method in PyTorch.

libmtl

A PyTorch Library for Multi-Task Learning

lightzero

[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios

llm-reasoners

A library for advanced large language model reasoning

llm_tree_search

The official implementation of paper: Alphazero-like Tree-Search can guide large language model decoding and training

make-a-video-pytorch

Implementation of Make-A-Video, new SOTA text to video generator from Meta AI, in Pytorch

marl

Implementation for mSAC methods in PyTorch

mcts-dpo

This is the repository that contains the source code for the Self-Evaluation Guided MCTS for online DPO.

ndrl-benchmark

pinecone-vercel-starter

Pinecone + Vercel AI SDK Starter

ppoxfamily

PPO x Family DRL Tutorial Course（决策智能入门级公开课：8节课帮你盘清算法理论，理顺代码逻辑，玩转决策AI应用实践）

puyuan1996.github.io

quiz-system

A concise quiz-system example using Vue.js

ray

An open source framework that provides a simple, universal API for building distributed applications. Ray is packaged with RLlib, a scalable reinforcement learning library, and Tune, a scalable hyperparameter tuning library.

roma

Codes accompanying the paper "ROMA: Multi-Agent Reinforcement Learning with Emergent Roles" (ICML 2020 https://arxiv.org/abs/2003.08039)

sota-rl-algorithms

PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT-Opt, PointNet..

study

Explore a collection of code examples for learning C++ and Python.

1
2

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.