puyuan1996 Goto Github PK
Name: 蒲源
Type: User
Company: China
Bio: 脚踏实地,仰望星空。Keep your feet on the ground, and your eyes on the stars.
Location: Shenzhen
Name: 蒲源
Type: User
Company: China
Bio: 脚踏实地,仰望星空。Keep your feet on the ground, and your eyes on the stars.
Location: Shenzhen
Configuration Parsing and Management Based on ChainLoader
A curated list of reinforcement learning with vision (Visual RL) resources
1024 + 深度强化学习(Deep Reinforcement Learning + 1024 Game)
OpenDILab Decision AI Engine
DI-engine docs(Chinese and English)
OpenAI Gym wrapper for the DeepMind Control Suite
原神七圣召唤模拟环境 Simulator of Genius Invocation
javascript gobang AI,JS五子棋AI,源码+教程,基于Alpha-Beta剪枝算法(不是神经网络)
An integrated example of front-end and back-end for a Gomoku game 五子棋前后端集成示例
Transformers are Sample-Efficient World Models. ICLR 2023, notable top 5%.
GTP engine and self-play learning in Go
Improve your Baduk skills by training with KataGo!
Implementation of LC-SAC method in PyTorch.
A PyTorch Library for Multi-Task Learning
[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios
A library for advanced large language model reasoning
The official implementation of paper: Alphazero-like Tree-Search can guide large language model decoding and training
Implementation of Make-A-Video, new SOTA text to video generator from Meta AI, in Pytorch
Implementation for mSAC methods in PyTorch
This is the repository that contains the source code for the Self-Evaluation Guided MCTS for online DPO.
Pinecone + Vercel AI SDK Starter
PPO x Family DRL Tutorial Course(决策智能入门级公开课:8节课帮你盘清算法理论,理顺代码逻辑,玩转决策AI应用实践 )
A concise quiz-system example using Vue.js
An open source framework that provides a simple, universal API for building distributed applications. Ray is packaged with RLlib, a scalable reinforcement learning library, and Tune, a scalable hyperparameter tuning library.
Codes accompanying the paper "ROMA: Multi-Agent Reinforcement Learning with Emergent Roles" (ICML 2020 https://arxiv.org/abs/2003.08039)
PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT-Opt, PointNet..
Explore a collection of code examples for learning C++ and Python.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.