lanseyege's Projects
blog
ē®ę³å¦ä¹ ē¬č®°
Inverse RL Algorithms (APP, MaxEnt, GAIL, VAIL)
Code for "Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks"
Paper list of multi-agent reinforcement learning (MARL)
Maximum Causal Entropy Inverse Reinforcement Learning
use Maximum Entropy Model do some experiments
A tiny autograd engine and a neural net library on top of it, potentially for educational purposes
machine learning model
Papers on Graph neural network(GNN)
PyTorch implementations of Generative Adversarial Networks.
A very simple generative adversarial network (GAN) in PyTorch
PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.
Build your neural network easy and fast
Reproduction of the Maximum Causal Entropy theory proposed by Brian in The Principle of Maximum Causal Entropy for Estimating Interacting Processes
Reverse-mode automatic differentiation in Rust (experiment)
PyTorch0.4 implementation of: actor critic / proximal policy optimization / acer / ddpg / twin dueling ddpg / soft actor critic / generative adversarial imitation learning / hindsight experience replay
Reinforcement Learning Algorithms
Applying Reinforcement Learning in Quantitative Trading
Soft attention mechanism for video caption generation
A simple A2C made from scratch in PyTorch. Accompanying comic at https://hackernoon.com/intuitive-rl-intro-to-advantage-actor-critic-a2c-4ff545978752