papers's Introduction

PART I

Base

Playing Atari with Deep Reinforcement Learning
Human-level control through deep reinforcement learning
Rainbow: Combining Improvements in Deep Reinforcement Learning

Double

Deep Reinforcement Learning with Double Q-learning (double)

Dueling

Dueling Network Architectures for Deep Reinforcement Learning (dueling)

Manipulate Experience Replay

PRIORITIZED EXPERIENCE REPLAY (PER)
Prioritized Sequence Experience Replay (PSER)
Hindsight Experience Replay

Distributed

Massively Parallel Methods for Deep Reinforcement Learning (Gorila)
Asynchronous Methods for Deep Reinforcement Learning (A3C)
REINFORCEMENT LEARNING THROUGH ASYNCHRONOUS ADVANTAGE ACTOR-CRITIC ON A GPU (GA3C)
EFFICIENT PARALLEL METHODS FOR DEEP REINFORCEMENT LEARNING (PAAC)
ACCELERATED METHODS FOR DEEP REINFORCEMENT LEARNING
IMPALA: Scalable Distributed Deep-RL with ImportanceWeighted Actor-Learner Architectures (IMPALA)

Distributional

Exploration

Deep Exploration via Bootstrapped DQN
Parameter Space Noise for Exploration
Noisy Networks for Exploration

Unclassified

Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor

PART II

DPG

Deterministic Policy Gradient Algorithms
CONTINUOUS CONTROL WITH DEEP REINFORCEMENT LEARNING
Distributed Distributional Deterministic Policy Gradients1

FIM & NGD & TRM

FLANKS

Data/Image Augmentation

CutOut - Improved Regularization of Convolutional Neural Networks with Cutout

NORMALIZATION

BN - Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift
LN - Layer Normalization
IN - Instance Normalization: The Missing Ingredient for Fast Stylization
GN - Group Normalization
BIN - Batch-Instance Normalization for Adaptively Style-Invariant Neural Networks
BRN - Batch Renormalization: Towards Reducing Minibatch Dependence in Batch-Normalized Models
CBN - Cross-Iteration Batch Normalization

ATTENTION

SE - Squeeze-and-Excitation Networks
CBAM - CBAM: Convolutional Block Attention Module

CNN

AlexNet - ImageNet Classification with Deep Convolutional Neural Networks
ZFNet - Visualizing and Understanding Convolutional Networks
dropout - Improving neural networks by preventing co-adaptation of feature detectors
maxout - Maxout Networks
Network In Network
VGG - Very Deep Convolutional Networks for Large-Scale Image Recognition

BACKBONES

CSP - CSPNet: A New Backbone that can Enhance Learning Capability of CNN

NECKS

SPP - Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition
FPN - Feature Pyramid Networks for Object Detection
PAN - Path Aggregation Network for Instance Segmentation

OBJECT DETECTION

Pose Estimation

Pose Machines: Articulated Pose Estimation via Inference Machines
Convolutional Pose Machines
Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields
OpenPose: Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields
PARE: Part Attention Regressor for 3D Human Body Estimation

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.

ddlau / papers Goto Github PK

papers's Introduction

PART I

Base

Double

Dueling

Manipulate Experience Replay

Distributed

Distributional

Exploration

Unclassified

PART II

DPG

FIM & NGD & TRM

FLANKS

Data/Image Augmentation

NORMALIZATION

ATTENTION

CNN

BACKBONES

NECKS

OBJECT DETECTION

Pose Estimation

papers's People

Contributors

Watchers

Recommend Projects

Recommend Topics

Recommend Org