Coder Social home page Coder Social logo

aleresearch-lfa's Introduction

ALEResearch-LFA

Research on ALE --- mainly about bpro features and its extensions.

We choose to focus on using the linear function approximation, which is a simpler yet fixed representation approach.

We aim to achieve performance comparable to DQN's (Deep Q-networks)

Bpro and its extensions exploit spatial invariance, temporal offsets and a simple way to detect sprites.

The codes in the master branch are not used to generate any research results. Instead it lays the premiliminary foundation for real research. BproVector has the fastest codes to run BPRO feature set. bproTime has the codes to run BPROST feature set. Blob (branch) has the codes to run Blob-BPROST features set (blob is the name of our simple methods to detect sprites on screens). Other braches have some other experiments (e.g we tried to implement tug-of-wash hashing) Another intersting brach is blobThreeVersion1(and blobThreeVersion2). Instead of doing pairwise offsets, it does offsets between three pixels. Still it has spatial invariance built in.

In blob and bproTime, it has seperate functions to generate temporal offsets (one pixel comes from current screen and the other comes from the screen five frames ago and spatial offsets (both pixels come from current screens). The fucntions are called addTimeDimensionalOffsets and addRelativeFeaturesIndices respectively. In case you only want to run Blob-PROS or Blob-PROT (or B-PROS/ B-PROT), you can commet out the corresponding functions. Both have a redundant function called addThreePointOffsetsIndices, it never gets called.

The latest research results are available on http://arxiv.org/abs/1512.01563

Braches start with Adaptive include our codes to attempt an adaptive feature representation in ALE. The work is still in progress.

aleresearch-lfa's People

Contributors

yitaoliang avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.