Coder Social home page Coder Social logo

purpleyoung's Projects

interactive-multi-objective-reinforcement-learning icon interactive-multi-objective-reinforcement-learning

Multi-objective reinforcement learning deals with finding policies for tasks where there are multiple distinct criteria to optimize for. Since there may be trade-offs between the criteria, there does not necessarily exist a globally best policy; instead, the goal is to find Pareto optimal policies that are the best for certain preference functions. The Pareto Q-learning algorithm looks for all Pareto optimal policies at the same time. Introduced a variant of Pareto Q-learning that asks queries to a user, who is assumed to have an underlying preference function and also the scalarized Q-learning algorithm which reduces the dimensionality of multi-objective space by using scalarization function and ask user preferences by taking weights for scalarization. The goal is to find the optimal policy for that user’s preference function as quickly as possible. Used two benchmark problems i.e. Deep Sea Treasure and Resource Collection for experiments.

jstarcraft-ai icon jstarcraft-ai

目标是提供一个完整的Java机器学习(Machine Learning/ML)框架,作为人工智能在学术界与工业界的桥梁. 让相关领域的研发人员能够在各种软硬件环境/数据结构/算法/模型之间无缝切换. 涵盖了从数据处理到模型的训练与评估各个环节,支持硬件加速和并行计算,是最快最全的Java机器学习库.

jumpy icon jumpy

A Python interface for ND4J: A Numpy Array Wrapper for the JVM

knn-for-pattern-recognize- icon knn-for-pattern-recognize-

KNN形态识别 股票形态识别(如W双底)用图像识别的方法准确率高但速度慢(因要画图),用K-近邻方法以数值型数据计算快准确率基本符合要求(查准率70%左右),可用于对决策时间有要求的交易。 工作完成情况: 1、W双底识别模型查准确率约70% 2、模型文件上载到聚宽后可在回测中调用。

ktdx icon ktdx

Kotlin (Java) version of tdx client

leela-zero icon leela-zero

Go engine with no human-provided knowledge, modeled after the AlphaGo Zero paper.

madmario icon madmario

Interactive tutorial to build a learning Mario, for first-time RL learners

madrl icon madrl

Repo containing code for multi-agent deep reinforcement learning (MADRL).

mas_stock_prediction icon mas_stock_prediction

Simple Reinforcement Learning, Deep Reinforcement Learning, Federated, Multi Agent stock prediction code for this Google stocks dataset: https://www.kaggle.com/thevirusx3/google-stock-market-data

mini-alphastar icon mini-alphastar

A mini-scale reproduction code of the AlphaStar program. Note: the original AlphaStar is the AI proposed by DeepMind to play StarCraft II.

nd4j icon nd4j

Fast, Scientific and Numerical Computing for the JVM (NDArrays)

neuron_poker icon neuron_poker

Texas holdem OpenAi gym poker environment with reinforcement learning based on keras-rl. Includes virtual rendering and montecarlo for equity calculation.

numpywdl icon numpywdl

Implement Wide & Deep algorithm by using NumPy

phoenixgo icon phoenixgo

Go AI program which implements the AlphaGo Zero paper

py-mt4 icon py-mt4

🎰 用Python来写MT4的自动化交易脚本

python-binance icon python-binance

Binance Exchange API python implementation for automated trading

pytorch-a3c icon pytorch-a3c

Simple A3C implementation with pytorch + multiprocessing

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.