Coder Social home page Coder Social logo

conerwei's Projects

a3c_trading icon a3c_trading

Trading with recurrent actor-critic reinforcement learning

alpha-zero-general icon alpha-zero-general

A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more

blog icon blog

Python机器学习算法技术博客,有原创干货!有code实践!

chatglm2-6b icon chatglm2-6b

ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型

dalai icon dalai

The simplest way to run LLaMA on your local machine

dqn-ddpg_stock_trading icon dqn-ddpg_stock_trading

Using DQN/DDPG for stock trading. Xiong, Z., Liu, X.Y., Zhong, S., Yang, H. and Walid, A., 2018. Practical deep reinforcement learning approach for stock trading, NeurIPS 2018 AI in Finance Workshop.

elegantrl icon elegantrl

Scalable and Elastic Deep Reinforcement Learning Using PyTorch. Please star. 🔥

fastchat icon fastchat

The release repo for "Vicuna: An Open Chatbot Impressing GPT-4"

finrl icon finrl

FinRL: Financial Reinforcement Learning Framework. Please star. 🔥

finrl-meta icon finrl-meta

FinRL­-Meta: A Universe for Data­-Driven Financial Reinforcement Learning. 🔥

gdrl icon gdrl

Grokking Deep Reinforcement Learning

glm-130b icon glm-130b

GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)

gops icon gops

General Optimal control Problem Solver (GOPS), an easy-to-use PyTorch reinforcement learning solver package for industrial control.

gpt4all icon gpt4all

gpt4all: a chatbot trained on a massive collection of clean assistant data including code, stories and dialogue

gpteacher icon gpteacher

A collection of modular datasets generated by GPT-4, General-Instruct - Roleplay-Instruct - Code-Instruct - and Toolformer

gym-anytrading icon gym-anytrading

The most simple, flexible, and comprehensive OpenAI Gym trading environment (Approved by OpenAI Gym)

jarvis icon jarvis

JARVIS, a system to connect LLMs with ML community

leela-zero icon leela-zero

Go engine with no human-provided knowledge, modeled after the AlphaGo Zero paper.

lightzero icon lightzero

LightZero: A lightweight and efficient MCTS/AlphaZero/MuZero algorithm toolkit.

lmflow icon lmflow

An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Language Model for All.

mm-cot icon mm-cot

Official implementation for "Multimodal Chain-of-Thought Reasoning in Language Models" (stay tuned and more will be updated)

open-assistant icon open-assistant

OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.