Coder Social home page Coder Social logo

Season's Projects

alpa icon alpa

Training and serving large-scale neural networks with auto parallelization.

apex icon apex

A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch

colossalai icon colossalai

Making large AI models cheaper, faster and more accessible

cream icon cream

This is a collection of our NAS and Vision Transformer work.

cutlass icon cutlass

CUDA Templates for Linear Algebra Subroutines

dask icon dask

Parallel computing with task scheduling

deepspeed icon deepspeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

delta icon delta

An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs

dgl icon dgl

Python package built to ease deep learning on graph, on top of existing DL frameworks.

easylm icon easylm

Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.

flax icon flax

Flax is a neural network library for JAX that is designed for flexibility.

gloo icon gloo

Collective communications library with various primitives for multi-machine training.

horovod icon horovod

Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.

internlm icon internlm

InternLM has open-sourced a 7 billion parameter base model, a chat model tailored for practical scenarios and the training system.

internvl icon internvl

[CVPR 2024] InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks —— An Open-Source Alternative to ViT-22B

jax icon jax

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

llava icon llava

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

llm.c icon llm.c

LLM training in simple, raw C/CUDA

llvm-project icon llvm-project

The LLVM Project is a collection of modular and reusable compiler and toolchain technologies.

lmdeploy icon lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

mamba-mini icon mamba-mini

An efficient pytorch implementation of selective scan in one file, works with both cpu and gpu, with corresponding mathematical derivation. It is probably the code which is the most close to selective_scan_cuda in mamba.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.