Coder Social home page Coder Social logo

Mengchi Zhang's Projects

aitemplate icon aitemplate

AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.

asfermi icon asfermi

assembler for NVIDIA FERMI. Imported from Google Code

benchmark icon benchmark

TorchBench is a collection of open source benchmarks used to evaluate PyTorch performance.

chipkit icon chipkit

CHIPKIT: An agile, reusable open-source framework for rapid test chip development

ck-artifact-evaluation icon ck-artifact-evaluation

Public CK repository with materials and workflows to reproduce results from published papers or open competitions at ACM, IEEE and NeurIPS conferences and journals

cub icon cub

THIS REPOSITORY HAS MOVED TO github.com/nvidia/cub, WHICH IS AUTOMATICALLY MIRRORED HERE.

cuda-samples icon cuda-samples

Samples for CUDA Developers which demonstrates features in CUDA Toolkit

fairscale icon fairscale

PyTorch extensions for high performance and large scale training.

fbgemm icon fbgemm

FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/

galois icon galois

Galois: C++ library for multi-core and multi-node parallelization

genn icon genn

GeNN is a GPU-enhanced Neuronal Network simulation environment based on code generation for Nvidia CUDA.

gpgpu-sim_distribution icon gpgpu-sim_distribution

GPGPU-Sim provides a detailed simulation model of a contemporary GPU running CUDA and/or OpenCL workloads and now includes an integrated (and validated) energy model, GPUWattch.

gpgpu-sim_simulations icon gpgpu-sim_simulations

A repository that compliments gpgpu-sim, providing automated regression scripts, simulation launching utilities and the code + arguments for simulations that complete in a reasonable amount of time on GPGPU-Sim.

gpufs icon gpufs

GPUfs - File system support for NVIDIA GPUs

gpunet icon gpunet

GPUnet is a native GPU networking layer that provides a socket abstraction over Infiniband to GPU programs for NVIDIA GPUs.

isca-2021-script icon isca-2021-script

A collection of redistributable Python scripts to help organize ISCA 2021 (The 48th International Symposium on Computer Architecture).

llvm-project icon llvm-project

The LLVM Project is a collection of modular and reusable compiler and toolchain technologies. Note: the repository does not accept github pull requests at this moment. Please submit your patches at http://reviews.llvm.org.

maxas icon maxas

Assembler for NVIDIA Maxwell architecture

micrograd icon micrograd

A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API

mightypc icon mightypc

Mighty toolkit for conference Program Chairs.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.