brad-mengchi Goto Github PK
Name: Mengchi Zhang
Type: User
Company: Meta
Bio: Research Scientist at Meta
Location: Menlo Park
Name: Mengchi Zhang
Type: User
Company: Meta
Bio: Research Scientist at Meta
Location: Menlo Park
AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.
assembler for NVIDIA FERMI. Imported from Google Code
TorchBench is a collection of open source benchmarks used to evaluate PyTorch performance.
CHIPKIT: An agile, reusable open-source framework for rapid test chip development
Public CK repository with materials and workflows to reproduce results from published papers or open competitions at ACM, IEEE and NeurIPS conferences and journals
THIS REPOSITORY HAS MOVED TO github.com/nvidia/cub, WHICH IS AUTOMATICALLY MIRRORED HERE.
Samples for CUDA Developers which demonstrates features in CUDA Toolkit
PyTorch extensions for high performance and large scale training.
FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/
Galois: C++ library for multi-core and multi-node parallelization
GeNN is a GPU-enhanced Neuronal Network simulation environment based on code generation for Nvidia CUDA.
Some simple attempts at building GENN models
GPGPU-Sim provides a detailed simulation model of a contemporary GPU running CUDA and/or OpenCL workloads and now includes an integrated (and validated) energy model, GPUWattch.
A repository that compliments gpgpu-sim, providing automated regression scripts, simulation launching utilities and the code + arguments for simulations that complete in a reasonable amount of time on GPGPU-Sim.
GPUfs - File system support for NVIDIA GPUs
GPUnet is a native GPU networking layer that provides a socket abstraction over Infiniband to GPU programs for NVIDIA GPUs.
A collection of redistributable Python scripts to help organize ISCA 2021 (The 48th International Symposium on Computer Architecture).
example LLVM pass
The LLVM Project is a collection of modular and reusable compiler and toolchain technologies. Note: the repository does not accept github pull requests at this moment. Please submit your patches at http://reviews.llvm.org.
Assembler for NVIDIA Maxwell architecture
A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API
Mighty toolkit for conference Program Chairs.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.