yeonan Goto Github PK
Name: Yeonan Ha
Type: User
Blog: https://yeonan.github.io
Name: Yeonan Ha
Type: User
Blog: https://yeonan.github.io
This is the top-level repository for the Accel-Sim framework.
Accelerating Recommender model training by leveraging popular choices -- VLDB 2022
Accelergy is an energy estimation infrastructure for accelerator energy estimations
A benchmarking suite for heterogeneous systems. The primary goal of this project is to improve and update aspects of existing benchmarking suites which are either insufficient or outdated.
Documentation and example programs for custom-developed instruction set architecture of the ANN Processor.
Official repository of the AWS EC2 FPGA Hardware and Software Development Kit
Programmable Neural Network Compression
Open source FPGA-based NIC and platform for in-network compute
CUDA GDB
TLB Benchmarks
Deep Learning Examples
http://vlsiarch.eecs.harvard.edu/research/recommendation/
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Set of datasets for the deep learning recommendation model (DLRM).
Open Source Specialized Computing Stack for Accelerating Deep Neural Networks.
Release of stream-specialization software/hardware stack.
This is a Tensor Train based compression library to compress sparse embedding tables used in large-scale machine learning models such as recommendation and natural language processing. We showed this library can reduce the total model size by up to 100x in Facebook’s open sourced DLRM model while achieving same model quality. Our implementation is faster than the state-of-the-art implementations. Existing the state-of-the-art library also decompresses the whole embedding tables on the fly therefore they do not provide memory reduction during runtime of the training. Our library decompresses only the requested rows therefore can provide 10,000 times memory footprint reduction per embedding table. The library also includes a software cache to store a portion of the entries in the table in decompressed format for faster lookup and process.
Flexible I/O Tester
FireSim: Easy-to-use, Scalable, FPGA-accelerated Cycle-accurate Hardware Simulation in the Cloud
Scalable Network Stack for FPGAs (TCP/IP, RoCEv2)
MLSys 2021 paper: MicroRec: efficient recommendation inference by hardware and data structure solutions
Introduction to FPGA emulation and digital design. This capstone project was part of the 2021 University of San Diego Shiley-Marcos School of Engineering & Computing Showcase.
FpgaNIC is an FPGA-based Versatile 100Gb SmartNIC for GPUs
Galois: C++ library for multi-core and multi-node parallelization
LonestarGPU: Irregular algorithms parallelized for GPUs
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.