edwinlim0919 / neural-speed Goto Github PK
View Code? Open in Web Editor NEWThis project forked from intel/neural-speed
An innovation library for efficient LLM inference via low-bit quantization and sparsity
Home Page: https://github.com/intel/neural-speed
License: Apache License 2.0