ksm26 / quantization-fundamentals-with-hugging-face Goto Github PK
View Code? Open in Web Editor NEWLearn linear quantization techniques using the Quanto library and downcasting methods with the Transformers library to compress and optimize generative AI models effectively.
Home Page: https://www.deeplearning.ai/short-courses/quantization-fundamentals-with-hugging-face/