Repo for testing out large language model quantization techniques.
Most notebooks are run in kaggle/google colab.
Pull requests are welcome. For major changes, please open an issue first to discuss what you would like to change.
Please make sure to update tests as appropriate.