trendingtechnology / diffq Goto Github PK
View Code? Open in Web Editor NEWThis project forked from facebookresearch/diffq
DiffQ performs differentiable quantization using pseudo quantization noise. It can automatically tune the number of bits used per weight or group of weights, in order to achieve a given trade-off between model size and accuracy.
License: Other