1 post
A deep dive into model quantization — reducing model size while preserving accuracy. Covers symmetric and asymmetric approaches with code.