mlx/mlx/backend
Alex Barron c52d1600f0
Fused Affine Quantize/Dequantize ops (#1282)
* Add fast affine dequantize

* add full quantize kernel

* fused kernel with scale/bias computation

* fix docstring

* fix no jit error

* fix test

* test fix

* reduce fast api to only affine_quantize
2024-07-29 15:11:38 -07:00
..
accelerate Custom transforms (#1246) 2024-07-10 18:00:01 -07:00
common Custom transforms (#1246) 2024-07-10 18:00:01 -07:00
metal Fused Affine Quantize/Dequantize ops (#1282) 2024-07-29 15:11:38 -07:00
no_cpu Custom transforms (#1246) 2024-07-10 18:00:01 -07:00
no_metal Fused Affine Quantize/Dequantize ops (#1282) 2024-07-29 15:11:38 -07:00