mlx/python
Alex Barron c52d1600f0
Fused Affine Quantize/Dequantize ops (#1282)
* Add fast affine dequantize

* add full quantize kernel

* fused kernel with scale/bias computation

* fix docstring

* fix no jit error

* fix test

* test fix

* reduce fast api to only affine_quantize
2024-07-29 15:11:38 -07:00
..
mlx feat: Added "tanh" option to GELU approximation (#1268) 2024-07-28 09:07:56 +02:00
src Fused Affine Quantize/Dequantize ops (#1282) 2024-07-29 15:11:38 -07:00
tests Fused Affine Quantize/Dequantize ops (#1282) 2024-07-29 15:11:38 -07:00