mlx/python at c52d1600f02fdb0ed76a2442aba7d870718bd65c - mlx

mirror of https://github.com/ml-explore/mlx.git synced 2025-12-16 01:49:05 +08:00

Files

Alex Barron c52d1600f0 Fused Affine Quantize/Dequantize ops (#1282 )

* Add fast affine dequantize

* add full quantize kernel

* fused kernel with scale/bias computation

* fix docstring

* fix no jit error

* fix test

* test fix

* reduce fast api to only affine_quantize

2024-07-29 15:11:38 -07:00

mlx

feat: Added "tanh" option to GELU approximation (#1268 )

2024-07-28 09:07:56 +02:00

src

Fused Affine Quantize/Dequantize ops (#1282 )

2024-07-29 15:11:38 -07:00

tests

Fused Affine Quantize/Dequantize ops (#1282 )

2024-07-29 15:11:38 -07:00