mlx/python/tests/test_quantized.py at 3b2ffcefc3d6a93670819904a4cea210bb547f0b

mirror of https://github.com/ml-explore/mlx.git synced 2025-12-16 01:49:05 +08:00

Files

Awni Hannun ec72b44417 Add quantize/dequantize for mxfp8 and nvfp4 (#2688 )

* Add quantize/dequantize slow path for mxfp8 and nvfp4

* fast cuda kernel for mx/nv quantization

* fallback for cuda < 12.8 (#2697)

* format (#2700)

* fix (#2701)

* metal kernels

* docs

* fix jit

* add default bits and group sizes

* improve quant docs

* fix output type of mxfp4 matmuls

2025-10-28 16:23:12 -07:00

35 KiB

Raw Blame History

View Raw

35 KiB Raw Blame History

35 KiB

Raw Blame History