Files
mlx/python/tests
Awni Hannun ec72b44417 Add quantize/dequantize for mxfp8 and nvfp4 (#2688)
* Add quantize/dequantize slow path for mxfp8 and nvfp4

* fast cuda kernel for mx/nv quantization

* fallback for cuda < 12.8 (#2697)

* format (#2700)

* fix (#2701)

* metal kernels

* docs

* fix jit

* add default bits and group sizes

* improve quant docs

* fix output type of mxfp4 matmuls
2025-10-28 16:23:12 -07:00
..
2025-10-02 15:40:04 -07:00
2025-10-02 15:40:04 -07:00
2025-10-27 11:33:32 -07:00
2025-10-08 19:24:33 -07:00
2025-08-26 14:24:47 -07:00
2025-10-27 11:33:42 -07:00