mlx/python at 70560b6bd5324efbfd9f97c9076c884d21ffac34 - mlx

mirror of https://github.com/ml-explore/mlx.git synced 2025-12-16 01:49:05 +08:00

Files

Awni Hannun 70560b6bd5 Add mode parameter for quantization (#2499 )

* add mode parameter for quantization

* mxfp4 quantize/dequantize + start of optional biases

* mxfp4 works

* speedup

* cpu mxfp4

* fix

* fix test tol

* fix

* refactor

* add quant mode enum

2025-08-28 06:45:26 -07:00

mlx

Add mode parameter for quantization (#2499 )

2025-08-28 06:45:26 -07:00

scripts

nccl dep + default for cuda (#2526 )

2025-08-21 17:57:49 -07:00

src

Add mode parameter for quantization (#2499 )

2025-08-28 06:45:26 -07:00

tests

Add mode parameter for quantization (#2499 )

2025-08-28 06:45:26 -07:00