mirror of
https://github.com/ml-explore/mlx.git
synced 2025-09-20 20:18:15 +08:00

* add mode parameter for quantization * mxfp4 quantize/dequantize + start of optional biases * mxfp4 works * speedup * cpu mxfp4 * fix * fix test tol * fix * refactor * add quant mode enum