mlx/benchmarks/python/comparative/bench_mlx.py at e1bdf6a8d9ba140352f04dd5f714e419ae38c18f

mirror of https://github.com/ml-explore/mlx.git synced 2025-12-16 01:49:05 +08:00

Files

Awni Hannun 7a34e46677 Quantize with groups of 32 (#511 )

* allow quantize with group sizes of 32

* missing cpu dispatch

* remove print

* Fix qvm for group_size 32

---------

Co-authored-by: Angelos Katharopoulos <a_katharopoulos@apple.com>

2024-01-21 06:19:05 -08:00

11 KiB

Raw Blame History

View Raw

11 KiB Raw Blame History

11 KiB

Raw Blame History