mlx/benchmarks/python
Awni Hannun 7a34e46677
Quantize with groups of 32 (#511)
* allow quantize with group sizes of 32

* missing cpu dispatch

* remove print

* Fix qvm for group_size 32

---------

Co-authored-by: Angelos Katharopoulos <a_katharopoulos@apple.com>
2024-01-21 06:19:05 -08:00
..
blas Update GEMM (#424) 2024-01-17 12:42:39 -08:00
comparative Quantize with groups of 32 (#511) 2024-01-21 06:19:05 -08:00
batch_matmul_bench.py Add isort pre-commit and run (#68) 2023-12-08 11:31:47 -08:00
llama_jax_bench.py copyright + ack 2023-11-30 11:12:53 -08:00
llama_mlx_bench.py copyright + ack 2023-11-30 11:12:53 -08:00
llama_torch_bench.py Add isort pre-commit and run (#68) 2023-12-08 11:31:47 -08:00
single_ops.py Add isort pre-commit and run (#68) 2023-12-08 11:31:47 -08:00
time_utils.py copyright + ack 2023-11-30 11:12:53 -08:00