mlx/benchmarks/python at 7a34e46677f56f9803032c0505281dc320a9d20b - mlx

mirror of https://github.com/ml-explore/mlx.git synced 2025-12-16 01:49:05 +08:00

Files

Awni Hannun 7a34e46677 Quantize with groups of 32 (#511 )

* allow quantize with group sizes of 32

* missing cpu dispatch

* remove print

* Fix qvm for group_size 32

---------

Co-authored-by: Angelos Katharopoulos <a_katharopoulos@apple.com>

2024-01-21 06:19:05 -08:00

blas

Update GEMM (#424 )

2024-01-17 12:42:39 -08:00

comparative

Quantize with groups of 32 (#511 )

2024-01-21 06:19:05 -08:00

batch_matmul_bench.py

Add isort pre-commit and run (#68 )

2023-12-08 11:31:47 -08:00

llama_jax_bench.py

2023-11-30 11:12:53 -08:00

llama_mlx_bench.py

2023-11-30 11:12:53 -08:00

llama_torch_bench.py

Add isort pre-commit and run (#68 )

2023-12-08 11:31:47 -08:00

single_ops.py

Add isort pre-commit and run (#68 )

2023-12-08 11:31:47 -08:00

time_utils.py

2023-11-30 11:12:53 -08:00