mirror of
https://github.com/ml-explore/mlx.git
synced 2025-06-24 09:21:16 +08:00
![]() * allow quantize with group sizes of 32 * missing cpu dispatch * remove print * Fix qvm for group_size 32 --------- Co-authored-by: Angelos Katharopoulos <a_katharopoulos@apple.com> |
||
---|---|---|
.. | ||
blas | ||
comparative | ||
batch_matmul_bench.py | ||
llama_jax_bench.py | ||
llama_mlx_bench.py | ||
llama_torch_bench.py | ||
single_ops.py | ||
time_utils.py |