mirror of
https://github.com/ml-explore/mlx.git
synced 2025-09-01 04:24:36 +08:00

* Allow arbitrary first dim on qmm_t and qmv * Allow arbitrary first dim on qmm and qvm * Specialized aligned vs unaligned case * Add more checks for valid quantizations