mirror of
https://github.com/ml-explore/mlx.git
synced 2025-10-04 05:18:09 +08:00

* Allow arbitrary first dim on qmm_t and qmv * Allow arbitrary first dim on qmm and qvm * Specialized aligned vs unaligned case * Add more checks for valid quantizations