mirror of
https://github.com/ml-explore/mlx.git
synced 2025-09-17 17:28:10 +08:00

* Allow arbitrary first dim on qmm_t and qmv * Allow arbitrary first dim on qmm and qvm * Specialized aligned vs unaligned case * Add more checks for valid quantizations