mlx/python
Alex Barron d15fa13daf
Batched Quantized Matmul + Fast Small QMV (#1503)
* add fast qmv for small dims

* fix test

* batched cpu

* add batched template param

* refactor metal quantized.cpp
2024-10-21 16:23:17 -07:00
..
mlx fix submodule stubs (#1492) 2024-10-15 16:23:37 -07:00
src fix gumbel (#1495) 2024-10-17 13:52:39 -07:00
tests Batched Quantized Matmul + Fast Small QMV (#1503) 2024-10-21 16:23:17 -07:00