mlx/python
Alex Barron 26be608470
Add split_k qvm for long context (#1564)
* Add splitk qvm

* configurable splitk

* tuning

* remove extra instantiation

* remove refactor

* separate test

* cpu tolerance
2024-11-05 11:25:19 -08:00
..
mlx improvements to scatter / gather (#1541) 2024-10-30 19:30:54 -07:00
src No extra reshape (#1557) 2024-11-02 19:07:20 -07:00
tests Add split_k qvm for long context (#1564) 2024-11-05 11:25:19 -08:00