Files
mlx-examples/llms/tests
Awni Hannun ee60e2a9d5 Kv cache (#643)
* in place kv_cache

* fix

* fix kv cache size

* partially fix kv cache dtype

* step kv cache

* multiple of step size

* more teests + kv cache

* more kv cache

* udpate all models to use kv cache
2024-05-08 08:18:13 -07:00
..
2024-05-08 08:18:13 -07:00