Files
mlx/python/src
Ethan a749a91c75 Support disable metal buffer cache to prevent performance degradation caused by large memory caching (#390)
* support disable metal buffer cache, due to large unused memory buffered when llm generated long context tokens

* Run format and add "cache_enabled" feature tests
2024-01-18 08:33:34 -08:00
..
2024-01-11 06:47:29 -08:00
2023-11-30 11:12:53 -08:00
2024-01-09 13:36:51 -08:00
2024-01-09 13:36:51 -08:00
2023-12-26 19:42:04 -08:00
2024-01-10 13:22:48 -08:00
2024-01-10 13:22:48 -08:00
2024-01-11 06:47:29 -08:00
2024-01-17 12:42:39 -08:00
2023-11-30 11:12:53 -08:00
2023-12-11 13:42:55 -08:00