mlx/python/src
Ethan a749a91c75
Support disable metal buffer cache to prevent performance degradation caused by large memory caching (#390)
* support disable metal buffer cache, due to large unused memory buffered when llm generated long context tokens

* Run format and add "cache_enabled" feature tests
2024-01-18 08:33:34 -08:00
..
array.cpp Added formatter structure and a boolean value formatter (#354) 2024-01-18 07:49:41 -08:00
CMakeLists.txt feat: Add numpy constants (#428) 2024-01-11 06:47:29 -08:00
constants.cpp feat: Add numpy constants (#428) 2024-01-11 06:47:29 -08:00
device.cpp Fix stub generation, change graph exporting for arrows to go to outputs (#455) 2024-01-14 14:06:16 -08:00
fft.cpp copyright + ack 2023-11-30 11:12:53 -08:00
indexing.cpp Scatter vjp (#394) 2024-01-09 13:36:51 -08:00
indexing.h Scatter vjp (#394) 2024-01-09 13:36:51 -08:00
linalg.cpp linalg.norm (#187) 2023-12-26 19:42:04 -08:00
load.cpp GGUF support (#350) 2024-01-10 13:22:48 -08:00
load.h GGUF support (#350) 2024-01-10 13:22:48 -08:00
metal.cpp Support disable metal buffer cache to prevent performance degradation caused by large memory caching (#390) 2024-01-18 08:33:34 -08:00
mlx.cpp feat: Add numpy constants (#428) 2024-01-11 06:47:29 -08:00
ops.cpp Update GEMM (#424) 2024-01-17 12:42:39 -08:00
random.cpp make behaviour of dtype arguments consistent and compliant to numpy (#379) 2024-01-05 09:37:46 -08:00
stream.cpp copyright + ack 2023-11-30 11:12:53 -08:00
transforms.cpp Fix stub generation, change graph exporting for arrows to go to outputs (#455) 2024-01-14 14:06:16 -08:00
utils.h Mlx array accessor (#128) 2023-12-11 13:42:55 -08:00