mlx/src at a098bc92e0083bf3613559ba3990ca52ed8631c6 - mlx

mirror of https://github.com/ml-explore/mlx.git synced 2025-06-27 11:21:28 +08:00

History

Brian Keene 19fb69e2ed Add memory_efficient_threshold kwarg to sdpa kernel (#1319 ) Allows opt-in to memory efficient GPU shader at proscribed sequence length. Otherwise, utilizes aggregate MLX primitives for best latency.		2024-08-12 12:57:09 -07:00
..
array.cpp	Move to nanobind v2 (#1316 )	2024-08-08 17:17:46 -07:00
buffer.h	Fix buffer protocol buffer size designation (#1010 )	2024-04-19 06:06:13 -07:00
CMakeLists.txt	Comms (#1097 )	2024-05-23 17:04:02 -07:00
constants.cpp	Array api (#1289 )	2024-07-26 10:40:49 -07:00
convert.cpp	fix creating array from bf16 tensors in jax / torch (#1305 )	2024-08-01 16:20:51 -07:00
convert.h	list based indexing (#1150 )	2024-05-22 15:52:05 -07:00
device.cpp	Switch to nanobind (#839 )	2024-03-18 20:12:25 -07:00
distributed.cpp	Add docs for the distributed namespace (#1184 )	2024-06-06 11:37:00 -07:00
fast.cpp	Add memory_efficient_threshold kwarg to sdpa kernel (#1319 )	2024-08-12 12:57:09 -07:00
fft.cpp	Switch to nanobind (#839 )	2024-03-18 20:12:25 -07:00
indexing.cpp	Fixes for large arrays with a few ops (#1299 )	2024-07-30 17:18:39 -07:00
indexing.h	Switch to nanobind (#839 )	2024-03-18 20:12:25 -07:00
linalg.cpp	CPU mx.linalg.cholesky_inverse and mx.linalg.tri_inv (#1307 )	2024-08-08 15:18:02 -07:00
load.cpp	Do not store iostream in shared_ptr (#872 )	2024-03-22 06:54:45 -07:00
load.h	Switch to nanobind (#839 )	2024-03-18 20:12:25 -07:00
metal.cpp	Reset peak memory (#1074 )	2024-05-03 17:12:51 -07:00
mlx.cpp	Comms (#1097 )	2024-05-23 17:04:02 -07:00
ops.cpp	Add "edge" mode to mx.pad (#1309 )	2024-08-06 11:23:10 -07:00
random.cpp	Einsum (#1269 )	2024-07-25 09:36:44 -07:00
stream.cpp	Cpp docs (#1036 )	2024-04-26 12:56:05 -07:00
transforms.cpp	docs fixes (#1263 )	2024-07-11 15:59:07 -07:00
trees.cpp	Properly handle negative axes in python vmap (#944 )	2024-04-02 18:07:23 -07:00
trees.h	Properly handle negative axes in python vmap (#944 )	2024-04-02 18:07:23 -07:00
utils.cpp	fix conversion to array (#1070 )	2024-05-06 16:02:49 -07:00
utils.h	fix conversion to array (#1070 )	2024-05-06 16:02:49 -07:00