Files
mlx/python/tests
Brian Keene 19fb69e2ed Add memory_efficient_threshold kwarg to sdpa kernel (#1319)
Allows opt-in to memory efficient GPU shader at proscribed sequence
length.  Otherwise, utilizes aggregate MLX primitives for best latency.
2024-08-12 12:57:09 -07:00
..
2024-07-26 10:40:49 -07:00
2024-07-10 18:00:01 -07:00
2024-06-14 09:52:26 -07:00
2024-07-26 10:40:49 -07:00
2024-07-25 09:36:44 -07:00
2024-06-11 14:35:12 -07:00
2024-01-08 16:39:08 -08:00
2024-01-30 13:11:01 -08:00
2024-02-25 08:39:55 -08:00
2024-08-06 11:23:10 -07:00
2024-08-05 20:12:27 -07:00