mlx/python at 1086dc4db070d4f7d211f2bb760f560b98f969f7 - mlx

mirror of https://github.com/ml-explore/mlx.git synced 2025-12-11 15:06:42 +08:00

Files

Brian Keene 19fb69e2ed Add memory_efficient_threshold kwarg to sdpa kernel (#1319 )

Allows opt-in to memory efficient GPU shader at proscribed sequence
length.  Otherwise, utilizes aggregate MLX primitives for best latency.

2024-08-12 12:57:09 -07:00

2024-08-06 11:23:10 -07:00

2024-08-12 12:57:09 -07:00

2024-08-12 12:57:09 -07:00