Files
mlx/mlx/backend/metal/kernels
Jagrit Digani 89d327075f Enabling fused attention for head dim 128 (#1899)
* Share KV smem

* Fix bfloat error

* Unroll O = S @ V loop

* Perf upgrade

* Remove commented out function

* Add -Wno-c++17-extensions flag to metal flags

* Add -Wno-c++17-extensions flag to metal extension flags
2025-02-26 10:02:06 -08:00
..
2024-06-06 12:57:25 -07:00
2024-05-23 16:23:44 -07:00
2024-12-09 11:09:02 -08:00
2025-01-07 14:02:16 -08:00
2025-01-07 14:02:16 -08:00
2024-06-06 12:57:25 -07:00
2024-06-12 09:47:12 -07:00
2024-12-09 11:09:02 -08:00
2024-12-09 11:09:02 -08:00
2024-07-23 14:54:43 -07:00
2024-12-09 11:09:02 -08:00
2025-02-12 22:02:36 -08:00
2024-05-23 16:23:44 -07:00
2024-08-28 16:39:11 -07:00
2024-10-24 11:05:46 -07:00
2024-10-24 11:05:46 -07:00
2024-12-09 11:09:02 -08:00
2025-02-05 06:10:22 -08:00
2025-02-05 06:10:22 -08:00
2024-12-09 11:09:02 -08:00
2025-01-03 11:52:17 -08:00
2025-02-13 08:44:14 -08:00
2024-12-09 11:09:02 -08:00
2025-02-13 08:44:14 -08:00