Files
mlx/benchmarks/python
Jagrit Digani 9adcd1a650 Support fused masking in Attention (#1924)
* Update API to allow mask='causal' in fast::sdpa

* Add fallback

* Update steel::AttnParams

* Fix typo

* WIP, basic causal

* Update tests

* Update benchmarking

* Update masking loop limits

* Add bool masking and update tests

* Update additive mask

* Update benchmarks

* Update benchmarks

* Update tests

* Update for bfloat error

* Update early exit

* Add random seed to tests
2025-03-20 11:01:32 -07:00
..
2024-01-17 12:42:39 -08:00
2024-11-04 22:25:16 -08:00
2024-04-27 06:24:57 -07:00
2024-07-25 09:36:44 -07:00
2025-03-10 06:05:26 -07:00