mlx/benchmarks
Jagrit Digani 9adcd1a650
Support fused masking in Attention (#1924)
* Update API to allow mask='causal' in fast::sdpa

* Add fallback

* Update steel::AttnParams

* Fix typo

* WIP, basic causal

* Update tests

* Update benchmarking

* Update masking loop limits

* Add bool masking and update tests

* Update additive mask

* Update benchmarks

* Update benchmarks

* Update tests

* Update for bfloat error

* Update early exit

* Add random seed to tests
2025-03-20 11:01:32 -07:00
..
cpp Remove "using namespace mlx::core" in benchmarks/examples (#1685) 2024-12-11 07:08:29 -08:00
numpy Add isort pre-commit and run (#68) 2023-12-08 11:31:47 -08:00
python Support fused masking in Attention (#1924) 2025-03-20 11:01:32 -07:00