mlx/python
Awni Hannun e613d0eaf0
SDPA support for small batch (over sequence) queries (#1922)
* batch query sdpa

* batch sdpa for query
2025-03-04 10:59:04 -08:00
..
mlx Ring docs (#1829) 2025-02-28 11:34:21 -08:00
src RMS norm without scaling (#1915) 2025-02-28 20:26:57 -08:00
tests SDPA support for small batch (over sequence) queries (#1922) 2025-03-04 10:59:04 -08:00