mlx/benchmarks
Vijay Krish 06072601ce
Scatter optimization : Eliminate 64b integer divide. (#662)
Launch 2D grid to eliminate divide and mod in device code,
since 64b integer division is very expensive.

Github Issue #506

Co-authored-by: Vijay Krishnamoorthy <vijay_krish@apple.com>
2024-02-10 08:49:51 -08:00
..
cpp Multi output primitives (#330) 2024-01-08 16:39:08 -08:00
numpy Add isort pre-commit and run (#68) 2023-12-08 11:31:47 -08:00
python Scatter optimization : Eliminate 64b integer divide. (#662) 2024-02-10 08:49:51 -08:00