mirror of
https://github.com/ml-explore/mlx.git
synced 2025-09-23 05:58:09 +08:00

Launch 2D grid to eliminate divide and mod in device code, since 64b integer division is very expensive. Github Issue #506 Co-authored-by: Vijay Krishnamoorthy <vijay_krish@apple.com>