mirror of
https://github.com/ml-explore/mlx.git
synced 2025-11-08 13:28:15 +08:00
Launch 2D grid to eliminate divide and mod in device code, since 64b integer division is very expensive. Github Issue #506 Co-authored-by: Vijay Krishnamoorthy <vijay_krish@apple.com>