mlx/benchmarks/python
Vijay Krish 06072601ce
Scatter optimization : Eliminate 64b integer divide. (#662)
Launch 2D grid to eliminate divide and mod in device code,
since 64b integer division is very expensive.

Github Issue #506

Co-authored-by: Vijay Krishnamoorthy <vijay_krish@apple.com>
2024-02-10 08:49:51 -08:00
..
blas Update GEMM (#424) 2024-01-17 12:42:39 -08:00
comparative Quantize with groups of 32 (#511) 2024-01-21 06:19:05 -08:00
batch_matmul_bench.py Add isort pre-commit and run (#68) 2023-12-08 11:31:47 -08:00
gather_bench.py Scatter optimization : Eliminate 64b integer divide. (#662) 2024-02-10 08:49:51 -08:00
scatter_bench.py Scatter optimization : Eliminate 64b integer divide. (#662) 2024-02-10 08:49:51 -08:00
single_ops.py Propagate nans in binary ops (#579) 2024-01-29 11:19:38 -08:00
time_utils.py Scatter optimization : Eliminate 64b integer divide. (#662) 2024-02-10 08:49:51 -08:00