mlx/benchmarks/python
Vijay Krish 972d9a3aea
Up to 10x faster scatter. (#709)
* Faster scatter.

Add specialization for 1-d index tensors.

* Address review comments.

- Check for row contiguity of index, update tensors
  instead of checking strides.
- Add support for 1d specialization with col contiguous update
  tensor, along with a test.

* Nit1

Co-authored-by: Awni Hannun <awni.hannun@gmail.com>

* Nit2

Co-authored-by: Awni Hannun <awni.hannun@gmail.com>

---------

Co-authored-by: Awni Hannun <awni.hannun@gmail.com>
2024-02-21 11:09:30 -08:00
..
blas Update GEMM (#424) 2024-01-17 12:42:39 -08:00
comparative feat: Update pre-commit-config.yaml (#667) 2024-02-11 06:08:20 -08:00
batch_matmul_bench.py Add isort pre-commit and run (#68) 2023-12-08 11:31:47 -08:00
compile_bench.py Shapeless compilation for some graphs (#687) 2024-02-19 21:43:54 -08:00
gather_bench.py Scatter optimization : Eliminate 64b integer divide. (#662) 2024-02-10 08:49:51 -08:00
rope_bench.py Custom primitive + RoPE fat op (#676) 2024-02-14 14:04:25 -08:00
scatter_bench.py Up to 10x faster scatter. (#709) 2024-02-21 11:09:30 -08:00
single_ops.py Propagate nans in binary ops (#579) 2024-01-29 11:19:38 -08:00
time_utils.py Shapeless compilation for some graphs (#687) 2024-02-19 21:43:54 -08:00