mlx/benchmarks/python
Awni Hannun ccf1645995
Custom primitive + RoPE fat op (#676)
* extensions start

* rope custom op

* fix build

* docs + rope benchmark

* fix test

* Add a Metal kernel for RoPE

* Fix position of traditional

* transform tests

* Move rope computation to float and fix tests

* Fix the test and a typo

* change to fast

* fix no metal build

---------

Co-authored-by: Angelos Katharopoulos <a_katharopoulos@apple.com>
2024-02-14 14:04:25 -08:00
..
blas Update GEMM (#424) 2024-01-17 12:42:39 -08:00
comparative feat: Update pre-commit-config.yaml (#667) 2024-02-11 06:08:20 -08:00
batch_matmul_bench.py Add isort pre-commit and run (#68) 2023-12-08 11:31:47 -08:00
gather_bench.py Scatter optimization : Eliminate 64b integer divide. (#662) 2024-02-10 08:49:51 -08:00
rope_bench.py Custom primitive + RoPE fat op (#676) 2024-02-14 14:04:25 -08:00
scatter_bench.py Scatter optimization : Eliminate 64b integer divide. (#662) 2024-02-10 08:49:51 -08:00
single_ops.py Propagate nans in binary ops (#579) 2024-01-29 11:19:38 -08:00
time_utils.py Faster gather and scatter. (#682) 2024-02-13 17:47:41 -08:00