mlx/benchmarks/python
Awni Hannun baf9fa5f42
Einsum (#1269)
* einsum initial

* fix comma break

* sum axis was wrong

* small cleanups

* python binding

* changed bindings to resemble numpy

* remove todo comment

* comment changes

* add count of operands/inputs

* fail fast if operands list is empty

* ignore comma if no output

* einsum path matching numpy

* getting somewhere with path

* remove print

* it passes the first test

* moved einsum tests to seperate file

* seperated einsum path

* moved einsum naive

* remove space from equation

* fast fail if no operands passed

* update tests and remove printf

* small cleanup

* some more cleanups

* removed python helper file

* ack

* utilize std for finding min in vector

* duplicate def

* remove the tuple as it was unreadable

* moved einsum_naive back to ops

* remaining isn't needed

* avoid creating another set

* cleanup

* greedy path, start of naive einsum

* more einsum

* fix some bugs

* some more fixes, tests pass

* benchmark

* some simplify

* fix einsum and test

Co-authored-by: Angelos Katharopoulos <a_katharopoulos@apple.com>

* add a bunch more tests and fix a bunch more bugs

* some docs nits

---------

Co-authored-by: dc-dc-dc <dgcruz983@gmail.com>
Co-authored-by: Angelos Katharopoulos <a_katharopoulos@apple.com>
2024-07-25 09:36:44 -07:00
..
blas Update GEMM (#424) 2024-01-17 12:42:39 -08:00
comparative Fix benchmark (#1175) 2024-06-04 07:50:46 -07:00
batch_matmul_bench.py Add isort pre-commit and run (#68) 2023-12-08 11:31:47 -08:00
compile_bench.py Add softmin, hardshrink, hardtanh (#1180) 2024-06-04 15:48:18 -07:00
conv1d_bench.py Add groups to Conv1d (#948) 2024-04-27 06:24:57 -07:00
conv_bench.py Add softmin, hardshrink, hardtanh (#1180) 2024-06-04 15:48:18 -07:00
einsum_bench.py Einsum (#1269) 2024-07-25 09:36:44 -07:00
fft_bench.py Feature complete Metal FFT (#1102) 2024-06-06 12:57:25 -07:00
gather_bench.py Scatter optimization : Eliminate 64b integer divide. (#662) 2024-02-10 08:49:51 -08:00
hadamard_bench.py Fast Hadamard Transform (#1249) 2024-07-09 20:39:01 -07:00
layer_norm_bench.py Implement vjps for some primitives in the fast namespace (#883) 2024-03-26 16:35:34 -07:00
rms_norm_bench.py Implement vjps for some primitives in the fast namespace (#883) 2024-03-26 16:35:34 -07:00
rope_bench.py Fix copy donation and add partial rope (#881) 2024-03-22 17:28:26 -07:00
scatter_bench.py Up to 10x faster scatter. (#709) 2024-02-21 11:09:30 -08:00
sdpa_bench.py Add softmin, hardshrink, hardtanh (#1180) 2024-06-04 15:48:18 -07:00
single_ops.py Propagate nans in binary ops (#579) 2024-01-29 11:19:38 -08:00
time_utils.py Shapeless compilation for some graphs (#687) 2024-02-19 21:43:54 -08:00