Alex Barron
|
27d70c7d9d
|
Feature complete Metal FFT (#1102)
* feature complete metal fft
* fix contiguity bug
* jit fft
* simplify rader/bluestein constant computation
* remove kernel/utils.h dep
* remove bf16.h dep
* format
---------
Co-authored-by: Alex Barron <abarron22@apple.com>
|
2024-06-06 12:57:25 -07:00 |
|
Nikhil Mehta
|
0b7d71fd2f
|
Add softmin, hardshrink, hardtanh (#1180)
---------
Co-authored-by: Nikhil Mehta <nikmehta@tesla.com>
|
2024-06-04 15:48:18 -07:00 |
|
Alex Barron
|
2e7c02d5cd
|
Metal FFT for powers of 2 up to 2048 (#915)
* add Metal FFT for powers of 2
* skip GPU test on linux
* fix contiguity bug
* address comments
* Update mlx/backend/metal/fft.cpp
* Update mlx/backend/metal/fft.cpp
* fix bug in synch
---------
Co-authored-by: Alex Barron <abarron22@apple.com>
Co-authored-by: Awni Hannun <awni.hannun@gmail.com>
Co-authored-by: Awni Hannun <awni@apple.com>
|
2024-04-11 21:40:06 -07:00 |
|