mlx/mlx
Jagrit Digani 6686e61ca4
Reduce update (#783)
* Split reduction files to reduce compile times

* Add small and medium axis size specializations for row reductions

* Add non-row-reduction options for small and med kernels
2024-03-04 19:09:51 -08:00
..
3rdparty jagrit's commit files 2023-11-29 10:52:08 -08:00
backend Reduce update (#783) 2024-03-04 19:09:51 -08:00
io Remove unused variables (#706) 2024-02-18 12:50:10 -08:00
types Refactor CPU compile preamble (#708) 2024-02-19 06:12:53 -08:00
allocator.cpp Memory allocation (#292) 2024-01-02 11:59:19 -08:00
allocator.h Memory allocation (#292) 2024-01-02 11:59:19 -08:00
array.cpp Fix empty array construction in cpp (#684) 2024-02-13 23:34:17 -08:00
array.h Fix empty array construction in cpp (#684) 2024-02-13 23:34:17 -08:00
CMakeLists.txt Custom primitive + RoPE fat op (#676) 2024-02-14 14:04:25 -08:00
compile.cpp Implement the 'where' primitive for conditional selection (#664) 2024-02-22 15:10:48 -08:00
compile.h Shapeless compilation for some graphs (#687) 2024-02-19 21:43:54 -08:00
device.cpp copyright + ack 2023-11-30 11:12:53 -08:00
device.h copyright + ack 2023-11-30 11:12:53 -08:00
dtype.cpp copyright + ack 2023-11-30 11:12:53 -08:00
dtype.h Pickle + dtype fix for numpy conversion (#763) 2024-03-02 06:09:29 -08:00
fast_primitives.h CPU compile (#691) 2024-02-17 06:54:32 -08:00
fast.cpp Separate fast ops and primitives (#699) 2024-02-16 19:16:39 -08:00
fast.h Separate fast ops and primitives (#699) 2024-02-16 19:16:39 -08:00
fft.cpp Spelling (#342) 2024-01-01 21:08:17 -08:00
fft.h copyright + ack 2023-11-30 11:12:53 -08:00
graph_utils.cpp Kernel generation (#614) 2024-02-07 13:15:59 -08:00
graph_utils.h Kernel generation (#614) 2024-02-07 13:15:59 -08:00
io.h Metadata support for safetensors (#639) 2024-02-08 19:33:15 -08:00
linalg.cpp QR factorization (#310) 2024-01-26 09:27:31 -08:00
linalg.h QR factorization (#310) 2024-01-26 09:27:31 -08:00
mlx.h Custom primitive + RoPE fat op (#676) 2024-02-14 14:04:25 -08:00
ops.cpp nice tensordot for mlx c (#782) 2024-03-04 09:51:02 -08:00
ops.h nice tensordot for mlx c (#782) 2024-03-04 09:51:02 -08:00
primitives.cpp Convolution update (#651) 2024-02-28 20:11:16 -08:00
primitives.h Convolution update (#651) 2024-02-28 20:11:16 -08:00
random.cpp Add loc and scale to random.normal (#638) 2024-02-07 11:49:59 -08:00
random.h Add loc and scale to random.normal (#638) 2024-02-07 11:49:59 -08:00
scheduler.cpp copyright + ack 2023-11-30 11:12:53 -08:00
scheduler.h move all ObjC (via metal-cpp) interaction until post static initializers (#370) 2024-01-04 16:12:00 -08:00
stream.h copyright + ack 2023-11-30 11:12:53 -08:00
transforms_impl.h Shapeless compilation for some graphs (#687) 2024-02-19 21:43:54 -08:00
transforms.cpp Reduce vmap + some fixes (#601) 2024-02-01 11:30:28 -08:00
transforms.h Compile primitive (#571) 2024-02-05 06:51:22 -08:00
utils.cpp Adds device context manager (#679) 2024-02-14 14:14:58 -08:00
utils.h CPU compile (#691) 2024-02-17 06:54:32 -08:00