mlx/mlx/backend/metal/kernels
Jagrit Digani cec8661113
Add a SliceUpdate op and primitive (#850)
* Enable copy to work with int64 strides
* Fix uniform buffer indices or copy kernel arguments
* Update utils.h
* Remove manual unrolling of elem to loc loop
* GPU copy updated to handle negative strides
* Add slice update primitive
2024-03-20 10:39:25 -07:00
..
reduction Strided reduce specialization for small reductions (#826) 2024-03-14 09:16:53 -07:00
steel No copy gems (#801) 2024-03-12 13:13:41 -07:00
arange.metal copyright + ack 2023-11-30 11:12:53 -08:00
arg_reduce.metal Fix threadgroup memory in arg reduce (#723) 2024-02-21 19:42:16 -08:00
atomic.h Fix eye for larger matrices (#463) 2024-01-16 00:51:24 -08:00
bf16_math.h copyright + ack 2023-11-30 11:12:53 -08:00
bf16.h copyright + ack 2023-11-30 11:12:53 -08:00
binary_two.metal Remainder negative numerator bug fixed (#641) 2024-02-09 16:49:14 -08:00
binary.h Remainder negative numerator bug fixed (#641) 2024-02-09 16:49:14 -08:00
binary.metal Fix logsumexp edge case (#740) 2024-02-25 08:39:55 -08:00
CMakeLists.txt Fast Inference SDPA op (#735) 2024-03-04 21:06:11 -08:00
compiled_preamble.h Implement the 'where' primitive for conditional selection (#664) 2024-02-22 15:10:48 -08:00
complex.h Remainder negative numerator bug fixed (#641) 2024-02-09 16:49:14 -08:00
conv.metal Convolution update (#651) 2024-02-28 20:11:16 -08:00
copy.metal Add a SliceUpdate op and primitive (#850) 2024-03-20 10:39:25 -07:00
defines.h copyright + ack 2023-11-30 11:12:53 -08:00
erf.h copyright + ack 2023-11-30 11:12:53 -08:00
gather.metal Update gather and scatter to not use Argument Encoder (#683) 2024-02-14 13:42:13 -08:00
gemv.metal No copy gems (#801) 2024-03-12 13:13:41 -07:00
indexing.h Update gather and scatter to not use Argument Encoder (#683) 2024-02-14 13:42:13 -08:00
quantized.metal Improved quantized matrix vector product (#786) 2024-03-05 17:32:19 -08:00
random.metal copyright + ack 2023-11-30 11:12:53 -08:00
rope.metal Custom primitive + RoPE fat op (#676) 2024-02-14 14:04:25 -08:00
scaled_dot_product_attention_params.h Fast Inference SDPA op (#735) 2024-03-04 21:06:11 -08:00
scaled_dot_product_attention.metal Compile stride bug (#812) 2024-03-11 06:31:31 -07:00
scan.metal copyright + ack 2023-11-30 11:12:53 -08:00
scatter.metal fix scatter (#821) 2024-03-12 11:42:07 -07:00
softmax.metal copyright + ack 2023-11-30 11:12:53 -08:00
sort.metal Spelling (#342) 2024-01-01 21:08:17 -08:00
ternary.h Implement the 'where' primitive for conditional selection (#664) 2024-02-22 15:10:48 -08:00
ternary.metal Fix logsumexp edge case (#740) 2024-02-25 08:39:55 -08:00
unary.h Fix logsumexp edge case (#740) 2024-02-25 08:39:55 -08:00
unary.metal Kernel generation (#614) 2024-02-07 13:15:59 -08:00
utils.h Add a SliceUpdate op and primitive (#850) 2024-03-20 10:39:25 -07:00