mlx/mlx/backend/metal/kernels
2024-11-04 22:25:16 -08:00
..
fft Feature complete Metal FFT (#1102) 2024-06-06 12:57:25 -07:00
reduction Reductions update (#1351) 2024-11-04 22:25:16 -08:00
steel Gemm update (#1518) 2024-10-30 19:30:28 -07:00
arange.h More jitting (#1132) 2024-05-23 16:23:44 -07:00
arange.metal More jitting (#1132) 2024-05-23 16:23:44 -07:00
arg_reduce.metal More fixes for arrays with large sizes (#1405) 2024-09-17 12:46:31 -07:00
atomic.h Refactor reductions and fix scatter atomics for large sizes (#1300) 2024-08-22 16:03:31 -07:00
bf16_math.h Get metal version from xcode (#1228) 2024-06-26 07:02:11 -07:00
bf16.h Get metal version from xcode (#1228) 2024-06-26 07:02:11 -07:00
binary_ops.h Fix complex power on Metal (#1460) 2024-10-06 19:58:30 -07:00
binary_two.h Faster Metal unary and binary for general case (#1431) 2024-09-25 12:07:43 -07:00
binary_two.metal Faster metal compiled kernels + some fixes (#1486) 2024-10-14 12:45:38 -07:00
binary.h Faster Metal unary and binary for general case (#1431) 2024-09-25 12:07:43 -07:00
binary.metal Faster metal compiled kernels + some fixes (#1486) 2024-10-14 12:45:38 -07:00
CMakeLists.txt Gemm update (#1518) 2024-10-30 19:30:28 -07:00
complex.h Refactor reductions and fix scatter atomics for large sizes (#1300) 2024-08-22 16:03:31 -07:00
conv.metal Conv grad with groups + bugfix (#1449) 2024-10-06 07:08:53 -07:00
copy.h Optimization for general ND copies (#1421) 2024-09-17 17:59:51 -07:00
copy.metal Faster metal compiled kernels + some fixes (#1486) 2024-10-14 12:45:38 -07:00
defines.h Refactor reductions and fix scatter atomics for large sizes (#1300) 2024-08-22 16:03:31 -07:00
erf.h JIT compile option for binary minimization (#1091) 2024-05-22 12:57:13 -07:00
expm1f.h Fix overflow / underflow handling for expm1f (#1278) 2024-07-23 07:29:06 -07:00
fft.h Feature complete Metal FFT (#1102) 2024-06-06 12:57:25 -07:00
fft.metal Add Quantized Ops to the JIT (#1204) 2024-06-12 09:47:12 -07:00
gather.h improvements to scatter / gather (#1541) 2024-10-30 19:30:54 -07:00
gemv_masked.h Add gemv masked to JIT plus some fixes (#1310) 2024-08-07 13:38:07 -07:00
gemv_masked.metal Add gemv masked to JIT plus some fixes (#1310) 2024-08-07 13:38:07 -07:00
gemv.metal Masked gemv (#1211) 2024-06-14 09:52:26 -07:00
hadamard.h Fix bfloat16 Hadamard (#1283) 2024-07-23 14:54:43 -07:00
indexing.h improvements to scatter / gather (#1541) 2024-10-30 19:30:54 -07:00
layer_norm.metal Fixes for large arrays with a few ops (#1299) 2024-07-30 17:18:39 -07:00
quantized.h Batched Quantized Matmul + Fast Small QMV (#1503) 2024-10-21 16:23:17 -07:00
quantized.metal Batched Quantized Matmul + Fast Small QMV (#1503) 2024-10-21 16:23:17 -07:00
random.metal Faster bits and bernoulli (#1535) 2024-10-28 11:11:00 -07:00
reduce_utils.h More jitting (#1132) 2024-05-23 16:23:44 -07:00
reduce.h Fix JIT reductions (#1373) 2024-08-28 16:39:11 -07:00
reduce.metal Reductions update (#1351) 2024-11-04 22:25:16 -08:00
rms_norm.metal Fixes for large arrays with a few ops (#1299) 2024-07-30 17:18:39 -07:00
rope.metal Fix rope (#1340) 2024-08-20 17:37:52 -07:00
scaled_dot_product_attention_params.h Metal shaders for memory efficient self attention on large sequences (#964) 2024-06-03 09:16:19 -07:00
scaled_dot_product_attention.metal Sdpa fix (#1558) 2024-11-02 21:25:46 -07:00
scan.h Working 64-bit scans (#1506) 2024-10-24 11:05:46 -07:00
scan.metal Working 64-bit scans (#1506) 2024-10-24 11:05:46 -07:00
scatter.h improvements to scatter / gather (#1541) 2024-10-30 19:30:54 -07:00
sdpa_vector.h Sdpa fix (#1558) 2024-11-02 21:25:46 -07:00
softmax.h consistently handle all -inf in softmax (#1470) 2024-10-08 09:54:02 -07:00
softmax.metal More jitting (#1132) 2024-05-23 16:23:44 -07:00
sort.h More fixes for arrays with large sizes (#1405) 2024-09-17 12:46:31 -07:00
sort.metal Fix strided sort bug (#1236) 2024-06-26 14:32:11 -07:00
ternary_ops.h JIT compile option for binary minimization (#1091) 2024-05-22 12:57:13 -07:00
ternary.h Faster Metal unary and binary for general case (#1431) 2024-09-25 12:07:43 -07:00
ternary.metal Faster metal compiled kernels + some fixes (#1486) 2024-10-14 12:45:38 -07:00
unary_ops.h Real and Imag (#1490) 2024-10-15 16:23:15 -07:00
unary.h Real and Imag (#1490) 2024-10-15 16:23:15 -07:00
unary.metal Real and Imag (#1490) 2024-10-15 16:23:15 -07:00
utils.h Working 64-bit scans (#1506) 2024-10-24 11:05:46 -07:00