mlx/mlx/backend/metal
2025-02-13 14:21:44 -08:00
..
jit scatter axis + gather axis primitives (#1813) 2025-01-31 20:48:08 -08:00
kernels Bitwise Inverse (#1862) 2025-02-13 08:44:14 -08:00
allocator.cpp track resource limit and throw if exceeded (#1718) 2024-12-18 18:45:58 -08:00
allocator.h track resource limit and throw if exceeded (#1718) 2024-12-18 18:45:58 -08:00
binary.cpp Allow no copy negative strides in as_strided and slice (#1688) 2024-12-12 08:59:45 -08:00
binary.h Fixes for large arrays with a few ops (#1299) 2024-07-30 17:18:39 -07:00
CMakeLists.txt MinGW support (#1806) 2025-02-01 12:40:06 -08:00
compiled.cpp Dynamic broadcasting for shapeless compile/export (#1722) 2025-01-09 11:04:24 -08:00
conv.cpp More shape type (#1705) 2024-12-19 08:08:20 -08:00
copy.cpp Refactor common into cpu specific and truly common (#1817) 2025-02-03 15:58:02 -08:00
copy.h Dynamic slicing (#1741) 2025-01-07 14:02:16 -08:00
custom_kernel.cpp don't duplicate malloc with custom kernel init (#1830) 2025-02-04 13:20:57 -08:00
device.cpp Faster synchronization Fence primitive (#1773) 2025-01-17 18:42:19 -08:00
device.h Faster synchronization Fence primitive (#1773) 2025-01-17 18:42:19 -08:00
distributed.cpp Fixes output donation for IO ops on the GPU (#1857) 2025-02-12 10:52:30 -08:00
event.cpp fix and cleanup event signal/wait for metal (#1765) 2025-01-10 18:37:26 -08:00
event.h fix and cleanup event signal/wait for metal (#1765) 2025-01-10 18:37:26 -08:00
fence.cpp Faster synchronization Fence primitive (#1773) 2025-01-17 18:42:19 -08:00
fence.h Faster synchronization Fence primitive (#1773) 2025-01-17 18:42:19 -08:00
fft.cpp Refactor common into cpu specific and truly common (#1817) 2025-02-03 15:58:02 -08:00
hadamard.cpp Fully wrap the command encoder (#1572) 2024-11-08 11:50:21 -08:00
indexing.cpp scatter axis + gather axis primitives (#1813) 2025-01-31 20:48:08 -08:00
jit_kernels.cpp Move some kernels to get_template_definition (#1782) 2025-01-21 08:59:44 -08:00
kernels.h Add missing #pragma once (#1838) 2025-02-06 11:11:22 -08:00
make_compiled_preamble.sh Dispatch bf16 at run time when using the JIT (#1584) 2024-11-15 16:54:36 -08:00
matmul.cpp Refactor common into cpu specific and truly common (#1817) 2025-02-03 15:58:02 -08:00
matmul.h Use int64 stride everywhere (#1671) 2024-12-09 11:09:02 -08:00
metal_impl.h Add synchronize function (#1006) 2024-04-22 08:25:46 -07:00
metal.cpp Revert "More buffer donation in some cases (#1858)" (#1863) 2025-02-13 14:21:44 -08:00
metal.h Added missing unordered_map includes (#1635) 2024-12-02 07:03:03 -08:00
nojit_kernels.cpp Dynamic slicing (#1741) 2025-01-07 14:02:16 -08:00
normalization.cpp Fully wrap the command encoder (#1572) 2024-11-08 11:50:21 -08:00
primitives.cpp Fixes output donation for IO ops on the GPU (#1857) 2025-02-12 10:52:30 -08:00
quantized.cpp Faster small batch qmv (#1861) 2025-02-12 22:02:36 -08:00
reduce.cpp More shape type (#1705) 2024-12-19 08:08:20 -08:00
reduce.h Reductions update (#1351) 2024-11-04 22:25:16 -08:00
resident.cpp Fix some leaks and races (#1629) 2024-11-27 20:01:20 -08:00
resident.h Wired (#1510) 2024-10-25 09:35:33 -07:00
rope.cpp Allow offset to be an mx.array for mx.fast.rope (#1724) 2024-12-19 15:51:44 -08:00
scaled_dot_product_attention.cpp Allow different value dimensions in sdpa_vector (#1811) 2025-01-31 20:58:59 -08:00
scan.cpp Fully wrap the command encoder (#1572) 2024-11-08 11:50:21 -08:00
slicing.cpp Fix a couple of slicing bugs (#1827) 2025-02-05 19:50:08 -08:00
slicing.h More shape type (#1705) 2024-12-19 08:08:20 -08:00
softmax.cpp Fully wrap the command encoder (#1572) 2024-11-08 11:50:21 -08:00
sort.cpp fix sort (#1835) 2025-02-05 17:16:27 -08:00
ternary.cpp Allow no copy negative strides in as_strided and slice (#1688) 2024-12-12 08:59:45 -08:00
ternary.h Add some internal GPU apis (#1177) 2024-06-04 09:24:26 -07:00
unary.cpp Bitwise Inverse (#1862) 2025-02-13 08:44:14 -08:00
unary.h Add some internal GPU apis (#1177) 2024-06-04 09:24:26 -07:00
utils.cpp Fp64 on the CPU (#1843) 2025-02-07 15:52:22 -08:00
utils.h Fixes output donation for IO ops on the GPU (#1857) 2025-02-12 10:52:30 -08:00