.. |
jit
|
scatter axis + gather axis primitives (#1813)
|
2025-01-31 20:48:08 -08:00 |
kernels
|
Bitwise Inverse (#1862)
|
2025-02-13 08:44:14 -08:00 |
allocator.cpp
|
track resource limit and throw if exceeded (#1718)
|
2024-12-18 18:45:58 -08:00 |
allocator.h
|
track resource limit and throw if exceeded (#1718)
|
2024-12-18 18:45:58 -08:00 |
binary.cpp
|
Allow no copy negative strides in as_strided and slice (#1688)
|
2024-12-12 08:59:45 -08:00 |
binary.h
|
Fixes for large arrays with a few ops (#1299)
|
2024-07-30 17:18:39 -07:00 |
CMakeLists.txt
|
MinGW support (#1806)
|
2025-02-01 12:40:06 -08:00 |
compiled.cpp
|
Dynamic broadcasting for shapeless compile/export (#1722)
|
2025-01-09 11:04:24 -08:00 |
conv.cpp
|
More shape type (#1705)
|
2024-12-19 08:08:20 -08:00 |
copy.cpp
|
Refactor common into cpu specific and truly common (#1817)
|
2025-02-03 15:58:02 -08:00 |
copy.h
|
Dynamic slicing (#1741)
|
2025-01-07 14:02:16 -08:00 |
custom_kernel.cpp
|
don't duplicate malloc with custom kernel init (#1830)
|
2025-02-04 13:20:57 -08:00 |
device.cpp
|
Faster synchronization Fence primitive (#1773)
|
2025-01-17 18:42:19 -08:00 |
device.h
|
Faster synchronization Fence primitive (#1773)
|
2025-01-17 18:42:19 -08:00 |
distributed.cpp
|
Fixes output donation for IO ops on the GPU (#1857)
|
2025-02-12 10:52:30 -08:00 |
event.cpp
|
fix and cleanup event signal/wait for metal (#1765)
|
2025-01-10 18:37:26 -08:00 |
event.h
|
fix and cleanup event signal/wait for metal (#1765)
|
2025-01-10 18:37:26 -08:00 |
fence.cpp
|
Faster synchronization Fence primitive (#1773)
|
2025-01-17 18:42:19 -08:00 |
fence.h
|
Faster synchronization Fence primitive (#1773)
|
2025-01-17 18:42:19 -08:00 |
fft.cpp
|
Refactor common into cpu specific and truly common (#1817)
|
2025-02-03 15:58:02 -08:00 |
hadamard.cpp
|
Fully wrap the command encoder (#1572)
|
2024-11-08 11:50:21 -08:00 |
indexing.cpp
|
scatter axis + gather axis primitives (#1813)
|
2025-01-31 20:48:08 -08:00 |
jit_kernels.cpp
|
Move some kernels to get_template_definition (#1782)
|
2025-01-21 08:59:44 -08:00 |
kernels.h
|
Add missing #pragma once (#1838)
|
2025-02-06 11:11:22 -08:00 |
make_compiled_preamble.sh
|
Dispatch bf16 at run time when using the JIT (#1584)
|
2024-11-15 16:54:36 -08:00 |
matmul.cpp
|
Refactor common into cpu specific and truly common (#1817)
|
2025-02-03 15:58:02 -08:00 |
matmul.h
|
Use int64 stride everywhere (#1671)
|
2024-12-09 11:09:02 -08:00 |
metal_impl.h
|
Add synchronize function (#1006)
|
2024-04-22 08:25:46 -07:00 |
metal.cpp
|
Revert "More buffer donation in some cases (#1858)" (#1863)
|
2025-02-13 14:21:44 -08:00 |
metal.h
|
Added missing unordered_map includes (#1635)
|
2024-12-02 07:03:03 -08:00 |
nojit_kernels.cpp
|
Dynamic slicing (#1741)
|
2025-01-07 14:02:16 -08:00 |
normalization.cpp
|
Fully wrap the command encoder (#1572)
|
2024-11-08 11:50:21 -08:00 |
primitives.cpp
|
Fixes output donation for IO ops on the GPU (#1857)
|
2025-02-12 10:52:30 -08:00 |
quantized.cpp
|
Faster small batch qmv (#1861)
|
2025-02-12 22:02:36 -08:00 |
reduce.cpp
|
More shape type (#1705)
|
2024-12-19 08:08:20 -08:00 |
reduce.h
|
Reductions update (#1351)
|
2024-11-04 22:25:16 -08:00 |
resident.cpp
|
Fix some leaks and races (#1629)
|
2024-11-27 20:01:20 -08:00 |
resident.h
|
Wired (#1510)
|
2024-10-25 09:35:33 -07:00 |
rope.cpp
|
Allow offset to be an mx.array for mx.fast.rope (#1724)
|
2024-12-19 15:51:44 -08:00 |
scaled_dot_product_attention.cpp
|
Allow different value dimensions in sdpa_vector (#1811)
|
2025-01-31 20:58:59 -08:00 |
scan.cpp
|
Fully wrap the command encoder (#1572)
|
2024-11-08 11:50:21 -08:00 |
slicing.cpp
|
Fix a couple of slicing bugs (#1827)
|
2025-02-05 19:50:08 -08:00 |
slicing.h
|
More shape type (#1705)
|
2024-12-19 08:08:20 -08:00 |
softmax.cpp
|
Fully wrap the command encoder (#1572)
|
2024-11-08 11:50:21 -08:00 |
sort.cpp
|
fix sort (#1835)
|
2025-02-05 17:16:27 -08:00 |
ternary.cpp
|
Allow no copy negative strides in as_strided and slice (#1688)
|
2024-12-12 08:59:45 -08:00 |
ternary.h
|
Add some internal GPU apis (#1177)
|
2024-06-04 09:24:26 -07:00 |
unary.cpp
|
Bitwise Inverse (#1862)
|
2025-02-13 08:44:14 -08:00 |
unary.h
|
Add some internal GPU apis (#1177)
|
2024-06-04 09:24:26 -07:00 |
utils.cpp
|
Fp64 on the CPU (#1843)
|
2025-02-07 15:52:22 -08:00 |
utils.h
|
Fixes output donation for IO ops on the GPU (#1857)
|
2025-02-12 10:52:30 -08:00 |