.. |
kernels
|
feat: metal formatting and pre-commit bump (#1038)
|
2024-04-30 07:18:09 -07:00 |
mps
|
copyright + ack
|
2023-11-30 11:12:53 -08:00 |
allocator.cpp
|
Cpp docs (#1036)
|
2024-04-26 12:56:05 -07:00 |
allocator.h
|
Expose function to clear memory cache (#1032)
|
2024-04-24 16:48:51 -07:00 |
CMakeLists.txt
|
Shared events for synchronization + async eval (#998)
|
2024-04-17 06:16:02 -07:00 |
compiled_preamble.h
|
Kernel generation (#614)
|
2024-02-07 13:15:59 -08:00 |
compiled.cpp
|
Explicit barriers with concurrent dispatch (#977)
|
2024-04-10 21:45:31 -07:00 |
conv.cpp
|
Add groups to Conv1d (#948)
|
2024-04-27 06:24:57 -07:00 |
copy.cpp
|
Explicit barriers with concurrent dispatch (#977)
|
2024-04-10 21:45:31 -07:00 |
copy.h
|
Add a SliceUpdate op and primitive (#850)
|
2024-03-20 10:39:25 -07:00 |
device.cpp
|
fix slice update indexing (#1053)
|
2024-04-29 12:17:40 -07:00 |
device.h
|
feat: metal formatting and pre-commit bump (#1038)
|
2024-04-30 07:18:09 -07:00 |
event.cpp
|
Shared events for synchronization + async eval (#998)
|
2024-04-17 06:16:02 -07:00 |
fft.cpp
|
Metal FFT for powers of 2 up to 2048 (#915)
|
2024-04-11 21:40:06 -07:00 |
indexing.cpp
|
Explicit barriers with concurrent dispatch (#977)
|
2024-04-10 21:45:31 -07:00 |
make_compiled_preamble.sh
|
quote file name (#670)
|
2024-02-11 10:33:30 -08:00 |
matmul.cpp
|
Add groups to Conv1d (#948)
|
2024-04-27 06:24:57 -07:00 |
matmul.h
|
Add groups to Conv1d (#948)
|
2024-04-27 06:24:57 -07:00 |
metal_impl.h
|
Add synchronize function (#1006)
|
2024-04-22 08:25:46 -07:00 |
metal.cpp
|
Add synchronize function (#1006)
|
2024-04-22 08:25:46 -07:00 |
metal.h
|
Expose function to clear memory cache (#1032)
|
2024-04-24 16:48:51 -07:00 |
normalization.cpp
|
Explicit barriers with concurrent dispatch (#977)
|
2024-04-10 21:45:31 -07:00 |
primitives.cpp
|
Add bitwise ops (#1037)
|
2024-04-26 22:03:42 -07:00 |
quantized.cpp
|
Simplifying and improving qmm (#1030)
|
2024-04-24 13:07:45 -07:00 |
reduce.cpp
|
Explicit barriers with concurrent dispatch (#977)
|
2024-04-10 21:45:31 -07:00 |
reduce.h
|
Explicit barriers with concurrent dispatch (#977)
|
2024-04-10 21:45:31 -07:00 |
rope.cpp
|
Explicit barriers with concurrent dispatch (#977)
|
2024-04-10 21:45:31 -07:00 |
scaled_dot_product_attention.cpp
|
Explicit barriers with concurrent dispatch (#977)
|
2024-04-10 21:45:31 -07:00 |
scan.cpp
|
Explicit barriers with concurrent dispatch (#977)
|
2024-04-10 21:45:31 -07:00 |
softmax.cpp
|
Explicit barriers with concurrent dispatch (#977)
|
2024-04-10 21:45:31 -07:00 |
sort.cpp
|
Explicit barriers with concurrent dispatch (#977)
|
2024-04-10 21:45:31 -07:00 |
utils.h
|
Metal FFT for powers of 2 up to 2048 (#915)
|
2024-04-11 21:40:06 -07:00 |