.. |
copy
|
rebase + nit (#2260)
|
2025-06-10 10:51:51 -07:00 |
iterators
|
CUDA backend: argreduce (#2270)
|
2025-06-11 13:26:17 -07:00 |
kernels
|
CUDA backend: reduce (#2269)
|
2025-06-11 11:22:25 -07:00 |
reduce
|
CUDA backend: reduce (#2269)
|
2025-06-11 11:22:25 -07:00 |
allocator.cpp
|
Avoid invoking allocator::malloc when creating CUDA event (#2232)
|
2025-06-03 16:48:40 -07:00 |
allocator.h
|
Avoid invoking allocator::malloc when creating CUDA event (#2232)
|
2025-06-03 16:48:40 -07:00 |
arg_reduce.cu
|
CUDA backend: argreduce (#2270)
|
2025-06-11 13:26:17 -07:00 |
binary.cu
|
CUDA backend: binary ops (#2259)
|
2025-06-10 06:37:40 -07:00 |
CMakeLists.txt
|
Fix warnings from latest CUDA toolkit (#2275)
|
2025-06-12 06:03:01 -07:00 |
copy.cu
|
rebase + nit (#2260)
|
2025-06-10 10:51:51 -07:00 |
cuda.cpp
|
start cuda circle config (#2256)
|
2025-06-10 21:19:47 -07:00 |
cuda.h
|
start cuda circle config (#2256)
|
2025-06-10 21:19:47 -07:00 |
device.cpp
|
CUDA backend: matmul (#2241)
|
2025-06-06 12:24:04 -07:00 |
device.h
|
CUDA backend: matmul (#2241)
|
2025-06-06 12:24:04 -07:00 |
eval.cpp
|
CUDA backend: backbone (#2075)
|
2025-05-06 21:26:46 -07:00 |
event.cu
|
Avoid atomic updates across CPU/GPU in CUDA event (#2231)
|
2025-06-03 16:49:06 -07:00 |
event.h
|
CUDA backend: backbone (#2075)
|
2025-05-06 21:26:46 -07:00 |
fence.cpp
|
Avoid atomic updates across CPU/GPU in CUDA event (#2231)
|
2025-06-03 16:49:06 -07:00 |
kernel_utils.cu
|
Move some dims utils to common (#2223)
|
2025-05-29 06:48:30 -07:00 |
kernel_utils.cuh
|
CUDA backend: reduce (#2269)
|
2025-06-11 11:22:25 -07:00 |
layer_norm.cu
|
CUDA backend: layernorm (#2271)
|
2025-06-11 15:48:32 -07:00 |
logsumexp.cu
|
CUDA backend: softmax (#2272)
|
2025-06-11 13:55:22 -07:00 |
matmul.cpp
|
CUDA backend: matmul (#2241)
|
2025-06-06 12:24:04 -07:00 |
no_cuda.cpp
|
start cuda circle config (#2256)
|
2025-06-10 21:19:47 -07:00 |
primitives.cu
|
CUDA backend: layernorm (#2271)
|
2025-06-11 15:48:32 -07:00 |
random.cu
|
CUDA backend: random (#2261)
|
2025-06-10 08:59:56 -07:00 |
reduce.cu
|
CUDA backend: reduce (#2269)
|
2025-06-11 11:22:25 -07:00 |
slicing.cpp
|
rebase + nit (#2260)
|
2025-06-10 10:51:51 -07:00 |
softmax.cu
|
CUDA backend: softmax (#2272)
|
2025-06-11 13:55:22 -07:00 |
sort.cu
|
CUDA backend: sort (#2262)
|
2025-06-10 08:59:47 -07:00 |
unary.cu
|
CUDA backend: unary ops (#2158)
|
2025-06-09 06:45:08 -07:00 |
utils.cpp
|
CUDA backend: backbone (#2075)
|
2025-05-06 21:26:46 -07:00 |
utils.h
|
Move some dims utils to common (#2223)
|
2025-05-29 06:48:30 -07:00 |
worker.cpp
|
CUDA backend: backbone (#2075)
|
2025-05-06 21:26:46 -07:00 |
worker.h
|
CUDA backend: backbone (#2075)
|
2025-05-06 21:26:46 -07:00 |