mlx/mlx/backend/cuda/device
Awni Hannun c9a9180584
Cuda perf tuning (#2307)
* perf tuning

* fix adding inputs arrays in matmul / srot

* format

* fix
2025-06-20 14:50:57 -07:00
..
arange.cuh CUDA backend: compile (#2276) 2025-06-12 17:08:39 -07:00
atomic_ops.cuh CUDA backend: indexing ops (#2277) 2025-06-12 21:44:19 -07:00
binary_ops.cuh divmod, partition, sort fixes (#2302) 2025-06-16 18:49:32 -07:00
cast_op.cuh [CUDA] Fix back-end bugs and enable corresponding tests (#2296) 2025-06-16 08:45:40 -07:00
config.h divmod, partition, sort fixes (#2302) 2025-06-16 18:49:32 -07:00
cucomplex_math.cuh CUDA backend: compile (#2276) 2025-06-12 17:08:39 -07:00
fp16_math.cuh CUDA backend: compile (#2276) 2025-06-12 17:08:39 -07:00
gather_axis.cuh CUDA backend: indexing ops (#2277) 2025-06-12 21:44:19 -07:00
gather.cuh CUDA backend: indexing ops (#2277) 2025-06-12 21:44:19 -07:00
indexing.cuh CUDA backend: indexing ops (#2277) 2025-06-12 21:44:19 -07:00
scatter_axis.cuh CUDA backend: indexing ops (#2277) 2025-06-12 21:44:19 -07:00
scatter_ops.cuh CUDA backend: indexing ops (#2277) 2025-06-12 21:44:19 -07:00
scatter.cuh CUDA backend: indexing ops (#2277) 2025-06-12 21:44:19 -07:00
ternary_ops.cuh Cuda bug fixes 2 (#2298) 2025-06-16 13:14:46 -07:00
unary_ops.cuh [CUDA] Fix back-end bugs and enable corresponding tests (#2296) 2025-06-16 08:45:40 -07:00
utils.cuh Cuda perf tuning (#2307) 2025-06-20 14:50:57 -07:00