mlx/python
Awni Hannun ec0d5db67b
[CUDA] Switch to CUDA graphs (#2317)
* cuda graph prototype

fix signal bug + start to add dependencies

capture more

capture more ops

remaining ops

fix reduce and rope deps

add concurrent context

try update, but not working

cosistent topology order

use node api

use node api directly to reduce overhead

fix bug

use kernels in unary

cache graph

format

fix synchronization

format

* comment
2025-07-02 15:59:13 -07:00
..
mlx allow parameters to be deleted (#2325) 2025-07-01 21:27:23 -07:00
scripts Build CUDA release in Circle (#2306) 2025-06-19 15:26:36 -07:00
src Compile float64 functions on CPU (#2311) 2025-06-24 10:18:52 -07:00
tests [CUDA] Switch to CUDA graphs (#2317) 2025-07-02 15:59:13 -07:00