mirror of
				https://github.com/ml-explore/mlx.git
				synced 2025-10-31 07:58:14 +08:00 
			
		
		
		
	[CUDA] Switch to CUDA graphs (#2317)
* cuda graph prototype fix signal bug + start to add dependencies capture more capture more ops remaining ops fix reduce and rope deps add concurrent context try update, but not working cosistent topology order use node api use node api directly to reduce overhead fix bug use kernels in unary cache graph format fix synchronization format * comment
This commit is contained in:
		| @@ -391,9 +391,11 @@ class TestLoad(mlx_tests.MLXTestCase): | ||||
|         scale = mx.array(2.0) | ||||
|         y = mx.load(save_file) | ||||
|         mx.eval(y) | ||||
|         mx.synchronize() | ||||
|         load_only = mx.get_peak_memory() | ||||
|         y = mx.load(save_file) * scale | ||||
|         mx.eval(y) | ||||
|         mx.synchronize() | ||||
|         load_with_binary = mx.get_peak_memory() | ||||
|  | ||||
|         self.assertEqual(load_only, load_with_binary) | ||||
|   | ||||
		Reference in New Issue
	
	Block a user
	 Awni Hannun
					Awni Hannun