张壹 zhangyiss
  • Joined on 2024-09-10
zhangyiss synced commits to refs/pull/1970/merge at zhangyiss/mlx from mirror 2025-07-18 07:01:14 +08:00
fbb3f65a1a fix resource leaks in matmul and graph (#2383)
6b1b8ea91b [CUDA] Add work per thread to compile (#2368)
b2273733ea Test with CUDA 12.2 (#2375)
f409b229a4 fix ring distributed test (#2380)
Compare 6 commits »
zhangyiss synced commits to refs/pull/2234/merge at zhangyiss/mlx from mirror 2025-07-18 07:01:14 +08:00
fbb3f65a1a fix resource leaks in matmul and graph (#2383)
6b1b8ea91b [CUDA] Add work per thread to compile (#2368)
Compare 3 commits »
zhangyiss synced commits to refs/pull/2300/merge at zhangyiss/mlx from mirror 2025-07-18 07:01:14 +08:00
fbb3f65a1a fix resource leaks in matmul and graph (#2383)
6b1b8ea91b [CUDA] Add work per thread to compile (#2368)
Compare 3 commits »
zhangyiss synced and deleted reference refs/tags/refs/pull/2382/merge at zhangyiss/mlx from mirror 2025-07-18 07:01:13 +08:00
zhangyiss synced commits to main at zhangyiss/mlx from mirror 2025-07-18 07:01:13 +08:00
31fc530c76 [CUDA] Add more ways finding CCCL headers in JIT (#2382)
zhangyiss synced commits to refs/pull/1914/head at zhangyiss/mlx from mirror 2025-07-18 07:01:13 +08:00
0a8bb904d7 nits
c535d8c1b5 Merge branch 'ml-explore:main' into adding-Muon-optimizer
4b3d7634cd format
516d172ba5 remove comments
698daee214 replace with mx.addmm
Compare 10 commits »
zhangyiss synced commits to refs/pull/1914/merge at zhangyiss/mlx from mirror 2025-07-18 07:01:13 +08:00
0a8bb904d7 nits
c535d8c1b5 Merge branch 'ml-explore:main' into adding-Muon-optimizer
4b3d7634cd format
516d172ba5 remove comments
Compare 11 commits »
zhangyiss synced commits to refs/pull/2382/merge at zhangyiss/mlx from mirror 2025-07-17 22:51:21 +08:00
fbb3f65a1a fix resource leaks in matmul and graph (#2383)
6b1b8ea91b [CUDA] Add work per thread to compile (#2368)
Compare 3 commits »
zhangyiss synced commits to refs/pull/1914/merge at zhangyiss/mlx from mirror 2025-07-17 22:51:20 +08:00
7f39e9c299 nits
baad6e392b Merge branch 'ml-explore:main' into adding-Muon-optimizer
Compare 3 commits »
zhangyiss synced commits to refs/pull/2074/merge at zhangyiss/mlx from mirror 2025-07-17 22:51:20 +08:00
b2273733ea Test with CUDA 12.2 (#2375)
f409b229a4 fix ring distributed test (#2380)
30571e2326 Rename the copy util in cpu/copy.h to copy_cpu (#2378)
Compare 4 commits »
zhangyiss synced commits to refs/pull/2104/merge at zhangyiss/mlx from mirror 2025-07-17 22:51:20 +08:00
b2273733ea Test with CUDA 12.2 (#2375)
f409b229a4 fix ring distributed test (#2380)
30571e2326 Rename the copy util in cpu/copy.h to copy_cpu (#2378)
d7734edd9f fix complex reduce + nan propagation in min and max (#2377)
Compare 6 commits »
zhangyiss synced commits to refs/pull/2290/merge at zhangyiss/mlx from mirror 2025-07-17 22:51:20 +08:00
6b1b8ea91b [CUDA] Add work per thread to compile (#2368)
Compare 2 commits »
zhangyiss synced commits to refs/pull/2379/merge at zhangyiss/mlx from mirror 2025-07-17 22:51:20 +08:00
03db23cc39 Merge 32d84f520ecb3840df26f075061d2c7d3778bbc0 into fbb3f65a1a
fbb3f65a1a fix resource leaks in matmul and graph (#2383)
6b1b8ea91b [CUDA] Add work per thread to compile (#2368)
Compare 3 commits »
zhangyiss synced and deleted reference refs/tags/cuda-compile at zhangyiss/mlx from mirror 2025-07-17 22:51:19 +08:00
zhangyiss synced and deleted reference refs/tags/refs/pull/2368/merge at zhangyiss/mlx from mirror 2025-07-17 22:51:19 +08:00
zhangyiss synced and deleted reference refs/tags/refs/pull/2383/merge at zhangyiss/mlx from mirror 2025-07-17 22:51:19 +08:00
zhangyiss synced commits to main at zhangyiss/mlx from mirror 2025-07-17 22:51:19 +08:00
fbb3f65a1a fix resource leaks in matmul and graph (#2383)
6b1b8ea91b [CUDA] Add work per thread to compile (#2368)
Compare 2 commits »
zhangyiss synced commits to refs/pull/1914/head at zhangyiss/mlx from mirror 2025-07-17 22:51:19 +08:00
7f39e9c299 nits
baad6e392b Merge branch 'ml-explore:main' into adding-Muon-optimizer
b2273733ea Test with CUDA 12.2 (#2375)
Compare 3 commits »
zhangyiss synced commits to refs/pull/2379/head at zhangyiss/mlx from mirror 2025-07-17 14:31:15 +08:00
32d84f520e Add contiguous_copy_gpu util for copying array
b2273733ea Test with CUDA 12.2 (#2375)
f409b229a4 fix ring distributed test (#2380)
30571e2326 Rename the copy util in cpu/copy.h to copy_cpu (#2378)
d7734edd9f fix complex reduce + nan propagation in min and max (#2377)
Compare 5 commits »
zhangyiss synced commits to refs/pull/2379/merge at zhangyiss/mlx from mirror 2025-07-17 14:31:15 +08:00
6182d8e948 Merge 32d84f520ecb3840df26f075061d2c7d3778bbc0 into b2273733ea
32d84f520e Add contiguous_copy_gpu util for copying array
Compare 2 commits »