张壹 zhangyiss
  • Joined on 2024-09-10
zhangyiss synced and deleted reference refs/tags/refs/pull/2480/merge at zhangyiss/mlx from mirror 2025-08-13 04:36:47 +08:00
zhangyiss synced commits to main at zhangyiss/mlx from mirror 2025-08-13 04:36:47 +08:00
ac207ce7aa make code blocks copyable (#2480)
zhangyiss synced commits to refs/pull/2484/head at zhangyiss/mlx from mirror 2025-08-12 20:26:43 +08:00
3ebc047168 Add RAII managed CudaGraph class
fce53b61d6 Fix reduce sum/prod overflow (#2477)
8ae4a76308 Use CMake <4.1 to avoid the nvpl error (#2489)
Compare 3 commits »
zhangyiss synced commits to refs/pull/2484/merge at zhangyiss/mlx from mirror 2025-08-12 20:26:43 +08:00
3ebc047168 Add RAII managed CudaGraph class
fce53b61d6 Fix reduce sum/prod overflow (#2477)
8ae4a76308 Use CMake <4.1 to avoid the nvpl error (#2489)
Compare 4 commits »
zhangyiss synced commits to refs/pull/2487/merge at zhangyiss/mlx from mirror 2025-08-12 20:26:43 +08:00
fce53b61d6 Fix reduce sum/prod overflow (#2477)
8ae4a76308 Use CMake <4.1 to avoid the nvpl error (#2489)
Compare 3 commits »
zhangyiss synced commits to refs/pull/2472/merge at zhangyiss/mlx from mirror 2025-08-12 20:26:42 +08:00
e6f58cbb46 Merge 7ebb9fe6683b3a431edd2e098bce77bbfd49ab5e into fce53b61d6
fce53b61d6 Fix reduce sum/prod overflow (#2477)
8ae4a76308 Use CMake <4.1 to avoid the nvpl error (#2489)
Compare 3 commits »
zhangyiss synced commits to refs/pull/2480/merge at zhangyiss/mlx from mirror 2025-08-12 20:26:42 +08:00
fce53b61d6 Fix reduce sum/prod overflow (#2477)
8ae4a76308 Use CMake <4.1 to avoid the nvpl error (#2489)
Compare 3 commits »
zhangyiss synced commits to main at zhangyiss/mlx from mirror 2025-08-12 20:26:41 +08:00
fce53b61d6 Fix reduce sum/prod overflow (#2477)
8ae4a76308 Use CMake <4.1 to avoid the nvpl error (#2489)
Compare 2 commits »
zhangyiss synced and deleted reference refs/tags/refs/pull/2477/merge at zhangyiss/mlx from mirror 2025-08-12 20:26:40 +08:00
zhangyiss synced commits to refs/pull/9/merge at zhangyiss/Table-and-Graph-Libs from mirror 2025-08-11 15:22:42 +08:00
271da89153 Added Ruff config.
Compare 2 commits »
zhangyiss synced new reference custom-cuda-kernel to zhangyiss/mlx from mirror 2025-08-10 19:26:43 +08:00
zhangyiss synced commits to simple-gemm at zhangyiss/mlx from mirror 2025-08-10 19:26:43 +08:00
e648148d22 Add a cutlass gemm
f7a76acf18 More pipelining for the sm_80 gemm
Compare 2 commits »
zhangyiss synced commits to refs/pull/1970/merge at zhangyiss/mlx from mirror 2025-08-10 19:26:43 +08:00
7fde1b6a1e Fix logsumexp/softmax not fused for some cases (#2474)
Compare 2 commits »
zhangyiss synced commits to refs/pull/2156/merge at zhangyiss/mlx from mirror 2025-08-10 19:26:43 +08:00
7fde1b6a1e Fix logsumexp/softmax not fused for some cases (#2474)
Compare 2 commits »
zhangyiss synced commits to refs/pull/2300/merge at zhangyiss/mlx from mirror 2025-08-10 19:26:43 +08:00
7fde1b6a1e Fix logsumexp/softmax not fused for some cases (#2474)
Compare 2 commits »
zhangyiss synced commits to custom-cuda-kernel at zhangyiss/mlx from mirror 2025-08-10 19:26:42 +08:00
zhangyiss synced commits to refs/pull/2104/merge at zhangyiss/mlx from mirror 2025-08-10 03:06:40 +08:00
7fde1b6a1e Fix logsumexp/softmax not fused for some cases (#2474)
Compare 2 commits »
zhangyiss synced commits to refs/pull/2290/merge at zhangyiss/mlx from mirror 2025-08-10 03:06:40 +08:00
7fde1b6a1e Fix logsumexp/softmax not fused for some cases (#2474)
Compare 2 commits »
zhangyiss synced commits to refs/pull/2074/merge at zhangyiss/mlx from mirror 2025-08-09 18:56:39 +08:00
7fde1b6a1e Fix logsumexp/softmax not fused for some cases (#2474)
Compare 2 commits »
zhangyiss synced commits to refs/pull/2401/merge at zhangyiss/mlx from mirror 2025-08-09 18:56:39 +08:00
7fde1b6a1e Fix logsumexp/softmax not fused for some cases (#2474)
Compare 2 commits »