张壹 zhangyiss
  • Joined on 2024-09-10
zhangyiss synced commits to cuda_circle at zhangyiss/mlx from mirror 2025-06-09 12:51:14 +08:00
zhangyiss synced new reference cuda_circle to zhangyiss/mlx from mirror 2025-06-09 12:51:14 +08:00
zhangyiss synced commits to cuda_available at zhangyiss/mlx from mirror 2025-06-09 12:51:13 +08:00
zhangyiss synced new reference cuda_available to zhangyiss/mlx from mirror 2025-06-09 12:51:13 +08:00
zhangyiss synced commits to refs/pull/2104/merge at zhangyiss/mlx from mirror 2025-06-09 04:41:13 +08:00
5866b3857b Refactor the lu test (#2250)
1ca616844b Fix unintuitive metal kernel caching (#2242)
2e8cf0b450 Change layernorms to two pass algorithm (#2246)
24f89173d1 CUDA backend: matmul (#2241)
Compare 8 commits »
zhangyiss synced commits to refs/pull/2215/merge at zhangyiss/mlx from mirror 2025-06-09 04:41:13 +08:00
5866b3857b Refactor the lu test (#2250)
1ca616844b Fix unintuitive metal kernel caching (#2242)
2e8cf0b450 Change layernorms to two pass algorithm (#2246)
24f89173d1 CUDA backend: matmul (#2241)
Compare 13 commits »
zhangyiss synced commits to refs/pull/2219/merge at zhangyiss/mlx from mirror 2025-06-08 20:21:15 +08:00
5866b3857b Refactor the lu test (#2250)
1ca616844b Fix unintuitive metal kernel caching (#2242)
Compare 3 commits »
zhangyiss synced commits to main at zhangyiss/mlx from mirror 2025-06-08 03:41:13 +08:00
5866b3857b Refactor the lu test (#2250)
zhangyiss synced commits to refs/pull/2074/merge at zhangyiss/mlx from mirror 2025-06-08 03:41:13 +08:00
5866b3857b Refactor the lu test (#2250)
1ca616844b Fix unintuitive metal kernel caching (#2242)
2e8cf0b450 Change layernorms to two pass algorithm (#2246)
24f89173d1 CUDA backend: matmul (#2241)
Compare 7 commits »
zhangyiss synced commits to refs/pull/2158/merge at zhangyiss/mlx from mirror 2025-06-08 03:41:13 +08:00
5866b3857b Refactor the lu test (#2250)
Compare 2 commits »
zhangyiss synced commits to refs/pull/2234/merge at zhangyiss/mlx from mirror 2025-06-08 03:41:13 +08:00
5866b3857b Refactor the lu test (#2250)
1ca616844b Fix unintuitive metal kernel caching (#2242)
Compare 3 commits »
zhangyiss synced commits to refs/pull/2156/merge at zhangyiss/mlx from mirror 2025-06-07 11:11:16 +08:00
2e8cf0b450 Change layernorms to two pass algorithm (#2246)
24f89173d1 CUDA backend: matmul (#2241)
c6a20b427a Improve metal elementwise kernels (#2247)
a5ac9244c4 fix linux linking error (#2248)
Compare 11 commits »
zhangyiss synced commits to refs/pull/2158/head at zhangyiss/mlx from mirror 2025-06-07 11:11:16 +08:00
4c43ff0591 CUDA backend: unary ops
2e8cf0b450 Change layernorms to two pass algorithm (#2246)
24f89173d1 CUDA backend: matmul (#2241)
c6a20b427a Improve metal elementwise kernels (#2247)
a5ac9244c4 fix linux linking error (#2248)
Compare 15 commits »
zhangyiss synced commits to refs/pull/2158/merge at zhangyiss/mlx from mirror 2025-06-07 11:11:16 +08:00
1ca616844b Fix unintuitive metal kernel caching (#2242)
4c43ff0591 CUDA backend: unary ops
2e8cf0b450 Change layernorms to two pass algorithm (#2246)
24f89173d1 CUDA backend: matmul (#2241)
Compare 5 commits »
zhangyiss synced commits to refs/pull/2219/merge at zhangyiss/mlx from mirror 2025-06-07 11:11:16 +08:00
2e8cf0b450 Change layernorms to two pass algorithm (#2246)
24f89173d1 CUDA backend: matmul (#2241)
Compare 3 commits »
zhangyiss synced commits to refs/pull/2234/merge at zhangyiss/mlx from mirror 2025-06-07 11:11:16 +08:00
2e8cf0b450 Change layernorms to two pass algorithm (#2246)
24f89173d1 CUDA backend: matmul (#2241)
Compare 3 commits »
zhangyiss synced and deleted reference refs/tags/refs/pull/2241/merge at zhangyiss/mlx from mirror 2025-06-07 11:11:15 +08:00
zhangyiss synced and deleted reference refs/tags/refs/pull/2242/merge at zhangyiss/mlx from mirror 2025-06-07 11:11:15 +08:00
zhangyiss synced and deleted reference refs/tags/refs/pull/2246/merge at zhangyiss/mlx from mirror 2025-06-07 11:11:15 +08:00
zhangyiss synced commits to main at zhangyiss/mlx from mirror 2025-06-07 11:11:15 +08:00
1ca616844b Fix unintuitive metal kernel caching (#2242)
2e8cf0b450 Change layernorms to two pass algorithm (#2246)
24f89173d1 CUDA backend: matmul (#2241)
Compare 3 commits »