张壹 zhangyiss
  • Joined on 2024-09-10
zhangyiss synced commits to refs/pull/2156/merge at zhangyiss/mlx from mirror 2025-06-16 03:11:15 +08:00
580776559b RoPE for CUDA (#2293)
a14aaa7c9d Fix cuda arg reduce (#2291)
a6d780154f fix cuda gemm for bf16 (#2288)
6871e2eeb7 fix cuda jit (#2287)
Compare 20 commits »
zhangyiss synced commits to refs/pull/2290/merge at zhangyiss/mlx from mirror 2025-06-16 03:11:15 +08:00
580776559b RoPE for CUDA (#2293)
Compare 2 commits »
zhangyiss synced commits to refs/pull/2290/merge at zhangyiss/mlx from mirror 2025-06-15 18:31:15 +08:00
cb4dc59a9e feat(benchmarks): add comprehensive SVD performance benchmarks
e5c8773371 feat(metal): implement complete Metal SVD with Jacobi algorithm
Compare 3 commits »
zhangyiss synced commits to rope-cuda at zhangyiss/mlx from mirror 2025-06-15 18:31:14 +08:00
zhangyiss synced new reference rope-cuda to zhangyiss/mlx from mirror 2025-06-15 18:31:14 +08:00
zhangyiss synced commits to refs/pull/1970/merge at zhangyiss/mlx from mirror 2025-06-15 18:31:14 +08:00
a14aaa7c9d Fix cuda arg reduce (#2291)
a6d780154f fix cuda gemm for bf16 (#2288)
Compare 3 commits »
zhangyiss synced commits to refs/pull/2234/merge at zhangyiss/mlx from mirror 2025-06-15 18:31:14 +08:00
a14aaa7c9d Fix cuda arg reduce (#2291)
Compare 2 commits »
zhangyiss synced commits to refs/pull/2290/head at zhangyiss/mlx from mirror 2025-06-15 18:31:14 +08:00
cb4dc59a9e feat(benchmarks): add comprehensive SVD performance benchmarks
e5c8773371 feat(metal): implement complete Metal SVD with Jacobi algorithm
Compare 2 commits »
zhangyiss synced and deleted reference refs/tags/refs/pull/2291/merge at zhangyiss/mlx from mirror 2025-06-15 10:21:14 +08:00
zhangyiss synced commits to main at zhangyiss/mlx from mirror 2025-06-15 10:21:14 +08:00
a14aaa7c9d Fix cuda arg reduce (#2291)
zhangyiss synced commits to refs/pull/2290/merge at zhangyiss/mlx from mirror 2025-06-15 10:21:14 +08:00
a14aaa7c9d Fix cuda arg reduce (#2291)
Compare 2 commits »
zhangyiss synced commits to refs/pull/2290/head at zhangyiss/mlx from mirror 2025-06-15 02:17:44 +08:00
8151239116 feat: Replace CPU fallback with real Metal SVD kernels
fdfa2b5b39 fix: Resolve Metal command buffer issues in SVD tests
34db0e3626 test: Add comprehensive Metal SVD test suite
56d2532aad feat: Add JIT kernel support for SVD operations
f2c731c29b feat: Enable GPU support in linalg SVD interface
Compare 8 commits »
zhangyiss synced commits to refs/pull/2290/merge at zhangyiss/mlx from mirror 2025-06-15 02:17:44 +08:00
8151239116 feat: Replace CPU fallback with real Metal SVD kernels
fdfa2b5b39 fix: Resolve Metal command buffer issues in SVD tests
34db0e3626 test: Add comprehensive Metal SVD test suite
56d2532aad feat: Add JIT kernel support for SVD operations
Compare 9 commits »
zhangyiss synced and deleted reference refs/tags/refs/pull/2287/merge at zhangyiss/mlx from mirror 2025-06-14 17:51:16 +08:00
zhangyiss synced commits to main at zhangyiss/mlx from mirror 2025-06-14 17:51:16 +08:00
a6d780154f fix cuda gemm for bf16 (#2288)
6871e2eeb7 fix cuda jit (#2287)
Compare 2 commits »
zhangyiss synced commits to refs/pull/1970/merge at zhangyiss/mlx from mirror 2025-06-14 17:51:16 +08:00
6871e2eeb7 fix cuda jit (#2287)
Compare 2 commits »
zhangyiss synced commits to refs/pull/2074/merge at zhangyiss/mlx from mirror 2025-06-14 17:51:16 +08:00
a6d780154f fix cuda gemm for bf16 (#2288)
6871e2eeb7 fix cuda jit (#2287)
8402a2acf4 Fix complex power and print (#2286)
fddb6933e1 Collection of refactors (#2274)
Compare 13 commits »
zhangyiss synced commits to refs/pull/2104/merge at zhangyiss/mlx from mirror 2025-06-14 17:51:16 +08:00
8402a2acf4 Fix complex power and print (#2286)
fddb6933e1 Collection of refactors (#2274)
Compare 3 commits »
zhangyiss synced commits to refs/pull/2234/merge at zhangyiss/mlx from mirror 2025-06-14 17:51:16 +08:00
a6d780154f fix cuda gemm for bf16 (#2288)
6871e2eeb7 fix cuda jit (#2287)
Compare 3 commits »
zhangyiss synced commits to refs/pull/2219/merge at zhangyiss/mlx from mirror 2025-06-14 09:21:15 +08:00
8402a2acf4 Fix complex power and print (#2286)
fddb6933e1 Collection of refactors (#2274)
Compare 3 commits »