张壹 zhangyiss
  • Joined on 2024-09-10
zhangyiss pushed to dev_yi at zhangyiss/gctl_tutorials 2025-07-23 11:09:11 +08:00
zhangyiss synced commits to refs/pull/2401/merge at zhangyiss/mlx from mirror 2025-07-23 10:02:16 +08:00
d107d8d495 add cuda gemv (#2400)
1e496ddb82 [CUDA] Simplify allocator (#2392)
74eccbf3fa use size option in binary (#2399)
08638223ca Fix including stubs in wheel (#2398)
Compare 5 commits »
zhangyiss synced commits to refs/pull/2297/merge at zhangyiss/mlx from mirror 2025-07-23 10:02:15 +08:00
d107d8d495 add cuda gemv (#2400)
1e496ddb82 [CUDA] Simplify allocator (#2392)
74eccbf3fa use size option in binary (#2399)
08638223ca Fix including stubs in wheel (#2398)
Compare 5 commits »
zhangyiss synced commits to refs/pull/2385/head at zhangyiss/mlx from mirror 2025-07-23 10:02:15 +08:00
bd4c7edb1d Use tf32 for conv
92cbefc539 Zero-initilizing array
872c222f47 Do error check for cublas handle
c118efaa35 Make LRUCache more like a normal container
a053eeec3a Set cudnn stream before execution
Compare 22 commits »
zhangyiss synced commits to refs/pull/2385/merge at zhangyiss/mlx from mirror 2025-07-23 10:02:15 +08:00
6d416e729a Merge bd4c7edb1d6f84d90745c0b573d419b4e030f8e8 into d107d8d495
bd4c7edb1d Use tf32 for conv
92cbefc539 Zero-initilizing array
872c222f47 Do error check for cublas handle
c118efaa35 Make LRUCache more like a normal container
Compare 20 commits »
zhangyiss synced commits to refs/pull/2074/merge at zhangyiss/mlx from mirror 2025-07-23 10:02:14 +08:00
d107d8d495 add cuda gemv (#2400)
1e496ddb82 [CUDA] Simplify allocator (#2392)
74eccbf3fa use size option in binary (#2399)
08638223ca Fix including stubs in wheel (#2398)
Compare 5 commits »
zhangyiss synced commits to sdpa_full_mask at zhangyiss/mlx from mirror 2025-07-23 10:02:12 +08:00
zhangyiss synced new reference sdpa_full_mask to zhangyiss/mlx from mirror 2025-07-23 10:02:12 +08:00
zhangyiss synced commits to refs/pull/2401/merge at zhangyiss/mlx from mirror 2025-07-23 01:51:20 +08:00
7df3a2887d fix mismatch
Compare 2 commits »
zhangyiss synced commits to refs/pull/1970/merge at zhangyiss/mlx from mirror 2025-07-23 01:51:19 +08:00
74eccbf3fa use size option in binary (#2399)
08638223ca Fix including stubs in wheel (#2398)
Compare 3 commits »
zhangyiss synced commits to refs/pull/2234/merge at zhangyiss/mlx from mirror 2025-07-23 01:51:19 +08:00
d107d8d495 add cuda gemv (#2400)
1e496ddb82 [CUDA] Simplify allocator (#2392)
74eccbf3fa use size option in binary (#2399)
08638223ca Fix including stubs in wheel (#2398)
Compare 5 commits »
zhangyiss synced commits to refs/pull/2300/merge at zhangyiss/mlx from mirror 2025-07-23 01:51:19 +08:00
d107d8d495 add cuda gemv (#2400)
1e496ddb82 [CUDA] Simplify allocator (#2392)
74eccbf3fa use size option in binary (#2399)
08638223ca Fix including stubs in wheel (#2398)
Compare 7 commits »
zhangyiss synced commits to refs/pull/2385/merge at zhangyiss/mlx from mirror 2025-07-23 01:51:19 +08:00
a156d7c340 Merge 57073e3dca4f7905162aa372b57374f68c491e8d into d107d8d495
d107d8d495 add cuda gemv (#2400)
1e496ddb82 [CUDA] Simplify allocator (#2392)
74eccbf3fa use size option in binary (#2399)
08638223ca Fix including stubs in wheel (#2398)
Compare 5 commits »
zhangyiss synced commits to refs/pull/2392/head at zhangyiss/mlx from mirror 2025-07-23 01:51:19 +08:00
f4556ac385 comment
b1a44ef240 comment
4fd39d662d use cuda buffer in small pool
60e20bedb6 Don't use shared event in worker
b62368f292 simplify allocator and fixe race with small pool
Compare 9 commits »
zhangyiss synced commits to refs/pull/2400/head at zhangyiss/mlx from mirror 2025-07-23 01:51:19 +08:00
beef3f42cc add cuda gemv
74eccbf3fa use size option in binary (#2399)
08638223ca Fix including stubs in wheel (#2398)
56cc858af9 Add contiguous_copy_cpu util for copying array (#2397)
f55c4ed1d6 Remove thrust iterators (#2396)
Compare 5 commits »
zhangyiss synced commits to refs/pull/2401/head at zhangyiss/mlx from mirror 2025-07-23 01:51:19 +08:00
7df3a2887d fix mismatch
zhangyiss synced and deleted reference refs/tags/refs/pull/2398/merge at zhangyiss/mlx from mirror 2025-07-23 01:51:18 +08:00
zhangyiss synced and deleted reference refs/tags/refs/pull/2399/merge at zhangyiss/mlx from mirror 2025-07-23 01:51:18 +08:00
zhangyiss synced and deleted reference refs/tags/refs/pull/2400/merge at zhangyiss/mlx from mirror 2025-07-23 01:51:18 +08:00
zhangyiss synced commits to main at zhangyiss/mlx from mirror 2025-07-23 01:51:18 +08:00
d107d8d495 add cuda gemv (#2400)
1e496ddb82 [CUDA] Simplify allocator (#2392)
74eccbf3fa use size option in binary (#2399)
08638223ca Fix including stubs in wheel (#2398)
Compare 4 commits »