张壹 zhangyiss
  • Joined on 2024-09-10
zhangyiss synced commits to fix_stubs at zhangyiss/mlx from mirror 2025-07-22 01:21:15 +08:00
zhangyiss synced and deleted reference refs/tags/refs/pull/2396/merge at zhangyiss/mlx from mirror 2025-07-22 01:21:14 +08:00
zhangyiss synced and deleted reference refs/tags/refs/pull/2397/merge at zhangyiss/mlx from mirror 2025-07-22 01:21:14 +08:00
zhangyiss synced commits to refs/pull/2385/head at zhangyiss/mlx from mirror 2025-07-21 08:41:14 +08:00
368dfbbdeb Make LRUCache more like a normal container
zhangyiss synced commits to refs/pull/2385/merge at zhangyiss/mlx from mirror 2025-07-21 08:41:14 +08:00
c256f94c20 Merge 368dfbbdebddb35c0bb2b66bb6d4d1bfed13267e into 93d70419e7
368dfbbdeb Make LRUCache more like a normal container
Compare 2 commits »
zhangyiss synced commits to refs/pull/2392/head at zhangyiss/mlx from mirror 2025-07-21 00:31:13 +08:00
764d2195e1 use cuda buffer in small pool
zhangyiss synced commits to refs/pull/2392/merge at zhangyiss/mlx from mirror 2025-07-21 00:31:13 +08:00
7c10b2a93e Merge 764d2195e13ce6767e2117083696389f8e8df8ab into 93d70419e7
764d2195e1 use cuda buffer in small pool
Compare 2 commits »
zhangyiss synced commits to refs/pull/2290/merge at zhangyiss/mlx from mirror 2025-07-21 00:31:12 +08:00
93d70419e7 [CUDA] speedup handling scalars (#2389)
63f663d9c6 fix cuda manylinux version to match others (#2388)
84b4d96efa fix release build + patch bump (#2387)
aec67f2fa6 patch bump (#2386)
Compare 9 commits »
zhangyiss synced commits to refs/pull/2385/head at zhangyiss/mlx from mirror 2025-07-21 00:31:12 +08:00
1c11d6c41e Set cudnn stream before execution
d6c0173c41 Test the native cuda graph api
cf2a36f5f1 Add cache
Compare 3 commits »
zhangyiss synced commits to refs/pull/2385/merge at zhangyiss/mlx from mirror 2025-07-21 00:31:12 +08:00
5b8596a677 Merge 1c11d6c41e9746e54bf6e5cc60afc22f949bebad into 93d70419e7
1c11d6c41e Set cudnn stream before execution
d6c0173c41 Test the native cuda graph api
cf2a36f5f1 Add cache
Compare 4 commits »
zhangyiss synced commits to refs/pull/2385/head at zhangyiss/mlx from mirror 2025-07-20 16:11:14 +08:00
d801de7c06 Turn off tf32
b20e764c72 Plan needs to be kept alive
2d3766e919 Switch to backend apis
Compare 3 commits »
zhangyiss synced commits to refs/pull/2385/merge at zhangyiss/mlx from mirror 2025-07-20 16:11:14 +08:00
a63c00a360 Merge d801de7c060027b894bcf75e291865fb82b81889 into 93d70419e7
d801de7c06 Turn off tf32
b20e764c72 Plan needs to be kept alive
2d3766e919 Switch to backend apis
Compare 4 commits »
zhangyiss synced commits to refs/pull/2385/merge at zhangyiss/mlx from mirror 2025-07-19 23:51:16 +08:00
49cb1d885d Merge cdfded878114b5365d5d93bd01dee2fa8c666df0 into 93d70419e7
cdfded8781 cudnn only accepts contiguous inputs
4d944e41b2 Install libcudnn9-dev-cuda-12 in CI
00bbf9a663 include cudnn as python dep
bbfdc0e1ec Fix C++ conv tests
Compare 11 commits »
zhangyiss synced commits to refs/pull/2104/merge at zhangyiss/mlx from mirror 2025-07-19 23:51:15 +08:00
93d70419e7 [CUDA] speedup handling scalars (#2389)
63f663d9c6 fix cuda manylinux version to match others (#2388)
84b4d96efa fix release build + patch bump (#2387)
aec67f2fa6 patch bump (#2386)
Compare 10 commits »
zhangyiss synced commits to refs/pull/2219/merge at zhangyiss/mlx from mirror 2025-07-19 23:51:15 +08:00
93d70419e7 [CUDA] speedup handling scalars (#2389)
63f663d9c6 fix cuda manylinux version to match others (#2388)
84b4d96efa fix release build + patch bump (#2387)
aec67f2fa6 patch bump (#2386)
Compare 13 commits »
zhangyiss synced commits to refs/pull/2300/merge at zhangyiss/mlx from mirror 2025-07-19 23:51:15 +08:00
93d70419e7 [CUDA] speedup handling scalars (#2389)
63f663d9c6 fix cuda manylinux version to match others (#2388)
84b4d96efa fix release build + patch bump (#2387)
aec67f2fa6 patch bump (#2386)
Compare 6 commits »
zhangyiss synced commits to refs/pull/2385/head at zhangyiss/mlx from mirror 2025-07-19 23:51:15 +08:00
cdfded8781 cudnn only accepts contiguous inputs
4d944e41b2 Install libcudnn9-dev-cuda-12 in CI
00bbf9a663 include cudnn as python dep
bbfdc0e1ec Fix C++ conv tests
fa11f7edda More unused backend apis
Compare 15 commits »
zhangyiss synced commits to refs/pull/2234/merge at zhangyiss/mlx from mirror 2025-07-19 15:41:14 +08:00
93d70419e7 [CUDA] speedup handling scalars (#2389)
63f663d9c6 fix cuda manylinux version to match others (#2388)
Compare 3 commits »
zhangyiss synced commits to refs/pull/2297/merge at zhangyiss/mlx from mirror 2025-07-19 15:41:14 +08:00
84b4d96efa fix release build + patch bump (#2387)
aec67f2fa6 patch bump (#2386)
deee214a95 Adding support for the Muon Optimizer (#1914)
Compare 4 commits »
zhangyiss synced commits to refs/pull/2385/merge at zhangyiss/mlx from mirror 2025-07-19 15:41:14 +08:00
7ccc1f6961 Merge 7e253098248bf28655fd86125c5675f212e34adf into 63f663d9c6
63f663d9c6 fix cuda manylinux version to match others (#2388)
Compare 2 commits »