张壹 zhangyiss
  • Joined on 2024-09-10
zhangyiss synced commits to refs/pull/2476/merge at zhangyiss/mlx from mirror 2025-08-20 18:39:50 +08:00
512281781c Remove state return from function example in compile documentation (#2518)
ac85ddfdb7 [CUDA] Add GEMM-based fallback convolution kernels (#2511)
65d0d40232 Split cuDNN helpers into a separate header (#2491)
Compare 4 commits »
zhangyiss synced commits to refs/pull/1970/merge at zhangyiss/mlx from mirror 2025-08-20 18:39:49 +08:00
ac85ddfdb7 [CUDA] Add GEMM-based fallback convolution kernels (#2511)
65d0d40232 Split cuDNN helpers into a separate header (#2491)
cea9369610 fix lapack svd (#2515)
Compare 4 commits »
zhangyiss synced commits to refs/pull/2300/merge at zhangyiss/mlx from mirror 2025-08-20 18:39:49 +08:00
ac85ddfdb7 [CUDA] Add GEMM-based fallback convolution kernels (#2511)
65d0d40232 Split cuDNN helpers into a separate header (#2491)
Compare 3 commits »
zhangyiss synced commits to main at zhangyiss/mlx from mirror 2025-08-20 18:39:48 +08:00
512281781c Remove state return from function example in compile documentation (#2518)
zhangyiss synced new reference fix-conv-large-filter to zhangyiss/mlx from mirror 2025-08-20 18:39:47 +08:00
zhangyiss synced commits to custom-cuda-kernel at zhangyiss/mlx from mirror 2025-08-20 18:39:46 +08:00
d6b204b528 comments
fa56bf2feb Remove completion handler from custom kernel
39dbd92df5 Make threadgroup size less or equal to grid size
432c02dabc Typo in test
fa555c536a Remove regex
Compare 22 commits »
zhangyiss synced commits to fix-conv-large-filter at zhangyiss/mlx from mirror 2025-08-20 18:39:46 +08:00
zhangyiss synced and deleted reference refs/tags/refs/pull/2518/merge at zhangyiss/mlx from mirror 2025-08-20 18:39:45 +08:00
zhangyiss synced commits to refs/pull/1070/merge at zhangyiss/FTXUI from mirror 2025-08-20 17:01:19 +08:00
e56ff89cf3 Improve example style. (#1101)
Compare 2 commits »
zhangyiss synced commits to main at zhangyiss/FTXUI from mirror 2025-08-20 17:01:12 +08:00
e56ff89cf3 Improve example style. (#1101)
zhangyiss synced commits to gh-pages at zhangyiss/FTXUI from mirror 2025-08-20 17:01:11 +08:00
zhangyiss synced commits to quantize_mode at zhangyiss/mlx from mirror 2025-08-20 10:27:39 +08:00
e6437b7dd8 mxfp4 works
zhangyiss synced commits to refs/pull/2074/merge at zhangyiss/mlx from mirror 2025-08-20 10:27:39 +08:00
65d0d40232 Split cuDNN helpers into a separate header (#2491)
cea9369610 fix lapack svd (#2515)
Compare 3 commits »
zhangyiss synced commits to refs/pull/2499/head at zhangyiss/mlx from mirror 2025-08-20 10:27:39 +08:00
e6437b7dd8 mxfp4 works
zhangyiss synced commits to refs/pull/2499/merge at zhangyiss/mlx from mirror 2025-08-20 10:27:39 +08:00
d5eaa7635c Merge e6437b7dd8df2a381ad5815f78e1ef5f9b3b1ba9 into 65d0d40232
65d0d40232 Split cuDNN helpers into a separate header (#2491)
e6437b7dd8 mxfp4 works
Compare 3 commits »
zhangyiss synced commits to refs/pull/2511/head at zhangyiss/mlx from mirror 2025-08-20 10:27:39 +08:00
849fee90f3 Add gemm_grouped_conv
c81aeedec5 Add gemm_conv
65d0d40232 Split cuDNN helpers into a separate header (#2491)
cea9369610 fix lapack svd (#2515)
e7c6e1db82 no segfault with uninitialized array.at (#2514)
Compare 8 commits »
zhangyiss synced commits to refs/pull/2517/head at zhangyiss/mlx from mirror 2025-08-20 10:27:39 +08:00
77e5a5ae3c Remove completion handler from custom kernel
zhangyiss synced commits to refs/pull/2517/merge at zhangyiss/mlx from mirror 2025-08-20 10:27:39 +08:00
87b8c41af3 Merge 77e5a5ae3c75371e6f43594b860c0829bfe788df into cea9369610
77e5a5ae3c Remove completion handler from custom kernel
Compare 2 commits »
zhangyiss synced commits to custom-cuda-kernel at zhangyiss/mlx from mirror 2025-08-20 10:27:38 +08:00
77e5a5ae3c Remove completion handler from custom kernel
zhangyiss synced commits to main at zhangyiss/mlx from mirror 2025-08-20 10:27:38 +08:00
ac85ddfdb7 [CUDA] Add GEMM-based fallback convolution kernels (#2511)
65d0d40232 Split cuDNN helpers into a separate header (#2491)
Compare 2 commits »