张壹 zhangyiss
  • Joined on 2024-09-10
zhangyiss synced commits to refs/pull/2290/merge at zhangyiss/mlx from mirror 2025-08-03 09:16:44 +08:00
aaf78f4c6b Use LRU cache for cuda graph (#2448)
Compare 2 commits »
zhangyiss synced commits to main at zhangyiss/mlx from mirror 2025-08-03 01:06:44 +08:00
aaf78f4c6b Use LRU cache for cuda graph (#2448)
zhangyiss synced commits to refs/pull/1970/merge at zhangyiss/mlx from mirror 2025-08-03 01:06:44 +08:00
8831064493 Fix arctan2 grads (#2453)
be9bc96da4 [CUDA] Matmul utils initial commit (#2441)
Compare 3 commits »
zhangyiss synced commits to refs/pull/2074/merge at zhangyiss/mlx from mirror 2025-08-03 01:06:44 +08:00
aaf78f4c6b Use LRU cache for cuda graph (#2448)
Compare 2 commits »
zhangyiss synced commits to refs/pull/2290/merge at zhangyiss/mlx from mirror 2025-08-03 01:06:44 +08:00
8831064493 Fix arctan2 grads (#2453)
be9bc96da4 [CUDA] Matmul utils initial commit (#2441)
Compare 3 commits »
zhangyiss synced commits to refs/pull/2450/merge at zhangyiss/mlx from mirror 2025-08-03 01:06:44 +08:00
aaf78f4c6b Use LRU cache for cuda graph (#2448)
Compare 2 commits »
zhangyiss synced and deleted reference refs/tags/refs/pull/2448/merge at zhangyiss/mlx from mirror 2025-08-03 01:06:43 +08:00
zhangyiss synced commits to refs/pull/2104/merge at zhangyiss/mlx from mirror 2025-08-02 16:56:49 +08:00
be9bc96da4 [CUDA] Matmul utils initial commit (#2441)
86258f292f [CUDA] Vectorize generated kernels (#2444)
b26d88591c [CUDA] Save primitive inputs faster (#2449)
86c6a15571 [CUDA] Backward convolution (#2431)
Compare 9 commits »
zhangyiss synced commits to refs/pull/2401/merge at zhangyiss/mlx from mirror 2025-08-02 16:56:49 +08:00
8831064493 Fix arctan2 grads (#2453)
be9bc96da4 [CUDA] Matmul utils initial commit (#2441)
Compare 3 commits »
zhangyiss synced commits to refs/pull/2434/merge at zhangyiss/mlx from mirror 2025-08-02 16:56:49 +08:00
8831064493 Fix arctan2 grads (#2453)
be9bc96da4 [CUDA] Matmul utils initial commit (#2441)
Compare 3 commits »
zhangyiss synced commits to refs/pull/2448/merge at zhangyiss/mlx from mirror 2025-08-02 16:56:49 +08:00
8831064493 Fix arctan2 grads (#2453)
Compare 2 commits »
zhangyiss synced commits to refs/pull/2450/merge at zhangyiss/mlx from mirror 2025-08-02 16:56:49 +08:00
8831064493 Fix arctan2 grads (#2453)
Compare 2 commits »
zhangyiss synced commits to refs/pull/2074/merge at zhangyiss/mlx from mirror 2025-08-02 16:56:48 +08:00
8831064493 Fix arctan2 grads (#2453)
be9bc96da4 [CUDA] Matmul utils initial commit (#2441)
Compare 3 commits »
zhangyiss synced and deleted reference refs/tags/fix-arctan2-grads at zhangyiss/mlx from mirror 2025-08-02 16:56:46 +08:00
zhangyiss synced commits to main at zhangyiss/mlx from mirror 2025-08-02 16:56:46 +08:00
8831064493 Fix arctan2 grads (#2453)
zhangyiss synced commits to refs/pull/2448/merge at zhangyiss/mlx from mirror 2025-08-02 08:46:45 +08:00
24a9dd4fd0 Remove unused destructor
be9bc96da4 [CUDA] Matmul utils initial commit (#2441)
Compare 3 commits »
zhangyiss synced commits to refs/pull/2450/merge at zhangyiss/mlx from mirror 2025-08-02 08:46:45 +08:00
be9bc96da4 [CUDA] Matmul utils initial commit (#2441)
Compare 2 commits »
zhangyiss synced commits to simple-gemm at zhangyiss/mlx from mirror 2025-08-02 08:46:44 +08:00
ecb954341f Simple gemm example
be9bc96da4 [CUDA] Matmul utils initial commit (#2441)
86258f292f [CUDA] Vectorize generated kernels (#2444)
b26d88591c [CUDA] Save primitive inputs faster (#2449)
86c6a15571 [CUDA] Backward convolution (#2431)
Compare 13 commits »
zhangyiss synced commits to refs/pull/2441/head at zhangyiss/mlx from mirror 2025-08-02 08:46:44 +08:00
29d78af759 Guard PTX with architecture defines
zhangyiss synced commits to refs/pull/2448/head at zhangyiss/mlx from mirror 2025-08-02 08:46:44 +08:00
24a9dd4fd0 Remove unused destructor