Awni Hannun
|
a36fec5cb2
|
docs update
|
2025-07-18 22:25:35 +00:00 |
|
Awni Hannun
|
d44f06ae79
|
docs up
|
2025-07-18 22:25:35 +00:00 |
|
Awni Hannun
|
9da49a07a4
|
docs update
|
2025-07-18 22:25:35 +00:00 |
|
Awni Hannun
|
f5dcb1c2b9
|
docs update
|
2025-07-18 22:25:34 +00:00 |
|
Awni Hannun
|
6e9288a41c
|
docs update
|
2025-07-18 22:25:34 +00:00 |
|
Awni Hannun
|
b95224115c
|
docs update
|
2025-07-18 22:25:34 +00:00 |
|
Awni Hannun
|
0e688cbd0f
|
docs update
|
2025-07-18 22:25:34 +00:00 |
|
Awni Hannun
|
ba4eff9520
|
docs update
|
2025-07-18 22:25:33 +00:00 |
|
Awni Hannun
|
2360620475
|
docs
|
2025-07-18 22:25:33 +00:00 |
|
Awni Hannun
|
c620a28b16
|
docs update
|
2025-07-18 22:25:33 +00:00 |
|
Awni Hannun
|
3e724a7c98
|
docs update
|
2025-07-18 22:25:33 +00:00 |
|
Awni Hannun
|
f946f689a6
|
docs update
|
2025-07-18 22:25:32 +00:00 |
|
Awni Hannun
|
f77d99b285
|
docs update
|
2025-07-18 22:25:32 +00:00 |
|
Awni Hannun
|
1d2cadbc78
|
docs update
|
2025-07-18 22:25:32 +00:00 |
|
Awni Hannun
|
1b03079c4a
|
use proper version
|
2025-07-18 22:25:32 +00:00 |
|
Awni Hannun
|
bc695f2050
|
docs update
|
2025-07-18 22:25:32 +00:00 |
|
Awni Hannun
|
1fbcfa159f
|
docs update
|
2025-07-18 22:25:32 +00:00 |
|
Awni Hannun
|
2c714de62a
|
docs update
|
2025-07-18 22:25:32 +00:00 |
|
Awni Hannun
|
e492638dff
|
docs update
|
2025-07-18 22:25:31 +00:00 |
|
Awni Hannun
|
06b6d9a4d4
|
remove uneeded files in docs
|
2025-07-18 22:25:31 +00:00 |
|
Awni Hannun
|
f12615680d
|
update docs
|
2025-07-18 22:25:31 +00:00 |
|
Awni Hannun
|
c465c51cbb
|
docs update
|
2025-07-18 22:25:31 +00:00 |
|
Awni Hannun
|
7534da7269
|
docs up
|
2025-07-18 22:25:31 +00:00 |
|
Awni Hannun
|
2aeb6df29c
|
docs up
|
2025-07-18 22:25:31 +00:00 |
|
Awni Hannun
|
f1dfa257d2
|
docs update
|
2025-07-18 22:25:31 +00:00 |
|
Awni Hannun
|
de4f3e72fd
|
docs
|
2025-07-18 22:25:31 +00:00 |
|
Awni Hannun
|
9d7133097f
|
docs
|
2025-07-18 22:25:31 +00:00 |
|
Awni Hannun
|
65b0bdffe1
|
update docs
|
2025-07-18 22:25:31 +00:00 |
|
Awni Hannun
|
6f38552812
|
docs
|
2025-07-18 22:25:31 +00:00 |
|
Awni Hannun
|
5aa62e5553
|
docs
|
2025-07-18 22:25:31 +00:00 |
|
Awni Hannun
|
e4d33acace
|
docs
|
2025-07-18 22:25:30 +00:00 |
|
Awni Hannun
|
901a4ba68a
|
docs
|
2025-07-18 22:25:30 +00:00 |
|
Awni Hannun
|
8fdf710d38
|
docs
|
2025-07-18 22:25:30 +00:00 |
|
Awni Hannun
|
86cae0f191
|
docs
|
2025-07-18 22:25:30 +00:00 |
|
Awni Hannun
|
1a9bcc0ef1
|
docs
|
2025-07-18 22:25:30 +00:00 |
|
Awni Hannun
|
07f0207405
|
docs
|
2025-07-18 22:25:30 +00:00 |
|
Awni Hannun
|
24b9da7d61
|
docs
|
2025-07-18 22:25:30 +00:00 |
|
Awni Hannun
|
84b4d96efa
|
fix release build + patch bump (#2387)
|
2025-07-18 14:47:37 -07:00 |
|
Awni Hannun
|
aec67f2fa6
|
patch bump (#2386)
|
2025-07-18 12:25:48 -07:00 |
|
Gökdeniz Gülmez
|
deee214a95
|
Adding support for the Muon Optimizer (#1914)
* initial commit with workong optmimizer
* update ACKNOWLEDGMENTS.md
* nits and adding it to test
* nits
* G.astype(mx.bfloat16) to G.astype(G.dtype)
* G.ndim >= 2 to assert G.ndim == 2
* remove coments
* replace with mx.addmm
* remove comments
* format
* nits
* match muon
* fix addmm
---------
Co-authored-by: Awni Hannun <awni@apple.com>
|
2025-07-18 12:25:28 -07:00 |
|
Cheng
|
45adec102c
|
Add contiguous_copy_gpu util for copying array (#2379)
|
2025-07-18 06:44:25 -07:00 |
|
Cheng
|
31fc530c76
|
[CUDA] Add more ways finding CCCL headers in JIT (#2382)
|
2025-07-17 15:25:34 -07:00 |
|
Awni Hannun
|
fbb3f65a1a
|
fix resource leaks in matmul and graph (#2383)
|
2025-07-17 06:50:15 -07:00 |
|
Angelos Katharopoulos
|
6b1b8ea91b
|
[CUDA] Add work per thread to compile (#2368)
|
2025-07-17 06:47:52 -07:00 |
|
Awni Hannun
|
b2273733ea
|
Test with CUDA 12.2 (#2375)
* Test with CUDA 12.0
* try older image
* fix cpu sort
|
2025-07-16 13:00:37 -07:00 |
|
Awni Hannun
|
f409b229a4
|
fix ring distributed test (#2380)
|
2025-07-16 11:25:24 -07:00 |
|
Cheng
|
30571e2326
|
Rename the copy util in cpu/copy.h to copy_cpu (#2378)
|
2025-07-16 07:34:24 -07:00 |
|
Awni Hannun
|
d7734edd9f
|
fix complex reduce + nan propagation in min and max (#2377)
|
2025-07-15 18:19:47 -07:00 |
|
Awni Hannun
|
2ba69bc8fa
|
lower memory uniform sampling (#2361)
* lower memory uniform
* use fp32
* fix
|
2025-07-15 14:22:07 -07:00 |
|
Cheng
|
cb349a291c
|
[CUDA] Use cuda::std::complex in place of cuComplex (#2372)
|
2025-07-15 00:36:13 -07:00 |
|