Commit Graph

1295 Commits

Author SHA1 Message Date
Awni Hannun
c620a28b16 docs update 2025-07-18 22:25:33 +00:00
Awni Hannun
3e724a7c98 docs update 2025-07-18 22:25:33 +00:00
Awni Hannun
f946f689a6 docs update 2025-07-18 22:25:32 +00:00
Awni Hannun
f77d99b285 docs update 2025-07-18 22:25:32 +00:00
Awni Hannun
1d2cadbc78 docs update 2025-07-18 22:25:32 +00:00
Awni Hannun
1b03079c4a use proper version 2025-07-18 22:25:32 +00:00
Awni Hannun
bc695f2050 docs update 2025-07-18 22:25:32 +00:00
Awni Hannun
1fbcfa159f docs update 2025-07-18 22:25:32 +00:00
Awni Hannun
2c714de62a docs update 2025-07-18 22:25:32 +00:00
Awni Hannun
e492638dff docs update 2025-07-18 22:25:31 +00:00
Awni Hannun
06b6d9a4d4 remove uneeded files in docs 2025-07-18 22:25:31 +00:00
Awni Hannun
f12615680d update docs 2025-07-18 22:25:31 +00:00
Awni Hannun
c465c51cbb docs update 2025-07-18 22:25:31 +00:00
Awni Hannun
7534da7269 docs up 2025-07-18 22:25:31 +00:00
Awni Hannun
2aeb6df29c docs up 2025-07-18 22:25:31 +00:00
Awni Hannun
f1dfa257d2 docs update 2025-07-18 22:25:31 +00:00
Awni Hannun
de4f3e72fd docs 2025-07-18 22:25:31 +00:00
Awni Hannun
9d7133097f docs 2025-07-18 22:25:31 +00:00
Awni Hannun
65b0bdffe1 update docs 2025-07-18 22:25:31 +00:00
Awni Hannun
6f38552812 docs 2025-07-18 22:25:31 +00:00
Awni Hannun
5aa62e5553 docs 2025-07-18 22:25:31 +00:00
Awni Hannun
e4d33acace docs 2025-07-18 22:25:30 +00:00
Awni Hannun
901a4ba68a docs 2025-07-18 22:25:30 +00:00
Awni Hannun
8fdf710d38 docs 2025-07-18 22:25:30 +00:00
Awni Hannun
86cae0f191 docs 2025-07-18 22:25:30 +00:00
Awni Hannun
1a9bcc0ef1 docs 2025-07-18 22:25:30 +00:00
Awni Hannun
07f0207405 docs 2025-07-18 22:25:30 +00:00
Awni Hannun
24b9da7d61 docs 2025-07-18 22:25:30 +00:00
Awni Hannun
84b4d96efa
fix release build + patch bump (#2387) 2025-07-18 14:47:37 -07:00
Awni Hannun
aec67f2fa6
patch bump (#2386) 2025-07-18 12:25:48 -07:00
Gökdeniz Gülmez
deee214a95
Adding support for the Muon Optimizer (#1914)
* initial commit with workong optmimizer

* update ACKNOWLEDGMENTS.md

* nits and adding it to test

* nits

* G.astype(mx.bfloat16) to G.astype(G.dtype)

* G.ndim >= 2 to assert G.ndim == 2

* remove coments

* replace with  mx.addmm

* remove comments

* format

* nits

* match muon

* fix addmm

---------

Co-authored-by: Awni Hannun <awni@apple.com>
2025-07-18 12:25:28 -07:00
Cheng
45adec102c
Add contiguous_copy_gpu util for copying array (#2379) 2025-07-18 06:44:25 -07:00
Cheng
31fc530c76
[CUDA] Add more ways finding CCCL headers in JIT (#2382) 2025-07-17 15:25:34 -07:00
Awni Hannun
fbb3f65a1a
fix resource leaks in matmul and graph (#2383) 2025-07-17 06:50:15 -07:00
Angelos Katharopoulos
6b1b8ea91b
[CUDA] Add work per thread to compile (#2368) 2025-07-17 06:47:52 -07:00
Awni Hannun
b2273733ea
Test with CUDA 12.2 (#2375)
* Test with CUDA 12.0

* try older image

* fix cpu sort
2025-07-16 13:00:37 -07:00
Awni Hannun
f409b229a4
fix ring distributed test (#2380) 2025-07-16 11:25:24 -07:00
Cheng
30571e2326
Rename the copy util in cpu/copy.h to copy_cpu (#2378) 2025-07-16 07:34:24 -07:00
Awni Hannun
d7734edd9f
fix complex reduce + nan propagation in min and max (#2377) 2025-07-15 18:19:47 -07:00
Awni Hannun
2ba69bc8fa
lower memory uniform sampling (#2361)
* lower memory uniform

* use fp32

* fix
2025-07-15 14:22:07 -07:00
Cheng
cb349a291c
[CUDA] Use cuda::std::complex in place of cuComplex (#2372) 2025-07-15 00:36:13 -07:00
Awni Hannun
f0a0b077a0
Install linux with mlx[cuda] and mlx[cpu] (#2356)
* install linux with mlx[cuda] and mlx[cpu]

* temp for testing

* cleanup circle, fix cuda repair

* update circle

* update circle

* decouple python bindings from core libraries
2025-07-14 17:17:33 -07:00
Awni Hannun
49114f28ab
fix flaky test (#2371) 2025-07-14 17:16:18 -07:00
Awni Hannun
e7d2ebadd2
[CUDA] Affine quantize (#2354)
* affine quantize and dequantize kernels

* format

* fix

* format
2025-07-14 15:45:44 -07:00
Awni Hannun
e569803d7c
update linux build (#2370) 2025-07-14 15:13:56 -07:00
Cheng
d34f887abc
Add Primitive::name and remove Primitive::print (#2365) 2025-07-14 14:06:35 -07:00
Angelos Katharopoulos
5201df5030
Fix imag() vjp (#2367) 2025-07-14 13:11:16 -07:00
Cheng
2d3c26c565
[CUDA] Do not put kernels in annoymous namespace (#2362) 2025-07-12 14:24:45 -07:00
Cheng
6325f60d52
[CUDA] Bundle CCCL for JIT compilation (#2357)
* Ship CCCL for JIT compilation

* Remove cexpf
2025-07-11 18:45:37 -07:00
Awni Hannun
42cc9cfbc7
fix copy dispatch (#2360) 2025-07-11 10:59:35 -07:00