Anastasiia Filippova
|
27778156dc
|
Nccl reduce scatter, all gather (#2727)
* Added reduce scatter and all gather for nccl
* fix unused import, delete unused file
* small fix
* deleted useless condition
* fixed comments
* fix bug in eval_gpu, renamed to sum_scatter, fix docs
* final fix docs
* remove and
* Update mlx/distributed/mpi/mpi.cpp
Co-authored-by: Awni Hannun <awni.hannun@gmail.com>
* fix broken set input output
* fixes set output
* typo
* fix typo
* no cpu, no gpu for reduce scatter
---------
Co-authored-by: Awni Hannun <awni.hannun@gmail.com>
|
2025-11-05 08:21:11 -08:00 |
|
Awni Hannun
|
343e33b6d5
|
fix all_gather vjp (#2654)
|
2025-10-07 06:05:23 -07:00 |
|
Awni Hannun
|
f409b229a4
|
fix ring distributed test (#2380)
|
2025-07-16 11:25:24 -07:00 |
|
Awni Hannun
|
4fda5fbdf9
|
add python testing for cuda with ability to skip list of tests (#2295)
|
2025-06-15 10:56:48 -07:00 |
|
Angelos Katharopoulos
|
ddaa4b7dcb
|
Fix the test and add custom min/max reductions for uncommon MPI types (#2060)
|
2025-04-10 17:01:17 -07:00 |
|
Anastasiia Filippova
|
515f104926
|
Min / max reductions (#2041)
|
2025-04-09 23:22:20 -07:00 |
|
Angelos Katharopoulos
|
4eef8102c9
|
Distributed layers (#1270)
|
2025-03-21 13:52:17 -07:00 |
|
Angelos Katharopoulos
|
69e4dd506b
|
Add a ring all gather (#1985)
|
2025-03-21 13:36:51 -07:00 |
|
Angelos Katharopoulos
|
10b271d963
|
Ring update (#1885)
|
2025-02-20 14:32:31 -08:00 |
|
Angelos Katharopoulos
|
ccb61d7aae
|
Ring distributed backend (#1784)
|
2025-01-27 22:15:01 -08:00 |
|