[CUDA] Fix conv grads with groups (#2495)

* Put reshape utils in one file * [CUDA] Fix conv grads with groups * Put the reshape utils in gpu/copy.h
2025-12-16 01:49:05 +08:00 · 2025-08-16 10:09:18 +09:00
parent 37b440faa8
commit 1ba18ff7d9
8 changed files with 119 additions and 62 deletions
--- a/mlx/backend/common/utils.h
+++ b/mlx/backend/common/utils.h
@@ -196,9 +196,6 @@ void shared_buffer_reshape(
    const Strides& out_strides,
    array& out);

-// Like the swapaxes op but safe to call in eval_gpu.
-array swapaxes_in_eval(const array& x, int axis1, int axis2);
-
 template <typename T>
 inline SmallVector<T> remove_index(SmallVector<T> vec, size_t index) {
  vec.erase(std::next(vec.begin(), index));