[CUDA] synch properly waits for all tasks to finish and clear (#2303)

* cuda synch properly waits for all tasks to finish and clear

* fix copy
This commit is contained in:
Awni Hannun
2025-06-17 12:03:25 -07:00
committed by GitHub
parent b8022c578a
commit cad5c0241c
7 changed files with 27 additions and 8 deletions

View File

@@ -123,6 +123,9 @@ class CommandEncoder {
return has_gpu_work_;
}
// Wait until kernels and completion handlers are finished
void synchronize();
private:
Device& device_;
DeviceStream& stream_;