Faster gather and scatter. (#682)

Reduce unnecessary integer ops, especially since
there kernels are integer bound.

Increase number of iterations for benchmarks for
better smoothing.

Github Issue #506

Co-authored-by: Vijay Krishnamoorthy <vijay_krish@apple.com>
This commit is contained in:
Vijay Krish
2024-02-13 17:47:41 -08:00
committed by GitHub
parent be6e9d6a9f
commit 2fdc2462c3
2 changed files with 3 additions and 3 deletions

View File

@@ -28,7 +28,7 @@ def measure_runtime(fn, **kwargs):
fn(**kwargs)
tic = time.time()
iters = 10
iters = 100
for _ in range(iters):
fn(**kwargs)
return (time.time() - tic) * 1000 / iters