mirror of
https://github.com/ml-explore/mlx.git
synced 2025-06-24 17:31:16 +08:00
![]() * Add conv1d grouped convs on CPU * Add GPU support * Parallelize inside metal kernel * clenaup * Update mlx/ops.cpp Co-authored-by: Awni Hannun <awni.hannun@gmail.com> * New unfold kernel + remove unused code * Remove copy and refactor * Update vjp and reuse steel gemm * Fixed groups on cpu * Fix metal validation --------- Co-authored-by: Awni Hannun <awni.hannun@gmail.com> |
||
---|---|---|
.. | ||
blas | ||
comparative | ||
batch_matmul_bench.py | ||
compile_bench.py | ||
conv1d_bench.py | ||
conv_bench.py | ||
fft_bench.py | ||
gather_bench.py | ||
layer_norm_bench.py | ||
rms_norm_bench.py | ||
rope_bench.py | ||
scatter_bench.py | ||
single_ops.py | ||
time_utils.py |