mirror of https://github.com/ml-explore/mlx.git synced 2025-10-31 16:21:27 +08:00

Files

Angelos Katharopoulos dfa9f4bc58 An initial quantized matmul implementation (#205 )

* Add quantized matvec
* Add quantized matrix matrix with 2nd matrix transposed
* Add quantized matmul tests
* Add a slow cpu quantized matmul
* Add a slightly faster vectorized cpu version

2023-12-18 23:18:57 -08:00

bench_mlx.py

An initial quantized matmul implementation (#205 )

2023-12-18 23:18:57 -08:00

bench_torch.py

An initial quantized matmul implementation (#205 )

2023-12-18 23:18:57 -08:00

compare.py

Activations LeakyReLU / PReLU / Softplus / Mish (#109 )

2023-12-11 19:40:57 -08:00

README.md

awni's commit files

2023-11-29 10:30:41 -08:00

README.md

Microbenchmarks comparing MLX to PyTorch

Implement the same microbenchmarks in MLX and PyTorch to compare and make a list of the biggest possible performance improvements and/or regressions.

Run with python bench_mlx.py sum_axis --size 8x1024x128 --axis 2 --cpu for instance to measure the times it takes to sum across the 3rd axis of the above tensor on the cpu.

compare.py runs several benchmarks and compares the speed-up or lack thereof in comparison to PyTorch.

Each bench script can be run with --print-pid to print the PID and wait for a key in order to ease attaching a debugger.