mlx/python at dfa9f4bc58a9ebae6264027f6fe5402d908fea00 - mlx

mirror of https://github.com/ml-explore/mlx.git synced 2025-12-12 07:18:52 +08:00

Files

Angelos Katharopoulos dfa9f4bc58 An initial quantized matmul implementation (#205 )

* Add quantized matvec
* Add quantized matrix matrix with 2nd matrix transposed
* Add quantized matmul tests
* Add a slow cpu quantized matmul
* Add a slightly faster vectorized cpu version

2023-12-18 23:18:57 -08:00

mlx

Fix cross-attention (#210 )

2023-12-18 12:27:27 -08:00

src

An initial quantized matmul implementation (#205 )

2023-12-18 23:18:57 -08:00

tests

An initial quantized matmul implementation (#205 )

2023-12-18 23:18:57 -08:00

README.md

awni's commit files

2023-11-29 10:30:41 -08:00

README.md

Packaging for PyPI

Install build and twine:

pip install --user --upgrade build
pip install --user --upgrade twine

Generate the source distribution and wheel:

python -m build

Warning use a test server first

Test Upload

Upload to test server:

python -m twine upload --repository testpypi dist/*

Install from test server and check that it works:

python -m pip install --index-url https://test.pypi.org/simple/ --no-deps mlx

Upload

python -m twine upload dist/*