mlx/python
Angelos Katharopoulos dfa9f4bc58
An initial quantized matmul implementation (#205)
* Add quantized matvec
* Add quantized matrix matrix with 2nd matrix transposed
* Add quantized matmul tests
* Add a slow cpu quantized matmul
* Add a slightly faster vectorized cpu version
2023-12-18 23:18:57 -08:00
..
mlx Fix cross-attention (#210) 2023-12-18 12:27:27 -08:00
src An initial quantized matmul implementation (#205) 2023-12-18 23:18:57 -08:00
tests An initial quantized matmul implementation (#205) 2023-12-18 23:18:57 -08:00
README.md awni's commit files 2023-11-29 10:30:41 -08:00

Packaging for PyPI

Install build and twine:

pip install --user --upgrade build
pip install --user --upgrade twine

Generate the source distribution and wheel:

python -m build

Warning use a test server first

Test Upload

Upload to test server:

python -m twine upload --repository testpypi dist/*

Install from test server and check that it works:

python -m pip install --index-url https://test.pypi.org/simple/ --no-deps mlx

Upload

python -m twine upload dist/*