mirror of
https://github.com/ml-explore/mlx-examples.git
synced 2025-08-29 07:30:06 +08:00
move
This commit is contained in:
parent
feb7f10888
commit
02649b30e5
@ -77,9 +77,7 @@ to see how to use the API in more detail.
|
|||||||
The `mlx-lm` package also comes with functionality to quantize and optionally
|
The `mlx-lm` package also comes with functionality to quantize and optionally
|
||||||
upload models to the Hugging Face Hub.
|
upload models to the Hugging Face Hub.
|
||||||
|
|
||||||
You can convert models in the [Hugging Face
|
You can convert models using the Python API:
|
||||||
Space](https://huggingface.co/spaces/mlx-community/mlx-my-repo) or using the
|
|
||||||
Python API:
|
|
||||||
|
|
||||||
```python
|
```python
|
||||||
from mlx_lm import convert
|
from mlx_lm import convert
|
||||||
@ -165,6 +163,10 @@ mlx_lm.convert \
|
|||||||
--upload-repo mlx-community/my-4bit-mistral
|
--upload-repo mlx-community/my-4bit-mistral
|
||||||
```
|
```
|
||||||
|
|
||||||
|
Models can also be converted and quantized directly in the
|
||||||
|
[mlx-my-repo]]https://huggingface.co/spaces/mlx-community/mlx-my-repo) Hugging
|
||||||
|
Face Space.
|
||||||
|
|
||||||
### Long Prompts and Generations
|
### Long Prompts and Generations
|
||||||
|
|
||||||
`mlx-lm` has some tools to scale efficiently to long prompts and generations:
|
`mlx-lm` has some tools to scale efficiently to long prompts and generations:
|
||||||
|
Loading…
Reference in New Issue
Block a user