mlx-examples/README.md at 2995a6748610380767ccda08de0f70a8a7232669

zhangyiss/mlx-examples

Fork 0

mirror of https://github.com/ml-explore/mlx-examples.git synced 2025-09-09 16:13:20 +08:00

Files

Awni Hannun 2995a67486 comments + readme updates

2024-01-03 11:50:34 -08:00

2.2 KiB

Raw Blame History

Generate Text with MLX and Hugging Face

This an example of Llama style large language model text generation that can pull models from the Hugging Face Hub.

Setup

Install the dependencies:

pip install -r requirements.txt

Run

python generate.py --model <model_path> --prompt "hello"

For example:

python generate.py --model mistralai/Mistral-7B-v0.1 --prompt "hello"

will download the Mistral 7B model and generate text using the given prompt.

The <model_path> should be either a path to a local directory or a Hugging Face repo with weights stored in safetensors format. If you use a repo from the Hugging Face hub, then the model will be downloaded and cached the first time you run it. See the [#Models] section for a full list of supported models.

Run python generate.py --help to see all the options.

Models

The example supports Hugging Face format Mistral and Llama-style models. If the model you want to run is not supported, file an issue or better yet, submit a pull request.

Here is a list of a few more Hugging Face models repos which work with this example:

Most Mistral and Llama style models should work out of the box.

Convert new models

You can convert new models to the MLX format using the convert.py script. This script takes a Hugging Face repo as input and outputs an MLX formatted model (which you can then upload to Hugging Face).

To convert a model, run:

python convert.py --hf-model <hf_repo>

To make a 4-bit quantized model, use -q. For more options run:

python convert.py

You can upload new models to the Hugging Face MLX Community by specifying --upload-name`` to convert.py`.

2.2 KiB Raw Blame History

Generate Text with MLX and Hugging Face

Setup

Run

Models

Convert new models

2.2 KiB

Raw Blame History