mlx-examples/mixtral at 078fed3d8d8cb24c1eda31f0009edf327659b914 - mlx-examples - Gitea for Geophysics

zhangyiss/mlx-examples

mirror of https://github.com/ml-explore/mlx-examples.git synced 2025-10-31 10:58:07 +08:00

Files

History

Awni Hannun 078fed3d8d use official HF for mixtral

2023-12-14 15:30:32 -08:00

..

convert.py

use official HF for mixtral

2023-12-14 15:30:32 -08:00

mixtral.py

use official HF for mixtral

2023-12-14 15:30:32 -08:00

params.json

use official HF for mixtral

2023-12-14 15:30:32 -08:00

README.md

use official HF for mixtral

2023-12-14 15:30:32 -08:00

requirements.txt

initial mixtral

2023-12-12 07:44:23 -08:00

README.md

Mixtral 8x7B

Run the Mixtral¹ 8x7B mixture-of-experts (MoE) model in MLX on Apple silicon.

Note, for 16-bit precision this model needs a machine with substantial RAM (~100GB) to run.

Setup

Install Git Large File Storage. For example with Homebrew:

brew install git-lfs

Download the models from Hugging Face:

GIT_LFS_SKIP_SMUDGE=1 git clone https://huggingface.co/mistralai/Mixtral-8x7B-v0.1/
cd Mixtral-8x7B-v0.1/ && \
  git lfs pull --include "consolidated.*.pt" && \
  git lfs pull --include "tokenizer.model"

Now from mlx-exmaples/mixtral convert and save the weights as NumPy arrays so MLX can read them:

python convert.py --model_path Mixtral-8x7B-v0.1/

The conversion script will save the converted weights in the same location.

Generate

As easy as:

python mixtral.py --model_path Mixtral-8x7B-v0.1/

Refer to Mistral's blog post for more details. ↩︎