mirror of
https://github.com/ml-explore/mlx-examples.git
synced 2025-08-30 02:53:41 +08:00
.. | ||
convert.py | ||
qwen.py | ||
README.md | ||
requirements.txt |
Qwen
Qwen (通义千问) is a language model proposed by Alibaba Cloud1. The architecture of Qwen is similar to Llama except for the bias in the attention layers.
Setup
Download (from huggingface) and conver the model. By default, the model is Qwen/Qwen-1_8B
.
python convert.py
This will make the weights.npz
file which MLX can read.
Generate
To generate text with the default prompt (default tokenizer is Qwen/Qwen-1_8B
):
python qwen.py
To see a list of options, run:
python qwen.py --help
-
For more details on the model see the official repo of Qwen and the Hugging Face. ↩︎