Add readme and requirements for qwen example

2025-08-30 02:53:41 +08:00 · 2023-12-18 15:29:06 +08:00 · 2023-12-18 15:29:06 +08:00 · a8ef549546
commit a8ef549546
parent ec94fcf430
2 changed files with 33 additions and 0 deletions
--- a/qwen/README.md
+++ b/qwen/README.md
@ -0,0 +1,29 @@
+# Qwen
+
+Qwen (通义千问) is a language model proposed by Alibaba Cloud[^1]. The architecture of Qwen is similar to Llama except for the bias in the attention layers.
+
+## Setup
+
+Download (from huggingface) and conver the model. By default, the model is `Qwen/Qwen-1_8B`.
+
+```sh
+python convert.py
+```
+
+This will make the `weights.npz` file which MLX can read.
+
+## Generate
+
+To generate text with the default prompt (default tokenizer is `Qwen/Qwen-1_8B`):
+
+```sh
+python qwen.py
+```
+
+To see a list of options, run:
+
+```sh
+python qwen.py --help
+```
+
+[^1]: For more details on the model see the official repo of [Qwen](https://github.com/QwenLM/Qwen) and the [Hugging Face](https://huggingface.co/Qwen).
--- a/qwen/requirements.txt
+++ b/qwen/requirements.txt
@ -0,0 +1,4 @@
+mlx
+numpy
+transformers>=4.35
+torch