Add readme and requirements for qwen example

This commit is contained in:
Juni May 2023-12-18 15:29:06 +08:00
parent ec94fcf430
commit a8ef549546
2 changed files with 33 additions and 0 deletions

29
qwen/README.md Normal file
View File

@ -0,0 +1,29 @@
# Qwen
Qwen (通义千问) is a language model proposed by Alibaba Cloud[^1]. The architecture of Qwen is similar to Llama except for the bias in the attention layers.
## Setup
Download (from huggingface) and conver the model. By default, the model is `Qwen/Qwen-1_8B`.
```sh
python convert.py
```
This will make the `weights.npz` file which MLX can read.
## Generate
To generate text with the default prompt (default tokenizer is `Qwen/Qwen-1_8B`):
```sh
python qwen.py
```
To see a list of options, run:
```sh
python qwen.py --help
```
[^1]: For more details on the model see the official repo of [Qwen](https://github.com/QwenLM/Qwen) and the [Hugging Face](https://huggingface.co/Qwen).

4
qwen/requirements.txt Normal file
View File

@ -0,0 +1,4 @@
mlx
numpy
transformers>=4.35
torch