From 7d923b41f116968563c18b6aedf94ce1a8a7e828 Mon Sep 17 00:00:00 2001 From: Awni Hannun Date: Thu, 28 Dec 2023 21:40:18 -0800 Subject: [PATCH] nits in README --- llms/deepseek-coder/README.md | 19 +++++++++---------- 1 file changed, 9 insertions(+), 10 deletions(-) diff --git a/llms/deepseek-coder/README.md b/llms/deepseek-coder/README.md index bc5226cc..dae20da4 100644 --- a/llms/deepseek-coder/README.md +++ b/llms/deepseek-coder/README.md @@ -1,7 +1,7 @@ # Deepseek Coder Deepseek Coder is a family of code generating language models based on the -LLama architecture.[^1] The models were trained from scratch on a corpus of 2T +Llama architecture.[^1] The models were trained from scratch on a corpus of 2T tokens, with a composition of 87% code and 13% natural language containing both English and Chinese. @@ -16,7 +16,7 @@ pip install -r requirements.txt Next, download and convert the model. ```sh -python convert.py --hf-path --mlx-path +python convert.py --hf-path ``` To generate a 4-bit quantized model, use `-q`. For a full list of options run: @@ -26,20 +26,19 @@ python convert.py --help ``` The converter downloads the model from Hugging Face. The default model is -`deepseek-ai/deepseek-coder-6.7b-instruct`. Check out the Hugging Face -page[^1] to see a list of available models. +`deepseek-ai/deepseek-coder-6.7b-instruct`. Check out the [Hugging Face +page]((https://huggingface.co/deepseek-ai) to see a list of available models. By default, the conversion script will save the converted `weights.npz`, -`tokenizer`, and `config.json` in the path provided by `--mlx-path`. - +tokenizer, and `config.json` in the `mlx_model` directory. ### Run -Once you've converted the weights to MLX format, you can interact with the -Deepseek coder model: +Once you've converted the weights, you can interact with the Deepseek coder +model: ``` -python deepseek-coder.py --model-path --prompt "write a quick sort algorithm in python." +python deepseek_coder.py --prompt "write a quick sort algorithm in python." ``` -[^1] For more information see the [Hugging Face page](https://huggingface.co/deepseek-ai). +[^1]: For more information [blog post](https://deepseekcoder.github.io/) by DeepSeek AI