From 7300b3d871fe227af17d0bba718f938d711582b3 Mon Sep 17 00:00:00 2001 From: Goekdeniz-Guelmez Date: Tue, 11 Mar 2025 10:56:37 +0100 Subject: [PATCH] udpate LORA.md --- llms/mlx_lm/LORA.md | 4 ++++ 1 file changed, 4 insertions(+) diff --git a/llms/mlx_lm/LORA.md b/llms/mlx_lm/LORA.md index 1d918c0a..5969cd1f 100644 --- a/llms/mlx_lm/LORA.md +++ b/llms/mlx_lm/LORA.md @@ -135,6 +135,8 @@ mlx_lm.lora \ - `--reference-model-path`: Path to reference model weights. If not specified, uses the same model - `--temperature`: Sampling temperature for generations. Higher values increase randomness (default: 1.0) - `--reward-weights`: Optional list of weights for multiple reward functions. Must match number of reward functions. If not specified, all rewards weighted equally with 1.0 +- `--use-chat-template`: Whether to use the model's chat template for formatting prompts (default: False) +- `--use-prompt`: Whether to use the prompt as part of the input for generation (default: False) ## Training Process @@ -170,6 +172,8 @@ mlx_lm.lora \ --reference-model-path \ --temperature 1.0 \ --reward-weights 1.0 1.0 \ + --use-chat-template False \ + --use-prompt False \ --batch-size 4 \ --learning-rate 1e-5 \ --num-epochs 3