mlx-examples

mirror of https://github.com/ml-explore/mlx-examples.git synced 2025-06-24 17:31:18 +08:00

Author	SHA1	Message	Date
Goekdeniz-Guelmez	1de4c8695a	removings	2025-03-14 08:49:20 +01:00
Goekdeniz-Guelmez	73cc094681	fix optimizer	2025-03-08 22:36:20 +01:00
Gökdeniz Gülmez	700c3ef5cc	Merge branch 'main' into adding-orpo-training	2025-03-08 10:16:40 +01:00
Gökdeniz Gülmez	e150621095	Adding multiple optimizers to mlx lm (#1315 ) * initial commmit * adding more customized YAML configuartion * update YAML example file * Changed the switch to set opt_class * removing muon * using default arguments * udpate	2025-03-05 13:54:54 -08:00
Gökdeniz Gülmez	6a3912be7f	Merge branch 'main' into adding-orpo-training	2025-02-28 22:10:21 +01:00
Awni Hannun	b2108a0de6	Allow mask prompt in config (#1314 )	2025-02-28 11:33:04 -08:00
Gökdeniz Gülmez	de147187c1	Merge branch 'ml-explore:main' into adding-orpo-training	2025-02-21 19:59:43 +01:00
Awni Hannun	85669451d0	Fix num layers in fine tune (#1294 )	2025-02-20 13:32:01 -08:00
Gökdeniz Gülmez	575ece6ef0	Merge branch 'main' into adding-orpo-training	2025-02-10 10:51:01 +01:00
Chime Ogbuji	5865899c81	Completion only fine-tuning of instruction models with collections of HF datasets (#1103 ) - Optional completion only fine-tuning with `--mask-prompt` - Collections of Hugging Face datasets --------- Co-authored-by: Awni Hannun <awni@apple.com>	2025-02-09 20:12:34 -08:00
Goekdeniz-Guelmez	56712664f6	nice metric printing in testing	2025-02-04 11:21:52 +01:00
Goekdeniz-Guelmez	43940ec673	fix Test	2025-02-04 11:13:07 +01:00
Goekdeniz-Guelmez	541677aa7f	cleaning up	2025-01-31 21:36:24 +01:00
Goekdeniz-Guelmez	e3688293ed	removing dpo and fixing some stuff for orpo	2025-01-24 16:09:22 +01:00
Goekdeniz-Guelmez	0bb001121e	niits	2025-01-22 21:39:29 +01:00
Goekdeniz-Guelmez	363bde634e	fixes	2025-01-19 13:45:33 +01:00
Goekdeniz-Guelmez	fa80d081f2	finish	2025-01-19 01:58:29 +01:00
Goekdeniz-Guelmez	a9b7609118	initial commit	2025-01-19 01:09:43 +01:00
Goekdeniz-Guelmez	582f979dfd	fixing reference model loading and freezing	2025-01-19 00:41:27 +01:00
Goekdeniz-Guelmez	1ff788821c	initial commit	2025-01-19 00:19:36 +01:00
Jarrett	07f88f8057	fix(lora): add back store_true default args (#1205 )	2025-01-16 11:15:42 -08:00
Jarrett	40b88eff48	fix(lora): config yaml & arg default merge bug (#1196 )	2025-01-09 11:33:54 -08:00
Awni Hannun	c4833a2f55	fix encoding with special tokens + chat template (#1189 )	2025-01-03 10:50:59 -08:00
madroid	aa1c8abdc6	LoRA: Support HuggingFace dataset via data parameter (#996 ) * LoRA: support huggingface dataset via `data` argument * LoRA: Extract the load_custom_hf_dataset function * LoRA: split small functions * fix spelling errors * handle load hf dataset error * fix pre-commit lint * update data argument help * nits and doc --------- Co-authored-by: Awni Hannun <awni@apple.com>	2024-09-30 07:36:21 -07:00
Gökdeniz Gülmez	50e5ca81a8	Adding full finetuning (#903 ) * Adding full model weights finetuning * Updating the LORA.md and ACKNOWLEDGMENTS.md files. * removing --use-dora and --fulll-training and adding --fine-tune-type * some clean up * reformating and fixing dora training * updated CONFIG_DEFAULTS * update config example * update in the config example fie * Update LORA.md * merge and commit * adding argument for dora linear layer * clean up * clean up in the example yaml file * fix * final fix before sending * small addition to re md file * fix for loading the fully trained model by saving all the files and configs correctly * clean up * removing the unnesesairy files * changing lora layers back to 16 * removed max file size * nits * resolve merge * some consistency changes --------- Co-authored-by: Awni Hannun <awni@apple.com>	2024-09-29 17:12:47 -07:00
Chime Ogbuji	8bf397e450	Pass use_dora parameter to linear_to_lora_layers (#885 )	2024-07-11 14:34:34 -07:00
Robin Glauser	4872727f14	Fixing "NameError: name 'resume_adapter_file' is not defined" (#817 ) args. is missing from resume_adapter_file so the name is not defined.	2024-06-05 10:07:31 -07:00
madroid	c457a3f88b	LoRA: Extract small function (#614 ) * LoRA: Extract pre_processing_model function * LoRA: Extract small functions(train_model,evaluate_model) * move test case to test_tuner_utils.py * nits * nits * remove extra param, validate at it 0 * version * fix test --------- Co-authored-by: Awni Hannun <awni@apple.com>	2024-06-02 06:38:42 -07:00
Awni Hannun	9fc6efbd90	version bump + some fixes (#792 )	2024-05-21 20:09:35 -07:00
alexC-nonsense4k	42458914c8	support dora finetune in mlx-examples/llms/mlx_lm (#779 ) * support dora finetune * solve problems in lora.py and tuner.utils.py * add use_dora (bool) in functions of load adapters * delete all unsupported quantization code and fix all the calculate problems in mlx_lm/tuner/dora.py * Using stop_gradient to prevent gradients from flowing through ‘norm’ during backpropagation * set DEFAULT_USE_DORA in mlx_lm/generate.py * add annotation for all the use_dora * mlx_lm/fuse.py support fuse dora layers and fix a bug of to_linear() in mlx_lm/tuner/dora.py * simplify code of juding type of a fused layer in mlx_lm/fuse.py * add use_dora in mlx_lm/fuse.py when apply_lora_layers() * style + nits * style + nits * more updates --------- Co-authored-by: chenyifei08 <chenyifei08@baidu.com> Co-authored-by: Awni Hannun <awni@apple.com>	2024-05-16 08:21:26 -07:00
Awni Hannun	92430df0a0	Fix lora for qwen moe (#743 ) * fix lora for qwen moe * use max seq length in test as well	2024-05-02 21:55:09 -07:00
Awni Hannun	685012c2ad	Couple fixes for LoRA (#711 ) * don't overwrite in test only mode * only load model specific safetensors	2024-04-25 14:16:13 -07:00
Phúc H. Lê Khắc	35206806ac	Create executables for generate, lora, server, merge, convert (#682 ) * feat: create executables mlx_lm.<cmd> * nits in docs --------- Co-authored-by: Awni Hannun <awni@apple.com>	2024-04-16 16:08:49 -07:00
Awni Hannun	2bd64b78cf	Save lora config (#636 ) * lora config * comments * version bump	2024-04-02 13:52:53 -07:00
Chime Ogbuji	f6283ef7ce	Configurable LR schedulers (#604 ) * Initial config handler and test * Added means to run from CLI * Update lora config loading and tests * Constrain scheduler config (warmup and minimum LR) for each kind * Update reference to moved schedule_config module * Minor fix * Fix typos * Moved build_schedule and tests * nits in schedule config * flake * fix path --------- Co-authored-by: Awni Hannun <awni@apple.com>	2024-03-29 13:41:10 -07:00
Ivan Fioravanti	d2a99172a6	Add dropout parameter to lora configuration (#599 ) * Add dropout parameter to lora configuration A dropout parameter has been added to the lora configuration settings in lora_config.yaml. The LoRALinear class in utils.py has been updated to take this new parameter. Additionally, a AttributeError: 'types.SimpleNamespace' object has no attribute 'prompt' related to `args.prompt` has been removed from lora.py. * Update lora_config.yaml Set dropout to 0.0 in the sample config file * format --------- Co-authored-by: Awni Hannun <awni@apple.com>	2024-03-20 08:44:40 -07:00
Anchen	949f63f309	chore(mlx-lm): fix print_trainable_parameters for quant models (#581 ) * chore(mlx-lm): fix print_trainable_parameters for quant models * chore: clean up * refactor: use layer type to check quant bits * chore: address comment	2024-03-20 08:41:03 -07:00
madroid	b0bcd86a40	Support for OpenAI’s fine-tuning dataset format (#548 ) * LoRA: move load_dataset to tuner/datasets.py file * LoRA: support OpenAI chat format datasets see https://platform.openai.com/docs/guides/fine-tuning/example-format * LoRA: support OpenAI completion format datasets * LoRA: formatting dataset timing to reduce memory footprint * Refactor dataset item access in PromptCompletionDataset * Update mlx_lm/LORA.md * Update mlx_lm/LORA.md * check Unsupported data format * add tests, fine-tune doc * add tests, fine-tune doc * add jinja2 for chat template * nits in readme * nits in readme --------- Co-authored-by: Awni Hannun <awni@apple.com>	2024-03-19 16:45:46 -07:00
Awni Hannun	e4b19bb9e1	Make attention faster for a some models (#574 ) * make attention faster for a couple models * remove unused generation flags * add comment on lora * include text files as well	2024-03-14 21:35:54 -07:00
madroid	485180ae91	LoRA: some minor optimizations (#573 ) * init training_args in training scope * Add trainable parameters percentage	2024-03-13 20:26:30 -07:00
Awni Hannun	39084e81c2	Some improvements to LoRA (#528 ) * set cache_limit * remove set cache_limit * cleanup * add gradient checkpointing * fix sort * mokey patch call for checkpoint * fix example config	2024-03-12 20:02:03 -07:00
Chime Ogbuji	e56d9015ef	LoRA on all linear transformer block layers (#546 ) * Add --lora-all-linear option to apply LoRa to all linear transfer block layers * Moved to YAML config and added specification of rank & alpha * nits in conifg, more tests * nit * run tests for prs --------- Co-authored-by: Awni Hannun <awni@apple.com>	2024-03-12 07:37:40 -07:00
Chime Ogbuji	8c2cf665ed	YAML configuration for mlx_lm.lora (#503 ) * Convert mlx_lm.lora to use YAML configuration * pre-commit run fixes * Fix loading of config file * Remove invalid YAML from doc * Update command-line options and YAML parameter overriding, per feedback in #503 * Minor wording change * Positional argument * Moved config to a (-c/--config) flag * Removed CLI option defaults (since CLI options take precedence and their defaults are in CONFIG_DEFAULTS) * pre-commit format updates * Fix handling of CLI option defaults * Prevent None values of unspecified CLI options from overwriting values from CONFIG_DEFAULTS * nits --------- Co-authored-by: Awni Hannun <awni@apple.com>	2024-03-08 07:57:52 -08:00
Anchen	13794a05da	chore(mlx-lm): add adapter support in generate.py (#494 ) * chore(mlx-lm): add adapter support in generate.py * chore: remove generate from lora.py and raise error to let user use mlx_lm.generate instead	2024-02-28 07:49:25 -08:00
Madroid Ma	e5dfef5d9a	LoRA: Extract the run function for easy use in scripts file (#482 ) * LoRA: Extract the run_lora function for easy use in scripts * LoRA: run_lora function adds a TrainingCallback pass. * LoRA: change run_lora to run	2024-02-26 19:35:04 -08:00
Ivan Fioravanti	b05907c87e	Change argument name in lora.py (#453 ) The argument name "--max_seq_length" was updated to "--max-seq-length" in the code to maintain a consistent naming convention across the program.	2024-02-18 06:04:49 -08:00
Madroid Ma	0ba466369f	LoRA: add training callbacks (#414 ) * LoRA: add training callbacks * LoRA: add trained tokens print & callback	2024-02-16 06:04:57 -08:00
Awni Hannun	d4666615bb	Lazy import + refactor Lora layer addition (#426 ) * lazy model import in mlx_lm * change lora loading * fix olmo lora * remove a bunch of unused stuff from plamo * move phixtral to mlx-lm and out of llms/	2024-02-12 10:51:02 -08:00
Awni Hannun	aa7447efa2	Olmo in MLX LM (#415 ) * run olmo * format	2024-02-05 21:13:49 -08:00
Ivan Fioravanti	7fbca214b1	Add max sequence length argument in lora.py (#408 ) A new argument "--max_seq_length" has been added to the command-line parser and passed as a parameter to the main function of the lora.py script. This allows users to specify and control the maximum sequence length during training.	2024-02-04 12:28:21 -08:00

1 2

53 Commits