mlx-examples

mirror of https://github.com/ml-explore/mlx-examples.git synced 2025-07-17 07:51:12 +08:00

Author	SHA1	Message	Date
Gökdeniz Gülmez	700c3ef5cc	Merge branch 'main' into adding-orpo-training	2025-03-08 10:16:40 +01:00
Gökdeniz Gülmez	56d2db23e1	adding OLMoE architecture (#1321 ) * initial commit * udpate ACKNOWLEDGMENTS.md * adding olmoe to training * clean up * faster generation * remove sanitize method * more clean ups * adding SwitchGLU * clean up * a little faster and adding norm_topk_prob * formated	2025-03-05 13:46:06 -08:00
Goekdeniz-Guelmez	bb261aadcb	updates	2025-03-01 12:42:39 +01:00
Goekdeniz-Guelmez	5704136791	better dataset handling	2025-02-21 21:12:45 +01:00
Gökdeniz Gülmez	de147187c1	Merge branch 'ml-explore:main' into adding-orpo-training	2025-02-21 19:59:43 +01:00
Awni Hannun	85669451d0	Fix num layers in fine tune (#1294 )	2025-02-20 13:32:01 -08:00
Gökdeniz Gülmez	80c64da960	Merge branch 'ml-explore:main' into adding-orpo-training	2025-02-12 11:09:43 +01:00
Awni Hannun	ec30dc3538	hunyuan finetune (#1270 )	2025-02-11 16:49:35 -08:00
Awni Hannun	42413c5d85	fix lora timings after validation (#1278 )	2025-02-11 16:48:55 -08:00
Gökdeniz Gülmez	575ece6ef0	Merge branch 'main' into adding-orpo-training	2025-02-10 10:51:01 +01:00
Chime Ogbuji	5865899c81	Completion only fine-tuning of instruction models with collections of HF datasets (#1103 ) - Optional completion only fine-tuning with `--mask-prompt` - Collections of Hugging Face datasets --------- Co-authored-by: Awni Hannun <awni@apple.com>	2025-02-09 20:12:34 -08:00
Awni Hannun	31611b62d7	Add IBM granite model (#1265 ) * add granite * add thinking option	2025-02-08 15:46:15 -08:00
Goekdeniz-Guelmez	1beefd58a0	add create_dataset	2025-02-04 11:06:57 +01:00
Gökdeniz Gülmez	c33c245c11	Merge branch 'ml-explore:main' into adding-orpo-training	2025-02-04 11:04:40 +01:00
Awni Hannun	d9924d08d1	Fix no validation in lora (#1241 )	2025-02-03 09:55:24 -08:00
Goekdeniz-Guelmez	541677aa7f	cleaning up	2025-01-31 21:36:24 +01:00
Gökdeniz Gülmez	294d189eed	Merge branch 'main' into adding-orpo-training	2025-01-26 16:59:37 +01:00
Gökdeniz Gülmez	77faa14ba4	adding support for kyutai's helium (#1208 ) * initial commit * adding helium into training * Update ACKNOWLEDGMENTS.md * nits * nits * fixes / nits --------- Co-authored-by: Awni Hannun <awni@apple.com>	2025-01-26 07:19:07 -08:00
Goekdeniz-Guelmez	2f2ddd4811	clean up	2025-01-26 15:17:06 +01:00
Goekdeniz-Guelmez	d8e7834345	Removed rejected_rewards handling, Updated batch unpacking to match iterator, Updated batch unpacking to match iterator, Added preference score scaling, Simplified reward calculation, Removed redundant rejected_rewards	2025-01-25 21:35:37 +01:00
Goekdeniz-Guelmez	09ed837896	updates	2025-01-24 16:57:18 +01:00
Goekdeniz-Guelmez	e3688293ed	removing dpo and fixing some stuff for orpo	2025-01-24 16:09:22 +01:00
Goekdeniz-Guelmez	0bb001121e	niits	2025-01-22 21:39:29 +01:00
Gökdeniz Gülmez	4098c3bd2f	Merge branch 'ml-explore:main' into adding-orpo-training	2025-01-22 14:18:38 +01:00
Victor Nogueira	df1406735b	Fix dataset variable name, in `datasets.py` (#1212 )	2025-01-21 14:12:43 -08:00
Goekdeniz-Guelmez	363bde634e	fixes	2025-01-19 13:45:33 +01:00
Goekdeniz-Guelmez	ea0d11cd2f	update	2025-01-19 02:05:43 +01:00
Goekdeniz-Guelmez	424cb854e9	nits	2025-01-19 02:03:50 +01:00
Goekdeniz-Guelmez	9ede9db19b	nits	2025-01-19 02:03:31 +01:00
Goekdeniz-Guelmez	fa80d081f2	finish	2025-01-19 01:58:29 +01:00
Goekdeniz-Guelmez	7d279b51ef	remerge with dpo	2025-01-19 01:14:08 +01:00
Goekdeniz-Guelmez	a9b7609118	initial commit	2025-01-19 01:09:43 +01:00
Goekdeniz-Guelmez	582f979dfd	fixing reference model loading and freezing	2025-01-19 00:41:27 +01:00
Goekdeniz-Guelmez	1ff788821c	initial commit	2025-01-19 00:19:36 +01:00
Awni Hannun	50f0a7f6d9	add internlm3 (#1206 )	2025-01-15 14:55:41 -08:00
Ivan Fioravanti	6ae6c72c2e	reduction moved to CPU in case of distributed training (#1200 )	2025-01-14 17:20:42 -08:00
Chime Ogbuji	0228c46434	Custom local dataset features (#1085 ) * Generalize prompt_feature and completion_feature for use in local datasets to facilitate compatibility with many other training dataset formats. * Persist configured prompt/completion key * rebase + nits --------- Co-authored-by: Awni Hannun <awni@apple.com>	2025-01-13 10:01:18 -08:00
Awni Hannun	c4833a2f55	fix encoding with special tokens + chat template (#1189 )	2025-01-03 10:50:59 -08:00
Prince Canuma	dfa4dd6c93	Add support for cohere2 (#1157 ) * add support for cohere2 * revert to act_fn to silu * fix tests and sliding window attention * add tests * add to tuner * fix sliding window * add coauthor :) Co-authored-by: n8programs <43304488+N8python@users.noreply.github.com> * Add rotating kvcache to save space * some nits * style * nits --------- Co-authored-by: n8programs <43304488+N8python@users.noreply.github.com> Co-authored-by: N8 <n8@n8programs.com> Co-authored-by: Awni Hannun <awni@apple.com>	2024-12-16 08:01:03 -08:00
n8programs	5687d5b99b	Adds EXAONE architecture. (#1145 ) * Adds EXAONE architecture. * nits + format * format * clean up and fix rope * clean up and fix rope --------- Co-authored-by: Awni Hannun <awni@apple.com>	2024-12-09 07:58:25 -08:00
Alex Barron	2211b27388	Mixed Quantizations (#1132 ) * saving/loading mixed quantizations * comment * add bits per weight * more concise bpw * count bias too	2024-12-08 14:21:50 -08:00
Awni Hannun	8801beb66f	Add olmo2 (#1128 ) * add olmo2 * add olmo2	2024-12-02 11:42:58 -08:00
Awni Hannun	657b4cc0aa	[MLX LM] Sampler refactor + a few improvements (#1094 ) * starting * refactor sampler/processor and a few improvements * fix stream * fix stream generate * fix eos handling in stream generate	2024-11-07 16:15:24 -08:00
Angelos Katharopoulos	331148d8ec	Enable distributed LoRA training (#821 )	2024-11-02 18:02:31 -07:00
Zai Thottakath	418d9a5511	Feature: QDoRA (#891 ) * feat: QDoRA with tests and a small bug fix for recalculation of self.m * some simplifications and fixes --------- Co-authored-by: Awni Hannun <awni@apple.com>	2024-09-30 08:01:11 -07:00
madroid	aa1c8abdc6	LoRA: Support HuggingFace dataset via data parameter (#996 ) * LoRA: support huggingface dataset via `data` argument * LoRA: Extract the load_custom_hf_dataset function * LoRA: split small functions * fix spelling errors * handle load hf dataset error * fix pre-commit lint * update data argument help * nits and doc --------- Co-authored-by: Awni Hannun <awni@apple.com>	2024-09-30 07:36:21 -07:00
Gökdeniz Gülmez	50e5ca81a8	Adding full finetuning (#903 ) * Adding full model weights finetuning * Updating the LORA.md and ACKNOWLEDGMENTS.md files. * removing --use-dora and --fulll-training and adding --fine-tune-type * some clean up * reformating and fixing dora training * updated CONFIG_DEFAULTS * update config example * update in the config example fie * Update LORA.md * merge and commit * adding argument for dora linear layer * clean up * clean up in the example yaml file * fix * final fix before sending * small addition to re md file * fix for loading the fully trained model by saving all the files and configs correctly * clean up * removing the unnesesairy files * changing lora layers back to 16 * removed max file size * nits * resolve merge * some consistency changes --------- Co-authored-by: Awni Hannun <awni@apple.com>	2024-09-29 17:12:47 -07:00
madroid	7ec2021bb9	LoRA: support tools(function calling) format datasets (#995 ) * LoRA: support fine-tuning tools datasets * LoRA: Split small function * LoRA: add tools format to lora docs * LoRA: pre-commit fix * Revert "LoRA: pre-commit fix" This reverts commit `b94b7e0fe7`. * Revert "LoRA: Split small function" This reverts commit `3f6a5f19fd`. * LoRA: remove ToolsDataset In a JSONL file, not all data is required to include the tools value. * nit in readme * nit in readme * nit in readme --------- Co-authored-by: Awni Hannun <awni@apple.com>	2024-09-28 10:41:36 -07:00
Gökdeniz Gülmez	76710f61af	Adding support for mamba (#940 ) * initial commit * initial commit * Adding first lines * adding x, and dt projection layers * adding the clamping mechanism * First succesful inference * last commit for today - added custom geenrate function and it works as expected, will try training and then with loading a model from the hub * clean up * save up * almost * update * update * fixed cache handeling * fixed loading * added seperate generat_step method in the model and also in the utils to automaticaly use the generate step mthod in the model class * quick update * still not working * save * still not working * initial commit * utils.py logits = logits[:, -1, :] TypeError: tuple indices must be integers or slices, not tuple * update * update * Fixing the Batching Depfwise Comnvolution and multi token input * fixing generate and logits outputs * Done! * Fixing the cache handling, generating works now trying training * update ACKNOWLEDGEMENTS * removing the model_type if stuff in the _step loop in generate_step and adding MambaCache in base.py for training easier generations and removing mamba in tuner/utils. * quick clean up * update trainer/utils for right initialisation of the layers for LoRA, but not working. * clean up * Forther update to trainer/utils for correct layer selection. Successfull training * removing extra mamba-infer.py file * clean up, reformating will come later * reformat and big clean up, final commit * some speedups and cleanups * fix test * nits * nits --------- Co-authored-by: Awni Hannun <awni@apple.com>	2024-09-28 07:02:53 -07:00
L	fc93c55723	feat(mlx_lm): Nemotron (#949 ) * feat: Nemotron https://huggingface.co/nvidia/Minitron-4B-Base This is basically Llama with partial RoPE and LayerNorm instead of BatchNorm. Also they add 1 to the LayerNorm weight for some reason. * fixup! feat: Nemotron * nits --------- Co-authored-by: Awni Hannun <awni@apple.com>	2024-08-29 21:08:57 -07:00

1 2 3

107 Commits