mlx-examples

mirror of https://github.com/ml-explore/mlx-examples.git synced 2025-12-16 02:08:55 +08:00

Author	SHA1	Message	Date
Goekdeniz-Guelmez	1beefd58a0	add create_dataset	2025-02-04 11:06:57 +01:00
Gökdeniz Gülmez	c33c245c11	Merge branch 'ml-explore:main' into adding-orpo-training	2025-02-04 11:04:40 +01:00
Awni Hannun	d9924d08d1	Fix no validation in lora (#1241 )	2025-02-03 09:55:24 -08:00
Goekdeniz-Guelmez	541677aa7f	cleaning up	2025-01-31 21:36:24 +01:00
Gökdeniz Gülmez	294d189eed	Merge branch 'main' into adding-orpo-training	2025-01-26 16:59:37 +01:00
Gökdeniz Gülmez	77faa14ba4	adding support for kyutai's helium (#1208 ) * initial commit * adding helium into training * Update ACKNOWLEDGMENTS.md * nits * nits * fixes / nits --------- Co-authored-by: Awni Hannun <awni@apple.com>	2025-01-26 07:19:07 -08:00
Goekdeniz-Guelmez	2f2ddd4811	clean up	2025-01-26 15:17:06 +01:00
Goekdeniz-Guelmez	d8e7834345	Removed rejected_rewards handling, Updated batch unpacking to match iterator, Updated batch unpacking to match iterator, Added preference score scaling, Simplified reward calculation, Removed redundant rejected_rewards	2025-01-25 21:35:37 +01:00
Goekdeniz-Guelmez	09ed837896	updates	2025-01-24 16:57:18 +01:00
Goekdeniz-Guelmez	e3688293ed	removing dpo and fixing some stuff for orpo	2025-01-24 16:09:22 +01:00
Goekdeniz-Guelmez	0bb001121e	niits	2025-01-22 21:39:29 +01:00
Gökdeniz Gülmez	4098c3bd2f	Merge branch 'ml-explore:main' into adding-orpo-training	2025-01-22 14:18:38 +01:00
Victor Nogueira	df1406735b	Fix dataset variable name, in `datasets.py` (#1212 )	2025-01-21 14:12:43 -08:00
Goekdeniz-Guelmez	363bde634e	fixes	2025-01-19 13:45:33 +01:00
Goekdeniz-Guelmez	ea0d11cd2f	update	2025-01-19 02:05:43 +01:00
Goekdeniz-Guelmez	424cb854e9	nits	2025-01-19 02:03:50 +01:00
Goekdeniz-Guelmez	9ede9db19b	nits	2025-01-19 02:03:31 +01:00
Goekdeniz-Guelmez	fa80d081f2	finish	2025-01-19 01:58:29 +01:00
Goekdeniz-Guelmez	7d279b51ef	remerge with dpo	2025-01-19 01:14:08 +01:00
Goekdeniz-Guelmez	a9b7609118	initial commit	2025-01-19 01:09:43 +01:00
Goekdeniz-Guelmez	582f979dfd	fixing reference model loading and freezing	2025-01-19 00:41:27 +01:00
Goekdeniz-Guelmez	1ff788821c	initial commit	2025-01-19 00:19:36 +01:00
Awni Hannun	50f0a7f6d9	add internlm3 (#1206 )	2025-01-15 14:55:41 -08:00
Ivan Fioravanti	6ae6c72c2e	reduction moved to CPU in case of distributed training (#1200 )	2025-01-14 17:20:42 -08:00
Chime Ogbuji	0228c46434	Custom local dataset features (#1085 ) * Generalize prompt_feature and completion_feature for use in local datasets to facilitate compatibility with many other training dataset formats. * Persist configured prompt/completion key * rebase + nits --------- Co-authored-by: Awni Hannun <awni@apple.com>	2025-01-13 10:01:18 -08:00
Awni Hannun	c4833a2f55	fix encoding with special tokens + chat template (#1189 )	2025-01-03 10:50:59 -08:00
Prince Canuma	dfa4dd6c93	Add support for cohere2 (#1157 ) * add support for cohere2 * revert to act_fn to silu * fix tests and sliding window attention * add tests * add to tuner * fix sliding window * add coauthor :) Co-authored-by: n8programs <43304488+N8python@users.noreply.github.com> * Add rotating kvcache to save space * some nits * style * nits --------- Co-authored-by: n8programs <43304488+N8python@users.noreply.github.com> Co-authored-by: N8 <n8@n8programs.com> Co-authored-by: Awni Hannun <awni@apple.com>	2024-12-16 08:01:03 -08:00
n8programs	5687d5b99b	Adds EXAONE architecture. (#1145 ) * Adds EXAONE architecture. * nits + format * format * clean up and fix rope * clean up and fix rope --------- Co-authored-by: Awni Hannun <awni@apple.com>	2024-12-09 07:58:25 -08:00
Alex Barron	2211b27388	Mixed Quantizations (#1132 ) * saving/loading mixed quantizations * comment * add bits per weight * more concise bpw * count bias too	2024-12-08 14:21:50 -08:00
Awni Hannun	8801beb66f	Add olmo2 (#1128 ) * add olmo2 * add olmo2	2024-12-02 11:42:58 -08:00
Awni Hannun	657b4cc0aa	[MLX LM] Sampler refactor + a few improvements (#1094 ) * starting * refactor sampler/processor and a few improvements * fix stream * fix stream generate * fix eos handling in stream generate	2024-11-07 16:15:24 -08:00
Angelos Katharopoulos	331148d8ec	Enable distributed LoRA training (#821 )	2024-11-02 18:02:31 -07:00
Zai Thottakath	418d9a5511	Feature: QDoRA (#891 ) * feat: QDoRA with tests and a small bug fix for recalculation of self.m * some simplifications and fixes --------- Co-authored-by: Awni Hannun <awni@apple.com>	2024-09-30 08:01:11 -07:00
madroid	aa1c8abdc6	LoRA: Support HuggingFace dataset via data parameter (#996 ) * LoRA: support huggingface dataset via `data` argument * LoRA: Extract the load_custom_hf_dataset function * LoRA: split small functions * fix spelling errors * handle load hf dataset error * fix pre-commit lint * update data argument help * nits and doc --------- Co-authored-by: Awni Hannun <awni@apple.com>	2024-09-30 07:36:21 -07:00
Gökdeniz Gülmez	50e5ca81a8	Adding full finetuning (#903 ) * Adding full model weights finetuning * Updating the LORA.md and ACKNOWLEDGMENTS.md files. * removing --use-dora and --fulll-training and adding --fine-tune-type * some clean up * reformating and fixing dora training * updated CONFIG_DEFAULTS * update config example * update in the config example fie * Update LORA.md * merge and commit * adding argument for dora linear layer * clean up * clean up in the example yaml file * fix * final fix before sending * small addition to re md file * fix for loading the fully trained model by saving all the files and configs correctly * clean up * removing the unnesesairy files * changing lora layers back to 16 * removed max file size * nits * resolve merge * some consistency changes --------- Co-authored-by: Awni Hannun <awni@apple.com>	2024-09-29 17:12:47 -07:00
madroid	7ec2021bb9	LoRA: support tools(function calling) format datasets (#995 ) * LoRA: support fine-tuning tools datasets * LoRA: Split small function * LoRA: add tools format to lora docs * LoRA: pre-commit fix * Revert "LoRA: pre-commit fix" This reverts commit `b94b7e0fe7`. * Revert "LoRA: Split small function" This reverts commit `3f6a5f19fd`. * LoRA: remove ToolsDataset In a JSONL file, not all data is required to include the tools value. * nit in readme * nit in readme * nit in readme --------- Co-authored-by: Awni Hannun <awni@apple.com>	2024-09-28 10:41:36 -07:00
Gökdeniz Gülmez	76710f61af	Adding support for mamba (#940 ) * initial commit * initial commit * Adding first lines * adding x, and dt projection layers * adding the clamping mechanism * First succesful inference * last commit for today - added custom geenrate function and it works as expected, will try training and then with loading a model from the hub * clean up * save up * almost * update * update * fixed cache handeling * fixed loading * added seperate generat_step method in the model and also in the utils to automaticaly use the generate step mthod in the model class * quick update * still not working * save * still not working * initial commit * utils.py logits = logits[:, -1, :] TypeError: tuple indices must be integers or slices, not tuple * update * update * Fixing the Batching Depfwise Comnvolution and multi token input * fixing generate and logits outputs * Done! * Fixing the cache handling, generating works now trying training * update ACKNOWLEDGEMENTS * removing the model_type if stuff in the _step loop in generate_step and adding MambaCache in base.py for training easier generations and removing mamba in tuner/utils. * quick clean up * update trainer/utils for right initialisation of the layers for LoRA, but not working. * clean up * Forther update to trainer/utils for correct layer selection. Successfull training * removing extra mamba-infer.py file * clean up, reformating will come later * reformat and big clean up, final commit * some speedups and cleanups * fix test * nits * nits --------- Co-authored-by: Awni Hannun <awni@apple.com>	2024-09-28 07:02:53 -07:00
L	fc93c55723	feat(mlx_lm): Nemotron (#949 ) * feat: Nemotron https://huggingface.co/nvidia/Minitron-4B-Base This is basically Llama with partial RoPE and LayerNorm instead of BatchNorm. Also they add 1 to the LayerNorm weight for some reason. * fixup! feat: Nemotron * nits --------- Co-authored-by: Awni Hannun <awni@apple.com>	2024-08-29 21:08:57 -07:00
Prince Canuma	b5e18ef1e3	Add Phi-3.5-MoE (#946 ) * add phimoe * add phimoe to tunner * add switch_mlp * fix SuScaled args * nits --------- Co-authored-by: Awni Hannun <awni@apple.com>	2024-08-24 06:52:33 -07:00
Awni Hannun	58591a1b41	fine tune deepseek (#932 )	2024-08-22 10:41:21 -07:00
L	0164d2058b	feat: DeepSeek MoE v1 (#942 ) * feat: deepseek v1 DeepSeek is still releasing models on the DeepSeek V1 architecture. ```sh mlx_lm.convert --hf-path deepseek-ai/DeepSeek-Prover-V1.5-RL --mlx-path DeepSeek-Prover-V1.5-RL-8bit --q-bits 8 -q mlx_lm.generate --model DeepSeek-Prover-V1.5-RL-8bit --ignore-chat-template --max-tokens 512 --prompt 'import Mathlib import Aesop set_option maxHeartbeats 0 open BigOperators Real Nat Topology Rat /-- The second and fourth terms of a geometric sequence are $2$ and $6$. Which of the following is a possible first term? Show that it is $\frac{2\sqrt{3}}{3}$.-/ theorem amc12b_2003_p6 (a r : ℝ) (u : ℕ → ℝ) (h₀ : ∀ k, u k = a * r ^ k) (h₁ : u 1 = 2) (h₂ : u 3 = 6) : u 0 = 2 / Real.sqrt 3 ∨ u 0 = -(2 / Real.sqrt 3) := by' ``` * nits * nits * nits --------- Co-authored-by: Awni Hannun <awni@apple.com>	2024-08-17 07:18:09 -07:00
Zai Thottakath	4e01700816	Allow the entire model to be targed for LoRA and DoRA fine tuning: LoRA and DoRA embeddings with small DoRALinear bug fix (#914 ) * feature: LoRA adapter for Embeddings * feature: wire in LoRAEmbedding into the tuner. Allow the embedding and non model.layers Linear layers to be targeted for fine tuning * feature: DoRA adapter for Embeddings * feature: wire in DoRAEmbedding * bugfix: ensure self.m is recalculated when the linear layer is changed in DoRALinear.from_linear * refactor: prefer from_base over from_linear or from_embedding. prefer fuse over to_linear or to_embedding * cleanup: remove unused imports in test_dora.py * refactor: remove unnecessary non_layer_modules * cleanup: remove wrong comments for lora embedding dropout. remove uncessary parens in dora embedding dropout * nits --------- Co-authored-by: Awni Hannun <awni@apple.com>	2024-08-16 07:38:36 -07:00
nicolov	fbe3247772	Add GPT-neox model (#863 )	2024-07-11 06:13:17 -07:00
Awni Hannun	538339b599	gemma2 (#855 )	2024-06-27 10:06:28 -07:00
Chime Ogbuji	df6bc09d74	Configuration-based use of HF hub-hosted datasets for training (#701 ) * Add hf_dataset configuration for using HF hub-hosted datasets for (Q)LoRA training * Pre-commit formatting * Fix YAML config example * Print DS info * Include name * Add hf_dataset parameter default * Remove TextHFDataset and CompletionsHFDataset and use Dataset and CompletionsDataset instead, adding a text_key constructor argument to the former (and changing it to work with a provided data structure instead of just from a JSON file), and prompt_key and completion_key arguments to the latter with defaults for backwards compatibility. * nits * update docs --------- Co-authored-by: Awni Hannun <awni@apple.com>	2024-06-26 10:20:50 -07:00
Awni Hannun	d8b073e3a7	Add eos token to lora fine-tunes (#818 ) * add eos token to lora fine-tunes * Comment	2024-06-12 07:44:21 -07:00
Derek Lewis	89b0b75250	GPT2 Support (#798 ) * GPT-2 model support * Add test for gpt2 model * Fix weight sanitizing for quantization * use approx gelu --------- Co-authored-by: Awni Hannun <awni@apple.com>	2024-06-02 16:33:20 -07:00
madroid	c457a3f88b	LoRA: Extract small function (#614 ) * LoRA: Extract pre_processing_model function * LoRA: Extract small functions(train_model,evaluate_model) * move test case to test_tuner_utils.py * nits * nits * remove extra param, validate at it 0 * version * fix test --------- Co-authored-by: Awni Hannun <awni@apple.com>	2024-06-02 06:38:42 -07:00
Chen Xin	aac98ca6f4	support internlm2 (#797 ) * support internlm2 * only attention projections --------- Co-authored-by: Awni Hannun <awni@apple.com>	2024-05-27 06:22:21 -07:00
Prince Canuma	b044ce2acf	Add support for ibm granite (#758 ) * add support for granite 3-8B config * add gpt_bigcode * add positional embedding condition. * add support for granite 3-8B config * add gpt_bigcode * add positional embedding condition. * remove unused function * rebase fix * move position emebedding to mask creation * add to tuner and format * add support for granite 3-8B config * add gpt_bigcode * add positional embedding condition. * add support for granite 3-8B config * add gpt_bigcode * add positional embedding condition. * rebase fix * move position emebedding to mask creation * add to tuner and format * refactor mask * remove dropout layers	2024-05-21 20:16:31 -07:00

1 2

95 Commits