mlx-examples

mirror of https://github.com/ml-explore/mlx-examples.git synced 2025-06-26 18:51:18 +08:00

Author	SHA1	Message	Date
Goekdeniz-Guelmez	e96afe9e9f	updates	2025-02-11 09:09:28 +01:00
Goekdeniz-Guelmez	88ca747e9e	nits	2025-02-10 19:46:19 +01:00
Goekdeniz-Guelmez	b7bc811507	nits	2025-02-10 19:45:19 +01:00
Goekdeniz-Guelmez	e5aa2c3b5d	nits	2025-02-10 17:51:14 +01:00
Goekdeniz-Guelmez	f88e897019	removing helper functions	2025-02-10 16:07:28 +01:00
Goekdeniz-Guelmez	d9da35f458	nits	2025-02-10 10:52:32 +01:00
Gökdeniz Gülmez	0dac286539	Merge branch 'main' into adding-GRPO-training	2025-02-10 10:43:22 +01:00
Chime Ogbuji	5865899c81	Completion only fine-tuning of instruction models with collections of HF datasets (#1103 ) - Optional completion only fine-tuning with `--mask-prompt` - Collections of Hugging Face datasets --------- Co-authored-by: Awni Hannun <awni@apple.com>	2025-02-09 20:12:34 -08:00
Goekdeniz-Guelmez	00712522ba	rebase loss calculation	2025-02-09 17:13:05 +01:00
Goekdeniz-Guelmez	a527cdb39b	fix: prevent gradients from flowing through the reference model's logits	2025-02-09 17:02:58 +01:00
Goekdeniz-Guelmez	54179901b5	fix	2025-02-09 15:41:47 +01:00
Goekdeniz-Guelmez	9ba6146a76	fix	2025-02-09 14:32:50 +01:00
Awni Hannun	31611b62d7	Add IBM granite model (#1265 ) * add granite * add thinking option	2025-02-08 15:46:15 -08:00
Gökdeniz Gülmez	94dcd0f63e	Merge branch 'ml-explore:main' into adding-GRPO-training	2025-02-06 08:15:58 +01:00
Goekdeniz-Guelmez	bcfa55d882	updates	2025-02-05 15:02:12 +01:00
Goekdeniz-Guelmez	0a19522ec4	updates	2025-02-05 14:38:09 +01:00
Goekdeniz-Guelmez	35a2d99cf9	smoll fix	2025-02-05 11:30:21 +01:00
Goekdeniz-Guelmez	a33cad84b4	udpates	2025-02-05 09:48:00 +01:00
Goekdeniz-Guelmez	2a8e6f6e44	udpate	2025-02-05 08:47:03 +01:00
Goekdeniz-Guelmez	0a09a93454	fix cache handling	2025-02-05 08:44:06 +01:00
Goekdeniz-Guelmez	7b0141455e	better create_dataset	2025-02-04 10:43:00 +01:00
Goekdeniz-Guelmez	bd1a42ec2f	adding args into dataset handling	2025-02-04 10:22:34 +01:00
Goekdeniz-Guelmez	7173840283	first succesfull training run	2025-02-04 09:18:45 +01:00
Goekdeniz-Guelmez	ca32424043	updates	2025-02-03 21:57:26 +01:00
Goekdeniz-Guelmez	54e295ea80	fix name funcs	2025-02-03 19:56:11 +01:00
Goekdeniz-Guelmez	06f9c29c94	print func name	2025-02-03 19:47:40 +01:00
Goekdeniz-Guelmez	40bca770ae	fixes	2025-02-03 19:43:49 +01:00
Goekdeniz-Guelmez	05d921b788	optims	2025-02-03 19:37:05 +01:00
Awni Hannun	d9924d08d1	Fix no validation in lora (#1241 )	2025-02-03 09:55:24 -08:00
Goekdeniz-Guelmez	1d9e4802f0	first working prototype, will try training out at home	2025-02-03 12:05:29 +01:00
Goekdeniz-Guelmez	23d75cd7ad	starting fist training test run	2025-02-03 10:08:28 +01:00
Goekdeniz-Guelmez	a3ed632422	dataset wrapper done	2025-02-03 09:13:17 +01:00
Goekdeniz-Guelmez	d034ca369e	adding function for R1	2025-02-03 08:26:42 +01:00
Goekdeniz-Guelmez	243c9621d9	update lora.py	2025-01-31 21:10:44 +01:00
Goekdeniz-Guelmez	a57d553fc1	update	2025-01-31 16:57:43 +01:00
Goekdeniz-Guelmez	80bcf68956	grpo_trainer shoudl be done	2025-01-31 16:54:18 +01:00
Goekdeniz-Guelmez	6c58aa995c	updates	2025-01-31 16:27:31 +01:00
Goekdeniz-Guelmez	93370ff1c3	updates ans fixing the KL div lines	2025-01-30 23:55:40 +01:00
Gökdeniz Gülmez	b1e573d6e8	Merge branch 'ml-explore:main' into adding-GRPO-training	2025-01-29 15:07:52 +01:00
Goekdeniz-Guelmez	5e0ae83487	initial commit, gn	2025-01-29 00:19:07 +01:00
Gökdeniz Gülmez	77faa14ba4	adding support for kyutai's helium (#1208 ) * initial commit * adding helium into training * Update ACKNOWLEDGMENTS.md * nits * nits * fixes / nits --------- Co-authored-by: Awni Hannun <awni@apple.com>	2025-01-26 07:19:07 -08:00
Victor Nogueira	df1406735b	Fix dataset variable name, in `datasets.py` (#1212 )	2025-01-21 14:12:43 -08:00
Awni Hannun	50f0a7f6d9	add internlm3 (#1206 )	2025-01-15 14:55:41 -08:00
Ivan Fioravanti	6ae6c72c2e	reduction moved to CPU in case of distributed training (#1200 )	2025-01-14 17:20:42 -08:00
Chime Ogbuji	0228c46434	Custom local dataset features (#1085 ) * Generalize prompt_feature and completion_feature for use in local datasets to facilitate compatibility with many other training dataset formats. * Persist configured prompt/completion key * rebase + nits --------- Co-authored-by: Awni Hannun <awni@apple.com>	2025-01-13 10:01:18 -08:00
Awni Hannun	c4833a2f55	fix encoding with special tokens + chat template (#1189 )	2025-01-03 10:50:59 -08:00
Prince Canuma	dfa4dd6c93	Add support for cohere2 (#1157 ) * add support for cohere2 * revert to act_fn to silu * fix tests and sliding window attention * add tests * add to tuner * fix sliding window * add coauthor :) Co-authored-by: n8programs <43304488+N8python@users.noreply.github.com> * Add rotating kvcache to save space * some nits * style * nits --------- Co-authored-by: n8programs <43304488+N8python@users.noreply.github.com> Co-authored-by: N8 <n8@n8programs.com> Co-authored-by: Awni Hannun <awni@apple.com>	2024-12-16 08:01:03 -08:00
n8programs	5687d5b99b	Adds EXAONE architecture. (#1145 ) * Adds EXAONE architecture. * nits + format * format * clean up and fix rope * clean up and fix rope --------- Co-authored-by: Awni Hannun <awni@apple.com>	2024-12-09 07:58:25 -08:00
Alex Barron	2211b27388	Mixed Quantizations (#1132 ) * saving/loading mixed quantizations * comment * add bits per weight * more concise bpw * count bias too	2024-12-08 14:21:50 -08:00
Awni Hannun	8801beb66f	Add olmo2 (#1128 ) * add olmo2 * add olmo2	2024-12-02 11:42:58 -08:00

1 2 3

115 Commits