Goekdeniz-Guelmez
1beefd58a0
add create_dataset
2025-02-04 11:06:57 +01:00
Gökdeniz Gülmez
c33c245c11
Merge branch 'ml-explore:main' into adding-orpo-training
2025-02-04 11:04:40 +01:00
Awni Hannun
21d0ab6e8a
fix deepseek sharding ( #1242 )
2025-02-03 16:59:50 -08:00
Gökdeniz Gülmez
0989c073b0
Optimizations for mamba1 ( #1213 )
...
* added mx.einsum() operations: before: 41.293 tokens-per-sec, after: 57.822 tokens-per-sec
* Fused Operations in delta, B, C = ... :. Before: 57.822 tokens-per-sec, after: 83.890 tokens-per-sec
* Pre-computing A_log. After: 83.890 tokens-per-sec, before: 85.848 tokens-per-sec
* Update MambaBlock, Batched Input Processing, Improved Cache Handling, Pre-computed Constants, Cleaner State Management, Explicit Return Values:. Before: 82.442 tokens-per-sec, after: 129.130 tokens-per-sec.
* cleaning up and adding apple copyright to helium modelfile
* update Copyright to this year
* nits + even faster
---------
Co-authored-by: Awni Hannun <awni.hannun@gmail.com>
2025-02-03 13:36:08 -08:00
Awni Hannun
d9924d08d1
Fix no validation in lora ( #1241 )
2025-02-03 09:55:24 -08:00
Gökdeniz Gülmez
2c96da5155
Merge branch 'ml-explore:main' into adding-orpo-training
2025-02-03 09:07:52 +01:00
Awni Hannun
9c2ef38d4d
only download local shard ( #1240 )
2025-02-02 13:58:44 -08:00
Goekdeniz-Guelmez
541677aa7f
cleaning up
2025-01-31 21:36:24 +01:00
Gökdeniz Gülmez
ceccb4c9e9
Merge branch 'ml-explore:main' into adding-orpo-training
2025-01-29 15:07:22 +01:00
Awni Hannun
e8afb59de4
better overflow correction ( #1229 )
2025-01-28 14:37:30 -08:00
Anchen
7a83077cd7
chore(mlx-lm): support text type content in messages ( #1225 )
...
* chore(mlx-lm): support text type content
* chore: optimize the messagef content processing
* nits + format
---------
Co-authored-by: Awni Hannun <awni@apple.com>
2025-01-27 17:13:50 -08:00
Awni Hannun
f44a52e2dc
batched min p and fix spec gen sampling ( #1222 )
2025-01-27 15:40:31 -08:00
Goekdeniz-Guelmez
649d3f82ae
fix ACKNOWLEDGMENTS
2025-01-26 17:05:34 +01:00
Gökdeniz Gülmez
294d189eed
Merge branch 'main' into adding-orpo-training
2025-01-26 16:59:37 +01:00
Gökdeniz Gülmez
77faa14ba4
adding support for kyutai's helium ( #1208 )
...
* initial commit
* adding helium into training
* Update ACKNOWLEDGMENTS.md
* nits
* nits
* fixes / nits
---------
Co-authored-by: Awni Hannun <awni@apple.com>
2025-01-26 07:19:07 -08:00
Goekdeniz-Guelmez
2f2ddd4811
clean up
2025-01-26 15:17:06 +01:00
Goekdeniz-Guelmez
d8e7834345
Removed rejected_rewards handling, Updated batch unpacking to match iterator, Updated batch unpacking to match iterator, Added preference score scaling, Simplified reward calculation, Removed redundant rejected_rewards
2025-01-25 21:35:37 +01:00
Goekdeniz-Guelmez
09ed837896
updates
2025-01-24 16:57:18 +01:00
Goekdeniz-Guelmez
e3688293ed
removing dpo and fixing some stuff for orpo
2025-01-24 16:09:22 +01:00
Goekdeniz-Guelmez
0bb001121e
niits
2025-01-22 21:39:29 +01:00
Gökdeniz Gülmez
4098c3bd2f
Merge branch 'ml-explore:main' into adding-orpo-training
2025-01-22 14:18:38 +01:00
Awni Hannun
9a3ddc3e65
some fixes for pipeline parallel deep seek r1 ( #1216 )
2025-01-21 19:40:29 -08:00
Victor Nogueira
df1406735b
Fix dataset variable name, in datasets.py
( #1212 )
2025-01-21 14:12:43 -08:00
Goekdeniz-Guelmez
61cd25362c
nits
2025-01-19 13:46:20 +01:00
Goekdeniz-Guelmez
363bde634e
fixes
2025-01-19 13:45:33 +01:00
Goekdeniz-Guelmez
ea0d11cd2f
update
2025-01-19 02:05:43 +01:00
Goekdeniz-Guelmez
2a5b315f60
update ACKNOWLEDGMENTS.md
2025-01-19 02:05:06 +01:00
Goekdeniz-Guelmez
424cb854e9
nits
2025-01-19 02:03:50 +01:00
Goekdeniz-Guelmez
9ede9db19b
nits
2025-01-19 02:03:31 +01:00
Goekdeniz-Guelmez
fa80d081f2
finish
2025-01-19 01:58:29 +01:00
Goekdeniz-Guelmez
7d279b51ef
remerge with dpo
2025-01-19 01:14:08 +01:00
Goekdeniz-Guelmez
a9b7609118
initial commit
2025-01-19 01:09:43 +01:00
Goekdeniz-Guelmez
51fd621fdb
nits
2025-01-19 01:03:07 +01:00
Goekdeniz-Guelmez
040f7c38ac
update ACKNOWLEDGMENTS.md
2025-01-19 01:01:08 +01:00
Goekdeniz-Guelmez
06a9f5d106
update lora_config.yaml
2025-01-19 00:53:41 +01:00
Goekdeniz-Guelmez
1b4e19675d
update LORA.md
2025-01-19 00:48:45 +01:00
Goekdeniz-Guelmez
582f979dfd
fixing reference model loading and freezing
2025-01-19 00:41:27 +01:00
Goekdeniz-Guelmez
1ff788821c
initial commit
2025-01-19 00:19:36 +01:00
Jarrett
07f88f8057
fix(lora): add back store_true default args ( #1205 )
2025-01-16 11:15:42 -08:00
Awni Hannun
50f0a7f6d9
add internlm3 ( #1206 )
2025-01-15 14:55:41 -08:00
Ivan Fioravanti
6ae6c72c2e
reduction moved to CPU in case of distributed training ( #1200 )
2025-01-14 17:20:42 -08:00
Awni Hannun
c117af83b8
fix gpt bigcode ( #1204 )
2025-01-13 10:22:32 -08:00
Chime Ogbuji
0228c46434
Custom local dataset features ( #1085 )
...
* Generalize prompt_feature and completion_feature for use in local datasets to facilitate compatibility with many other training dataset formats.
* Persist configured prompt/completion key
* rebase + nits
---------
Co-authored-by: Awni Hannun <awni@apple.com>
2025-01-13 10:01:18 -08:00
Prince Canuma
bf2da36fc6
Fix Cohere2: mask shape error (long context) ( #1202 )
...
* fix mask shape error (long context)
* Update llms/mlx_lm/models/cohere2.py
Co-authored-by: Awni Hannun <awni.hannun@gmail.com>
* revert layer_idx
* black formatting
* Update cohere2.py
* format
---------
Co-authored-by: Awni Hannun <awni.hannun@gmail.com>
Co-authored-by: Awni Hannun <awni@apple.com>
2025-01-12 12:58:08 -08:00
Xingjun.Wang
514502da22
Support snapshot_download for ModelScope ( #1194 )
...
* add MLX_USE_MODELSCOPE env
* update
* update snapshot_download
* update
* remove modelscope dependency and add import check
* update
* nits
* fix
---------
Co-authored-by: wangxingjun778 <jason@U-C7X6TX5G-2239.local>
Co-authored-by: Awni Hannun <awni@apple.com>
2025-01-10 15:29:34 -08:00
Awni Hannun
93c5cfd781
Add a speculative decoding generator ( #1155 )
...
* add a speculative decoding generator
* fix
* fixes
* optional kwarg pop
2025-01-10 15:27:08 -08:00
Awni Hannun
5cae0a60e6
deepseek v3 model with pipeline parallelism ( #1191 )
...
* deepseekv3
* use upload_large_file instead of deprecated multi comit
* add pipeline generation and example
* comment
* get fp16 working
* use mlx==0.22
2025-01-09 15:55:53 -08:00
Jarrett
40b88eff48
fix(lora): config yaml & arg default merge bug ( #1196 )
2025-01-09 11:33:54 -08:00
Pedro Cuenca
b8f0cacfa8
Use upload_large_folder ( #1193 )
2025-01-07 09:18:31 -08:00
Awni Hannun
9183fe8b6d
fix ( #1192 )
2025-01-06 10:12:07 -08:00