Goekdeniz-Guelmez
e96afe9e9f
updates
2025-02-11 09:09:28 +01:00
Goekdeniz-Guelmez
88ca747e9e
nits
2025-02-10 19:46:19 +01:00
Goekdeniz-Guelmez
b7bc811507
nits
2025-02-10 19:45:19 +01:00
Goekdeniz-Guelmez
e5aa2c3b5d
nits
2025-02-10 17:51:14 +01:00
Goekdeniz-Guelmez
f88e897019
removing helper functions
2025-02-10 16:07:28 +01:00
Goekdeniz-Guelmez
d9da35f458
nits
2025-02-10 10:52:32 +01:00
Gökdeniz Gülmez
0dac286539
Merge branch 'main' into adding-GRPO-training
2025-02-10 10:43:22 +01:00
Chime Ogbuji
5865899c81
Completion only fine-tuning of instruction models with collections of HF datasets ( #1103 )
...
- Optional completion only fine-tuning with `--mask-prompt`
- Collections of Hugging Face datasets
---------
Co-authored-by: Awni Hannun <awni@apple.com>
2025-02-09 20:12:34 -08:00
Goekdeniz-Guelmez
00712522ba
rebase loss calculation
2025-02-09 17:13:05 +01:00
Goekdeniz-Guelmez
a527cdb39b
fix: prevent gradients from flowing through the reference model's logits
2025-02-09 17:02:58 +01:00
Goekdeniz-Guelmez
54179901b5
fix
2025-02-09 15:41:47 +01:00
Goekdeniz-Guelmez
9ba6146a76
fix
2025-02-09 14:32:50 +01:00
Awni Hannun
31611b62d7
Add IBM granite model ( #1265 )
...
* add granite
* add thinking option
2025-02-08 15:46:15 -08:00
Gökdeniz Gülmez
94dcd0f63e
Merge branch 'ml-explore:main' into adding-GRPO-training
2025-02-06 08:15:58 +01:00
Goekdeniz-Guelmez
bcfa55d882
updates
2025-02-05 15:02:12 +01:00
Goekdeniz-Guelmez
0a19522ec4
updates
2025-02-05 14:38:09 +01:00
Goekdeniz-Guelmez
35a2d99cf9
smoll fix
2025-02-05 11:30:21 +01:00
Goekdeniz-Guelmez
a33cad84b4
udpates
2025-02-05 09:48:00 +01:00
Goekdeniz-Guelmez
2a8e6f6e44
udpate
2025-02-05 08:47:03 +01:00
Goekdeniz-Guelmez
0a09a93454
fix cache handling
2025-02-05 08:44:06 +01:00
Goekdeniz-Guelmez
7b0141455e
better create_dataset
2025-02-04 10:43:00 +01:00
Goekdeniz-Guelmez
bd1a42ec2f
adding args into dataset handling
2025-02-04 10:22:34 +01:00
Goekdeniz-Guelmez
7173840283
first succesfull training run
2025-02-04 09:18:45 +01:00
Goekdeniz-Guelmez
ca32424043
updates
2025-02-03 21:57:26 +01:00
Goekdeniz-Guelmez
54e295ea80
fix name funcs
2025-02-03 19:56:11 +01:00
Goekdeniz-Guelmez
06f9c29c94
print func name
2025-02-03 19:47:40 +01:00
Goekdeniz-Guelmez
40bca770ae
fixes
2025-02-03 19:43:49 +01:00
Goekdeniz-Guelmez
05d921b788
optims
2025-02-03 19:37:05 +01:00
Awni Hannun
d9924d08d1
Fix no validation in lora ( #1241 )
2025-02-03 09:55:24 -08:00
Goekdeniz-Guelmez
1d9e4802f0
first working prototype, will try training out at home
2025-02-03 12:05:29 +01:00
Goekdeniz-Guelmez
23d75cd7ad
starting fist training test run
2025-02-03 10:08:28 +01:00
Goekdeniz-Guelmez
a3ed632422
dataset wrapper done
2025-02-03 09:13:17 +01:00
Goekdeniz-Guelmez
d034ca369e
adding function for R1
2025-02-03 08:26:42 +01:00
Goekdeniz-Guelmez
243c9621d9
update lora.py
2025-01-31 21:10:44 +01:00
Goekdeniz-Guelmez
a57d553fc1
update
2025-01-31 16:57:43 +01:00
Goekdeniz-Guelmez
80bcf68956
grpo_trainer shoudl be done
2025-01-31 16:54:18 +01:00
Goekdeniz-Guelmez
6c58aa995c
updates
2025-01-31 16:27:31 +01:00
Goekdeniz-Guelmez
93370ff1c3
updates ans fixing the KL div lines
2025-01-30 23:55:40 +01:00
Gökdeniz Gülmez
b1e573d6e8
Merge branch 'ml-explore:main' into adding-GRPO-training
2025-01-29 15:07:52 +01:00
Goekdeniz-Guelmez
5e0ae83487
initial commit, gn
2025-01-29 00:19:07 +01:00
Gökdeniz Gülmez
77faa14ba4
adding support for kyutai's helium ( #1208 )
...
* initial commit
* adding helium into training
* Update ACKNOWLEDGMENTS.md
* nits
* nits
* fixes / nits
---------
Co-authored-by: Awni Hannun <awni@apple.com>
2025-01-26 07:19:07 -08:00
Victor Nogueira
df1406735b
Fix dataset variable name, in datasets.py
( #1212 )
2025-01-21 14:12:43 -08:00
Awni Hannun
50f0a7f6d9
add internlm3 ( #1206 )
2025-01-15 14:55:41 -08:00
Ivan Fioravanti
6ae6c72c2e
reduction moved to CPU in case of distributed training ( #1200 )
2025-01-14 17:20:42 -08:00
Chime Ogbuji
0228c46434
Custom local dataset features ( #1085 )
...
* Generalize prompt_feature and completion_feature for use in local datasets to facilitate compatibility with many other training dataset formats.
* Persist configured prompt/completion key
* rebase + nits
---------
Co-authored-by: Awni Hannun <awni@apple.com>
2025-01-13 10:01:18 -08:00
Awni Hannun
c4833a2f55
fix encoding with special tokens + chat template ( #1189 )
2025-01-03 10:50:59 -08:00
Prince Canuma
dfa4dd6c93
Add support for cohere2 ( #1157 )
...
* add support for cohere2
* revert to act_fn to silu
* fix tests and sliding window attention
* add tests
* add to tuner
* fix sliding window
* add coauthor :)
Co-authored-by: n8programs <43304488+N8python@users.noreply.github.com>
* Add rotating kvcache to save space
* some nits
* style
* nits
---------
Co-authored-by: n8programs <43304488+N8python@users.noreply.github.com>
Co-authored-by: N8 <n8@n8programs.com>
Co-authored-by: Awni Hannun <awni@apple.com>
2024-12-16 08:01:03 -08:00
n8programs
5687d5b99b
Adds EXAONE architecture. ( #1145 )
...
* Adds EXAONE architecture.
* nits + format
* format
* clean up and fix rope
* clean up and fix rope
---------
Co-authored-by: Awni Hannun <awni@apple.com>
2024-12-09 07:58:25 -08:00
Alex Barron
2211b27388
Mixed Quantizations ( #1132 )
...
* saving/loading mixed quantizations
* comment
* add bits per weight
* more concise bpw
* count bias too
2024-12-08 14:21:50 -08:00
Awni Hannun
8801beb66f
Add olmo2 ( #1128 )
...
* add olmo2
* add olmo2
2024-12-02 11:42:58 -08:00