Commit Graph

736 Commits

Author SHA1 Message Date
Goekdeniz-Guelmez
f13a0d04ca seperate functions 2025-03-05 15:28:12 +01:00
Goekdeniz-Guelmez
d723ddfeda updates 2025-03-05 14:49:56 +01:00
Goekdeniz-Guelmez
9a36452519 updates 2025-03-05 14:42:34 +01:00
Goekdeniz-Guelmez
326935be49 updates 2025-03-05 14:40:23 +01:00
Goekdeniz-Guelmez
2d2f39f96e updates 2025-03-05 14:25:55 +01:00
Goekdeniz-Guelmez
1f89453295 eos token return fix 2025-03-05 14:00:51 +01:00
Goekdeniz-Guelmez
2bde97fe13 minor speed improvement 2025-03-05 13:55:24 +01:00
Goekdeniz-Guelmez
3dfb21267b updates 2025-03-05 12:59:41 +01:00
Gökdeniz Gülmez
c817743333
Merge branch 'ml-explore:main' into adding-GRPO-training 2025-03-03 22:13:42 +01:00
Pierre-Louis
1bc3476a46
chore(lora): Add real-time log buffering fix for nohup execution (#1311)
* chore(lora): Add real-time log buffering fix for nohup execution

Disable Python stdout buffering to ensure logs appear in nohup.out in real-time instead of only after script completion.

* chore(lora): remove python 3.7+ check

* chore(lora): running pre-commit hook

---------

Co-authored-by: Pierre-Louis Létoquart <randlgint@proton.me>
2025-03-03 06:12:33 -08:00
Shunta Saito
269faa5fa4
Fix plamo2 model to use rms_norm (#1308)
* Fix plamo2 model to use rms_norm and enable sliding window attention

* Fix missing variable

* Remove sliding window attention impl. cause it should be done by using RotatingKVCache

* Remove unused imports
2025-03-03 06:12:02 -08:00
Goekdeniz-Guelmez
132225a018 updates 2025-03-01 22:23:33 +01:00
Goekdeniz-Guelmez
925e11439b updates 2025-02-28 22:07:24 +01:00
Gökdeniz Gülmez
80e10a59d7
Merge branch 'main' into adding-GRPO-training 2025-02-28 21:16:02 +01:00
Awni Hannun
845cd8c01e
support kimi + more options in chat mode (#1312) 2025-02-28 11:33:18 -08:00
Awni Hannun
b2108a0de6
Allow mask prompt in config (#1314) 2025-02-28 11:33:04 -08:00
Goekdeniz-Guelmez
15d53279ae batching fix 2025-02-28 16:02:40 +01:00
Gökdeniz Gülmez
a04eb02257
Merge branch 'ml-explore:main' into adding-GRPO-training 2025-02-28 11:18:32 +01:00
madroid
eb73549631
Generate: Support Prefill Response (#1299)
* Generate: Support Prefill Prompt

python -m mlx_lm.generate \
       --model mlx-community/DeepSeek-R1-Distill-Qwen-1.5B-4bit \
       --prompt "hello" \
       --prefill-prompt "<think>\n"

* Generate: rename prefill-prompt to prefill-response

* nits

---------

Co-authored-by: Awni Hannun <awni.hannun@gmail.com>
2025-02-27 07:44:00 -08:00
Gökdeniz Gülmez
f27ed26b32
Merge branch 'ml-explore:main' into adding-GRPO-training 2025-02-27 11:23:20 +01:00
Awni Hannun
00a7379070
Fixes for phi4 mini (#1305) 2025-02-26 16:21:54 -08:00
Awni Hannun
0f240a4c7e
Use max tokens from options in mlx_lm evaluate (#1302) 2025-02-26 15:46:16 -08:00
Awni Hannun
56e60ad5a6
fix manage for new transformers (#1304) 2025-02-26 15:44:57 -08:00
Pedro Cuenca
b7f742ef56
Mixed quant recipes (#1300)
* Mixed 3/6 and 2/6 recipes based on Alex Barron's

* format / nits

---------

Co-authored-by: Awni Hannun <awni.hannun@gmail.com>
2025-02-26 11:32:36 -08:00
Goekdeniz-Guelmez
fab2dc2688 smoll fix 2025-02-26 15:21:57 +01:00
Gökdeniz Gülmez
ef6ff92add
Merge branch 'ml-explore:main' into adding-GRPO-training 2025-02-25 13:26:22 +01:00
Shunta Saito
c37e26a1a3
Add plamo-2-1b model (#1283)
* Add pfnet/plamo-2-1b

* Fix cache.py to support non-top level layers

* Use mlx's BaseModelArgs

* Fix model

* Use sanitize()

* Remove unnecessary changes

* Add plamo2.py

* Apply formatter

* Fix some part

* Allow a cache obj defined externally

* Fix channel first weights to channel last for right use of MLX's conv1d

* Remove unused code part

* Give all inputs when it's the first time call of model

* Fix import

* Include .jsonl files to download from Huggingface hub

* Fix reference to layers

* Remove unnecessary code and add a test for plamo2

* Do not pass mask to prepare_inputs_for_generation

* Fix to use repeat instead of tile

* Add state property to PlamoCache

* Add __iter__ and __next__ methods to PlamoCache

* cleanup

* cleanup

* fix

---------

Co-authored-by: Awni Hannun <awni.hannun@gmail.com>
2025-02-24 19:24:43 -08:00
Goekdeniz-Guelmez
53185c7f3d last update, gn 2025-02-24 22:20:07 +01:00
Goekdeniz-Guelmez
e4eac9c97b adding custom system message integration in dataset, more opimizations (generates now faster, while same RAM usage), fix for the identical generatrions, seperated the reward functions into a seperate file. 2025-02-24 20:49:22 +01:00
Gökdeniz Gülmez
bd5f081ca5
Merge branch 'ml-explore:main' into adding-GRPO-training 2025-02-22 19:11:26 +01:00
Goekdeniz-Guelmez
c0bd89aa15 add usage in LORA.md 2025-02-22 17:29:06 +01:00
Goekdeniz-Guelmez
9705ed908e fix wrong generation in train 2025-02-22 17:21:08 +01:00
Usama Ahmed
09b641aaa7
Fix FutureWarning in torch.load by setting weights_only=True (#1295) 2025-02-22 06:08:54 -08:00
Goekdeniz-Guelmez
d9c4c6e60c clean up and readding temperature argument 2025-02-22 02:34:56 +01:00
Goekdeniz-Guelmez
d653371e3d nits 2025-02-22 02:12:02 +01:00
Goekdeniz-Guelmez
235348c211 generation speed improvement in training too from 3 t/s to 15 t/s 2025-02-22 02:03:01 +01:00
Goekdeniz-Guelmez
79de353530 nits 2025-02-22 01:05:58 +01:00
Goekdeniz-Guelmez
c51b0a2715 fix 2025-02-22 00:21:47 +01:00
Goekdeniz-Guelmez
710bc1490e training mode working too got from 2 toks/sec to 30 toks/sec with raw 1.5B model 2025-02-21 22:42:15 +01:00
Goekdeniz-Guelmez
6086137131 Huge speed improvement in validation mode. 2025-02-21 22:08:49 +01:00
Goekdeniz-Guelmez
2f20107d9b little faster generation + prints ot a examplke generatino in validation mode, more optimization in trianing function 2025-02-21 16:02:27 +01:00
Awni Hannun
3d793ecf68
Fix logits processor bugs with spec dec (#1291)
* Fix logits processor bugs with spec dec

* bump patch
2025-02-20 15:55:55 -08:00
Awni Hannun
85669451d0
Fix num layers in fine tune (#1294) 2025-02-20 13:32:01 -08:00
Gökdeniz Gülmez
11c8991476
Merge branch 'ml-explore:main' into adding-GRPO-training 2025-02-19 21:24:22 +01:00
Awni Hannun
1cbf5cdac7
use more standard window strategy (#1287) 2025-02-19 06:22:51 -08:00
Goekdeniz-Guelmez
541f0be937 fix generation cutoff in evaluation 2025-02-17 14:39:38 +01:00
Gökdeniz Gülmez
1eea135a20
Merge branch 'ml-explore:main' into adding-GRPO-training 2025-02-17 14:25:03 +01:00
Goekdeniz-Guelmez
6a6bd53e43 removing print and switching some variables in the math 2025-02-15 15:38:51 +01:00
Goekdeniz-Guelmez
5ec4790656 removing comments + adding temperature + reward weighting 2025-02-15 15:29:22 +01:00
Goekdeniz-Guelmez
baeb9f117f reduncancy fix + nits 2025-02-14 09:09:59 +01:00