Commit Graph

  • ef6ff92add
    Merge branch 'ml-explore:main' into adding-GRPO-training Gökdeniz Gülmez 2025-02-25 13:26:22 +0100
  • 834c087479
    Avoid importing private function chaihahaha 2025-02-25 15:07:11 +0800
  • c37e26a1a3
    Add plamo-2-1b model (#1283) Shunta Saito 2025-02-25 12:24:43 +0900
  • 53185c7f3d last update, gn Goekdeniz-Guelmez 2025-02-24 22:20:07 +0100
  • e4eac9c97b adding custom system message integration in dataset, more opimizations (generates now faster, while same RAM usage), fix for the identical generatrions, seperated the reward functions into a seperate file. Goekdeniz-Guelmez 2025-02-24 20:49:11 +0100
  • 7900a6c22c fix Awni Hannun 2025-02-24 09:10:24 -0800
  • 2edcc0355f cleanup Awni Hannun 2025-02-24 09:07:07 -0800
  • 9392bc70f7 cleanup Awni Hannun 2025-02-24 08:51:12 -0800
  • 09f5add151 Update lora.py paNikitin 2025-02-24 19:30:17 +0300
  • b2ab37238e added adapter additional tokens load on fuse paNikitin 2025-02-24 12:21:30 +0300
  • e2ace6fb0f Update trainer.py paNikitin 2025-02-24 09:12:31 +0300
  • 675c322978 Merge remote-tracking branch 'upstream/main' into mitmul/add-plamo2-1b-support Shunta Saito 2025-02-24 13:37:43 +0900
  • 431ece6c5b Generate: Support Prefill Prompt madroid 2025-02-24 12:19:49 +0800
  • 231f5e870e fixed full dequantization mem leak paNikitin 2025-02-23 15:24:59 +0300
  • 5b7581f41c Update trainer.py paNikitin 2025-02-23 12:56:09 +0300
  • a2b61afd05 update upstream paNikitin 2025-02-23 12:48:25 +0300
  • 0f790c4c84 Update lora.py paNikitin 2025-02-23 12:37:49 +0300
  • 95d44228c9 Update new_tokens.py paNikitin 2025-02-23 12:34:17 +0300
  • 68403f5577 added cot loss masking training paNikitin 2025-02-23 12:31:44 +0300
  • fb1559e1f3 Add __iter__ and __next__ methods to PlamoCache Shunta Saito 2025-02-23 17:33:19 +0900
  • e2d9d619c4 Add state property to PlamoCache Shunta Saito 2025-02-23 16:06:38 +0900
  • 21c0abaf23 Fix to use repeat instead of tile Shunta Saito 2025-02-23 14:54:23 +0900
  • d7426c7750 Do not pass mask to prepare_inputs_for_generation Shunta Saito 2025-02-23 14:47:49 +0900
  • 31225f4960 Remove unnecessary code and add a test for plamo2 Shunta Saito 2025-02-23 14:43:14 +0900
  • 12d1e968cc
    Add support for IPv6 chaihahaha 2025-02-23 02:13:11 +0800
  • bd5f081ca5
    Merge branch 'ml-explore:main' into adding-GRPO-training Gökdeniz Gülmez 2025-02-22 19:11:26 +0100
  • ea7ca22d10
    Merge branch 'ml-explore:main' into adding-dpo-training Gökdeniz Gülmez 2025-02-22 19:11:15 +0100
  • 2d4895d68b
    Merge branch 'ml-explore:main' into adding-orpo-training Gökdeniz Gülmez 2025-02-22 19:11:02 +0100
  • c0bd89aa15 add usage in LORA.md Goekdeniz-Guelmez 2025-02-22 17:29:06 +0100
  • 9705ed908e fix wrong generation in train Goekdeniz-Guelmez 2025-02-22 17:21:08 +0100
  • 09b641aaa7
    Fix FutureWarning in torch.load by setting weights_only=True (#1295) Usama Ahmed 2025-02-22 17:08:54 +0300
  • e4e7678c2c
    Fix FutureWarning in torch.load by setting weights_only=True Usama Ahmed 2025-02-22 14:36:24 +0300
  • d4e688edc3 Fix reference to layers Shunta Saito 2025-02-22 19:55:42 +0900
  • d9c4c6e60c clean up and readding temperature argument Goekdeniz-Guelmez 2025-02-22 02:34:56 +0100
  • d653371e3d nits Goekdeniz-Guelmez 2025-02-22 02:12:02 +0100
  • 235348c211 generation speed improvement in training too from 3 t/s to 15 t/s Goekdeniz-Guelmez 2025-02-22 02:03:01 +0100
  • 79de353530 nits Goekdeniz-Guelmez 2025-02-22 01:05:58 +0100
  • c51b0a2715 fix Goekdeniz-Guelmez 2025-02-22 00:21:47 +0100
  • 710bc1490e training mode working too got from 2 toks/sec to 30 toks/sec with raw 1.5B model Goekdeniz-Guelmez 2025-02-21 22:42:15 +0100
  • 6086137131 Huge speed improvement in validation mode. Goekdeniz-Guelmez 2025-02-21 22:08:49 +0100
  • 5704136791 better dataset handling Goekdeniz-Guelmez 2025-02-21 21:12:45 +0100
  • de147187c1
    Merge branch 'ml-explore:main' into adding-orpo-training Gökdeniz Gülmez 2025-02-21 19:59:43 +0100
  • d050911ab5
    Merge branch 'ml-explore:main' into adding-dpo-training Gökdeniz Gülmez 2025-02-21 18:35:52 +0100
  • 2f20107d9b little faster generation + prints ot a examplke generatino in validation mode, more optimization in trianing function Goekdeniz-Guelmez 2025-02-21 16:02:27 +0100
  • 3d793ecf68
    Fix logits processor bugs with spec dec (#1291) Awni Hannun 2025-02-20 15:55:55 -0800
  • f83d02369d bump patch Awni Hannun 2025-02-20 15:48:43 -0800
  • 85669451d0
    Fix num layers in fine tune (#1294) Awni Hannun 2025-02-20 13:32:01 -0800
  • 5fc071a33b Fix num layers in fine tune Awni Hannun 2025-02-20 08:39:55 -0800
  • b1eb530531 Fix logits processor bugs with spec dec Awni Hannun 2025-02-19 14:03:02 -0800
  • 11c8991476
    Merge branch 'ml-explore:main' into adding-GRPO-training Gökdeniz Gülmez 2025-02-19 21:24:22 +0100
  • 6d5432b143
    Merge branch 'ml-explore:main' into adding-dpo-training Gökdeniz Gülmez 2025-02-19 21:24:10 +0100
  • 65d777b868
    Merge branch 'ml-explore:main' into adding-orpo-training Gökdeniz Gülmez 2025-02-19 21:23:49 +0100
  • 1cbf5cdac7
    use more standard window strategy (#1287) Awni Hannun 2025-02-19 06:22:51 -0800
  • 348f728c10
    Merge branch 'ml-explore:main' into adding-orpo-training Gökdeniz Gülmez 2025-02-18 17:19:07 +0100
  • ab35c87911
    Merge branch 'ml-explore:main' into adding-dpo-training Gökdeniz Gülmez 2025-02-18 17:17:49 +0100
  • 541f0be937 fix generation cutoff in evaluation Goekdeniz-Guelmez 2025-02-17 14:39:38 +0100
  • 1eea135a20
    Merge branch 'ml-explore:main' into adding-GRPO-training Gökdeniz Gülmez 2025-02-17 14:25:03 +0100
  • 6a6bd53e43 removing print and switching some variables in the math Goekdeniz-Guelmez 2025-02-15 15:38:51 +0100
  • 5ec4790656 removing comments + adding temperature + reward weighting Goekdeniz-Guelmez 2025-02-15 15:29:22 +0100
  • 1e75bf184c Include .jsonl files to download from Huggingface hub Shunta Saito 2025-02-15 10:08:33 +0900
  • 9f422b4729 Fix import Shunta Saito 2025-02-14 23:35:10 +0900
  • 28f3f3adab Give all inputs when it's the first time call of model Shunta Saito 2025-02-14 23:24:45 +0900
  • 103c6616c4 Remove unused code part Shunta Saito 2025-02-14 23:06:42 +0900
  • 66dd97ed3d Fix channel first weights to channel last for right use of MLX's conv1d Shunta Saito 2025-02-14 22:38:13 +0900
  • 81917d41d5 Allow a cache obj defined externally Shunta Saito 2025-02-14 21:11:30 +0900
  • 00d13ebd40 Fix some part Shunta Saito 2025-02-14 01:51:06 +0900
  • fb5e225523 Apply formatter Shunta Saito 2025-02-13 20:02:38 +0900
  • 07cf4336b3 Add plamo2.py Shunta Saito 2025-02-13 19:54:07 +0900
  • ebea6928a3 Remove unnecessary changes Shunta Saito 2025-02-13 19:53:34 +0900
  • 72269c306c Use sanitize() Shunta Saito 2025-02-13 19:45:01 +0900
  • 197fd6aad8 Fix model Shunta Saito 2025-02-13 18:57:00 +0900
  • 40c7ce8048 Use mlx's BaseModelArgs Shunta Saito 2025-02-13 14:47:08 +0900
  • 9a6e6541de Fix cache.py to support non-top level layers Shunta Saito 2025-02-13 13:44:31 +0900
  • 58686bbcac Add pfnet/plamo-2-1b Shunta Saito 2025-02-13 13:06:44 +0900
  • 7efc5f8c5e use more standard window strategy Awni Hannun 2025-02-14 06:53:31 -0800
  • baeb9f117f reduncancy fix + nits Goekdeniz-Guelmez 2025-02-14 09:09:59 +0100
  • 96bf37008e
    Update README.md to include how to set temperature (#1280) Matthias Neumayer 2025-02-14 04:32:56 +0100
  • 2229775369 nits Awni Hannun 2025-02-13 19:31:51 -0800
  • 7b07b14e67
    add logits processor to spec gen (#1260) Awni Hannun 2025-02-13 19:19:53 -0800
  • 65a49dda0e nits Goekdeniz-Guelmez 2025-02-13 21:46:30 +0100
  • 8179b99436 quick prompting fix Goekdeniz-Guelmez 2025-02-12 19:24:35 +0100
  • a7273f6a56 small fix Goekdeniz-Guelmez 2025-02-12 18:30:12 +0100
  • 0408925f0d
    Update README.md to include how to set temperature Matthias Neumayer 2025-02-12 12:14:50 +0100
  • 3823154014
    Merge branch 'ml-explore:main' into adding-GRPO-training Gökdeniz Gülmez 2025-02-12 11:10:10 +0100
  • 4b44434c54
    Merge branch 'ml-explore:main' into adding-dpo-training Gökdeniz Gülmez 2025-02-12 11:09:58 +0100
  • 80c64da960
    Merge branch 'ml-explore:main' into adding-orpo-training Gökdeniz Gülmez 2025-02-12 11:09:43 +0100
  • c26e188417
    Merge branch 'ml-explore:main' into adding-support-for-mamba2 Gökdeniz Gülmez 2025-02-12 11:09:20 +0100
  • e33d9d509b updates Goekdeniz-Guelmez 2025-02-12 11:07:53 +0100
  • 607c300e18 Add Direct Preference Optimization (DPO) method Anupam Mediratta 2025-02-12 15:21:21 +0530
  • c42e858d7e Merge branch 'adding-GRPO-training' of https://github.com/Goekdeniz-Guelmez/mlx-examples into adding-GRPO-training Goekdeniz-Guelmez 2025-02-12 08:57:33 +0100
  • 5aeefc8c47 update new iterade batches function + nits Goekdeniz-Guelmez 2025-02-12 08:57:26 +0100
  • ec30dc3538
    hunyuan finetune (#1270) Awni Hannun 2025-02-11 16:49:35 -0800
  • 42413c5d85
    fix lora timings after validation (#1278) Awni Hannun 2025-02-11 16:48:55 -0800
  • 423badeb2b fix lora timings after validation Awni Hannun 2025-02-11 16:44:28 -0800
  • f8cbf159e0
    fix sharding for more even number of layers (#1276) Awni Hannun 2025-02-11 16:26:59 -0800
  • e879ea70e1
    fix generation evaluations (#1277) Awni Hannun 2025-02-11 16:10:30 -0800
  • 3d677f0870
    Add "from_draft" to GenerationResponse (#1272) Matt Clayton 2025-02-11 18:41:02 -0500
  • e1ee189ad1 fix generation evaluations Awni Hannun 2025-02-11 15:24:54 -0800
  • 361e3547e8 fix sharding for more even number of layers Awni Hannun 2025-02-11 13:41:36 -0800
  • 978deab589 small fix Goekdeniz-Guelmez 2025-02-11 17:48:42 +0100