Commit Graph

  • 5a8ba492e7 changing lora layers back to 16 Goekdeniz-Guelmez 2024-09-05 09:48:07 +0200
  • 74f743bc23 fix Goekdeniz-Guelmez 2024-09-05 09:44:57 +0200
  • b184700912 removing the unnesesairy files Goekdeniz-Guelmez 2024-09-05 09:36:43 +0200
  • f965aef271 clean up Goekdeniz-Guelmez 2024-09-05 09:36:14 +0200
  • e8f5a6b213 quick clean up Goekdeniz-Guelmez 2024-09-05 09:17:10 +0200
  • 9d14ea57e3 removing the model_type if stuff in the _step loop in generate_step and adding MambaCache in base.py for training easier generations and removing mamba in tuner/utils. Goekdeniz-Guelmez 2024-09-04 23:23:59 +0200
  • 290c1a4dda update ACKNOWLEDGEMENTS Goekdeniz-Guelmez 2024-09-04 23:03:19 +0200
  • fd3bd6d9aa Fixing the cache handling, generating works now trying training Goekdeniz-Guelmez 2024-09-04 23:00:25 +0200
  • 107575133e Done! Goekdeniz-Guelmez 2024-09-04 22:58:55 +0200
  • de1fdc7fdf fixing generate and logits outputs Goekdeniz-Guelmez 2024-09-04 22:46:45 +0200
  • 236acb16a8 Fixing the Batching Depfwise Comnvolution and multi token input Goekdeniz-Guelmez 2024-09-04 22:08:32 +0200
  • e6c96f2b7a update Goekdeniz-Guelmez 2024-09-04 20:43:55 +0200
  • 95e1690fc3
    Merge branch 'ml-explore:main' into adding-support-for-mamba Gökdeniz Gülmez 2024-09-04 20:23:19 +0200
  • 974cfe7ef8
    Merge branch 'ml-explore:main' into adding-full-finetuning Gökdeniz Gülmez 2024-09-04 20:23:08 +0200
  • bd29aec299
    Support HuggingFace model tree (#957) madroid 2024-09-04 21:19:32 +0800
  • 282304a87d fix mapping for sdxl turbo Pranav Veldurthi 2024-09-04 09:09:52 -0400
  • 83a209e200
    Add prompt piping (#962) Chime Ogbuji 2024-09-03 16:29:10 -0400
  • bf921afcbe
    Make sure to import the correct "version" module when installing mlx_whisper and mlx_lm from local source code. (#969) James Zhao 2024-09-03 23:16:21 +0300
  • 5be8b96014 fix Awni Hannun 2024-09-03 13:08:22 -0700
  • a57b470829 Update exception text Chime Ogbuji 2024-09-03 15:40:29 -0400
  • aa14faf22b Fix reference to changed option name Chime Ogbuji 2024-09-03 15:18:13 -0400
  • 62655dfa2f Merge remote-tracking branch 'origin/generation_pipes' into generation_pipes Chime Ogbuji 2024-09-03 13:14:35 -0400
  • 16d51590a3 Fix capitalization typo Chime Ogbuji 2024-09-03 13:14:29 -0400
  • f1dbe6424d Make sure to import the correct "version" module when installing the mlx_lm package from local source code zhaoyafei 2024-08-30 23:46:51 +0300
  • 1fbc9361e4 Fix attention layers map for SD-2-1-Base Pranav Veldurthi 2024-09-01 14:34:35 -0400
  • f3733cf5f8 update Goekdeniz-Guelmez 2024-09-01 13:39:03 +0200
  • c9dc00b1a6 Merge branch 'adding-support-for-mamba' of https://github.com/Goekdeniz-Guelmez/mlx-examples into adding-support-for-mamba Goekdeniz-Guelmez 2024-09-01 13:38:33 +0200
  • d1a0ca0c30
    Merge branch 'ml-explore:main' into adding-support-for-mamba Gökdeniz Gülmez 2024-08-31 12:09:02 +0200
  • 9a5dc3716e
    Merge branch 'ml-explore:main' into adding-full-finetuning Gökdeniz Gülmez 2024-08-31 12:08:48 +0200
  • 4eaa36e994 Make sure to import the correct "version" module when installing the mlx_whisper package from local source code. zhaoyafei 2024-08-30 23:42:47 +0300
  • 9b05bc2795
    Merge branch 'ml-explore:main' into generation_pipes Chime Ogbuji 2024-08-30 13:08:10 -0400
  • 12603bb792 Switch to using --verbose instead of --prompt-only Chime Ogbuji 2024-08-30 13:07:55 -0400
  • 3c6e8b11af
    fix (#965) Awni Hannun 2024-08-30 05:56:27 -0700
  • 9284682a1f fix Awni Hannun 2024-08-29 21:16:40 -0700
  • fc93c55723
    feat(mlx_lm): Nemotron (#949) L 2024-08-29 21:08:57 -0700
  • 8ba0a606cb nits Awni Hannun 2024-08-29 20:13:06 -0700
  • fac1f8b444 Hub: remove config print madroid 2024-08-30 09:53:07 +0800
  • d94187eb34 Hub: update quantization_config value madroid 2024-08-30 09:50:26 +0800
  • c14570a896 Hub: add quantization_config for model tree Quantized type madroid 2024-08-30 09:46:50 +0800
  • b1186e2a81
    Docs on prompt scaling (#963) Awni Hannun 2024-08-29 15:05:17 -0700
  • b0ba935043
    Merge branch 'ml-explore:main' into adding-support-for-mamba Gökdeniz Gülmez 2024-08-29 22:34:05 +0200
  • b30900f0ef
    Merge branch 'ml-explore:main' into adding-full-finetuning Gökdeniz Gülmez 2024-08-29 22:33:36 +0200
  • d4b11f6d27 nits Awni Hannun 2024-08-29 13:06:32 -0700
  • 8b7fcfbe9f remove unused var Awni Hannun 2024-08-29 12:55:34 -0700
  • afe143cd76 docs on prompt scaling Awni Hannun 2024-08-29 12:53:18 -0700
  • 8dfc76be0f Initial commit of --prompt-only and prompt from STDIN feature Chime Ogbuji 2024-08-29 11:03:33 -0400
  • 2caa8329c0
    feat: show batch generation progress L Lllvvuu 2024-08-23 16:27:50 +0900
  • 280b3784d4
    feat: support batch input in generate() L Lllvvuu 2024-08-21 00:46:19 +0800
  • 1003a8b2dd
    Add the ability to load the KV cache from a file (#956) Angelos Katharopoulos 2024-08-28 22:11:45 -0700
  • 7f8c961287
    Fix setattr for the TokenizerWrapper (#961) Angelos Katharopoulos 2024-08-28 14:47:33 -0700
  • cbb4b78b03 Add metadata to the kv cache Angelos Katharopoulos 2024-08-28 14:24:37 -0700
  • 45db052856 Fix setattr for the TokenizerWrapper Angelos Katharopoulos 2024-08-28 13:42:29 -0700
  • 213a950e6e
    feat(clip): add linear probe evaluation script Saurav Maheshkar 2024-08-28 20:54:38 +0100
  • 7d0e1cc4e0
    feat: basic speculative decoding support in mlx_lm.generate / mlx_lm.server L Lllvvuu 2024-08-26 01:36:10 -0700
  • 01c8cbf71b Hub: add base_model metadata madroid 2024-08-27 15:04:55 +0800
  • 8cdc91a92e Hub: Update quantization configuration fields madroid 2024-08-27 15:04:19 +0800
  • 920efec17e Load kv cache from a file Angelos Katharopoulos 2024-08-23 18:33:26 -0700
  • 6cf995cd32
    Merge branch 'ml-explore:main' into adding-full-finetuning Gökdeniz Gülmez 2024-08-26 20:54:50 +0200
  • bf21789b17
    chore: update black pre-commit hooks to latest versions (#955) Nripesh Niketan 2024-08-26 20:24:23 +0530
  • 64a6b0383a
    Merge branch 'main' into black-25.8 Nripesh Niketan 2024-08-26 20:07:23 +0530
  • 0c34402c22 chore: update black pre-commit hooks to latest versions Nripesh Niketan 2024-08-26 20:06:33 +0530
  • b5e18ef1e3
    Add Phi-3.5-MoE (#946) Prince Canuma 2024-08-24 15:52:33 +0200
  • 0e7f44a875 nits Awni Hannun 2024-08-24 06:45:20 -0700
  • d76417197f fix SuScaled args Prince Canuma 2024-08-24 09:13:03 +0200
  • 300fbc1c52
    Merge branch 'ml-explore:main' into pc/phimoe Prince Canuma 2024-08-24 09:04:34 +0200
  • e3df32430e add switch_mlp Prince Canuma 2024-08-24 08:48:10 +0200
  • 0b20d08d1b add phimoe to tunner Prince Canuma 2024-08-24 08:47:11 +0200
  • 6731254e76
    Use fast rope (#945) Awni Hannun 2024-08-23 13:18:51 -0700
  • 5105b31cf7
    feat: show batch generation progress L Lllvvuu 2024-08-23 16:27:50 +0900
  • 9aabf08b23 hard code freqs Awni Hannun 2024-08-22 16:48:46 -0700
  • d87293428b
    Merge branch 'ml-explore:main' into adding-support-for-mamba Gökdeniz Gülmez 2024-08-22 21:37:47 +0200
  • 327eb0d257
    Merge branch 'ml-explore:main' into adding-full-finetuning Gökdeniz Gülmez 2024-08-22 21:37:36 +0200
  • 58591a1b41
    fine tune deepseek (#932) Awni Hannun 2024-08-22 10:41:21 -0700
  • 01e1f3c9e4
    fixup! feat: Nemotron L Lllvvuu 2024-08-23 00:21:23 +0900
  • 0eaec79dfb
    Merge branch 'main' into adding-full-finetuning Gökdeniz Gülmez 2024-08-21 22:00:30 +0200
  • 9c86658aff
    feat: Nemotron L Lllvvuu 2024-08-21 09:25:30 +0800
  • ef92993abb
    feat: support batch input in generate() L Lllvvuu 2024-08-21 00:46:19 +0800
  • 0a52a9d55a fix Awni Hannun 2024-08-20 17:24:05 -0700
  • 6b1d27a39f add phimoe Prince Canuma 2024-08-20 23:47:13 +0200
  • 15975697d2 nit Awni Hannun 2024-08-20 06:33:53 -0700
  • b35b086278 only one of base or freqs Awni Hannun 2024-08-19 17:32:03 -0700
  • 3f8c1aca20 fix deepseek v2 Awni Hannun 2024-08-19 16:09:17 -0700
  • a3431ccc25 fix su Awni Hannun 2024-08-19 16:02:25 -0700
  • fdc1c707c3 requires unreleased mlx Awni Hannun 2024-08-19 14:17:36 -0700
  • 3822d6bfc3 use fast rope for llama3.1 Awni Hannun 2024-08-19 14:15:39 -0700
  • 8b5c9ce6d2 fix llama Awni Hannun 2024-08-19 14:13:18 -0700
  • 9ac9fa6798 use fast rope Awni Hannun 2024-08-19 14:11:17 -0700
  • 6c79ceb452
    Merge branch 'ml-explore:main' into adding-support-for-mamba Gökdeniz Gülmez 2024-08-19 17:59:40 +0200
  • c02d462a94 update Goekdeniz-Guelmez 2024-08-19 17:59:25 +0200
  • 0164d2058b
    feat: DeepSeek MoE v1 (#942) L 2024-08-17 23:18:09 +0900
  • 6e1dc51e9a nits Awni Hannun 2024-08-17 07:17:53 -0700
  • 102a0bb7ac nits Awni Hannun 2024-08-17 06:56:24 -0700
  • 7bb77fd5bd nits Awni Hannun 2024-08-17 06:55:30 -0700
  • 8af5fc0315 Merge branch 'adding-support-for-mamba' of https://github.com/Goekdeniz-Guelmez/mlx-examples into adding-support-for-mamba Goekdeniz-Guelmez 2024-08-17 14:33:31 +0200
  • 3d3dfa39f1 utils.py logits = logits[:, -1, :] TypeError: tuple indices must be integers or slices, not tuple Goekdeniz-Guelmez 2024-08-17 14:32:19 +0200
  • ea6b9467e0
    feat: deepseek v1 L Lllvvuu 2024-08-17 01:38:19 +0900
  • 7be292c0c9
    Handle longer prompt/generation (#931) Awni Hannun 2024-08-16 15:28:39 -0700
  • 67c8f802fc
    Merge branch 'ml-explore:main' into adding-support-for-mamba Gökdeniz Gülmez 2024-08-16 23:02:36 +0200
  • d42893eebc
    Merge branch 'ml-explore:main' into adding-full-finetuning Gökdeniz Gülmez 2024-08-16 23:02:26 +0200
  • df04e0c03b update version Awni Hannun 2024-08-16 13:52:26 -0700